GenAI PDF

Generative AI refers to models that create new content like text, images, and music, learning from large datasets. The document discusses various AI concepts including Large Language Models (LLMs), prompt engineering, and Retrieval-Augmented Generation (RAG), highlighting their applications and frameworks. Key takeaways emphasize the importance of integrating retrieval systems with generative models to enhance response accuracy and relevance.

Uploaded by

tiwarynileshkumar17

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views34 pages

GenAI PDF

Uploaded by

tiwarynileshkumar17

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

GENERATIVE AI

W O R K S H O P
What is GEN AI ?
Generative AI (Generative Artificial Intelligence)
refers to a class of AI models designed to create
new content, such as text, images, music, code, or
even videos, rather than just analyzing or
classifying data. These models learn from vast
datasets and generate human-like outputs based
on patterns they recognize.
DL Models
Discriminative AI Generative AI

1. Focuses on classifying or distinguishing 1. Focuses on generating new data that

between different types of data. resembles real data.
2. Learns the decision boundary between 2. Learns the underlying probability
different classes (P(y|x)). distribution of data (P(x)) to create new
3. Image classification, fraud detection, samples.
sentiment analysis. 3. Text generation, image synthesis, music
4. Logistic Regression, SVM, Random Forest, composition.
BERT. 4. GPT, Stable Diffusion, GANs, DALL·E.
5. Produces labels or class predictions (e.g., 5. Produces new content (e.g., a new
"Spam" or "Not Spam"). paragraph, image, or video).
6. "Is this email spam or not?" 6. "Write a new email in a professional tone."
Large Language Models
(LLMs)
LLMs (Large Language Models) are artificial intelligence models
trained on massive amounts of text data to understand, generate, and
manipulate human-like language. They use deep learning, particularly
transformer architectures, to perform tasks like answering questions,
summarizing text, generating code, and more.

BERT, GPT, XLM, T5, Megatron, M2M are some examples

Model Kwargs
(Keyword Arguments)

✅ "Kwargs" (Keyword Arguments) allow fine-tuning of model

behavior by passing parameters dynamically.
✅ Used in LLMs, Embedding Models, and Vector Stores to customize
responses, performance, and retrieval.
✅ Helps control output length, temperature, retrieval depth, and
more.
Parameter Description Example Value

Controls randomness in response (lower = 0.2 (Factual) / 0.8

temperature
deterministic) (Creative)

max_tokens Limits response length 512

top_k Selects top-k most relevant documents 5

retriever.search_kwar
Filters retrieved documents {"k": 3} (Retrieve top 3)
gs
Prompt Engineering: The Art of Talking to AI
Optimizing AI Responses for Accuracy & Relevance

Definition:
The practice of designing input prompts
to guide an AI model’s response.
Why It Matters:
LLMs respond based on how the question
is framed.
Better prompts → Better responses.
Types of Prompts

1️⃣ Zero-shot Prompting

2️⃣ Few-shot Prompting
3️⃣ Chain-of-Thought (CoT) Prompting
4️⃣ Role-based Prompting
Prompt Engineering in RAG
Why is prompt design crucial for RAG?
Helps the model better utilize retrieved data.
Reduces hallucinations by guiding the AI to cite sources.
Example:
❌ Weak Prompt: "What is the latest AI policy update?"
✅ Strong Prompt: "Based on the retrieved government
document, summarize the key changes in AI policy for
2024."
AI Frameworks

LangChain LangFlow LangGraph

LangChain is a framework
LangFlow is a low-code UI LangGraph is a
designed to help
for LangChain, making it framework built on
developers build
easier to create, visualize,
applications powered by LangChain that
and experiment with
large language models enables graph-based
LangChain workflows
(LLMs) like OpenAI’s GPT, workflows for LLMs.
using a drag-and-drop
Claude, or open-source
interface.
models like LLaMA.
Lang Chain
Key Features:
Chains: Sequences of LLM calls or logic (e.g.,
multi-step reasoning).
Retrieval: Enhances responses using external
knowledge sources like vector databases.
Agents: Uses LLMs to decide which
tools/functions to call dynamically.
Memory: Enables stateful conversations across
multiple interactions.
Prompt Template
A PROMPT TEMPLATE IS A STRUCTURED WAY OF DESIGNING PROMPTS, OFTEN WITH
PLACEHOLDERS FOR DYNAMIC INPUTS. THIS IS USEFUL WHEN WORKING WITH LLMS
(LARGE LANGUAGE MODELS), RAG-BASED SYSTEMS, OR AUTOMATION IN AI
APPLICATIONS.

USE CASES IN AI:

1. LLMS (LARGE LANGUAGE MODELS):
DESIGNING STRUCTURED PROMPTS FOR CHATGPT, OPENAI, OR ANY LANGUAGE
MODEL TO IMPROVE RESPONSE QUALITY.
2. RAG-BASED SYSTEMS:
IN RETRIEVAL-AUGMENTED GENERATION, RETRIEVED DATA (FROM A KNOWLEDGE
BASE) IS INSERTED INTO THE TEMPLATE BEFORE PASSING IT TO THE MODEL.
3. AI AUTOMATION:
USED IN APPLICATIONS LIKE AUTOMATED REPORT GENERATION, EMAIL
ASSISTANTS, OR CUSTOMER SERVICE BOTS WHERE THE INPUT KEEPS CHANGING.
Agents
An Agent is an AI system that dynamically decides
which actions to take based on user input. Unlike
basic prompt-response models, agents can reason,
plan, and interact with multiple tools to solve tasks.
🧠 How Do Agents Work?
1. Receive Input – The agent takes in a user's query.
2. Decide an Action – It determines what tool or
approach to use (retrieval, calculations, API
calls, etc.).
3. Execute the Action – Calls an external API,
searches documents, or retrieves data from
memory.
4. Process Results – Uses the gathered data to
generate a final response.
🔹 Example: Instead of just answering "What is the
weather today?" using a fixed knowledge base, an
agent can call an API to get live weather updates
and then respond.
Chain
Chains in LangChain are designed to
structure multi-step interactions between a
user and an AI model. Instead of making a
single LLM call, chains allow for sequential or
parallel execution of multiple components
to achieve a more complex task.
eg- LLMChain,
SequentialChain,MultiQueryChain,
Document
Loader
A Document Loader in LangChain is responsible for
loading and preprocessing data from different
sources (PDFs, text files, web pages, databases, etc.)
before passing them to the LLM or a retrieval system

Flow:
1. Load Data – Fetch documents from a source
(PDF, database, API, etc.).
2. Preprocess & Split – Convert into chunks for
better retrieval.
3. Embed & Store – Convert text into embeddings
(vector database).
4. Retrieve & Generate – Use RAG to fetch relevant
data and answer queries.
.
Memory
Memory Type Use Case Strengths Weaknesses

Full conversation
history
ConversationBufferMemory Keeps everything High token usage

ConversationBufferWindowMem
ory Limited history (last
Reduces token usage Forgets older context
k interactions)

ConversationSummaryMemory Summarized history Condensed storage May lose details

ConversationKGMemory Fact storage Structured knowledge Not ideal for casual chats

Large-scale
VectorStoreMemory Efficient for retrieval Requires a vector database
knowledge storage
Hugging Face
Hugging Face is a leading platform for open-source AI models, including LLMs, transformers,
embeddings, and datasets. It provides tools like:
🤗 Transformers (for NLP models)
🤗 Datasets (for data processing)
🤗 Spaces (for hosting AI apps)
🤗 Model Hub (for accessing pre-trained models)
LangChain integrates Hugging Face models for:
1. LLMs (HuggingFaceHub, Transformers)
2. Embeddings (Sentence Transformers, BERT, etc.)
3. Vector Databases (Chroma, Pinecone, etc.)
RAG
(Retrieval Augmented Generation)
Retrieval-Augmented Generation (RAG) is an advanced
AI technique that combines a language model (LLM)
with an external knowledge retrieval system to
generate more accurate, fact-based, and up-to-date
responses.
Instead of relying only on pre-trained knowledge, RAG
fetches relevant information from external sources
(e.g., vector databases, documents, APIs) before
generating a response.
Fine Tuning vs RAG
Fine-Tuning:
Adjusting the model’s weights by training it on new
data
Requires labeled datasets for supervised learning

RAG:
Enhancing responses by retrieving external
information at query time
Uses external knowledge bases (documents, APIs,
databases)
How RAG works?
Retrieval-
Generation-
RAG Pipeline Architecture
User Query →
Embedding Model (Vectorization) →
Retrieval from Vector Database →
Augmentation (Appending Context) →
LLM Response Generation
Embedding Model
An embedding model converts text, images,
or other data into high-dimensional vectors
(numerical representations). These vectors
capture semantic meaning, enabling
efficient similarity search in RAG (Retrieval-
Augmented Generation).
Vectorization in RAG
Vectorization is the process of converting
text, images, or other data into numerical
vectors for machine learning, similarity
search, and retrieval in RAG (Retrieval-
Augmented Generation) systems.
Vector
KING QUEEN MAN FOX

Power 1 0.9 0.1 0

Male 1 0 1 0.5

Female 0 1 0 0.5

Hardwork 0.2 0.1 1 1

ChromaDB
ChromaDB is an open-source vector
database designed for storing and retrieving
high-dimensional embeddings efficiently. It
is widely used in Retrieval-Augmented
Generation (RAG) systems to enable fast,
accurate, and scalable retrieval of relevant
documents.
Other examples: Pinecone, FAISS
Similarity Search
In Retrieval-Augmented Generation (RAG),
similarity search is used to find the most
relevant information from a vector
database. This helps in fetching documents
or knowledge chunks that match a user’s
query.
Types of Similarity Search
1️⃣ Cosine Similarity
Measures the cosine of the angle between two
vectors. (Text Similarity, Embeddings)

2️⃣Dot Product Similarity

Measures the magnitude and direction of
vectors(Ranking, attention mechanisms)
Retrieval
Retrieval is the process of finding and
fetching relevant information from a
database or knowledge source based on a
user's query. In Retrieval-Augmented
Generation (RAG), this step ensures that an
LLM gets accurate, contextually relevant
data before generating a response.
Tech Stack for RAG
LLMs: OpenAI GPT, LLaMA, Mistral,
HuggingFace
Vector Databases: ChromaDB, Pinecone,
FAISS, Weaviate
Embeddings: OpenAI Embeddings, Sentence
Transformers, BGE
Frameworks: LangChain, LlamaIndex
Document Loaders: PDFs, APIs, Notion,
Google Drive
LLM Response
In Retrieval-Augmented Generation (RAG),
an LLM (Large Language Model) response is
the final output generated after retrieving
relevant information. The response is
contextually enhanced by the retrieved
data, making it more factual and relevant.
EMBEDDING
MODEL SIMILARITY
SEARCH DATASET
(PDF)

USER QUERY
VECTOR

EMBEDDING
MODEL
CHROMA DB

LLM RESPONSE RETRIEVED

DOCUMENTS
VECTOR
Applications of RAG
AI-Powered Chatbots (Customer
Support, FAQs)
Legal & Medical Assistants (Retrieving
case laws, medical guidelines)
Enterprise Search (AI-powered
document retrieval)
Education & Research (Summarizing
academic papers)
Conclusion
Summary
🚀 Key Takeaways from Retrieval-Augmented
Generation (RAG)
✅ Enhances LLMs by retrieving relevant, factual data
before generating responses.
✅ Uses Vector Databases like ChromaDB, FAISS,
Pinecone to store and retrieve embeddings.
✅ Reduces Hallucinations by grounding responses in
real-world knowledge.
✅ Combines Retrieval + Generation for accurate,
context-aware answers.
✅ Widely Used in chatbots, search engines, and
enterprise AI solutions.
THANK YOU!

100 MCQs on the Medicine Bag Real
No ratings yet
100 MCQs on the Medicine Bag Real
22 pages
Speech Correction
100% (2)
Speech Correction
522 pages
PEC GEN AI NOTES
No ratings yet
PEC GEN AI NOTES
11 pages
Agentic RAG_removed
No ratings yet
Agentic RAG_removed
9 pages
Lecture # 14-1 Introduction to RAG
No ratings yet
Lecture # 14-1 Introduction to RAG
56 pages
building RAG apps
No ratings yet
building RAG apps
32 pages
Self-RAG
No ratings yet
Self-RAG
12 pages
Context Aware Retrieval Augmented Generation Based Conversational AI
No ratings yet
Context Aware Retrieval Augmented Generation Based Conversational AI
27 pages
WWW Promptingguide Ai Techniques Rag
No ratings yet
WWW Promptingguide Ai Techniques Rag
4 pages
Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai
No ratings yet
Github - Blog - Ai and ML - Generative Ai - What Is Retrieval Augmented Generation and What Does It Do For Generative Ai
14 pages
PDF School Based Training Manual
No ratings yet
PDF School Based Training Manual
168 pages
Generative AI PPT Final
No ratings yet
Generative AI PPT Final
34 pages
langchain_LLM
No ratings yet
langchain_LLM
25 pages
Module 4 _ RAG (Retrieval Augmented Generation) _ PEC GenAI Course
No ratings yet
Module 4 _ RAG (Retrieval Augmented Generation) _ PEC GenAI Course
23 pages
CrateDB and LangChain
No ratings yet
CrateDB and LangChain
14 pages
langchain_concepts
No ratings yet
langchain_concepts
7 pages
genai-tech-stacks
No ratings yet
genai-tech-stacks
2 pages
RAG First Month Assessment GenAI
No ratings yet
RAG First Month Assessment GenAI
3 pages
A Comprehensive Guide to Building Agentic RAG Systems with LangGraph
No ratings yet
A Comprehensive Guide to Building Agentic RAG Systems with LangGraph
23 pages
Department of Education
No ratings yet
Department of Education
66 pages
Rag
No ratings yet
Rag
11 pages
GenAIRAG LLM 71731191 PDF
No ratings yet
GenAIRAG LLM 71731191 PDF
32 pages
Exploring-HuggingFace
No ratings yet
Exploring-HuggingFace
16 pages
Untitled 2
No ratings yet
Untitled 2
40 pages
Practical RAG
No ratings yet
Practical RAG
127 pages
Www Llamaindex Ai Blog Rag Context Refinement Agent...
No ratings yet
Www Llamaindex Ai Blog Rag Context Refinement Agent...
10 pages
1546265073002Tamil_Dr.MuthuLakshmi
No ratings yet
1546265073002Tamil_Dr.MuthuLakshmi
6 pages
1_Build a Complete OpenSource LLM RAG QA Chatbot — an in-Depth Journey (Introduction) _ by Marco Bertelli _ Level Up Coding
No ratings yet
1_Build a Complete OpenSource LLM RAG QA Chatbot — an in-Depth Journey (Introduction) _ by Marco Bertelli _ Level Up Coding
12 pages
What is Retrieval-Augmented Generation (RAG)
No ratings yet
What is Retrieval-Augmented Generation (RAG)
12 pages
So You Want To Become An Accountant - John Hill
No ratings yet
So You Want To Become An Accountant - John Hill
4 pages
Car (Group 1)
No ratings yet
Car (Group 1)
15 pages
Datafy Generative-Ai Learning Path[82]
No ratings yet
Datafy Generative-Ai Learning Path[82]
7 pages
-OceanofPDF.com-Generative AI Apps With Langchain and Python - Rabi Jay
No ratings yet
-OceanofPDF.com-Generative AI Apps With Langchain and Python - Rabi Jay
387 pages
Generative Adversarial Networks
No ratings yet
Generative Adversarial Networks
43 pages
1 CREATIVE ARTS Topic 3 Visual Literacy First Part
No ratings yet
1 CREATIVE ARTS Topic 3 Visual Literacy First Part
3 pages
Thesis RAG Retrieval Augmented Generation For The IR-Anthology
No ratings yet
Thesis RAG Retrieval Augmented Generation For The IR-Anthology
83 pages
RJS FGC SSR
No ratings yet
RJS FGC SSR
94 pages
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
No ratings yet
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
36 pages
EEE - 6609 - 2022 - Deep Learning - Lecture - 1
No ratings yet
EEE - 6609 - 2022 - Deep Learning - Lecture - 1
16 pages
Mange Ment
No ratings yet
Mange Ment
478 pages
LangChain Talk (Aug-Sep'23)
No ratings yet
LangChain Talk (Aug-Sep'23)
47 pages
LangChain & RAG
No ratings yet
LangChain & RAG
62 pages
WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Cohesity Com Glossary Retrieval-Augmented-Generation-Rag
5 pages
The DOM GraphRAG Project
No ratings yet
The DOM GraphRAG Project
30 pages
GenAI_Notes
No ratings yet
GenAI_Notes
9 pages
Semantic Search and Beyond handout-Tim-Clarke
No ratings yet
Semantic Search and Beyond handout-Tim-Clarke
16 pages
LearnJam - LearningDesignPrinciples
No ratings yet
LearnJam - LearningDesignPrinciples
27 pages
Document 2
No ratings yet
Document 2
12 pages
Galilei E-Class Final
No ratings yet
Galilei E-Class Final
31 pages
Electromagnetism Geometry
100% (1)
Electromagnetism Geometry
159 pages
Pinnacle_Plus Projects
No ratings yet
Pinnacle_Plus Projects
12 pages
17 (Advanced) RAG Techniques To Turn Your LLM App Prototype Into A Production-Ready Solution - by Dominik Polzer - Jun, 2024 - Towards Data Science
No ratings yet
17 (Advanced) RAG Techniques To Turn Your LLM App Prototype Into A Production-Ready Solution - by Dominik Polzer - Jun, 2024 - Towards Data Science
54 pages
Lang Chain
No ratings yet
Lang Chain
7 pages
Stages of Group Development
No ratings yet
Stages of Group Development
1 page
LangChain_Concepts_Updated_with_Tealium (1)
No ratings yet
LangChain_Concepts_Updated_with_Tealium (1)
17 pages
RAG Architecture
100% (7)
RAG Architecture
52 pages
Rag
No ratings yet
Rag
10 pages
Best 10 Scheduling Template in Excel Free Download 2022: 1. Creative Schedule Curriculum
No ratings yet
Best 10 Scheduling Template in Excel Free Download 2022: 1. Creative Schedule Curriculum
8 pages
Gcu Safety Calendar 2
No ratings yet
Gcu Safety Calendar 2
7 pages
The Role of A Project Management Office
No ratings yet
The Role of A Project Management Office
2 pages
Agentic AI Pioneer Program - Curriculum
No ratings yet
Agentic AI Pioneer Program - Curriculum
9 pages
Scholarship List
No ratings yet
Scholarship List
49 pages
5th and 6th Topic
No ratings yet
5th and 6th Topic
8 pages
Minor_proj
No ratings yet
Minor_proj
15 pages
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
No ratings yet
WWW Oracle Com in Artificial-Intelligence Generative-Ai Retrieval-Augmented-Generation-Rag
7 pages
Ix. Lesson Plans: Code: S4Es-Ivi-10 Code: S4Es-Ivj-11
No ratings yet
Ix. Lesson Plans: Code: S4Es-Ivi-10 Code: S4Es-Ivj-11
2 pages
Spi Thefutureofsalestraining
No ratings yet
Spi Thefutureofsalestraining
5 pages
PPT Agents
No ratings yet
PPT Agents
59 pages
Discovering Meaning in Panic
100% (1)
Discovering Meaning in Panic
445 pages
How-to-Build-AI-driven-Knowledge-Assistants
100% (1)
How-to-Build-AI-driven-Knowledge-Assistants
24 pages
LLM and RAG
No ratings yet
LLM and RAG
12 pages
Generativeaiconamazonbedrock 231229150142 844d444e
No ratings yet
Generativeaiconamazonbedrock 231229150142 844d444e
48 pages
LlamaIndex Talk (W&B Fully Connected 2024)
No ratings yet
LlamaIndex Talk (W&B Fully Connected 2024)
38 pages
Teaching Competency Standards in Southeast Asian Countries
100% (2)
Teaching Competency Standards in Southeast Asian Countries
105 pages
RAG - A Simple Introduction
100% (5)
RAG - A Simple Introduction
75 pages
Lesson 1 - Part 1 Introduction of MIL
No ratings yet
Lesson 1 - Part 1 Introduction of MIL
42 pages
Building Blocks of Rag Ebook Final
100% (1)
Building Blocks of Rag Ebook Final
9 pages
Rag - LLM
No ratings yet
Rag - LLM
16 pages
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
No ratings yet
RAG (Generative AI) - A "Rags To Riches" Moment For Artificial Intelligence - by Kanishk Khatter - Medium
12 pages
RAG - The Future of LLMs - LinkedIn
No ratings yet
RAG - The Future of LLMs - LinkedIn
7 pages
Retrieval Augmented Generation - Streamlining The Creation of Intelligent Natural Language Processing Models
No ratings yet
Retrieval Augmented Generation - Streamlining The Creation of Intelligent Natural Language Processing Models
8 pages
Retrieval Augmented Generation - A Simple Introduction
No ratings yet
Retrieval Augmented Generation - A Simple Introduction
82 pages
LG MATH Grade 5 - Geometry v2.0 PDF
No ratings yet
LG MATH Grade 5 - Geometry v2.0 PDF
17 pages
Consumer Behavior Syllabus
No ratings yet
Consumer Behavior Syllabus
2 pages
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Socomec 2013
No ratings yet
Socomec 2013
660 pages
Violin Restoration Course Mittenwald 2014
0% (1)
Violin Restoration Course Mittenwald 2014
2 pages
Unit 2 Progress Test
100% (2)
Unit 2 Progress Test
12 pages
01 Needlecraft Module
No ratings yet
01 Needlecraft Module
16 pages
Data For GenAI
No ratings yet
Data For GenAI
17 pages