Task 2 - Optimising RAG

Uploaded by

nehasharma.reachinbox.ai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

66 views2 pages

Task 2 - Optimising RAG

Uploaded by

nehasharma.reachinbox.ai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Optimizing a Retrieval Augmented Generation (RAG) model

can significantly enhance its performance, making it more

efficient and effective in retrieving relevant information and
generating accurate responses. Here are two innovative
techniques to optimize the RAG model developed in Task 1:

Technique 1: Dynamic Context Window Management

Description:
In a RAG model, the quality of the generated response heavily
depends on the context provided to the generation model. By
dynamically managing the context window, we can ensure that
the most relevant and recent information is included in the
prompt, thereby improving the quality of the responses

Implementation Steps:

1. Relevance Scoring**: Enhance the retrieval mechanism to

not only fetch the top-k documents but also score them based
on their relevance to the query. This can be done using a more
sophisticated relevance scoring function that considers factors
like keyword overlap, semantic similarity, and even recency of
the information.

2. Dynamic Context Window**: Instead of using a fixed number

of documents for context, dynamically adjust the context
window based on the query. For instance, for complex queries,
provide a larger context, while for simple queries, a smaller
context might suffice.

3. Chunk Prioritization**: When the retrieved chunks exceed the

token limit for the generation model, prioritize the chunks based
on their relevance score. Only include the top-scoring chunks in
the context to ensure that the most pertinent information is
used for generating the response.
Technique 2: Fine-tuning the Generation Model

Description:
Fine-tuning the generation model on domain-specific data can
significantly improve its ability to generate accurate and
contextually appropriate responses. This involves training the
model on a dataset that closely resembles the type of questions
and answers it will encounter in the real world.

Implementation Steps:

1. Domain-Specific Dataset**: Compile a dataset of question-

answer pairs relevant to the business domain. This dataset
should be diverse and cover various topics that the QA bot is
expected to handle.

2. Fine-Tuning**: Fine-tune the GPT model on this dataset. This

process involves training the model to minimize the loss on the
question-answer pairs, thereby aligning the model's output
more closely with the desired responses.

3. Evaluation and Iteration**: Evaluate the fine-tuned model on

a validation set to ensure it generalizes well to unseen
questions. Iterate the fine-tuning process if necessary, adjusting
hyperparameters and the dataset as needed.

By implementing these techniques, the RAG model can be

optimized to provide more accurate, contextually relevant, and
high-quality answers, thereby enhancing the overall
performance and user satisfaction of the QA bot.

RAG Understanding.pdf
No ratings yet
RAG Understanding.pdf
12 pages
Macolor Worksheet2
No ratings yet
Macolor Worksheet2
3 pages
Generative AI PPT Final
No ratings yet
Generative AI PPT Final
34 pages
RAG Slide ENG
No ratings yet
RAG Slide ENG
41 pages
A Comprehensive Guide to Building Agentic RAG Systems with LangGraph
No ratings yet
A Comprehensive Guide to Building Agentic RAG Systems with LangGraph
23 pages
Advance RAG technique
No ratings yet
Advance RAG technique
23 pages
Harnessing_Retrieval_Augmented_Generatio
No ratings yet
Harnessing_Retrieval_Augmented_Generatio
4 pages
Different RAG Techniques
No ratings yet
Different RAG Techniques
9 pages
2505.08445v1
No ratings yet
2505.08445v1
14 pages
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
No ratings yet
Developing Retrieval Augmented Generation (RAG) Based LLM Systems From Pdfs - An Expert Report
36 pages
Top 20+ RAG Interview Questions
No ratings yet
Top 20+ RAG Interview Questions
8 pages
A Survey of Techniques For Maximizing LLM Performance
100% (1)
A Survey of Techniques For Maximizing LLM Performance
40 pages
P1
No ratings yet
P1
1 page
Task_2_Optimizing_RAG
No ratings yet
Task_2_Optimizing_RAG
2 pages
Optimising RAG
No ratings yet
Optimising RAG
2 pages
2412.06832v1
No ratings yet
2412.06832v1
10 pages
Corrective Retrieval Augmented Generation: Zhang Et Al. 2023b Muhlgay Et Al. 2023
No ratings yet
Corrective Retrieval Augmented Generation: Zhang Et Al. 2023b Muhlgay Et Al. 2023
13 pages
TENEJET 1 101 3 2023 MS Final
No ratings yet
TENEJET 1 101 3 2023 MS Final
6 pages
RAG_Detailed_Overview
No ratings yet
RAG_Detailed_Overview
3 pages
XGBoost in Practice: Definitive Reference for Developers and Engineers
From Everand
XGBoost in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Ad & Da Converters
0% (1)
Ad & Da Converters
48 pages
LightGBM in Practice: Definitive Reference for Developers and Engineers
From Everand
LightGBM in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Chapter 1 the Business World and Business Management (1).Ppt
No ratings yet
Chapter 1 the Business World and Business Management (1).Ppt
17 pages
What is Retrieval-Augmented Generation (RAG)
No ratings yet
What is Retrieval-Augmented Generation (RAG)
12 pages
Advanced RAG
No ratings yet
Advanced RAG
12 pages
Corrective Retrieval Augmented Generation: Zhang Et Al. 2023b Muhlgay Et Al. 2023
No ratings yet
Corrective Retrieval Augmented Generation: Zhang Et Al. 2023b Muhlgay Et Al. 2023
14 pages
Crag Pa Peer
No ratings yet
Crag Pa Peer
16 pages
03 Komatsu GD825 Machine Maintenance PDF
100% (3)
03 Komatsu GD825 Machine Maintenance PDF
50 pages
Formal Method in SE
No ratings yet
Formal Method in SE
17 pages
RAG Syllabus R&D
No ratings yet
RAG Syllabus R&D
6 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
ssrn-5267341
No ratings yet
ssrn-5267341
16 pages
Searching For Best Practices in Retrieval-Augmented Generation
No ratings yet
Searching For Best Practices in Retrieval-Augmented Generation
22 pages
Soal Bahasa Inggris SMP
No ratings yet
Soal Bahasa Inggris SMP
7 pages
CIW Data Analyst Exam Prep: 500 Practice Questions for Certification Success
From Everand
CIW Data Analyst Exam Prep: 500 Practice Questions for Certification Success
Steve Brown
No ratings yet
Learning Activities: Activity 1. Directions: Answer The Following Questions Below: Write Your Answer Inside The Box
No ratings yet
Learning Activities: Activity 1. Directions: Answer The Following Questions Below: Write Your Answer Inside The Box
3 pages
Steel Girder
No ratings yet
Steel Girder
42 pages
Knowledge Ply Chat
No ratings yet
Knowledge Ply Chat
4 pages
Generative AI
No ratings yet
Generative AI
25 pages
Dbtepromis - Nic.in PI Declaration Download - Aspx
No ratings yet
Dbtepromis - Nic.in PI Declaration Download - Aspx
2 pages
EasyChair-Preprint-15614
No ratings yet
EasyChair-Preprint-15614
20 pages
gautam2024evaluating
No ratings yet
gautam2024evaluating
7 pages
1732974151910
No ratings yet
1732974151910
12 pages
Document 2
No ratings yet
Document 2
12 pages
Enhancing Retrieval-Augmente Generation Practices
No ratings yet
Enhancing Retrieval-Augmente Generation Practices
13 pages
RAG
No ratings yet
RAG
4 pages
Retrieval-Augmented Generation For Large Language Models A Survey
No ratings yet
Retrieval-Augmented Generation For Large Language Models A Survey
26 pages
RAG_Research_Document_Abhishek
No ratings yet
RAG_Research_Document_Abhishek
2 pages
IR-LLMs
No ratings yet
IR-LLMs
17 pages
01rag For LLM A Survey
No ratings yet
01rag For LLM A Survey
21 pages
Mini Test 12
No ratings yet
Mini Test 12
2 pages
The Ultimate Guide to GenAI RAG: Enhancing AI with Real-Time Data Retrieval
No ratings yet
The Ultimate Guide to GenAI RAG: Enhancing AI with Real-Time Data Retrieval
12 pages
Bank of India 2
No ratings yet
Bank of India 2
4 pages
A Deep Dive Into Retrieval Augmented Generation: Team Members
No ratings yet
A Deep Dive Into Retrieval Augmented Generation: Team Members
14 pages
Extension courseware based on the ArchiMate Standard, Version 3.1 Standard by Van Haren Publishing
From Everand
Extension courseware based on the ArchiMate Standard, Version 3.1 Standard by Van Haren Publishing
Van Haren Learning Solutions a.o.
No ratings yet
Minor_proj
No ratings yet
Minor_proj
15 pages
Pre - DT Report ZBGR - 4331 - TDD
No ratings yet
Pre - DT Report ZBGR - 4331 - TDD
4 pages
Medium
No ratings yet
Medium
22 pages
20 Electrostatics-Coulomb's Law
50% (2)
20 Electrostatics-Coulomb's Law
5 pages
Rag
No ratings yet
Rag
10 pages
AI-Driven Web Apps: Practical Machine Learning for Software Developers
From Everand
AI-Driven Web Apps: Practical Machine Learning for Software Developers
Sivaramarajalu Ramadurai Venkataraajalu
No ratings yet
01 CE134P-2 Introduction To Structural Steel Design
No ratings yet
01 CE134P-2 Introduction To Structural Steel Design
18 pages
tyjt
No ratings yet
tyjt
2 pages
Title
No ratings yet
Title
2 pages
Machine Learning with Python: Foundations and Applications: ML, #1
From Everand
Machine Learning with Python: Foundations and Applications: ML, #1
Mohammed Nurudeen
No ratings yet
A Survey On Retrieval-Augmented Text Generation For Large Language Models
No ratings yet
A Survey On Retrieval-Augmented Text Generation For Large Language Models
18 pages
download4
No ratings yet
download4
2 pages
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
From Everand
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
Satou Takahiro
No ratings yet
Press Release - Part-Time Job Scam-28.3.2024
No ratings yet
Press Release - Part-Time Job Scam-28.3.2024
1 page
DLP-Practical Resaerch 1-COT-sampling-annotation
No ratings yet
DLP-Practical Resaerch 1-COT-sampling-annotation
7 pages
Production & Cost Concepts - Managerial Economics
86% (14)
Production & Cost Concepts - Managerial Economics
80 pages
Modern Atomic Theory
No ratings yet
Modern Atomic Theory
2 pages
Linder 316 IC Side Loader Forklift Service Manual
No ratings yet
Linder 316 IC Side Loader Forklift Service Manual
142 pages
F071.1 (GT) V09en - Application Form GOTS Chemical Assessment Ecocert Inde
No ratings yet
F071.1 (GT) V09en - Application Form GOTS Chemical Assessment Ecocert Inde
3 pages
Akıllı Telefonlar Ile Ilgili İngilizce Essay - Smartphones Essay - Essay Kontrol
No ratings yet
Akıllı Telefonlar Ile Ilgili İngilizce Essay - Smartphones Essay - Essay Kontrol
2 pages
Certified Associate in Project Management (CAPM) Practice Exams: Over 400 Practice Questions of Exam-Level Difficulty with Very Detailed Explanations to Right and Wrong Answers
From Everand
Certified Associate in Project Management (CAPM) Practice Exams: Over 400 Practice Questions of Exam-Level Difficulty with Very Detailed Explanations to Right and Wrong Answers
Daniel House
No ratings yet
Case 1
No ratings yet
Case 1
2 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
DP-600: Implementing Analytics Solutions Using Microsoft Fabric Exam Preparation
From Everand
DP-600: Implementing Analytics Solutions Using Microsoft Fabric Exam Preparation
Georgio Daccache
No ratings yet
Time Table IMO Model Course 1.08
100% (1)
Time Table IMO Model Course 1.08
2 pages
Sales Enablement - Slim - EP1
No ratings yet
Sales Enablement - Slim - EP1
75 pages
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
From Everand
Google Cloud Data Engineer 100+ Practice Exam Questions With Well Explained Answers
vivian njoroge
No ratings yet
M.Babu: Certified Sap SD Consultant
No ratings yet
M.Babu: Certified Sap SD Consultant
3 pages
1 - Optimize Amazon SageMaker Deployment Strategies
No ratings yet
1 - Optimize Amazon SageMaker Deployment Strategies
45 pages
Abattoir Layout Construction PDF
100% (8)
Abattoir Layout Construction PDF
22 pages
ISTQB Certified Tester Advanced Level Test Manager (CTAL-TM): Practice Questions Syllabus 2012
From Everand
ISTQB Certified Tester Advanced Level Test Manager (CTAL-TM): Practice Questions Syllabus 2012
Gabriel Awoyemi
No ratings yet
APSACS Orientation Leaflet
No ratings yet
APSACS Orientation Leaflet
16 pages
VMWARE Certified Spring Professional Certification Cased Based Practice Questions - Latest Edition
From Everand
VMWARE Certified Spring Professional Certification Cased Based Practice Questions - Latest Edition
Exam OG
No ratings yet
RT Procedure Rev01E
No ratings yet
RT Procedure Rev01E
20 pages
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)