0% found this document useful (0 votes)

36 views

AIM307_Retrieval-Augmented-Generation-with-Amazon-Bedrock

Uploaded by

khavan.work

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views

AIM307_Retrieval-Augmented-Generation-with-Amazon-Bedrock

Uploaded by

khavan.work

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 15

© 2023, Amazon Web Services, Inc. or its affiliates.

All rights
reserved.
AIM307

Retrieval Augmented
Generation with Amazon
Bedrock
Rupinder Grewal Clay Elmore
(he/him) (he/him)
Sr. Specialist Solutions Architect Specialist Solutions Architect -
– ML AI/M L
AWS AWS

© 2023, Amazon Web Services, Inc. or its affiliates. All rights

reserved.© 2023, Amazon Web Services, Inc. or its affiliates. All
reserved.
rights reserved.
Agen
da
01 Introduction to Retrieval Augmented Generation
(RAG)
02 Workshop setup

03 Hands-on lab

04 Conclusions and next steps

© 2023, Amazon Web Services, Inc. or its affiliates. All rights

reserved.
Why RAG is a common solution in
generative AI apps

Challenges with RAG as a Components

LLM s solution • Retrieval mechanisms
• LLM training • Provides context to • Model choices
data is LLMs outside of
incomplete training data • Orchestration
frameworks
• Model • Reduces factual
hallucination inaccuracy
• Model • Limited complexity
customization
© 2023, Amazon Web Services, Inc. or its affiliates. All rights
reserved.
How RAG
works
User
input
Tex Use
Prompt
augmentati
Large
language
Respon
r se
t
generati on model

workflo
on Embeddin
gs
Contex
t
w model

Embeddin 0.89
0.17
-0.02 -0.53 0.95 -
0.38

Dat Semant
ingesti
a ic
search
on
workflo Vector Embeddings Document New
w store model store data
© 2023, Amazon Web Services, Inc. or its affiliates. All rights
reserved.
Amazon
Bedrock

Amazon Choose Use as is or Send Receive

an FM customize prompt response
Bedrock
Use the Fine-tune FMs as Use Bedrock API to Receive
Build generative AI playground to needed. Bedrock send your model
applications using experiment with will automatically prompts to the response in
foundation models FMs and select deploy the FM for model your
(FMs) through a the one that inference application
serverless suits your
API service needs

© 2023, Amazon Web Services, Inc. or its affiliates. All rights

reserved.
Titan
Embeddings
Amazon NE
W
Highlig
Titan hts
Embedding • Titan Embeddings offers fast,
s cost-effective, high-performance, and
accurate embeddings in 25
V2.0 languages.

Translates text inputs (words, phrases) into • Optimized for text retrieval tasks,
numerical representations (embeddings). semantic
Comparing embeddings produces more similarity, and clustering.
relevant and contextual responses than word
matching. • Applications of this model includes
semantic search and personalization.

Max tokens: 8,000

Output vectors: 1,536
Language: Multilingual (25 languages)

Model ID:Amazon
© 2023, amazon.titan-embed-text-v1
Web Services, Inc. or its affiliates. All rights
reserved.
Anthropic
Claude
Claud Highlig
e hts
Anthropic offers the Claude family of large • Long context window (100k)
language models purpose-built for allows for large amounts of text
conversations, summarization, Q&A, workflow to be processed at once.
automation, coding, and more. Claude can
also take direction on personality, tone, and • Available in two sizes to help
behavior. choose the right-sized model
based on latency, accuracy, and
cost considerations.
Max tokens: 100,000
Language: English and multiple other • Early customers report that
languages Claude is much less likely to
Model IDs: produce harmful outputs, easier
- anthropic.claude-instant-v1 to converse with, and more
- anthropic.claude-v2 steerable .

© 2023, Amazon Web Services, Inc. or its affiliates. All rights

reserved.
Let’s get
started!

© 2023, Amazon Web Services, Inc. or its affiliates. All rights

reserved.
Workshop Link

http://bit.ly/3T8B
LSi

© 2023, Amazon Web Services, Inc. or its affiliates. All rights

reserved.
Ne

Knowledge
w

base for
RAG
Connect FMs to data
sources, including vector
engine for Amazon
OpenSearch Serverless,
Pinecone, and Redis
Enterprise Cloud

Enable automatic data

source detection

Provide source attribution

1
reserved. 1
Agents enable generative AI
Ne
w

applications to complete tasks

in just a few clicks

1 2 3 4

Select your Provide Select Developer

foundation basic relevant specifies
model instructio data Lambda
ns sources functions
Breaks down and orchestrates tasks
Securely accesses and retrieves company
data Takes action by invoking API calls on
your behalf Provides fully managed
infrastructure support
© 2023, Amazon Web Services, Inc. or its affiliates. All rights
reserved.
Further
considerations
Prompt RAG Vector
engineering orchestration databases

reserved.
Continuing your Amazon Bedrock
journey
BePdrroomckptReAnGgiwnoe BeRdAroGcokrcohdeestsra VKencotworleddagtae
rekrisnhgop mtiopnles bbaassees

reserved.
Thank
Thank you! Please complete the
Please complete the
session
survey in the mobile app
session

you!
Rupinder Grewal Clay Elmore
survey in the mobile app

[email protected] [email protected]
om om

AI HAN: 8 May 2025
100% (1)
AI HAN: 8 May 2025
45 pages
Comparative Analysis of RAG Fine-Tuning and Prompt Engineering in Chatbot Development
No ratings yet
Comparative Analysis of RAG Fine-Tuning and Prompt Engineering in Chatbot Development
4 pages
Laser GRBL Software Instruction
No ratings yet
Laser GRBL Software Instruction
12 pages
Google Sketchup and Sketchup Pro 7 Bible PDF
No ratings yet
Google Sketchup and Sketchup Pro 7 Bible PDF
3 pages
DataStage Best Practices
No ratings yet
DataStage Best Practices
30 pages
Building Blocks of Rag Ebook Final
100% (1)
Building Blocks of Rag Ebook Final
9 pages
RAG Syllabus R&D
No ratings yet
RAG Syllabus R&D
6 pages
Implementing A Retrieval-Augmented Generation System
No ratings yet
Implementing A Retrieval-Augmented Generation System
3 pages
Hybrid Retrieval-Augmented Generation Approach For LLMs Query Response Enhancement
No ratings yet
Hybrid Retrieval-Augmented Generation Approach For LLMs Query Response Enhancement
5 pages
ControlNet For Stable Diffusion
No ratings yet
ControlNet For Stable Diffusion
4 pages
Stable Diffusion
No ratings yet
Stable Diffusion
6 pages
Building a Streamlit Chatbot with LangChain and Llama 3.1_ Exploring LLMs — 3 _ by Abou Zuhayr _ Sep, 2024 _ GoPenAI
No ratings yet
Building a Streamlit Chatbot with LangChain and Llama 3.1_ Exploring LLMs — 3 _ by Abou Zuhayr _ Sep, 2024 _ GoPenAI
15 pages
Y2 Autumn Block 2 SOL Addition and Subtraction
No ratings yet
Y2 Autumn Block 2 SOL Addition and Subtraction
67 pages
Stable Diffusion
No ratings yet
Stable Diffusion
58 pages
Weaviate Advanced RAG Techniques eBook
100% (1)
Weaviate Advanced RAG Techniques eBook
13 pages
RAG_Beyond_Text_Enhancing_Image_Retrieval_in_RAG_Systems
100% (1)
RAG_Beyond_Text_Enhancing_Image_Retrieval_in_RAG_Systems
6 pages
Number Bonds Activities
No ratings yet
Number Bonds Activities
17 pages
Model Context Protocol (MCP)- Landscape- Security Threatsand Future Research Directions
No ratings yet
Model Context Protocol (MCP)- Landscape- Security Threatsand Future Research Directions
20 pages
Newwhitepaper_Embeddings & vector stores
No ratings yet
Newwhitepaper_Embeddings & vector stores
51 pages
Neural Networks and Deep Learning
No ratings yet
Neural Networks and Deep Learning
19 pages
Streamlit PDF Application Setup All Commands in One Single File
No ratings yet
Streamlit PDF Application Setup All Commands in One Single File
8 pages
How Does Stable Diffusion Work
No ratings yet
How Does Stable Diffusion Work
79 pages
Build A Chatgpt For Youtube Videos With Langchain
No ratings yet
Build A Chatgpt For Youtube Videos With Langchain
10 pages
Langchain 101
100% (1)
Langchain 101
4 pages
Knowledge Graphs v Vector Databases and when not to use them!
No ratings yet
Knowledge Graphs v Vector Databases and when not to use them!
3 pages
Number Bond
No ratings yet
Number Bond
28 pages
ARTICLE- Is Agentic RAG Worth the Investment? Agentic RAG Pricing and ROI Breakdown
No ratings yet
ARTICLE- Is Agentic RAG Worth the Investment? Agentic RAG Pricing and ROI Breakdown
1 page
AI Privacy Risks and Mitigations in Large Language Models
No ratings yet
AI Privacy Risks and Mitigations in Large Language Models
102 pages
Machine Learning Crashcourse
No ratings yet
Machine Learning Crashcourse
233 pages
Day 2 Module 2 - Understanding LLMs
No ratings yet
Day 2 Module 2 - Understanding LLMs
14 pages
Retrieval Augmented Generation - Streamlining The Creation of Intelligent Natural Language Processing Models
No ratings yet
Retrieval Augmented Generation - Streamlining The Creation of Intelligent Natural Language Processing Models
8 pages
Marilyn Burns On The Language of Math
No ratings yet
Marilyn Burns On The Language of Math
6 pages
LangChain QuickStart With Llama 2
No ratings yet
LangChain QuickStart With Llama 2
16 pages
Graph RAG
No ratings yet
Graph RAG
7 pages
Intelligent Agents
No ratings yet
Intelligent Agents
42 pages
Langchain Retrieval Augmented Generation White Paper
100% (1)
Langchain Retrieval Augmented Generation White Paper
23 pages
React Developer: Nanodegree Program Syllabus
No ratings yet
React Developer: Nanodegree Program Syllabus
12 pages
64e8c37a3a32b1b85d479988 - AIPromptPlaybook v1
No ratings yet
64e8c37a3a32b1b85d479988 - AIPromptPlaybook v1
28 pages
ChatGPT - An Honest Manual
No ratings yet
ChatGPT - An Honest Manual
35 pages
An Introduction To Vision-Language Modeling: Aishwarya Agrawal Kate Saenko Asli Celikyilmaz Vikas Chandra
No ratings yet
An Introduction To Vision-Language Modeling: Aishwarya Agrawal Kate Saenko Asli Celikyilmaz Vikas Chandra
76 pages
Gen Ai Solutions
No ratings yet
Gen Ai Solutions
14 pages
Creative Genius
No ratings yet
Creative Genius
6 pages
Python Programming-Grade 9
No ratings yet
Python Programming-Grade 9
53 pages
IoT Frameworks, Tools, APIs and Architectures
No ratings yet
IoT Frameworks, Tools, APIs and Architectures
11 pages
LLM - A Introduction To Generative AI
100% (1)
LLM - A Introduction To Generative AI
31 pages
1GitHub - Modelcontextprotocol_python-sdk_ the Official Python SDK for Model Context Protocol Servers and Clients
No ratings yet
1GitHub - Modelcontextprotocol_python-sdk_ the Official Python SDK for Model Context Protocol Servers and Clients
9 pages
Analysis_on_Enhancing_Financial_Decision-making_Through_Prompt_Engineering
No ratings yet
Analysis_on_Enhancing_Financial_Decision-making_Through_Prompt_Engineering
5 pages
Classification Techniques
No ratings yet
Classification Techniques
99 pages
Unit 5. Invertebrates
No ratings yet
Unit 5. Invertebrates
8 pages
Generative Ai Explained
No ratings yet
Generative Ai Explained
28 pages
AI Coding Tools, LLM, ChatGPT, Copilot, Instructor Perspectives
No ratings yet
AI Coding Tools, LLM, ChatGPT, Copilot, Instructor Perspectives
16 pages
AI-ML Edited
100% (1)
AI-ML Edited
12 pages
How Generative Ai Could Revitalize Profitability For Telcos
No ratings yet
How Generative Ai Could Revitalize Profitability For Telcos
11 pages
jbpm3 2 2-Handsontutorial
No ratings yet
jbpm3 2 2-Handsontutorial
99 pages
Stable Diffusion Prompts Article
No ratings yet
Stable Diffusion Prompts Article
13 pages
Generative Ai With Python Harnessing the Power of Machine Learning and Deep Learning to Build Creative and Intelligent Systems
100% (1)
Generative Ai With Python Harnessing the Power of Machine Learning and Deep Learning to Build Creative and Intelligent Systems
239 pages
Artificial Intelligence Applied To Software Testing
No ratings yet
Artificial Intelligence Applied To Software Testing
7 pages
Responsive Web Design Tipsheet: Start Small
No ratings yet
Responsive Web Design Tipsheet: Start Small
3 pages
Essential Python Libraries and Frameworks
No ratings yet
Essential Python Libraries and Frameworks
170 pages
Matthew Lamons - Rahul Kumar - Abhishek Nagaraja Python Deep Learning Projects - Data PDF
No ratings yet
Matthew Lamons - Rahul Kumar - Abhishek Nagaraja Python Deep Learning Projects - Data PDF
130 pages
Patterns of Big Data Forrester
No ratings yet
Patterns of Big Data Forrester
74 pages
Chatgpt for python
No ratings yet
Chatgpt for python
192 pages
Architecture_patterns_for_building_generative_AI_applications
No ratings yet
Architecture_patterns_for_building_generative_AI_applications
29 pages
Mathematical Fundamentals For ML - 1
No ratings yet
Mathematical Fundamentals For ML - 1
1 page
Students' Profile Registration and Login Creation Andapplying For Exam
No ratings yet
Students' Profile Registration and Login Creation Andapplying For Exam
24 pages
Senior Design Program Spring 23
No ratings yet
Senior Design Program Spring 23
32 pages
Aaron's Recorded Messages
No ratings yet
Aaron's Recorded Messages
2 pages
Dana Internet Solutions Guide: Alphasmart, Inc
No ratings yet
Dana Internet Solutions Guide: Alphasmart, Inc
68 pages
GPRS Tunneling Protocol GTP
No ratings yet
GPRS Tunneling Protocol GTP
22 pages
Assignment-1 Theory
No ratings yet
Assignment-1 Theory
3 pages
Unit - 5
No ratings yet
Unit - 5
36 pages
Buyers Guide Enterprise GRC Management Solutions
No ratings yet
Buyers Guide Enterprise GRC Management Solutions
75 pages
TR Bamboo Production NC Ii
100% (1)
TR Bamboo Production NC Ii
101 pages
Amazon Pay Business Script
No ratings yet
Amazon Pay Business Script
2 pages
PROJECT REPORT on Digital Marketing(Akash Sambyal)
No ratings yet
PROJECT REPORT on Digital Marketing(Akash Sambyal)
50 pages
Core 6 Succinctly
No ratings yet
Core 6 Succinctly
102 pages
Draughtsman Mechanical 2nd Year (Volume I of II) TP
No ratings yet
Draughtsman Mechanical 2nd Year (Volume I of II) TP
102 pages
IGuard LM Manual ENG
No ratings yet
IGuard LM Manual ENG
92 pages
Accounts Project
0% (1)
Accounts Project
20 pages
You Are Here: Services Compute OS Kernel Updates
No ratings yet
You Are Here: Services Compute OS Kernel Updates
2 pages
Knowledge Management Process One Page Reference-V2
No ratings yet
Knowledge Management Process One Page Reference-V2
1 page
Version Control Systems
No ratings yet
Version Control Systems
6 pages
Paper_1
No ratings yet
Paper_1
19 pages
Arpan Jain: Linkedin Angellist Github
No ratings yet
Arpan Jain: Linkedin Angellist Github
1 page
01) System Configuration (IPECS-MG)
No ratings yet
01) System Configuration (IPECS-MG)
23 pages
Well Planning Release Notes
No ratings yet
Well Planning Release Notes
21 pages
Configure SAMBA Server
No ratings yet
Configure SAMBA Server
15 pages
DenA2542X100 Monitor AMI Phosphate-II
No ratings yet
DenA2542X100 Monitor AMI Phosphate-II
2 pages
Aiche 40 033
No ratings yet
Aiche 40 033
5 pages