AIM307_Retrieval-Augmented-Generation-with-Amazon-Bedrock
AIM307_Retrieval-Augmented-Generation-with-Amazon-Bedrock
All rights
reserved.
AIM307
Retrieval Augmented
Generation with Amazon
Bedrock
Rupinder Grewal Clay Elmore
(he/him) (he/him)
Sr. Specialist Solutions Architect Specialist Solutions Architect -
– ML AI/M L
AWS AWS
03 Hands-on lab
workflo
on Embeddin
gs
Contex
t
w model
Embeddin 0.89
0.17
-0.02 -0.53 0.95 -
0.38
Dat Semant
ingesti
a ic
search
on
workflo Vector Embeddings Document New
w store model store data
© 2023, Amazon Web Services, Inc. or its affiliates. All rights
reserved.
Amazon
Bedrock
Translates text inputs (words, phrases) into • Optimized for text retrieval tasks,
numerical representations (embeddings). semantic
Comparing embeddings produces more similarity, and clustering.
relevant and contextual responses than word
matching. • Applications of this model includes
semantic search and personalization.
Model ID:Amazon
© 2023, amazon.titan-embed-text-v1
Web Services, Inc. or its affiliates. All rights
reserved.
Anthropic
Claude
Claud Highlig
e hts
Anthropic offers the Claude family of large • Long context window (100k)
language models purpose-built for allows for large amounts of text
conversations, summarization, Q&A, workflow to be processed at once.
automation, coding, and more. Claude can
also take direction on personality, tone, and • Available in two sizes to help
behavior. choose the right-sized model
based on latency, accuracy, and
cost considerations.
Max tokens: 100,000
Language: English and multiple other • Early customers report that
languages Claude is much less likely to
Model IDs: produce harmful outputs, easier
- anthropic.claude-instant-v1 to converse with, and more
- anthropic.claude-v2 steerable .
http://bit.ly/3T8B
LSi
Knowledge
w
base for
RAG
Connect FMs to data
sources, including vector
engine for Amazon
OpenSearch Serverless,
Pinecone, and Redis
Enterprise Cloud
1 2 3 4
you!
Rupinder Grewal Clay Elmore
survey in the mobile app
[email protected] [email protected]
om om