Generative AI Keynote
Generative AI Keynote
Amazon Bedrock
Ganesh Gella
Director of Engineering, Amazon Lex &
Amazon Bedrock Agents, AWS
© 2023, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon Confidential and Trademark.
तोहय 4छवा वारय സുഖമാേണാ
!ర# ఎల' ఉ)*+ర# भवान ् कथमDस सभ कुशल मंगल
थआ
ु ढ़ा के( हाल ऐ আেপানাৰ (কেন
Summarization Summarization
Chatbot Chatbot
Essentials for building a
generative AI application
CG1 G2 P2 G3 P3 G4 P4 G5 G5g P5
NVIDIA Tesla NVIDIA GRID NVIDIA NVIDIA NVIDIA V100 NVIDIA T4 NVIDIA A100 NVIDIA A10G NVIDIA T4G NVIDIA H100
M2050 “Fermi” GK104 “Kepler” K80 Tesla M60 Tensor Core Tensor Core Tensor Core Tensor Core Tensor Core Tensor Core
GPUs GPUs GPUs GPUs GPUs GPUs GPUs GPUs GPUs GPUs
Innovating at the silicon level
AWS AWS
Inferentia2
Claude 2.1
Amazon Amazon
Summarization, copywriting, Open-ended text generation,
and ideal for fine-tuning conversational chat, and RAG support
AI applications
Puppy Dog
Young
Supports semantic search, text retrieval,
Amazon Titan
and clustering
Text Embeddings
Supports a context length of up to 8k
tokens
Translates text into numerical
representations Works with 25+ languages
WITHOUT VECTOR EMBEDDINGS WITH VECTOR EMBEDDINGS
Golf
Golf Shoes
Shoes
GENERALLY AVAILABLE
Amazon Titan Foundation Models
TITAN TEXT TITAN TEXT TITAN TEXT TITAN MULTIMODAL TITAN IMAGE
EMBEDDINGS LITE EXPRESS EMBEDDINGS GENERATOR
Jurassic-2 Ultra Titan Text Embeddings Claude 2 Command + Embed Llama 2 Stable Diffusion XL1.0
Jurassic-2 Mid Titan Multimodal Embeddings Claude 2.1 Cohere Command Light Llama 2 13B
Titan Text Lite Claude Instant Cohere Embed English Llama 2 70B
Titan Text Express Cohere Embed Multilingual
Titan Image Generator
More than are using Amazon Bedrock
Essentials for building a
generative AI application
.NEW.
COST LATENCY
Is Finetuning the only option to make
Large Language Models work with my Data ?
CONVERT
data into embeddings
STORE
integrations with vector databases
Implementing RAG can
RETRIEVE
find relevant results of from vector database
based on user’s query
AUGMENT
add above results along with user’s query to
augment the prompt
GENERATE
instruct LLMs to generate responses based on
contextual data
Knowledge Bases for Automatically converts text documents into embeddings
1 2 3 4
AI Stack
Guardrails Agents Customization capabilities
AMAZON Q is AMAZON Q
Everything you need to accelerate your
Go Create
with AWS