SlideShare a Scribd company logo
1 | © Copyright 2024 Zilliz
1
1 | © Copyright 9/25/23 Zilliz
1 | © Copyright 9/25/23 Zilliz
Speaker
Jiang Chen
Ecosystem & AI Platform
jiang.chen@zilliz.com
@jiangc1010
2 | © Copyright 2024 Zilliz
2
Fantastic RAG Techniques
And Where to Find Them
Jiang Chen @ Zilliz
3 | © Copyright 2024 Zilliz
3
LLMs are great, but …
You still need to battle hallucination
with retriever, just like the Niffler
4 | © Copyright 2024 Zilliz
4
The evolution of AI made the semantic search of
unstructured data possible
Search by Probability
Statistical analyses of common
datasets established the foundation for
processing unstructured data, e.g. NLP,
and image classification
AI Model Breakthrough
The advancements in BERT, ViT, CBT
etc. have revolutionized semantic
analysis across unstructured data
Vectorization
Word2Vec, CNNs, Deep Speech pioneered
unstructured data embeddings, mapping the
words, images, videos into high-dimensional
vectors
5 | © Copyright 2024 Zilliz
5
01 Review of RAG basics
CONTENTS
02 Advanced RAG techniques
RAG in action with Milvus Lite
03
6 | © Copyright 2024 Zilliz
6
01 Review of RAG basics
7 | © Copyright 2024 Zilliz
7
Why RAG?
RAG vs. LLM
- Knowledge of LLM is out-of-date
- LLM can not get your private knowledge
- Hallucinations
- Transparency and interpretability
RAG vs. Fine-tune
- Fine-tune is expensive
- Fine-tune spent much time
- RAG is pluggable
8 | © Copyright 2024 Zilliz
8
9 | © Copyright 2024 Zilliz
9
02 Advanced RAG techniques
10 | © Copyright 2024 Zilliz
10
First thing first
Measure it before you attempts to improve it!
11 | © Copyright 2024 Zilliz
11
Indexing
Query Retrieval Prompt&
Generation
12 | © Copyright 2024 Zilliz
12
Types of RAG Enhancement Techniques
● Divide & Conquer
○ Query Enhancement: better express or process the query intent.
○ Indexing Enhancement: data cleanup, better parser and chunking
○ Retriever Enhancement: more retrievers and hybrid search strategy
○ Generator Enhancement: prompt engineering and more powerful LLM
● Thinking outside the box
○ Agents? Other tools than retriever?
13 | © Copyright 2024 Zilliz
13
Query Enhancement
14 | © Copyright 2024 Zilliz
14
15 | © Copyright 2024 Zilliz
15
16 | © Copyright 2024 Zilliz
16
What are the differences in features
between Milvus and Zilliz Cloud?
Sub query1: What are the features of Milvus?
Sub query2: What are the features of Zilliz Cloud?
17 | © Copyright 2024 Zilliz
17
18 | © Copyright 2024 Zilliz
18
Indexing Enhancement
19 | © Copyright 2024 Zilliz
19
Good dishes come from good ingredients
• Data collection
• Data cleaning
• Parsing & Chunking
• DNN-native data?
20 | © Copyright 2024 Zilliz
20
21 | © Copyright 2024 Zilliz
21
Retriever Enhancement
22 | © Copyright 2024 Zilliz
22
23 | © Copyright 2024 Zilliz
23
24 | © Copyright 2024 Zilliz
24
25 | © Copyright 2024 Zilliz
25
Generator Enhancement
26 | © Copyright 2024 Zilliz
26
27 | © Copyright 2024 Zilliz
27
28 | © Copyright 2024 Zilliz
28
Agents!
29 | © Copyright 2024 Zilliz
29
30 | © Copyright 2024 Zilliz
30
31 | © Copyright 2024 Zilliz
31
32 | © Copyright 2024 Zilliz
32
03 RAG in action with Milvus Lite
33 | © Copyright 2024 Zilliz
33
34 | © Copyright 2024 Zilliz
34
Seamless integration with all popular AI toolkits
35 | © Copyright 2024 Zilliz
35
35 | © Copyright 9/25/23 Zilliz
35 | © Copyright 9/25/23 Zilliz
Simplify and streamline
the conversion of
unstructured data into
state-of-the-art vector
embeddings, using
intuitive UI and Restful
APIs.
Pipelines
Easy. High-quality. Scalable.
Simplify the workflow
for developers, from
converting
unstructured data into
searchable vectors to
retrieving them from
vector databases
Deliver excellence in
every phase of vector
search pipeline
development and
deployment,
regardless of their
expertise
Ensure scalability for
managing large
datasets and
high-throughput
queries, maintaining
high performance with
min. customization or
infra changes
Zilliz Cloud Pipelines
36 | © Copyright 2024 Zilliz
36
T H A N K Y O U
Ad

More Related Content

What's hot (20)

The Twelve-Factor Appで考えるAWSのサービス開発
The Twelve-Factor Appで考えるAWSのサービス開発The Twelve-Factor Appで考えるAWSのサービス開発
The Twelve-Factor Appで考えるAWSのサービス開発
Amazon Web Services Japan
 
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdfRetrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Po-Chuan Chen
 
Generative models
Generative modelsGenerative models
Generative models
Birger Moell
 
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdfGen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
PhilipBasford
 
2024.05.01 RAG 세미나: 자연어처리의 정보검색 기법과 최신 RAG 모델
2024.05.01 RAG 세미나: 자연어처리의 정보검색 기법과 최신 RAG 모델2024.05.01 RAG 세미나: 자연어처리의 정보검색 기법과 최신 RAG 모델
2024.05.01 RAG 세미나: 자연어처리의 정보검색 기법과 최신 RAG 모델
Cheoneum Park
 
Generative AI - Responsible Path Forward.pdf
Generative AI - Responsible Path Forward.pdfGenerative AI - Responsible Path Forward.pdf
Generative AI - Responsible Path Forward.pdf
Saeed Al Dhaheri
 
Using MLOps to Bring ML to Production/The Promise of MLOps
Using MLOps to Bring ML to Production/The Promise of MLOpsUsing MLOps to Bring ML to Production/The Promise of MLOps
Using MLOps to Bring ML to Production/The Promise of MLOps
Weaveworks
 
Natural Language Processing NLP (Transformers)
Natural Language Processing NLP (Transformers)Natural Language Processing NLP (Transformers)
Natural Language Processing NLP (Transformers)
Hichem Felouat
 
Responsible AI
Responsible AIResponsible AI
Responsible AI
Data Con LA
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
Cloudera, Inc.
 
What is MLOps
What is MLOpsWhat is MLOps
What is MLOps
Henrik Skogström
 
UNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdf
UNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdfUNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdf
UNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdf
Hermes Romero
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of ML
Jordan Birdsell
 
分散トレーシングAWS:X-Rayとの上手い付き合い方
分散トレーシングAWS:X-Rayとの上手い付き合い方分散トレーシングAWS:X-Rayとの上手い付き合い方
分散トレーシングAWS:X-Rayとの上手い付き合い方
Recruit Lifestyle Co., Ltd.
 
Chicago AWS Solutions Architect Mehdy Haghy recaps the new AI/ML releases and...
Chicago AWS Solutions Architect Mehdy Haghy recaps the new AI/ML releases and...Chicago AWS Solutions Architect Mehdy Haghy recaps the new AI/ML releases and...
Chicago AWS Solutions Architect Mehdy Haghy recaps the new AI/ML releases and...
AWS Chicago
 
Building NLP applications with Transformers
Building NLP applications with TransformersBuilding NLP applications with Transformers
Building NLP applications with Transformers
Julien SIMON
 
Real World End to End machine Learning Pipeline
Real World End to End machine Learning PipelineReal World End to End machine Learning Pipeline
Real World End to End machine Learning Pipeline
Srivatsan Srinivasan
 
Vector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing MethodsVector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing Methods
Kate Shao
 
230309_LoRa
230309_LoRa230309_LoRa
230309_LoRa
YongSang Yoo
 
A comprehensive guide to Agentic AI Systems
A comprehensive guide to Agentic AI SystemsA comprehensive guide to Agentic AI Systems
A comprehensive guide to Agentic AI Systems
Debmalya Biswas
 
The Twelve-Factor Appで考えるAWSのサービス開発
The Twelve-Factor Appで考えるAWSのサービス開発The Twelve-Factor Appで考えるAWSのサービス開発
The Twelve-Factor Appで考えるAWSのサービス開発
Amazon Web Services Japan
 
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdfRetrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks.pdf
Po-Chuan Chen
 
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdfGen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
PhilipBasford
 
2024.05.01 RAG 세미나: 자연어처리의 정보검색 기법과 최신 RAG 모델
2024.05.01 RAG 세미나: 자연어처리의 정보검색 기법과 최신 RAG 모델2024.05.01 RAG 세미나: 자연어처리의 정보검색 기법과 최신 RAG 모델
2024.05.01 RAG 세미나: 자연어처리의 정보검색 기법과 최신 RAG 모델
Cheoneum Park
 
Generative AI - Responsible Path Forward.pdf
Generative AI - Responsible Path Forward.pdfGenerative AI - Responsible Path Forward.pdf
Generative AI - Responsible Path Forward.pdf
Saeed Al Dhaheri
 
Using MLOps to Bring ML to Production/The Promise of MLOps
Using MLOps to Bring ML to Production/The Promise of MLOpsUsing MLOps to Bring ML to Production/The Promise of MLOps
Using MLOps to Bring ML to Production/The Promise of MLOps
Weaveworks
 
Natural Language Processing NLP (Transformers)
Natural Language Processing NLP (Transformers)Natural Language Processing NLP (Transformers)
Natural Language Processing NLP (Transformers)
Hichem Felouat
 
Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18Federated Learning: ML with Privacy on the Edge 11.15.18
Federated Learning: ML with Privacy on the Edge 11.15.18
Cloudera, Inc.
 
UNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdf
UNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdfUNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdf
UNLEASHING INNOVATION Exploring Generative AI in the Enterprise.pdf
Hermes Romero
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of ML
Jordan Birdsell
 
分散トレーシングAWS:X-Rayとの上手い付き合い方
分散トレーシングAWS:X-Rayとの上手い付き合い方分散トレーシングAWS:X-Rayとの上手い付き合い方
分散トレーシングAWS:X-Rayとの上手い付き合い方
Recruit Lifestyle Co., Ltd.
 
Chicago AWS Solutions Architect Mehdy Haghy recaps the new AI/ML releases and...
Chicago AWS Solutions Architect Mehdy Haghy recaps the new AI/ML releases and...Chicago AWS Solutions Architect Mehdy Haghy recaps the new AI/ML releases and...
Chicago AWS Solutions Architect Mehdy Haghy recaps the new AI/ML releases and...
AWS Chicago
 
Building NLP applications with Transformers
Building NLP applications with TransformersBuilding NLP applications with Transformers
Building NLP applications with Transformers
Julien SIMON
 
Real World End to End machine Learning Pipeline
Real World End to End machine Learning PipelineReal World End to End machine Learning Pipeline
Real World End to End machine Learning Pipeline
Srivatsan Srinivasan
 
Vector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing MethodsVector Similarity Search & Indexing Methods
Vector Similarity Search & Indexing Methods
Kate Shao
 
A comprehensive guide to Agentic AI Systems
A comprehensive guide to Agentic AI SystemsA comprehensive guide to Agentic AI Systems
A comprehensive guide to Agentic AI Systems
Debmalya Biswas
 

Similar to Advanced Retrieval Augmented Generation Techniques (20)

2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
Timothy Spann
 
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Zilliz
 
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
Timothy Spann
 
What Makes "Deep Research"? A Dive into AI Agents
What Makes "Deep Research"? A Dive into AI AgentsWhat Makes "Deep Research"? A Dive into AI Agents
What Makes "Deep Research"? A Dive into AI Agents
Zilliz
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf
2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf
2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf
Ivan Tang
 
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
17-October-2024 NYC AI Camp - Step-by-Step RAG 10117-October-2024 NYC AI Camp - Step-by-Step RAG 101
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
Timothy Spann
 
08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
Timothy Spann
 
NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
Zilliz
 
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
09-03-2024_UnstructuredDataAndAIDiscussion.pdf09-03-2024_UnstructuredDataAndAIDiscussion.pdf
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
Timothy Spann
 
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to EdgeNYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
Timothy Spann
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Multimodal Embeddings (continued) - South Bay Meetup Slides
Multimodal Embeddings (continued) - South Bay Meetup SlidesMultimodal Embeddings (continued) - South Bay Meetup Slides
Multimodal Embeddings (continued) - South Bay Meetup Slides
Zilliz
 
2025-02-24 - AWS meetup - Zilliz presentation.pdf
2025-02-24 - AWS meetup - Zilliz presentation.pdf2025-02-24 - AWS meetup - Zilliz presentation.pdf
2025-02-24 - AWS meetup - Zilliz presentation.pdf
Ivan Tang
 
Multimodal Retrieval-Augmented Generation (RAG) with Vector Database
Multimodal Retrieval-Augmented Generation (RAG) with Vector DatabaseMultimodal Retrieval-Augmented Generation (RAG) with Vector Database
Multimodal Retrieval-Augmented Generation (RAG) with Vector Database
Zilliz
 
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Zilliz
 
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systemsSupercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Zilliz
 
09-12-2024 - Milvus, Vector database used for Sensor Data RAG
09-12-2024 - Milvus, Vector database used for Sensor Data RAG09-12-2024 - Milvus, Vector database used for Sensor Data RAG
09-12-2024 - Milvus, Vector database used for Sensor Data RAG
Timothy Spann
 
Introduction to Large Language Model Customization.pdf
Introduction to Large Language Model Customization.pdfIntroduction to Large Language Model Customization.pdf
Introduction to Large Language Model Customization.pdf
Zilliz
 
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
Timothy Spann
 
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
Timothy Spann
 
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Zilliz
 
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
Timothy Spann
 
What Makes "Deep Research"? A Dive into AI Agents
What Makes "Deep Research"? A Dive into AI AgentsWhat Makes "Deep Research"? A Dive into AI Agents
What Makes "Deep Research"? A Dive into AI Agents
Zilliz
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf
2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf
2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf
Ivan Tang
 
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
17-October-2024 NYC AI Camp - Step-by-Step RAG 10117-October-2024 NYC AI Camp - Step-by-Step RAG 101
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
Timothy Spann
 
08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
Timothy Spann
 
NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
Zilliz
 
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
09-03-2024_UnstructuredDataAndAIDiscussion.pdf09-03-2024_UnstructuredDataAndAIDiscussion.pdf
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
Timothy Spann
 
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to EdgeNYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
Timothy Spann
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Multimodal Embeddings (continued) - South Bay Meetup Slides
Multimodal Embeddings (continued) - South Bay Meetup SlidesMultimodal Embeddings (continued) - South Bay Meetup Slides
Multimodal Embeddings (continued) - South Bay Meetup Slides
Zilliz
 
2025-02-24 - AWS meetup - Zilliz presentation.pdf
2025-02-24 - AWS meetup - Zilliz presentation.pdf2025-02-24 - AWS meetup - Zilliz presentation.pdf
2025-02-24 - AWS meetup - Zilliz presentation.pdf
Ivan Tang
 
Multimodal Retrieval-Augmented Generation (RAG) with Vector Database
Multimodal Retrieval-Augmented Generation (RAG) with Vector DatabaseMultimodal Retrieval-Augmented Generation (RAG) with Vector Database
Multimodal Retrieval-Augmented Generation (RAG) with Vector Database
Zilliz
 
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Zilliz
 
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systemsSupercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Zilliz
 
09-12-2024 - Milvus, Vector database used for Sensor Data RAG
09-12-2024 - Milvus, Vector database used for Sensor Data RAG09-12-2024 - Milvus, Vector database used for Sensor Data RAG
09-12-2024 - Milvus, Vector database used for Sensor Data RAG
Timothy Spann
 
Introduction to Large Language Model Customization.pdf
Introduction to Large Language Model Customization.pdfIntroduction to Large Language Model Customization.pdf
Introduction to Large Language Model Customization.pdf
Zilliz
 
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
Timothy Spann
 
Ad

More from Zilliz (20)

Smarter RAG Pipelines: Scaling Search with Milvus and Feast
Smarter RAG Pipelines: Scaling Search with Milvus and FeastSmarter RAG Pipelines: Scaling Search with Milvus and Feast
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
Zilliz
 
Webinar - Zilliz Cloud Monthly Demo - March 2025
Webinar - Zilliz Cloud Monthly Demo - March 2025Webinar - Zilliz Cloud Monthly Demo - March 2025
Webinar - Zilliz Cloud Monthly Demo - March 2025
Zilliz
 
Combining Lexical and Semantic Search with Milvus 2.5
Combining Lexical and Semantic Search with Milvus 2.5Combining Lexical and Semantic Search with Milvus 2.5
Combining Lexical and Semantic Search with Milvus 2.5
Zilliz
 
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Bedrock Data Automation (Preview): Simplifying Unstructured Data ProcessingBedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Zilliz
 
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLMDeploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
Zilliz
 
February Product Demo: Discover the Power of Zilliz Cloud
February Product Demo: Discover the Power of Zilliz CloudFebruary Product Demo: Discover the Power of Zilliz Cloud
February Product Demo: Discover the Power of Zilliz Cloud
Zilliz
 
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Zilliz
 
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & MilvusBuilding the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Zilliz
 
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdfVoice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Zilliz
 
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
Zilliz
 
1 Table = 1000 Words? Foundation Models for Tabular Data
1 Table = 1000 Words? Foundation Models for Tabular Data1 Table = 1000 Words? Foundation Models for Tabular Data
1 Table = 1000 Words? Foundation Models for Tabular Data
Zilliz
 
How Milvus allows you to run Full Text Search
How Milvus allows you to run Full Text SearchHow Milvus allows you to run Full Text Search
How Milvus allows you to run Full Text Search
Zilliz
 
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
How to Optimize Your Embedding Model Selection and Development through TDA Cl...How to Optimize Your Embedding Model Selection and Development through TDA Cl...
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
Zilliz
 
Milvus: Scaling Vector Data Solutions for Gen AI
Milvus: Scaling Vector Data Solutions for Gen AIMilvus: Scaling Vector Data Solutions for Gen AI
Milvus: Scaling Vector Data Solutions for Gen AI
Zilliz
 
Keeping Data Fresh: Mastering Updates in Vector Databases
Keeping Data Fresh: Mastering Updates in Vector DatabasesKeeping Data Fresh: Mastering Updates in Vector Databases
Keeping Data Fresh: Mastering Updates in Vector Databases
Zilliz
 
GraphRAG Agents with Neo4j, Milvus and GPT4
GraphRAG Agents with Neo4j, Milvus and GPT4GraphRAG Agents with Neo4j, Milvus and GPT4
GraphRAG Agents with Neo4j, Milvus and GPT4
Zilliz
 
Using LLM Agents with Llama 3.2, LangGraph and Milvus
Using LLM Agents with Llama 3.2, LangGraph and MilvusUsing LLM Agents with Llama 3.2, LangGraph and Milvus
Using LLM Agents with Llama 3.2, LangGraph and Milvus
Zilliz
 
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Zilliz
 
Vector Databases for Enhanced Classification
Vector Databases for Enhanced ClassificationVector Databases for Enhanced Classification
Vector Databases for Enhanced Classification
Zilliz
 
Building an Accuracy Flywheel for your LLM RAG Apps
Building an Accuracy Flywheel for your LLM RAG AppsBuilding an Accuracy Flywheel for your LLM RAG Apps
Building an Accuracy Flywheel for your LLM RAG Apps
Zilliz
 
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
Smarter RAG Pipelines: Scaling Search with Milvus and FeastSmarter RAG Pipelines: Scaling Search with Milvus and Feast
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
Zilliz
 
Webinar - Zilliz Cloud Monthly Demo - March 2025
Webinar - Zilliz Cloud Monthly Demo - March 2025Webinar - Zilliz Cloud Monthly Demo - March 2025
Webinar - Zilliz Cloud Monthly Demo - March 2025
Zilliz
 
Combining Lexical and Semantic Search with Milvus 2.5
Combining Lexical and Semantic Search with Milvus 2.5Combining Lexical and Semantic Search with Milvus 2.5
Combining Lexical and Semantic Search with Milvus 2.5
Zilliz
 
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Bedrock Data Automation (Preview): Simplifying Unstructured Data ProcessingBedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Zilliz
 
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLMDeploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
Zilliz
 
February Product Demo: Discover the Power of Zilliz Cloud
February Product Demo: Discover the Power of Zilliz CloudFebruary Product Demo: Discover the Power of Zilliz Cloud
February Product Demo: Discover the Power of Zilliz Cloud
Zilliz
 
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Zilliz
 
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & MilvusBuilding the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Zilliz
 
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdfVoice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Zilliz
 
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
Zilliz
 
1 Table = 1000 Words? Foundation Models for Tabular Data
1 Table = 1000 Words? Foundation Models for Tabular Data1 Table = 1000 Words? Foundation Models for Tabular Data
1 Table = 1000 Words? Foundation Models for Tabular Data
Zilliz
 
How Milvus allows you to run Full Text Search
How Milvus allows you to run Full Text SearchHow Milvus allows you to run Full Text Search
How Milvus allows you to run Full Text Search
Zilliz
 
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
How to Optimize Your Embedding Model Selection and Development through TDA Cl...How to Optimize Your Embedding Model Selection and Development through TDA Cl...
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
Zilliz
 
Milvus: Scaling Vector Data Solutions for Gen AI
Milvus: Scaling Vector Data Solutions for Gen AIMilvus: Scaling Vector Data Solutions for Gen AI
Milvus: Scaling Vector Data Solutions for Gen AI
Zilliz
 
Keeping Data Fresh: Mastering Updates in Vector Databases
Keeping Data Fresh: Mastering Updates in Vector DatabasesKeeping Data Fresh: Mastering Updates in Vector Databases
Keeping Data Fresh: Mastering Updates in Vector Databases
Zilliz
 
GraphRAG Agents with Neo4j, Milvus and GPT4
GraphRAG Agents with Neo4j, Milvus and GPT4GraphRAG Agents with Neo4j, Milvus and GPT4
GraphRAG Agents with Neo4j, Milvus and GPT4
Zilliz
 
Using LLM Agents with Llama 3.2, LangGraph and Milvus
Using LLM Agents with Llama 3.2, LangGraph and MilvusUsing LLM Agents with Llama 3.2, LangGraph and Milvus
Using LLM Agents with Llama 3.2, LangGraph and Milvus
Zilliz
 
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Zilliz
 
Vector Databases for Enhanced Classification
Vector Databases for Enhanced ClassificationVector Databases for Enhanced Classification
Vector Databases for Enhanced Classification
Zilliz
 
Building an Accuracy Flywheel for your LLM RAG Apps
Building an Accuracy Flywheel for your LLM RAG AppsBuilding an Accuracy Flywheel for your LLM RAG Apps
Building an Accuracy Flywheel for your LLM RAG Apps
Zilliz
 
Ad

Recently uploaded (20)

IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
HCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser EnvironmentsHCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser Environments
panagenda
 
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptxDevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
Justin Reock
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded DevelopersLinux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Toradex
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath MaestroDev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
UiPathCommunity
 
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
SOFTTECHHUB
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Impelsys Inc.
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
How analogue intelligence complements AI
How analogue intelligence complements AIHow analogue intelligence complements AI
How analogue intelligence complements AI
Paul Rowe
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
HCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser EnvironmentsHCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser Environments
panagenda
 
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptxDevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
Justin Reock
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded DevelopersLinux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Toradex
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath MaestroDev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
UiPathCommunity
 
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
SOFTTECHHUB
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...
Impelsys Inc.
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 

Advanced Retrieval Augmented Generation Techniques

  • 1. 1 | © Copyright 2024 Zilliz 1 1 | © Copyright 9/25/23 Zilliz 1 | © Copyright 9/25/23 Zilliz Speaker Jiang Chen Ecosystem & AI Platform [email protected] @jiangc1010
  • 2. 2 | © Copyright 2024 Zilliz 2 Fantastic RAG Techniques And Where to Find Them Jiang Chen @ Zilliz
  • 3. 3 | © Copyright 2024 Zilliz 3 LLMs are great, but … You still need to battle hallucination with retriever, just like the Niffler
  • 4. 4 | © Copyright 2024 Zilliz 4 The evolution of AI made the semantic search of unstructured data possible Search by Probability Statistical analyses of common datasets established the foundation for processing unstructured data, e.g. NLP, and image classification AI Model Breakthrough The advancements in BERT, ViT, CBT etc. have revolutionized semantic analysis across unstructured data Vectorization Word2Vec, CNNs, Deep Speech pioneered unstructured data embeddings, mapping the words, images, videos into high-dimensional vectors
  • 5. 5 | © Copyright 2024 Zilliz 5 01 Review of RAG basics CONTENTS 02 Advanced RAG techniques RAG in action with Milvus Lite 03
  • 6. 6 | © Copyright 2024 Zilliz 6 01 Review of RAG basics
  • 7. 7 | © Copyright 2024 Zilliz 7 Why RAG? RAG vs. LLM - Knowledge of LLM is out-of-date - LLM can not get your private knowledge - Hallucinations - Transparency and interpretability RAG vs. Fine-tune - Fine-tune is expensive - Fine-tune spent much time - RAG is pluggable
  • 8. 8 | © Copyright 2024 Zilliz 8
  • 9. 9 | © Copyright 2024 Zilliz 9 02 Advanced RAG techniques
  • 10. 10 | © Copyright 2024 Zilliz 10 First thing first Measure it before you attempts to improve it!
  • 11. 11 | © Copyright 2024 Zilliz 11 Indexing Query Retrieval Prompt& Generation
  • 12. 12 | © Copyright 2024 Zilliz 12 Types of RAG Enhancement Techniques ● Divide & Conquer ○ Query Enhancement: better express or process the query intent. ○ Indexing Enhancement: data cleanup, better parser and chunking ○ Retriever Enhancement: more retrievers and hybrid search strategy ○ Generator Enhancement: prompt engineering and more powerful LLM ● Thinking outside the box ○ Agents? Other tools than retriever?
  • 13. 13 | © Copyright 2024 Zilliz 13 Query Enhancement
  • 14. 14 | © Copyright 2024 Zilliz 14
  • 15. 15 | © Copyright 2024 Zilliz 15
  • 16. 16 | © Copyright 2024 Zilliz 16 What are the differences in features between Milvus and Zilliz Cloud? Sub query1: What are the features of Milvus? Sub query2: What are the features of Zilliz Cloud?
  • 17. 17 | © Copyright 2024 Zilliz 17
  • 18. 18 | © Copyright 2024 Zilliz 18 Indexing Enhancement
  • 19. 19 | © Copyright 2024 Zilliz 19 Good dishes come from good ingredients • Data collection • Data cleaning • Parsing & Chunking • DNN-native data?
  • 20. 20 | © Copyright 2024 Zilliz 20
  • 21. 21 | © Copyright 2024 Zilliz 21 Retriever Enhancement
  • 22. 22 | © Copyright 2024 Zilliz 22
  • 23. 23 | © Copyright 2024 Zilliz 23
  • 24. 24 | © Copyright 2024 Zilliz 24
  • 25. 25 | © Copyright 2024 Zilliz 25 Generator Enhancement
  • 26. 26 | © Copyright 2024 Zilliz 26
  • 27. 27 | © Copyright 2024 Zilliz 27
  • 28. 28 | © Copyright 2024 Zilliz 28 Agents!
  • 29. 29 | © Copyright 2024 Zilliz 29
  • 30. 30 | © Copyright 2024 Zilliz 30
  • 31. 31 | © Copyright 2024 Zilliz 31
  • 32. 32 | © Copyright 2024 Zilliz 32 03 RAG in action with Milvus Lite
  • 33. 33 | © Copyright 2024 Zilliz 33
  • 34. 34 | © Copyright 2024 Zilliz 34 Seamless integration with all popular AI toolkits
  • 35. 35 | © Copyright 2024 Zilliz 35 35 | © Copyright 9/25/23 Zilliz 35 | © Copyright 9/25/23 Zilliz Simplify and streamline the conversion of unstructured data into state-of-the-art vector embeddings, using intuitive UI and Restful APIs. Pipelines Easy. High-quality. Scalable. Simplify the workflow for developers, from converting unstructured data into searchable vectors to retrieving them from vector databases Deliver excellence in every phase of vector search pipeline development and deployment, regardless of their expertise Ensure scalability for managing large datasets and high-throughput queries, maintaining high performance with min. customization or infra changes Zilliz Cloud Pipelines
  • 36. 36 | © Copyright 2024 Zilliz 36 T H A N K Y O U