SlideShare a Scribd company logo
4
Seamless integration with all popular AI toolkits
Most read
7
Basic RAG Architecture
Most read
19
Meta Storage
Root Query Data Index
Coordinator Service
Proxy
Proxy
etcd
Log Broker
SDK
Load Balancer
DDL/DCL
DML
NOTIFICATION
CONTROL SIGNAL
Object Storage
Minio / S3 / AzureBlob
Log Snapshot Delta File Index File
Worker Node QUERY DATA DATA
Message Storage
VECTOR
DATABASE
Access Layer
Query Node Data Node Index Node
Milvus Architecture
Most read
Stephen Batifol | Zilliz
Zilliz Webinar, July 11
Using LLM Agents with Llama
3, LangGraph and Milvus
Stephen Batifol
Developer Advocate, Zilliz/ Milvus
stephen.batifol@zilliz.com
linkedin.com/in/stephen-batifol/
@stephenbtl
Speaker
27K+
GitHub
Stars
25M+
Downloads
250+
Contributors
2,600
+
Forks
Milvus is an open-source vector database for GenAI projects. pip install on your
laptop, plug into popular AI dev tools, and push to production with a single line of
code.
Easy Setup
Pip-install to start
coding in a notebook
within seconds.
Reusable Code
Write once, and
deploy with one line
of code into the
production
environment
Integration
Plug into OpenAI,
Langchain,
LlmaIndex, and
many more
Feature-rich
Dense & sparse
embeddings,
filtering, reranking
and beyond
Seamless integration with all popular AI toolkits
| © Copyright 8/16/23 Zilliz
5
RAG
(Retrieval Augmented Generation)
Basic Idea
Use RAG to force the LLM to work with your data
by injecting it via a vector database like Milvus
Basic RAG Architecture
5 lines starter
9 | © Copyright 8/16/23 Zilliz
9 | © Copyright 8/16/23 Zilliz
01 Tech Stack
● Framework for building LLM Applications
● Focus on retrieving data and integrating with LLMs
● Integrations with most AI popular tools
🦜🔗 LangChain
🦜🕸 LangGraph by LangChain
● Build Stateful apps with LLMs and Multi-Agents workflow
● Cycles and Branching
● Human-in-the-Loop
● Persistence
Ollama
● Run LLMs anywhere
● Run Embedding Models
Using LLM Agents with Llama 3, LangGraph and Milvus
14 | © Copyright 8/16/23 Zilliz
14 | © Copyright 8/16/23 Zilliz
02 Agentic RAG
Agentic RAG
✅ Multi-turn
✅ Query / task planning layer
✅ Tool interface for external environment
✅ Reflection
✅ Memory for personalization
● Routing: Adaptive RAG
○ Route Questions to different retrieval approaches
● Fallback: Corrective RAG
○ Fallback to web search if docs are not relevant to query
● Self-Correction: Self-RAG
○ Try to fix answers with hallucinations or don’t address question
General Ideas
17 | © Copyright 8/16/23 Zilliz
17 | © Copyright 8/16/23 Zilliz
03 RAG in action with Milvus Lite
milvus.io
github.com/milvus-io/
@milvusio
@stephenbtl
/in/stephen-batifol
Thank you
Meta Storage
Root Query Data Index
Coordinator Service
Proxy
Proxy
etcd
Log Broker
SDK
Load Balancer
DDL/DCL
DML
NOTIFICATION
CONTROL SIGNAL
Object Storage
Minio / S3 / AzureBlob
Log Snapshot Delta File Index File
Worker Node QUERY DATA DATA
Message Storage
VECTOR
DATABASE
Access Layer
Query Node Data Node Index Node
Milvus Architecture

More Related Content

What's hot (20)

GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITYGENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
Andre Muscat
 
How ChatGPT and AI-assisted coding changes software engineering profoundly
How ChatGPT and AI-assisted coding changes software engineering profoundlyHow ChatGPT and AI-assisted coding changes software engineering profoundly
How ChatGPT and AI-assisted coding changes software engineering profoundly
Pekka Abrahamsson / Tampere University
 
200109-Open AI Chat GPT-4-3.pptx
200109-Open AI Chat GPT-4-3.pptx200109-Open AI Chat GPT-4-3.pptx
200109-Open AI Chat GPT-4-3.pptx
andre241421
 
Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdf
Qualcomm Research
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdf
Dung Hoang
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
DianaGray10
 
Simplified Introduction to AI
Simplified Introduction to AISimplified Introduction to AI
Simplified Introduction to AI
Deepu S Nath
 
Generative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptxGenerative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptx
Colleen Farrelly
 
Andy Roy - Conversational AI - Why We Must Build.pdf
Andy Roy - Conversational AI - Why We Must Build.pdfAndy Roy - Conversational AI - Why We Must Build.pdf
Andy Roy - Conversational AI - Why We Must Build.pdf
SOLTUIONSpeople, THINKubators, THINKathons
 
Generative AI
Generative AIGenerative AI
Generative AI
lutzsuarnaba1
 
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Mihai Criveti
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models Bootcamp
Data Science Dojo
 
Stanford AI Report 2023
Stanford AI Report 2023Stanford AI Report 2023
Stanford AI Report 2023
Kapil Khandelwal (KK)
 
Landscape of AI/ML in 2023
Landscape of AI/ML in 2023Landscape of AI/ML in 2023
Landscape of AI/ML in 2023
HyunJoon Jung
 
The Creative Ai storm
The Creative Ai stormThe Creative Ai storm
The Creative Ai storm
Leandro Righini
 
LLMs Bootcamp
LLMs BootcampLLMs Bootcamp
LLMs Bootcamp
Fiza987241
 
Intro to LLMs
Intro to LLMsIntro to LLMs
Intro to LLMs
Loic Merckel
 
The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021
Steve Omohundro
 
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPTAutomate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Anant Corporation
 
Responsible Generative AI
Responsible Generative AIResponsible Generative AI
Responsible Generative AI
CMassociates
 
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITYGENERATIVE AI, THE FUTURE OF PRODUCTIVITY
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
Andre Muscat
 
How ChatGPT and AI-assisted coding changes software engineering profoundly
How ChatGPT and AI-assisted coding changes software engineering profoundlyHow ChatGPT and AI-assisted coding changes software engineering profoundly
How ChatGPT and AI-assisted coding changes software engineering profoundly
Pekka Abrahamsson / Tampere University
 
200109-Open AI Chat GPT-4-3.pptx
200109-Open AI Chat GPT-4-3.pptx200109-Open AI Chat GPT-4-3.pptx
200109-Open AI Chat GPT-4-3.pptx
andre241421
 
Generative AI at the edge.pdf
Generative AI at the edge.pdfGenerative AI at the edge.pdf
Generative AI at the edge.pdf
Qualcomm Research
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdf
Dung Hoang
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
DianaGray10
 
Simplified Introduction to AI
Simplified Introduction to AISimplified Introduction to AI
Simplified Introduction to AI
Deepu S Nath
 
Generative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptxGenerative AI, WiDS 2023.pptx
Generative AI, WiDS 2023.pptx
Colleen Farrelly
 
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Mihai Criveti
 
Large Language Models Bootcamp
Large Language Models BootcampLarge Language Models Bootcamp
Large Language Models Bootcamp
Data Science Dojo
 
Landscape of AI/ML in 2023
Landscape of AI/ML in 2023Landscape of AI/ML in 2023
Landscape of AI/ML in 2023
HyunJoon Jung
 
The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021The Future of AI is Generative not Discriminative 5/26/2021
The Future of AI is Generative not Discriminative 5/26/2021
Steve Omohundro
 
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPTAutomate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Anant Corporation
 
Responsible Generative AI
Responsible Generative AIResponsible Generative AI
Responsible Generative AI
CMassociates
 

Similar to Using LLM Agents with Llama 3, LangGraph and Milvus (20)

Using LLM Agents with Llama 3.2, LangGraph and Milvus
Using LLM Agents with Llama 3.2, LangGraph and MilvusUsing LLM Agents with Llama 3.2, LangGraph and Milvus
Using LLM Agents with Llama 3.2, LangGraph and Milvus
Zilliz
 
Building an Agentic RAG locally with Ollama and Milvus
Building an Agentic RAG locally with Ollama and MilvusBuilding an Agentic RAG locally with Ollama and Milvus
Building an Agentic RAG locally with Ollama and Milvus
Zilliz
 
GraphRAG Agents with Neo4j, Milvus and GPT4
GraphRAG Agents with Neo4j, Milvus and GPT4GraphRAG Agents with Neo4j, Milvus and GPT4
GraphRAG Agents with Neo4j, Milvus and GPT4
Zilliz
 
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Multi-agent Systems with Mistral AI, Milvus and Llama-agentsMulti-agent Systems with Mistral AI, Milvus and Llama-agents
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Zilliz
 
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Multi-agent Systems with Mistral AI, Milvus and Llama-agentsMulti-agent Systems with Mistral AI, Milvus and Llama-agents
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Zilliz
 
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
Timothy Spann
 
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
17-October-2024 NYC AI Camp - Step-by-Step RAG 10117-October-2024 NYC AI Camp - Step-by-Step RAG 101
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
Timothy Spann
 
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
Timothy Spann
 
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
Timothy Spann
 
MultiModal RAG using vLLM and Pixtral - Stephen Batifol
MultiModal RAG using vLLM and Pixtral - Stephen BatifolMultiModal RAG using vLLM and Pixtral - Stephen Batifol
MultiModal RAG using vLLM and Pixtral - Stephen Batifol
Zilliz
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
Zilliz
 
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and OllamaTirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Zilliz
 
Evaluating Retrieval-Augmented Generation - Webinar
Evaluating Retrieval-Augmented Generation - WebinarEvaluating Retrieval-Augmented Generation - Webinar
Evaluating Retrieval-Augmented Generation - Webinar
Zilliz
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAGtspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
Timothy Spann
 
Fact based Generative AI
Fact based Generative AIFact based Generative AI
Fact based Generative AI
Stefan Weber
 
Chat with your data, privately and locally
Chat with your data, privately and locallyChat with your data, privately and locally
Chat with your data, privately and locally
Zilliz
 
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systemsSupercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Zilliz
 
06-18-2024-Princeton Meetup-Introduction to Milvus
06-18-2024-Princeton Meetup-Introduction to Milvus06-18-2024-Princeton Meetup-Introduction to Milvus
06-18-2024-Princeton Meetup-Introduction to Milvus
Timothy Spann
 
Using LLM Agents with Llama 3.2, LangGraph and Milvus
Using LLM Agents with Llama 3.2, LangGraph and MilvusUsing LLM Agents with Llama 3.2, LangGraph and Milvus
Using LLM Agents with Llama 3.2, LangGraph and Milvus
Zilliz
 
Building an Agentic RAG locally with Ollama and Milvus
Building an Agentic RAG locally with Ollama and MilvusBuilding an Agentic RAG locally with Ollama and Milvus
Building an Agentic RAG locally with Ollama and Milvus
Zilliz
 
GraphRAG Agents with Neo4j, Milvus and GPT4
GraphRAG Agents with Neo4j, Milvus and GPT4GraphRAG Agents with Neo4j, Milvus and GPT4
GraphRAG Agents with Neo4j, Milvus and GPT4
Zilliz
 
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Multi-agent Systems with Mistral AI, Milvus and Llama-agentsMulti-agent Systems with Mistral AI, Milvus and Llama-agents
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Zilliz
 
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Multi-agent Systems with Mistral AI, Milvus and Llama-agentsMulti-agent Systems with Mistral AI, Milvus and Llama-agents
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Zilliz
 
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
Timothy Spann
 
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
17-October-2024 NYC AI Camp - Step-by-Step RAG 10117-October-2024 NYC AI Camp - Step-by-Step RAG 101
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
Timothy Spann
 
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
Timothy Spann
 
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
Timothy Spann
 
MultiModal RAG using vLLM and Pixtral - Stephen Batifol
MultiModal RAG using vLLM and Pixtral - Stephen BatifolMultiModal RAG using vLLM and Pixtral - Stephen Batifol
MultiModal RAG using vLLM and Pixtral - Stephen Batifol
Zilliz
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
Zilliz
 
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and OllamaTirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Zilliz
 
Evaluating Retrieval-Augmented Generation - Webinar
Evaluating Retrieval-Augmented Generation - WebinarEvaluating Retrieval-Augmented Generation - Webinar
Evaluating Retrieval-Augmented Generation - Webinar
Zilliz
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...Building RAG with self-deployed Milvus vector database and Snowpark Container...
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAGtspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
Timothy Spann
 
Fact based Generative AI
Fact based Generative AIFact based Generative AI
Fact based Generative AI
Stefan Weber
 
Chat with your data, privately and locally
Chat with your data, privately and locallyChat with your data, privately and locally
Chat with your data, privately and locally
Zilliz
 
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systemsSupercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Zilliz
 
06-18-2024-Princeton Meetup-Introduction to Milvus
06-18-2024-Princeton Meetup-Introduction to Milvus06-18-2024-Princeton Meetup-Introduction to Milvus
06-18-2024-Princeton Meetup-Introduction to Milvus
Timothy Spann
 
Ad

More from Zilliz (20)

Zilliz Cloud Monthly Technical Review: May 2025
Zilliz Cloud Monthly Technical Review: May 2025Zilliz Cloud Monthly Technical Review: May 2025
Zilliz Cloud Monthly Technical Review: May 2025
Zilliz
 
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
Smarter RAG Pipelines: Scaling Search with Milvus and FeastSmarter RAG Pipelines: Scaling Search with Milvus and Feast
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
Zilliz
 
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Zilliz
 
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Zilliz
 
Webinar - Zilliz Cloud Monthly Demo - March 2025
Webinar - Zilliz Cloud Monthly Demo - March 2025Webinar - Zilliz Cloud Monthly Demo - March 2025
Webinar - Zilliz Cloud Monthly Demo - March 2025
Zilliz
 
What Makes "Deep Research"? A Dive into AI Agents
What Makes "Deep Research"? A Dive into AI AgentsWhat Makes "Deep Research"? A Dive into AI Agents
What Makes "Deep Research"? A Dive into AI Agents
Zilliz
 
Combining Lexical and Semantic Search with Milvus 2.5
Combining Lexical and Semantic Search with Milvus 2.5Combining Lexical and Semantic Search with Milvus 2.5
Combining Lexical and Semantic Search with Milvus 2.5
Zilliz
 
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Bedrock Data Automation (Preview): Simplifying Unstructured Data ProcessingBedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Zilliz
 
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLMDeploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
Zilliz
 
February Product Demo: Discover the Power of Zilliz Cloud
February Product Demo: Discover the Power of Zilliz CloudFebruary Product Demo: Discover the Power of Zilliz Cloud
February Product Demo: Discover the Power of Zilliz Cloud
Zilliz
 
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Zilliz
 
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & MilvusBuilding the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Zilliz
 
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdfVoice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Zilliz
 
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
Zilliz
 
1 Table = 1000 Words? Foundation Models for Tabular Data
1 Table = 1000 Words? Foundation Models for Tabular Data1 Table = 1000 Words? Foundation Models for Tabular Data
1 Table = 1000 Words? Foundation Models for Tabular Data
Zilliz
 
How Milvus allows you to run Full Text Search
How Milvus allows you to run Full Text SearchHow Milvus allows you to run Full Text Search
How Milvus allows you to run Full Text Search
Zilliz
 
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
How to Optimize Your Embedding Model Selection and Development through TDA Cl...How to Optimize Your Embedding Model Selection and Development through TDA Cl...
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
Zilliz
 
Milvus: Scaling Vector Data Solutions for Gen AI
Milvus: Scaling Vector Data Solutions for Gen AIMilvus: Scaling Vector Data Solutions for Gen AI
Milvus: Scaling Vector Data Solutions for Gen AI
Zilliz
 
Keeping Data Fresh: Mastering Updates in Vector Databases
Keeping Data Fresh: Mastering Updates in Vector DatabasesKeeping Data Fresh: Mastering Updates in Vector Databases
Keeping Data Fresh: Mastering Updates in Vector Databases
Zilliz
 
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Zilliz
 
Zilliz Cloud Monthly Technical Review: May 2025
Zilliz Cloud Monthly Technical Review: May 2025Zilliz Cloud Monthly Technical Review: May 2025
Zilliz Cloud Monthly Technical Review: May 2025
Zilliz
 
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
Smarter RAG Pipelines: Scaling Search with Milvus and FeastSmarter RAG Pipelines: Scaling Search with Milvus and Feast
Smarter RAG Pipelines: Scaling Search with Milvus and Feast
Zilliz
 
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Zilliz
 
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Zilliz
 
Webinar - Zilliz Cloud Monthly Demo - March 2025
Webinar - Zilliz Cloud Monthly Demo - March 2025Webinar - Zilliz Cloud Monthly Demo - March 2025
Webinar - Zilliz Cloud Monthly Demo - March 2025
Zilliz
 
What Makes "Deep Research"? A Dive into AI Agents
What Makes "Deep Research"? A Dive into AI AgentsWhat Makes "Deep Research"? A Dive into AI Agents
What Makes "Deep Research"? A Dive into AI Agents
Zilliz
 
Combining Lexical and Semantic Search with Milvus 2.5
Combining Lexical and Semantic Search with Milvus 2.5Combining Lexical and Semantic Search with Milvus 2.5
Combining Lexical and Semantic Search with Milvus 2.5
Zilliz
 
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Bedrock Data Automation (Preview): Simplifying Unstructured Data ProcessingBedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Bedrock Data Automation (Preview): Simplifying Unstructured Data Processing
Zilliz
 
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLMDeploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLM
Zilliz
 
February Product Demo: Discover the Power of Zilliz Cloud
February Product Demo: Discover the Power of Zilliz CloudFebruary Product Demo: Discover the Power of Zilliz Cloud
February Product Demo: Discover the Power of Zilliz Cloud
Zilliz
 
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23
Zilliz
 
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & MilvusBuilding the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & Milvus
Zilliz
 
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdfVoice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Voice-to-Value- LLM-Powered Customer Interaction Analysis.pdf
Zilliz
 
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...
Zilliz
 
1 Table = 1000 Words? Foundation Models for Tabular Data
1 Table = 1000 Words? Foundation Models for Tabular Data1 Table = 1000 Words? Foundation Models for Tabular Data
1 Table = 1000 Words? Foundation Models for Tabular Data
Zilliz
 
How Milvus allows you to run Full Text Search
How Milvus allows you to run Full Text SearchHow Milvus allows you to run Full Text Search
How Milvus allows you to run Full Text Search
Zilliz
 
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
How to Optimize Your Embedding Model Selection and Development through TDA Cl...How to Optimize Your Embedding Model Selection and Development through TDA Cl...
How to Optimize Your Embedding Model Selection and Development through TDA Cl...
Zilliz
 
Milvus: Scaling Vector Data Solutions for Gen AI
Milvus: Scaling Vector Data Solutions for Gen AIMilvus: Scaling Vector Data Solutions for Gen AI
Milvus: Scaling Vector Data Solutions for Gen AI
Zilliz
 
Keeping Data Fresh: Mastering Updates in Vector Databases
Keeping Data Fresh: Mastering Updates in Vector DatabasesKeeping Data Fresh: Mastering Updates in Vector Databases
Keeping Data Fresh: Mastering Updates in Vector Databases
Zilliz
 
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!
Zilliz
 
Ad

Recently uploaded (20)

AI Agents in Logistics and Supply Chain Applications Benefits and Implementation
AI Agents in Logistics and Supply Chain Applications Benefits and ImplementationAI Agents in Logistics and Supply Chain Applications Benefits and Implementation
AI Agents in Logistics and Supply Chain Applications Benefits and Implementation
Christine Shepherd
 
Azure vs AWS Which Cloud Platform Is Best for Your Business in 2025
Azure vs AWS  Which Cloud Platform Is Best for Your Business in 2025Azure vs AWS  Which Cloud Platform Is Best for Your Business in 2025
Azure vs AWS Which Cloud Platform Is Best for Your Business in 2025
Infrassist Technologies Pvt. Ltd.
 
Jeremy Millul - A Talented Software Developer
Jeremy Millul - A Talented Software DeveloperJeremy Millul - A Talented Software Developer
Jeremy Millul - A Talented Software Developer
Jeremy Millul
 
cnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdf
cnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdfcnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdf
cnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdf
AmirStern2
 
Mastering AI Workflows with FME - Peak of Data & AI 2025
Mastering AI Workflows with FME - Peak of Data & AI 2025Mastering AI Workflows with FME - Peak of Data & AI 2025
Mastering AI Workflows with FME - Peak of Data & AI 2025
Safe Software
 
Domino IQ – Was Sie erwartet, erste Schritte und Anwendungsfälle
Domino IQ – Was Sie erwartet, erste Schritte und AnwendungsfälleDomino IQ – Was Sie erwartet, erste Schritte und Anwendungsfälle
Domino IQ – Was Sie erwartet, erste Schritte und Anwendungsfälle
panagenda
 
Down the Rabbit Hole – Solving 5 Training Roadblocks
Down the Rabbit Hole – Solving 5 Training RoadblocksDown the Rabbit Hole – Solving 5 Training Roadblocks
Down the Rabbit Hole – Solving 5 Training Roadblocks
Rustici Software
 
End-to-end Assurance for SD-WAN & SASE with ThousandEyes
End-to-end Assurance for SD-WAN & SASE with ThousandEyesEnd-to-end Assurance for SD-WAN & SASE with ThousandEyes
End-to-end Assurance for SD-WAN & SASE with ThousandEyes
ThousandEyes
 
FME Beyond Data Processing Creating A Dartboard Accuracy App
FME Beyond Data Processing Creating A Dartboard Accuracy AppFME Beyond Data Processing Creating A Dartboard Accuracy App
FME Beyond Data Processing Creating A Dartboard Accuracy App
Safe Software
 
How to Detect Outliers in IBM SPSS Statistics.pptx
How to Detect Outliers in IBM SPSS Statistics.pptxHow to Detect Outliers in IBM SPSS Statistics.pptx
How to Detect Outliers in IBM SPSS Statistics.pptx
Version 1 Analytics
 
Your startup on AWS - How to architect and maintain a Lean and Mean account J...
Your startup on AWS - How to architect and maintain a Lean and Mean account J...Your startup on AWS - How to architect and maintain a Lean and Mean account J...
Your startup on AWS - How to architect and maintain a Lean and Mean account J...
angelo60207
 
Establish Visibility and Manage Risk in the Supply Chain with Anchore SBOM
Establish Visibility and Manage Risk in the Supply Chain with Anchore SBOMEstablish Visibility and Manage Risk in the Supply Chain with Anchore SBOM
Establish Visibility and Manage Risk in the Supply Chain with Anchore SBOM
Anchore
 
Introduction to Typescript - GDG On Campus EUE
Introduction to Typescript - GDG On Campus EUEIntroduction to Typescript - GDG On Campus EUE
Introduction to Typescript - GDG On Campus EUE
Google Developer Group On Campus European Universities in Egypt
 
DevOps in the Modern Era - Thoughtfully Critical Podcast
DevOps in the Modern Era - Thoughtfully Critical PodcastDevOps in the Modern Era - Thoughtfully Critical Podcast
DevOps in the Modern Era - Thoughtfully Critical Podcast
Chris Wahl
 
FCF- Getting Started in Cybersecurity 3.0
FCF- Getting Started in Cybersecurity 3.0FCF- Getting Started in Cybersecurity 3.0
FCF- Getting Started in Cybersecurity 3.0
RodrigoMori7
 
ISOIEC 42005 Revolutionalises AI Impact Assessment.pptx
ISOIEC 42005 Revolutionalises AI Impact Assessment.pptxISOIEC 42005 Revolutionalises AI Impact Assessment.pptx
ISOIEC 42005 Revolutionalises AI Impact Assessment.pptx
AyilurRamnath1
 
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Anish Kumar
 
Boosting MySQL with Vector Search -THE VECTOR SEARCH CONFERENCE 2025 .pdf
Boosting MySQL with Vector Search -THE VECTOR SEARCH CONFERENCE 2025 .pdfBoosting MySQL with Vector Search -THE VECTOR SEARCH CONFERENCE 2025 .pdf
Boosting MySQL with Vector Search -THE VECTOR SEARCH CONFERENCE 2025 .pdf
Alkin Tezuysal
 
Dancing with AI - A Developer's Journey.pptx
Dancing with AI - A Developer's Journey.pptxDancing with AI - A Developer's Journey.pptx
Dancing with AI - A Developer's Journey.pptx
Elliott Richmond
 
Improving Developer Productivity With DORA, SPACE, and DevEx
Improving Developer Productivity With DORA, SPACE, and DevExImproving Developer Productivity With DORA, SPACE, and DevEx
Improving Developer Productivity With DORA, SPACE, and DevEx
Justin Reock
 
AI Agents in Logistics and Supply Chain Applications Benefits and Implementation
AI Agents in Logistics and Supply Chain Applications Benefits and ImplementationAI Agents in Logistics and Supply Chain Applications Benefits and Implementation
AI Agents in Logistics and Supply Chain Applications Benefits and Implementation
Christine Shepherd
 
Azure vs AWS Which Cloud Platform Is Best for Your Business in 2025
Azure vs AWS  Which Cloud Platform Is Best for Your Business in 2025Azure vs AWS  Which Cloud Platform Is Best for Your Business in 2025
Azure vs AWS Which Cloud Platform Is Best for Your Business in 2025
Infrassist Technologies Pvt. Ltd.
 
Jeremy Millul - A Talented Software Developer
Jeremy Millul - A Talented Software DeveloperJeremy Millul - A Talented Software Developer
Jeremy Millul - A Talented Software Developer
Jeremy Millul
 
cnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdf
cnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdfcnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdf
cnc-drilling-dowel-inserting-machine-drillteq-d-510-english.pdf
AmirStern2
 
Mastering AI Workflows with FME - Peak of Data & AI 2025
Mastering AI Workflows with FME - Peak of Data & AI 2025Mastering AI Workflows with FME - Peak of Data & AI 2025
Mastering AI Workflows with FME - Peak of Data & AI 2025
Safe Software
 
Domino IQ – Was Sie erwartet, erste Schritte und Anwendungsfälle
Domino IQ – Was Sie erwartet, erste Schritte und AnwendungsfälleDomino IQ – Was Sie erwartet, erste Schritte und Anwendungsfälle
Domino IQ – Was Sie erwartet, erste Schritte und Anwendungsfälle
panagenda
 
Down the Rabbit Hole – Solving 5 Training Roadblocks
Down the Rabbit Hole – Solving 5 Training RoadblocksDown the Rabbit Hole – Solving 5 Training Roadblocks
Down the Rabbit Hole – Solving 5 Training Roadblocks
Rustici Software
 
End-to-end Assurance for SD-WAN & SASE with ThousandEyes
End-to-end Assurance for SD-WAN & SASE with ThousandEyesEnd-to-end Assurance for SD-WAN & SASE with ThousandEyes
End-to-end Assurance for SD-WAN & SASE with ThousandEyes
ThousandEyes
 
FME Beyond Data Processing Creating A Dartboard Accuracy App
FME Beyond Data Processing Creating A Dartboard Accuracy AppFME Beyond Data Processing Creating A Dartboard Accuracy App
FME Beyond Data Processing Creating A Dartboard Accuracy App
Safe Software
 
How to Detect Outliers in IBM SPSS Statistics.pptx
How to Detect Outliers in IBM SPSS Statistics.pptxHow to Detect Outliers in IBM SPSS Statistics.pptx
How to Detect Outliers in IBM SPSS Statistics.pptx
Version 1 Analytics
 
Your startup on AWS - How to architect and maintain a Lean and Mean account J...
Your startup on AWS - How to architect and maintain a Lean and Mean account J...Your startup on AWS - How to architect and maintain a Lean and Mean account J...
Your startup on AWS - How to architect and maintain a Lean and Mean account J...
angelo60207
 
Establish Visibility and Manage Risk in the Supply Chain with Anchore SBOM
Establish Visibility and Manage Risk in the Supply Chain with Anchore SBOMEstablish Visibility and Manage Risk in the Supply Chain with Anchore SBOM
Establish Visibility and Manage Risk in the Supply Chain with Anchore SBOM
Anchore
 
DevOps in the Modern Era - Thoughtfully Critical Podcast
DevOps in the Modern Era - Thoughtfully Critical PodcastDevOps in the Modern Era - Thoughtfully Critical Podcast
DevOps in the Modern Era - Thoughtfully Critical Podcast
Chris Wahl
 
FCF- Getting Started in Cybersecurity 3.0
FCF- Getting Started in Cybersecurity 3.0FCF- Getting Started in Cybersecurity 3.0
FCF- Getting Started in Cybersecurity 3.0
RodrigoMori7
 
ISOIEC 42005 Revolutionalises AI Impact Assessment.pptx
ISOIEC 42005 Revolutionalises AI Impact Assessment.pptxISOIEC 42005 Revolutionalises AI Impact Assessment.pptx
ISOIEC 42005 Revolutionalises AI Impact Assessment.pptx
AyilurRamnath1
 
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...
Anish Kumar
 
Boosting MySQL with Vector Search -THE VECTOR SEARCH CONFERENCE 2025 .pdf
Boosting MySQL with Vector Search -THE VECTOR SEARCH CONFERENCE 2025 .pdfBoosting MySQL with Vector Search -THE VECTOR SEARCH CONFERENCE 2025 .pdf
Boosting MySQL with Vector Search -THE VECTOR SEARCH CONFERENCE 2025 .pdf
Alkin Tezuysal
 
Dancing with AI - A Developer's Journey.pptx
Dancing with AI - A Developer's Journey.pptxDancing with AI - A Developer's Journey.pptx
Dancing with AI - A Developer's Journey.pptx
Elliott Richmond
 
Improving Developer Productivity With DORA, SPACE, and DevEx
Improving Developer Productivity With DORA, SPACE, and DevExImproving Developer Productivity With DORA, SPACE, and DevEx
Improving Developer Productivity With DORA, SPACE, and DevEx
Justin Reock
 

Using LLM Agents with Llama 3, LangGraph and Milvus

  • 1. Stephen Batifol | Zilliz Zilliz Webinar, July 11 Using LLM Agents with Llama 3, LangGraph and Milvus
  • 2. Stephen Batifol Developer Advocate, Zilliz/ Milvus [email protected] linkedin.com/in/stephen-batifol/ @stephenbtl Speaker
  • 3. 27K+ GitHub Stars 25M+ Downloads 250+ Contributors 2,600 + Forks Milvus is an open-source vector database for GenAI projects. pip install on your laptop, plug into popular AI dev tools, and push to production with a single line of code. Easy Setup Pip-install to start coding in a notebook within seconds. Reusable Code Write once, and deploy with one line of code into the production environment Integration Plug into OpenAI, Langchain, LlmaIndex, and many more Feature-rich Dense & sparse embeddings, filtering, reranking and beyond
  • 4. Seamless integration with all popular AI toolkits
  • 5. | © Copyright 8/16/23 Zilliz 5 RAG (Retrieval Augmented Generation)
  • 6. Basic Idea Use RAG to force the LLM to work with your data by injecting it via a vector database like Milvus
  • 9. 9 | © Copyright 8/16/23 Zilliz 9 | © Copyright 8/16/23 Zilliz 01 Tech Stack
  • 10. ● Framework for building LLM Applications ● Focus on retrieving data and integrating with LLMs ● Integrations with most AI popular tools 🦜🔗 LangChain
  • 11. 🦜🕸 LangGraph by LangChain ● Build Stateful apps with LLMs and Multi-Agents workflow ● Cycles and Branching ● Human-in-the-Loop ● Persistence
  • 12. Ollama ● Run LLMs anywhere ● Run Embedding Models
  • 14. 14 | © Copyright 8/16/23 Zilliz 14 | © Copyright 8/16/23 Zilliz 02 Agentic RAG
  • 15. Agentic RAG ✅ Multi-turn ✅ Query / task planning layer ✅ Tool interface for external environment ✅ Reflection ✅ Memory for personalization
  • 16. ● Routing: Adaptive RAG ○ Route Questions to different retrieval approaches ● Fallback: Corrective RAG ○ Fallback to web search if docs are not relevant to query ● Self-Correction: Self-RAG ○ Try to fix answers with hallucinations or don’t address question General Ideas
  • 17. 17 | © Copyright 8/16/23 Zilliz 17 | © Copyright 8/16/23 Zilliz 03 RAG in action with Milvus Lite
  • 19. Meta Storage Root Query Data Index Coordinator Service Proxy Proxy etcd Log Broker SDK Load Balancer DDL/DCL DML NOTIFICATION CONTROL SIGNAL Object Storage Minio / S3 / AzureBlob Log Snapshot Delta File Index File Worker Node QUERY DATA DATA Message Storage VECTOR DATABASE Access Layer Query Node Data Node Index Node Milvus Architecture