We will showcase how you can build a RAG using Milvus. Retrieval-augmented generation (RAG) is a technique for enhancing the accuracy and reliability of generative AI models with facts fetched from external sources.
Want to create an inspiring consumer experience with your brand, but need a little inspiration?
This RetailOasis whitepaper is dedicated to those retailers from around the world creating inspiring store experiences, engaging digital environments and immersive brand interactions that create long lasting consumer love.
Rishabh Software is a technology services company that has been providing services to businesses globally for over a decade. They have experience with various Java and J2EE technologies and have completed many architecture-based software solutions. They utilize development methodologies like Waterfall and Scrum and have expertise in domains like BFSI, ecommerce, healthcare, and more. They have experience with various projects including an ecommerce shopping cart, AML compliance solutions, and mobile app development.
LWC Datatable LDV, Christian Knapp & Christian MenzingerCzechDreamin
This document summarizes a presentation about using Lightning Web Components (LWC) to build a datatable to search for products from a large dataset. It discusses challenges faced, such as platform restrictions, performance issues, and type coercion errors when trying to query fields. It provides tips for handling large data volumes, like using offsets, limits, and Apex for sorting. While promising, LWC datables have limitations and require special handling of features like selections and relationship fields. Open source tools can help generate test data to explore solutions before projects launch.
This document summarizes Docker security features as of release 1.12. It discusses key security modules like namespaces, cgroups, capabilities, seccomp, AppArmor/SELinux that provide access control and isolation in Docker containers. It also covers multi-tenant security, image signing, TLS for daemon access, and best practices like using official images and regular updates.
Static Analysis Security Testing for Dummies... and YouKevin Fealey
Most enterprise application security teams have at least one Static Analysis Security Testing (SAST) tool in their tool-belt; but for many, the tool never leaves the belt. SAST tools have gotten a reputation for being slow, error-prone, and difficult to use; and out of the box, many of them are – but with a little more knowledge behind how these tools are designed, a SAST tool can be a valuable part of any security program.
In this talk, we’ll help you understand the strengths and weaknesses of SAST tools by illustrating how they trace your code for vulnerabilities. You’ll see out-of-the-box rules for commercial and open-source SAST tools, and learn how to write custom rules for the widely-used open source SAST tool, PMD. We’ll explain the value of customizing tools for your organization; and you’ll learn how to integrate SAST technologies into your existing build and deployment pipelines. Lastly, we’ll describe many of the common challenges organizations face when deploying a new security tool to security or development teams, as well as some helpful hints to resolve these issues
This presentation discusses Splunk's observability solution and its benefits. It can provide complete business visibility across hybrid landscapes, predict and prevent problems before customers notice through leveraging historic knowledge, and AI-directed troubleshooting. The open telemetry-native approach allows for full control of data collection. Customers saw reduced downtime, improved efficiency, faster innovation, and better customer experiences through using Splunk's observability capabilities. Splunk is a leader in the observability market according to analysts.
Shift Left Security - The What, Why and HowDevOps.com
This document discusses shift-left security, which involves moving security practices earlier into the software development lifecycle to proactively address risks rather than reactively. It notes that only 20% of organizations consistently integrate security early in DevOps processes. Shift-left security is important because traditional security teams cannot keep up with development speeds. The document outlines how to implement shift-left security through automating security practices, using control gates, and learning from production environments. It argues containers help shift security left through their minimal, declarative, and predictable nature which simplifies security requirements and policy automation.
Retrieval Augmented Generation (RAG), is a popular method to use a large language model, a vector database, and some sort of prompt interface to build better chat bots. On the surface, it seems pretty simple to build a RAG app, but when it comes down to implementation, there are many details to hash out. These details include how to: chunk data, work with embeddings, and even how to select and use a vector database.
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...Timothy Spann
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Techniques
Timothy Spann
https://2024.allthingsopen.org/sessions/advanced-retrieval-augmented-generation-rag-techniques
In 2023, we saw many simple retrieval augmented generation (RAG) examples being built. However, most of these examples and frameworks built around them simplified the process too much. Businesses were unable to derive value from their implementations. That’s because there are many other techniques involved in tuning a basic RAG app to work for you. In this talk we will cover three of the techniques you need to understand and leverage to build better RAG: chunking, embedding model choice, and metadata structuring.
17-October-2024 NYC AI Camp - Step-by-Step RAG 101Timothy Spann
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
https://github.com/tspannhw/AIM-BecomingAnAIEngineer
https://github.com/tspannhw/AIM-Ghosts
AIM - Becoming An AI Engineer
Step 1 - Start off local
Download Python (or use your local install)
https://www.python.org/downloads/
python3.11 -m venv yourenv
source yourenv/bin/activate
Create an environment
https://docs.python.org/3/library/venv.html
Use Pip
https://pip.pypa.io/en/stable/installation/
Setup a .env file for environment variables
Download Jupyter Lab
https://jupyter.org/
Run your notebook
jupyter lab --ip="0.0.0.0" --port=8881 --allow-root
Running on a Mac or Linux machine is optimal.
Setup environment variables
source .env
Alternatives
Download Conda
https://docs.conda.io/projects/conda/en/latest/index.html
https://colab.research.google.com/
Other languages: Java, .Net, Go, NodeJS
Other notebooks to try
https://zilliz.com/learn/milvus-notebooks
https://github.com/milvus-io/bootcamp/blob/master/bootcamp/tutorials/quickstart/build_RAG_with_milvus.ipynb
References
Guides
https://zilliz.com/learn
HuggingFace Friend
https://zilliz.com/learn/effortless-ai-workflows-a-beginners-guide-to-hugging-face-and-pymilvus
Milvus
https://zilliz.com/milvus-downloads
https://milvus.io/docs/quickstart.md
LangChain
https://zilliz.com/learn/LangChain
Notebook display
https://ipywidgets.readthedocs.io/en/stable/user_install.html
References
https://medium.com/@zilliz_learn/function-calling-with-ollama-llama-3-2-and-milvus-ac2bc2122538
https://github.com/milvus-io/bootcamp/tree/master/bootcamp/RAG/advanced_rag
https://zilliz.com/learn/Retrieval-Augmented-Generation
https://zilliz.com/blog/scale-search-with-milvus-handle-massive-datasets-with-ease
https://zilliz.com/learn/generative-ai
https://zilliz.com/learn/what-are-binary-vector-embedding
https://zilliz.com/learn/choosing-right-vector-index-for-your-project
Using LLM Agents with Llama 3.2, LangGraph and MilvusZilliz
We explore Agentic RAG (Internet search, check for hallucinations, correct answers). Don’t miss this deep dive into one of the hottest topics in AI today!
Building an Agentic RAG locally with Ollama and MilvusZilliz
With the rise of Open-Source LLMs like Llama, Mistral, Gemma, and more, it has become apparent that LLMs might also be useful even when run locally. In this talk, we will see how to deploy an Agentic Retrieval Augmented Generation (RAG) setup using Ollama, with Milvus as the vector database on your laptop. That way, you can also avoid being Rate Limited by OpenAI like I have been in the past.
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systemsZilliz
Apache Spark dominates the big data processing world, but efficient vector similarity search on massive datasets remains a bottleneck. This talk will show how you can seamlessly integrate Milvus with Spark to unlock the true power of vector similarity search.
We'll explore how Milvus integrates with Spark, enabling efficient vector search within Spark workflows. Real-world applications showcasing the combined power of Spark and Milvus in tackling complex similarity search challenges will be presented. Finally, we'll shed light on the significant performance gains achieved through this integration.
Whether you're dealing with recommendation systems, image retrieval, or any other application requiring vector similarity search, this talk will equip you with the knowledge to leverage Spark and Milvus to their maximum potential.
Join us on this exploration of how Spark and Milvus can enhance your big data processing capabilities with fast similarity search even at scale!
LangGraph GraphRAG agent with Llama 3.1 and GPT4o
Let's build an Advanced RAG with a GraphRAG agent that will run a combination of Llama 3.1 and GPT4o, for Llama 3.1 we will use Ollama. The idea is that we use GPT4o for advanced tasks, like generating the Neo4j query and Llama3.1 for the rest.
Multi-agent Systems with Mistral AI, Milvus and Llama-agentsZilliz
With the recent release of Llama Agents, we can now build agents that are async first and run as their own service. During this webinar, Stephen will show you how to build an Agentic RAG System using Llama Agents and Milvus.
09-12-2024 - Milvus, Vector database used for Sensor Data RAGTimothy Spann
09-12-2024 - Milvus Sensor Data RAG
https://voxel51.com/blog/the-computer-vision-interface-for-vector-search/
https://www.linkedin.com/feed/update/urn:li:activity:7233322212370300929/
It’s in the Air Tonight. Sensor Data in RAG– Tim Spann 🥑 at Milvus
#computervision#ai#artificialintelligence#machinevision#machinelearning#datascience
https://voxel51.com/computer-vision-events/ai-machine-learning-computer-vision-meetup-sept-12-2024/
will do a quick overview of the basics of Vector Databases and Milvus and then dive into a practical example of how to use one as part of an application. I will demonstrate how to consume air quality data and ingest it into Milvus as vectors and scalars. We will then use our vector database of Air Quality readings to feed our LLM and get proper answers to Air Quality questions. I will show you how to all the steps to build a RAG application with Milvus, LangChain, Ollama, Python and Air Quality Reports. Preview the demo on Medium.
About the Speaker
Tim Spann is a Principal Developer Advocate for Zilliz and Milvus. He works with Milvus, Generative AI, HuggingFace, Python, Big Data, IoT, and Edge AI. Tim has over twelve years of experience with the IoT, big data, distributed computing, messaging, machine learning and streaming technologies.
Using LLM Agents with Llama 3, LangGraph and MilvusZilliz
RAG systems are talked about in detail, but usually stick to the basics. In this talk, Stephen will show you how to build an Agentic RAG System using Langchain and Milvus.
Retrieval Augmented Generation Evaluation with RagasZilliz
Retrieval Augmented Generation (RAG) enhances chatbots by incorporating custom data in the prompt. Using large language models (LLMs) as judge has gained prominence in modern RAG systems. This talk will demo Ragas, an open-source automation tool for RAG evaluations. Christy will talk about and demo evaluating a RAG pipeline using Milvus and RAG metrics like context F1-score and answer correctness.
Introduction to Open Source RAG and RAG EvaluationZilliz
You’ve heard good data matters in Machine Learning, but does it matter for Generative AI applications? Corporate data often differs significantly from the general Internet data used to train most foundation models. Join me for a demo on building an open source RAG (Retrieval Augmented Generation) stack using Milvus vector database for Retrieval, LangChain, Llama 3 with Ollama, Ragas RAG Eval, and optional Zilliz cloud, OpenAI.
Multi-agent Systems with Mistral AI, Milvus and Llama-agentsZilliz
Agentic systems are on the rise, helping developers create intelligent, autonomous systems. LLMs are becoming more and more capable of following diverse sets of instructions, making them ideal for managing these agents. This advancement opens up numerous possibilities for handling complex tasks with minimal human intervention in so many areas. In this talk, we will see how to build agents using llama-agents. We’ll also explore how combining different LLMs can enable various actions. For simpler tasks, we'll use Mistral Nemo, a smaller and more cost-effective model, and Mistral Large for orchestrating different agents.
Zilliz's presentation in AWS x Apache Doris meetup on 24th Feb 2025 in Singapore.
In this presentation, I shared a live demo on how you can outsource thinking and reasoning to Amazon Nova for generating a research report on any topic.
Code repo for live demo can be found here: https://github.com/zilliztech/deep-searcher
I've also shared about the hard tradeoffs you need to make when choosing vector indexes and lastly my top 5 favourite features of Zilliz Cloud.
06-18-2024-Princeton Meetup-Introduction to MilvusTimothy Spann
06-18-2024-Princeton Meetup-Introduction to Milvus
[email protected]
https://www.linkedin.com/in/timothyspann/
https://x.com/paasdev
https://github.com/tspannhw
https://github.com/milvus-io/milvus
Get Milvused!
https://milvus.io/
Read my Newsletter every week!
https://github.com/tspannhw/FLiPStackWeekly/blob/main/142-17June2024.md
For more cool Unstructured Data, AI and Vector Database videos check out the Milvus vector database videos here
https://www.youtube.com/@MilvusVectorDatabase/videos
Unstructured Data Meetups -
https://www.meetup.com/unstructured-data-meetup-new-york/
https://lu.ma/calendar/manage/cal-VNT79trvj0jS8S7
https://www.meetup.com/pro/unstructureddata/
https://zilliz.com/community/unstructured-data-meetup
https://zilliz.com/event
Twitter/X: https://x.com/milvusio https://x.com/paasdev
LinkedIn: https://www.linkedin.com/company/zilliz/ https://www.linkedin.com/in/timothyspann/
GitHub: https://github.com/milvus-io/milvus https://github.com/tspannhw
Invitation to join Discord: https://discord.com/invite/FjCMmaJng6
Blogs: https://milvusio.medium.com/ https://www.opensourcevectordb.cloud/ https://medium.com/@tspann
Expand LLMs' knowledge by incorporating external data sources into LLMs and your AI applications.
We will discuss the value of RAG architecture at a high level and dive into an example that we can demo. The idea is to first demo without context, giving horrible results. Then we will walk through the steps of building a RAG pipeline that we can use to generate better data.
Milvus: Scaling Vector Data Solutions for Gen AIZilliz
Milvus, an LF AI project, is an open-source vector database built to power Gen AI solutions. 80% of the data in the world is unstructured data, and vector databases are the databases that help you get valuable insights from unstructured data. With this in mind, we built Milvus as a distributed system on top of other open-source solutions, including MinIO and Kafka, to support vector collections that exceed billion-scale. This session will deeply dive into the architecture decisions that make this cloud-native vector database seamlessly scale horizontally, provide users with tunable consistency, orchestrate in-memory and on-disk indexing, and scalable search strategies.
Zilliz Cloud Monthly Technical Review: May 2025Zilliz
About this webinar
Join our monthly demo for a technical overview of Zilliz Cloud, a highly scalable and performant vector database service for AI applications
Topics covered
- Zilliz Cloud's scalable architecture
- Key features of the developer-friendly UI
- Security best practices and data privacy
- Highlights from recent product releases
This webinar is an excellent opportunity for developers to learn about Zilliz Cloud's capabilities and how it can support their AI projects. Register now to join our community and stay up-to-date with the latest vector database technology.
Smarter RAG Pipelines: Scaling Search with Milvus and FeastZilliz
About this webinar
Learn how Milvus and Feast can be used together to scale vector search and easily declare views for retrieval using open source. We’ll demonstrate how to integrate Milvus with Feast to build a customized RAG pipeline.
Topics Covered
- Leverage Feast for dynamic metadata and document storage and retrieval, ensuring that the correct data is always available at inference time
- Learn how to integrate Feast with Milvus to support vector-based retrieval in RAG systems
- Use Milvus for fast, high-dimensional similarity search, enhancing the retrieval phase of your RAG model
More Related Content
Similar to A Beginners Guide to Building a RAG App Using Open Source Milvus (20)
Retrieval Augmented Generation (RAG), is a popular method to use a large language model, a vector database, and some sort of prompt interface to build better chat bots. On the surface, it seems pretty simple to build a RAG app, but when it comes down to implementation, there are many details to hash out. These details include how to: chunk data, work with embeddings, and even how to select and use a vector database.
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...Timothy Spann
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Techniques
Timothy Spann
https://2024.allthingsopen.org/sessions/advanced-retrieval-augmented-generation-rag-techniques
In 2023, we saw many simple retrieval augmented generation (RAG) examples being built. However, most of these examples and frameworks built around them simplified the process too much. Businesses were unable to derive value from their implementations. That’s because there are many other techniques involved in tuning a basic RAG app to work for you. In this talk we will cover three of the techniques you need to understand and leverage to build better RAG: chunking, embedding model choice, and metadata structuring.
17-October-2024 NYC AI Camp - Step-by-Step RAG 101Timothy Spann
17-October-2024 NYC AI Camp - Step-by-Step RAG 101
https://github.com/tspannhw/AIM-BecomingAnAIEngineer
https://github.com/tspannhw/AIM-Ghosts
AIM - Becoming An AI Engineer
Step 1 - Start off local
Download Python (or use your local install)
https://www.python.org/downloads/
python3.11 -m venv yourenv
source yourenv/bin/activate
Create an environment
https://docs.python.org/3/library/venv.html
Use Pip
https://pip.pypa.io/en/stable/installation/
Setup a .env file for environment variables
Download Jupyter Lab
https://jupyter.org/
Run your notebook
jupyter lab --ip="0.0.0.0" --port=8881 --allow-root
Running on a Mac or Linux machine is optimal.
Setup environment variables
source .env
Alternatives
Download Conda
https://docs.conda.io/projects/conda/en/latest/index.html
https://colab.research.google.com/
Other languages: Java, .Net, Go, NodeJS
Other notebooks to try
https://zilliz.com/learn/milvus-notebooks
https://github.com/milvus-io/bootcamp/blob/master/bootcamp/tutorials/quickstart/build_RAG_with_milvus.ipynb
References
Guides
https://zilliz.com/learn
HuggingFace Friend
https://zilliz.com/learn/effortless-ai-workflows-a-beginners-guide-to-hugging-face-and-pymilvus
Milvus
https://zilliz.com/milvus-downloads
https://milvus.io/docs/quickstart.md
LangChain
https://zilliz.com/learn/LangChain
Notebook display
https://ipywidgets.readthedocs.io/en/stable/user_install.html
References
https://medium.com/@zilliz_learn/function-calling-with-ollama-llama-3-2-and-milvus-ac2bc2122538
https://github.com/milvus-io/bootcamp/tree/master/bootcamp/RAG/advanced_rag
https://zilliz.com/learn/Retrieval-Augmented-Generation
https://zilliz.com/blog/scale-search-with-milvus-handle-massive-datasets-with-ease
https://zilliz.com/learn/generative-ai
https://zilliz.com/learn/what-are-binary-vector-embedding
https://zilliz.com/learn/choosing-right-vector-index-for-your-project
Using LLM Agents with Llama 3.2, LangGraph and MilvusZilliz
We explore Agentic RAG (Internet search, check for hallucinations, correct answers). Don’t miss this deep dive into one of the hottest topics in AI today!
Building an Agentic RAG locally with Ollama and MilvusZilliz
With the rise of Open-Source LLMs like Llama, Mistral, Gemma, and more, it has become apparent that LLMs might also be useful even when run locally. In this talk, we will see how to deploy an Agentic Retrieval Augmented Generation (RAG) setup using Ollama, with Milvus as the vector database on your laptop. That way, you can also avoid being Rate Limited by OpenAI like I have been in the past.
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systemsZilliz
Apache Spark dominates the big data processing world, but efficient vector similarity search on massive datasets remains a bottleneck. This talk will show how you can seamlessly integrate Milvus with Spark to unlock the true power of vector similarity search.
We'll explore how Milvus integrates with Spark, enabling efficient vector search within Spark workflows. Real-world applications showcasing the combined power of Spark and Milvus in tackling complex similarity search challenges will be presented. Finally, we'll shed light on the significant performance gains achieved through this integration.
Whether you're dealing with recommendation systems, image retrieval, or any other application requiring vector similarity search, this talk will equip you with the knowledge to leverage Spark and Milvus to their maximum potential.
Join us on this exploration of how Spark and Milvus can enhance your big data processing capabilities with fast similarity search even at scale!
LangGraph GraphRAG agent with Llama 3.1 and GPT4o
Let's build an Advanced RAG with a GraphRAG agent that will run a combination of Llama 3.1 and GPT4o, for Llama 3.1 we will use Ollama. The idea is that we use GPT4o for advanced tasks, like generating the Neo4j query and Llama3.1 for the rest.
Multi-agent Systems with Mistral AI, Milvus and Llama-agentsZilliz
With the recent release of Llama Agents, we can now build agents that are async first and run as their own service. During this webinar, Stephen will show you how to build an Agentic RAG System using Llama Agents and Milvus.
09-12-2024 - Milvus, Vector database used for Sensor Data RAGTimothy Spann
09-12-2024 - Milvus Sensor Data RAG
https://voxel51.com/blog/the-computer-vision-interface-for-vector-search/
https://www.linkedin.com/feed/update/urn:li:activity:7233322212370300929/
It’s in the Air Tonight. Sensor Data in RAG– Tim Spann 🥑 at Milvus
#computervision#ai#artificialintelligence#machinevision#machinelearning#datascience
https://voxel51.com/computer-vision-events/ai-machine-learning-computer-vision-meetup-sept-12-2024/
will do a quick overview of the basics of Vector Databases and Milvus and then dive into a practical example of how to use one as part of an application. I will demonstrate how to consume air quality data and ingest it into Milvus as vectors and scalars. We will then use our vector database of Air Quality readings to feed our LLM and get proper answers to Air Quality questions. I will show you how to all the steps to build a RAG application with Milvus, LangChain, Ollama, Python and Air Quality Reports. Preview the demo on Medium.
About the Speaker
Tim Spann is a Principal Developer Advocate for Zilliz and Milvus. He works with Milvus, Generative AI, HuggingFace, Python, Big Data, IoT, and Edge AI. Tim has over twelve years of experience with the IoT, big data, distributed computing, messaging, machine learning and streaming technologies.
Using LLM Agents with Llama 3, LangGraph and MilvusZilliz
RAG systems are talked about in detail, but usually stick to the basics. In this talk, Stephen will show you how to build an Agentic RAG System using Langchain and Milvus.
Retrieval Augmented Generation Evaluation with RagasZilliz
Retrieval Augmented Generation (RAG) enhances chatbots by incorporating custom data in the prompt. Using large language models (LLMs) as judge has gained prominence in modern RAG systems. This talk will demo Ragas, an open-source automation tool for RAG evaluations. Christy will talk about and demo evaluating a RAG pipeline using Milvus and RAG metrics like context F1-score and answer correctness.
Introduction to Open Source RAG and RAG EvaluationZilliz
You’ve heard good data matters in Machine Learning, but does it matter for Generative AI applications? Corporate data often differs significantly from the general Internet data used to train most foundation models. Join me for a demo on building an open source RAG (Retrieval Augmented Generation) stack using Milvus vector database for Retrieval, LangChain, Llama 3 with Ollama, Ragas RAG Eval, and optional Zilliz cloud, OpenAI.
Multi-agent Systems with Mistral AI, Milvus and Llama-agentsZilliz
Agentic systems are on the rise, helping developers create intelligent, autonomous systems. LLMs are becoming more and more capable of following diverse sets of instructions, making them ideal for managing these agents. This advancement opens up numerous possibilities for handling complex tasks with minimal human intervention in so many areas. In this talk, we will see how to build agents using llama-agents. We’ll also explore how combining different LLMs can enable various actions. For simpler tasks, we'll use Mistral Nemo, a smaller and more cost-effective model, and Mistral Large for orchestrating different agents.
Zilliz's presentation in AWS x Apache Doris meetup on 24th Feb 2025 in Singapore.
In this presentation, I shared a live demo on how you can outsource thinking and reasoning to Amazon Nova for generating a research report on any topic.
Code repo for live demo can be found here: https://github.com/zilliztech/deep-searcher
I've also shared about the hard tradeoffs you need to make when choosing vector indexes and lastly my top 5 favourite features of Zilliz Cloud.
06-18-2024-Princeton Meetup-Introduction to MilvusTimothy Spann
06-18-2024-Princeton Meetup-Introduction to Milvus
[email protected]
https://www.linkedin.com/in/timothyspann/
https://x.com/paasdev
https://github.com/tspannhw
https://github.com/milvus-io/milvus
Get Milvused!
https://milvus.io/
Read my Newsletter every week!
https://github.com/tspannhw/FLiPStackWeekly/blob/main/142-17June2024.md
For more cool Unstructured Data, AI and Vector Database videos check out the Milvus vector database videos here
https://www.youtube.com/@MilvusVectorDatabase/videos
Unstructured Data Meetups -
https://www.meetup.com/unstructured-data-meetup-new-york/
https://lu.ma/calendar/manage/cal-VNT79trvj0jS8S7
https://www.meetup.com/pro/unstructureddata/
https://zilliz.com/community/unstructured-data-meetup
https://zilliz.com/event
Twitter/X: https://x.com/milvusio https://x.com/paasdev
LinkedIn: https://www.linkedin.com/company/zilliz/ https://www.linkedin.com/in/timothyspann/
GitHub: https://github.com/milvus-io/milvus https://github.com/tspannhw
Invitation to join Discord: https://discord.com/invite/FjCMmaJng6
Blogs: https://milvusio.medium.com/ https://www.opensourcevectordb.cloud/ https://medium.com/@tspann
Expand LLMs' knowledge by incorporating external data sources into LLMs and your AI applications.
We will discuss the value of RAG architecture at a high level and dive into an example that we can demo. The idea is to first demo without context, giving horrible results. Then we will walk through the steps of building a RAG pipeline that we can use to generate better data.
Milvus: Scaling Vector Data Solutions for Gen AIZilliz
Milvus, an LF AI project, is an open-source vector database built to power Gen AI solutions. 80% of the data in the world is unstructured data, and vector databases are the databases that help you get valuable insights from unstructured data. With this in mind, we built Milvus as a distributed system on top of other open-source solutions, including MinIO and Kafka, to support vector collections that exceed billion-scale. This session will deeply dive into the architecture decisions that make this cloud-native vector database seamlessly scale horizontally, provide users with tunable consistency, orchestrate in-memory and on-disk indexing, and scalable search strategies.
Zilliz Cloud Monthly Technical Review: May 2025Zilliz
About this webinar
Join our monthly demo for a technical overview of Zilliz Cloud, a highly scalable and performant vector database service for AI applications
Topics covered
- Zilliz Cloud's scalable architecture
- Key features of the developer-friendly UI
- Security best practices and data privacy
- Highlights from recent product releases
This webinar is an excellent opportunity for developers to learn about Zilliz Cloud's capabilities and how it can support their AI projects. Register now to join our community and stay up-to-date with the latest vector database technology.
Smarter RAG Pipelines: Scaling Search with Milvus and FeastZilliz
About this webinar
Learn how Milvus and Feast can be used together to scale vector search and easily declare views for retrieval using open source. We’ll demonstrate how to integrate Milvus with Feast to build a customized RAG pipeline.
Topics Covered
- Leverage Feast for dynamic metadata and document storage and retrieval, ensuring that the correct data is always available at inference time
- Learn how to integrate Feast with Milvus to support vector-based retrieval in RAG systems
- Use Milvus for fast, high-dimensional similarity search, enhancing the retrieval phase of your RAG model
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...Zilliz
In this tutorial, we build an agent from scratch to reason over the Milvus documentation and Discord server history. We demonstrate fundamental agentic concepts such as long-term memory, tool use, reflection, conditional execution flow, and reasoning models. Our agent’s design is informed by recent open-source attempts to reproduce Deep Research.
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...Zilliz
About this webinar
Discover how to integrate Vision Language Models with Browser Use and Milvus to create an agentic system capable of real-time visual and textual analysis. Ideal for developers who want to learn how to use Agents that can see, take action, and remember what they saw.
This Session Will:
- Demonstrate a workflow where Browser Use extracts dynamic web data, while Milvus stores and retrieves the data, that way you can always come back to what the agent saw.
- Showcase practical use cases, such as querying live web content with AI agents that reason over historical and visual data.
- Explore balancing autonomy and control in agentic systems, including challenges like hallucination mitigation and performance optimization.
Webinar - Zilliz Cloud Monthly Demo - March 2025Zilliz
Join our monthly demo for a technical overview of Zilliz Cloud, a highly scalable and performant vector database service for AI applications
Topics covered
- Zilliz Cloud's scalable architecture
- Key features of the developer-friendly UI
- Security best practices and data privacy
- Highlights from recent product releases
- This webinar is an excellent opportunity for developers to learn about Zilliz Cloud's capabilities and how it can support their AI projects. Register now to join our community and stay up-to-date with the latest vector database technology.
What Makes "Deep Research"? A Dive into AI AgentsZilliz
About this webinar:
Unless you live under a rock, you will have heard about OpenAI’s release of Deep Research on Feb 2, 2025. This new product promises to revolutionize how we answer questions requiring the synthesis of large amounts of diverse information. But how does this technology work, and why is Deep Research a noticeable improvement over previous attempts? In this webinar, we will examine the concepts underpinning modern agents using our basic clone, Deep Searcher, as an example.
Topics covered:
Tool use
Structured output
Reflection
Reasoning models
Planning
Types of agentic memory
Combining Lexical and Semantic Search with Milvus 2.5Zilliz
In short, lexical search is a way to search your documents based on the keywords they contain, in contrast to semantic search, which compares the similarity of embeddings. We’ll be covering:
Why, when, and how should you use lexical search
What is the BM25 distance metric
How exactly does Milvus 2.5 implement lexical search
How to build an improved hybrid lexical + semantic search with Milvus 2.5
Bedrock Data Automation (Preview): Simplifying Unstructured Data ProcessingZilliz
Bedrock Data Automation (BDA) is a cloud-based service that simplifies the process of extracting valuable insights from unstructured content—such as documents, images, video, and audio. Come learn how BDA leverages generative AI to automate the transformation of multi-modal data into structured formats, enabling developers to build applications and automate complex workflows with greater speed and accuracy.
Deploying a Multimodal RAG System Using Open Source Milvus, LlamaIndex, and vLLMZilliz
About this webinar:
While text-based RAG systems have been everywhere in the last year and a half, there is so much more than text data. Images, audio, and documents often need to be processed together to provide meaningful insights, yet most RAG implementations focus solely on text. Think about automated visual inspection systems understanding manufacturing logs and production line images, or robotics systems correlating sensor data with visual feedback. These multimodal scenarios demand RAG systems that go beyond text-only processing.
In this talk, we'll walk through how to build a Multimodal RAG system that helps solve this problem. We'll explore the architecture that makes it possible to run such a system and demonstrate how to build one using Milvus, LlamaIndex, and vLLM for deploying open-source LLMs on your infrastructure.
Through a live demo, we'll showcase a real-world application processing both images and text queries. Whether you're looking to reduce API costs, maintain data privacy, or gain more control over your AI infrastructure, this session will provide you with actionable insights to implement Multimodal RAG in your organization.
Topics covered:
- vLLM and self hosting LLMs
- Multimodal RAG Demo: a real-world application processing both images and text queries
February Product Demo: Discover the Power of Zilliz CloudZilliz
Join our monthly demo for a technical overview of Zilliz Cloud, a highly scalable and performant vector database service for AI applications
Topics covered
- Zilliz Cloud's scalable architecture
- Key features of the developer-friendly UI
- Security best practices and data privacy
- Highlights from recent product releases
This webinar is an excellent opportunity for developers to learn about Zilliz Cloud's capabilities and how it can support their AI projects. Register now to join our community and stay up-to-date with the latest vector database technology.
Full Text Search with Milvus 2.5 - UD Meetup Berlin Jan 23Zilliz
"Milvus 2.5 introduces text search by introducing native full text search capabilities, seamlessly combining term-based matching with vector similarity in a single system. This feature automatically handles text-to-vector conversion and real-time BM25 scoring, eliminating the complexity of manual embedding generation and external processing pipelines.
Through a live demo, we'll showcase how easy we make it to use Full Text search now :D"
Building the Next-Gen Apps with Multimodal Retrieval using Twelve Labs & MilvusZilliz
"This session dives deep into the power of Multimodal Retrieval, a revolutionzing approach that enhances personalization by seamlessly integrating diverse data sources for more intuitive product interactions. Explore the foundational concepts of Multimodal Embedding and Any-to-Any Search, and learn how to leverage these technologies to build next generation products. Discover how to seamlessly integrate the Twelve Labs Embed API and Milvus into your projects.
Through live demos, you’ll see how Fashion Product Search is redefined with deeper insights into the architecture, and discover how this approach is revolutionizing user interactions, especially with bots. We’ll also explore real world case studies that demonstrate the ease and power of building multimodal apps."
"Explore the transformative potential of Voice AI in customer interaction analysis powered by LLMs. Learn how Gemini 2.0 enables transcription, summarization, and actionable insight extraction to streamline ticket resolution and enhance customer experiences.
This session delves into the architecture and practical applications of LLM-powered systems, showcasing how they revolutionize customer support workflows through real-world examples and insights"
Accelerate AI Agents with Multimodal RAG powered by Friendli Endpoints and Mi...Zilliz
AI agents are transforming industries, especially with recent vision-language models like Llama 3.2 Vision that enable AI agents to go beyond text-based understanding by integrating multimodal capabilities. Building such advanced AI agents can feel complex, but FriendliAI simplifies the process by offering end-to-end solutions, from creating your own custom models to deploying them in production. In this webinar, we’ll learn about the AI developer workflow from model fine-tuning to inference serving. We’ll also work through building a simple AI agent with advanced multimodal RAG capabilities using Friendli Serverless Endpoints and Milvus DB. This session is ideal for those looking to learn more about large-language model inference serving, start building AI agents with RAG capabilities, or explore multimodal RAG queries in greater depth
1 Table = 1000 Words? Foundation Models for Tabular DataZilliz
Tables form the backbone of modern data storage, powering everything from relational databases to enterprise systems. Yet despite their ubiquity, we've barely scratched the surface of their potential. While Deep Learning has revolutionized our ability to process text and images, its impact on tabular data has been surprisingly limited. This gap is now being bridged through groundbreaking research in multimodal modeling, particularly with innovations like the TableGPT2 model. In this talk, we'll explore how these new multimodal foundation models are trained to understand tabular data, and demonstrate practical ways to unlock hidden value in your organization's data assets.
How Milvus allows you to run Full Text SearchZilliz
Milvus 2.5 introduces text search by introducing native full text search capabilities, seamlessly combining term-based matching with vector similarity in a single system. This feature automatically handles text-to-vector conversion and real-time BM25 scoring, eliminating the complexity of manual embedding generation and external processing pipelines.
How to Optimize Your Embedding Model Selection and Development through TDA Cl...Zilliz
About this webinar:
Embedding models are a crucial layer in vector database applications, yet figuring out which embedding model is best for your dataset has been a notoriously difficult task. However, an efficient and intuitive approach for many use cases can be produced through Topological Data Analysis (TDA) on your evaluation dataset. Identifying patterns of weak performing behavior in your model is made easy and scalable through a table that reveals the performance of different semantic categories of queries being made to your vector database.
Topics covered:
- Risks and limitations of current evaluation approaches for embeddings
- Compare embedding models on your own dataset using Navigable TDA clusters
- ML lifecycle case studies in ecommerce: model selection, fine-tuning, and post-deployment
Keeping Data Fresh: Mastering Updates in Vector DatabasesZilliz
Managing and extracting value from unstructured data has become a critical challenge as the volume of data continues to grow. This virtual event brings together industry experts to explore the latest techniques in Retrieval Augmented Generation (RAG) and vector databases.
Discover how RAG systems are revolutionizing natural language processing by seamlessly integrating information retrieval techniques, enabling more accurate and contextual language generation. Gain practical insights into building and optimizing these applications.
This session will also cover how vector databases like Milvus, play a key role in RAG and working with unstructured data. Learn proven strategies for maintaining data freshness, accuracy, and efficiency, ensuring your organization stays ahead of the curve.
Milvus 2.5: Full-Text Search, More Powerful Metadata Filtering, and more!Zilliz
Milvus 2.5 introduces text search by introducing native full text search capabilities, seamlessly combining term-based matching with vector similarity in a single system. This feature automatically handles text-to-vector conversion and real-time BM25 scoring, eliminating the complexity of manual embedding generation and external processing pipelines.
Vector Databases for Enhanced ClassificationZilliz
What will you learn?
In this webinar, we dive into the use of Milvus as a high-performance vector database tailored for handling large-scale document collections, focusing on European Commission and Parliament acts. Our approach shifts from traditional RAG-based classification to a hybrid search method, leveraging K-Nearest Neighbor (KNN) for pinpointing top documents relevant to classification tasks. This session is ideal for those aiming to refine classification accuracy by leveraging vector-based indexing and hybrid retrieval in vast datasets.
Topics covered:
KNN and Sparse Search Integration: How KNN retrieval combined with sparse search helps extract top documents aligned with classification needs.
Versatile Embeddings for Multilingual and Multi-Domain Applications: The BGE M3-Embedding model is designed to provide robust, high-quality embeddings across multiple languages and domains, making it adaptable for diverse tasks in multilingual and cross-functional environments.
Real-World Application: Step-by-step demonstration using European legislative acts to showcase KNN-driven retrieval and classification workflows.
Scaling GenAI Inference From Prototype to Production: Real-World Lessons in S...Anish Kumar
Presented by: Anish Kumar
LinkedIn: https://www.linkedin.com/in/anishkumar/
This lightning talk dives into real-world GenAI projects that scaled from prototype to production using Databricks’ fully managed tools. Facing cost and time constraints, we leveraged four key Databricks features—Workflows, Model Serving, Serverless Compute, and Notebooks—to build an AI inference pipeline processing millions of documents (text and audiobooks).
This approach enables rapid experimentation, easy tuning of GenAI prompts and compute settings, seamless data iteration and efficient quality testing—allowing Data Scientists and Engineers to collaborate effectively. Learn how to design modular, parameterized notebooks that run concurrently, manage dependencies and accelerate AI-driven insights.
Whether you're optimizing AI inference, automating complex data workflows or architecting next-gen serverless AI systems, this session delivers actionable strategies to maximize performance while keeping costs low.
Presentation given at the LangChain community meetup London
https://lu.ma/9d5fntgj
Coveres
Agentic AI: Beyond the Buzz
Introduction to AI Agent and Agentic AI
Agent Use case and stats
Introduction to LangGraph
Build agent with LangGraph Studio V2
Domino IQ – Was Sie erwartet, erste Schritte und Anwendungsfällepanagenda
Webinar Recording: https://www.panagenda.com/webinars/domino-iq-was-sie-erwartet-erste-schritte-und-anwendungsfalle/
HCL Domino iQ Server – Vom Ideenportal zur implementierten Funktion. Entdecken Sie, was es ist, was es nicht ist, und erkunden Sie die Chancen und Herausforderungen, die es bietet.
Wichtige Erkenntnisse
- Was sind Large Language Models (LLMs) und wie stehen sie im Zusammenhang mit Domino iQ
- Wesentliche Voraussetzungen für die Bereitstellung des Domino iQ Servers
- Schritt-für-Schritt-Anleitung zur Einrichtung Ihres Domino iQ Servers
- Teilen und diskutieren Sie Gedanken und Ideen, um das Potenzial von Domino iQ zu maximieren
If You Use Databricks, You Definitely Need FMESafe Software
DataBricks makes it easy to use Apache Spark. It provides a platform with the potential to analyze and process huge volumes of data. Sounds awesome. The sales brochure reads as if it is a can-do-all data integration platform. Does it replace our beloved FME platform or does it provide opportunities for FME to shine? Challenge accepted
ELNL2025 - Unlocking the Power of Sensitivity Labels - A Comprehensive Guide....Jasper Oosterveld
Sensitivity labels, powered by Microsoft Purview Information Protection, serve as the foundation for classifying and protecting your sensitive data within Microsoft 365. Their importance extends beyond classification and play a crucial role in enforcing governance policies across your Microsoft 365 environment. Join me, a Data Security Consultant and Microsoft MVP, as I share practical tips and tricks to get the full potential of sensitivity labels. I discuss sensitive information types, automatic labeling, and seamless integration with Data Loss Prevention, Teams Premium, and Microsoft 365 Copilot.
Trends Artificial Intelligence - Mary MeekerClive Dickens
Mary Meeker’s 2024 AI report highlights a seismic shift in productivity, creativity, and business value driven by generative AI. She charts the rapid adoption of tools like ChatGPT and Midjourney, likening today’s moment to the dawn of the internet. The report emphasizes AI’s impact on knowledge work, software development, and personalized services—while also cautioning about data quality, ethical use, and the human-AI partnership. In short, Meeker sees AI as a transformative force accelerating innovation and redefining how we live and work.
National Fuels Treatments Initiative: Building a Seamless Map of Hazardous Fu...Safe Software
The National Fuels Treatments Initiative (NFT) is transforming wildfire mitigation by creating a standardized map of nationwide fuels treatment locations across all land ownerships in the United States. While existing state and federal systems capture this data in diverse formats, NFT bridges these gaps, delivering the first truly integrated national view. This dataset will be used to measure the implementation of the National Cohesive Wildland Strategy and demonstrate the positive impact of collective investments in hazardous fuels reduction nationwide. In Phase 1, we developed an ETL pipeline template in FME Form, leveraging a schema-agnostic workflow with dynamic feature handling intended for fast roll-out and light maintenance. This was key as the initiative scaled from a few to over fifty contributors nationwide. By directly pulling from agency data stores, oftentimes ArcGIS Feature Services, NFT preserves existing structures, minimizing preparation needs. External mapping tables ensure consistent attribute and domain alignment, while robust change detection processes keep data current and actionable. Now in Phase 2, we’re migrating pipelines to FME Flow to take advantage of advanced scheduling, monitoring dashboards, and automated notifications to streamline operations. Join us to explore how this initiative exemplifies the power of technology, blending FME, ArcGIS Online, and AWS to solve a national business problem with a scalable, automated solution.
Mastering AI Workflows with FME - Peak of Data & AI 2025Safe Software
Harness the full potential of AI with FME: From creating high-quality training data to optimizing models and utilizing results, FME supports every step of your AI workflow. Seamlessly integrate a wide range of models, including those for data enhancement, forecasting, image and object recognition, and large language models. Customize AI models to meet your exact needs with FME’s powerful tools for training, optimization, and seamless integration
Creating an Accessible Future-How AI-powered Accessibility Testing is Shaping...Impelsys Inc.
Web accessibility is a fundamental principle that strives to make the internet inclusive for all. According to the World Health Organization, over a billion people worldwide live with some form of disability. These individuals face significant challenges when navigating the digital landscape, making the quest for accessible web content more critical than ever.
Enter Artificial Intelligence (AI), a technological marvel with the potential to reshape the way we approach web accessibility. AI offers innovative solutions that can automate processes, enhance user experiences, and ultimately revolutionize web accessibility. In this blog post, we’ll explore how AI is making waves in the world of web accessibility.
Bridging the divide: A conversation on tariffs today in the book industry - T...BookNet Canada
A collaboration-focused conversation on the recently imposed US and Canadian tariffs where speakers shared insights into the current legislative landscape, ongoing advocacy efforts, and recommended next steps. This event was presented in partnership with the Book Industry Study Group.
Link to accompanying resource: https://bnctechforum.ca/sessions/bridging-the-divide-a-conversation-on-tariffs-today-in-the-book-industry/
Presented by BookNet Canada and the Book Industry Study Group on May 29, 2025 with support from the Department of Canadian Heritage.
Integration of Utility Data into 3D BIM Models Using a 3D Solids Modeling Wor...Safe Software
Jacobs has developed a 3D utility solids modelling workflow to improve the integration of utility data into 3D Building Information Modeling (BIM) environments. This workflow, a collaborative effort between the New Zealand Geospatial Team and the Australian Data Capture Team, employs FME to convert 2D utility data into detailed 3D representations, supporting enhanced spatial analysis and clash detection.
To enable the automation of this process, Jacobs has also developed a survey data standard that standardizes the capture of existing utilities. This standard ensures consistency in data collection, forming the foundation for the subsequent automated validation and modelling steps. The workflow begins with the acquisition of utility survey data, including attributes such as location, depth, diameter, and material of utility assets like pipes and manholes. This data is validated through a custom-built tool that ensures completeness and logical consistency, including checks for proper connectivity between network components. Following validation, the data is processed using an automated modelling tool to generate 3D solids from 2D geometric representations. These solids are then integrated into BIM models to facilitate compatibility with 3D workflows and enable detailed spatial analyses.
The workflow contributes to improved spatial understanding by visualizing the relationships between utilities and other infrastructure elements. The automation of validation and modeling processes ensures consistent and accurate outputs, minimizing errors and increasing workflow efficiency.
This methodology highlights the application of FME in addressing challenges associated with geospatial data transformation and demonstrates its utility in enhancing data integration within BIM frameworks. By enabling accurate 3D representation of utility networks, the workflow supports improved design collaboration and decision-making in complex infrastructure projects
Establish Visibility and Manage Risk in the Supply Chain with Anchore SBOMAnchore
Over 70% of any given software application consumes open source software (most likely not even from the original source) and only 15% of organizations feel confident in their risk management practices.
With the newly announced Anchore SBOM feature, teams can start safely consuming OSS while mitigating security and compliance risks. Learn how to import SBOMs in industry-standard formats (SPDX, CycloneDX, Syft), validate their integrity, and proactively address vulnerabilities within your software ecosystem.
For the full video of this presentation, please visit: https://www.edge-ai-vision.com/2025/06/state-space-models-vs-transformers-for-ultra-low-power-edge-ai-a-presentation-from-brainchip/
Tony Lewis, Chief Technology Officer at BrainChip, presents the “State-space Models vs. Transformers for Ultra-low-power Edge AI” tutorial at the May 2025 Embedded Vision Summit.
At the embedded edge, choices of language model architectures have profound implications on the ability to meet demanding performance, latency and energy efficiency requirements. In this presentation, Lewis contrasts state-space models (SSMs) with transformers for use in this constrained regime. While transformers rely on a read-write key-value cache, SSMs can be constructed as read-only architectures, enabling the use of novel memory types and reducing power consumption. Furthermore, SSMs require significantly fewer multiply-accumulate units—drastically reducing compute energy and chip area.
New techniques enable distillation-based migration from transformer models such as Llama to SSMs without major performance loss. In latency-sensitive applications, techniques such as precomputing input sequences allow SSMs to achieve sub-100 ms time-to-first-token, enabling real-time interactivity. Lewis presents a detailed side-by-side comparison of these architectures, outlining their trade-offs and opportunities at the extreme edge.
מכונות CNC קידוח אנכיות הן הבחירה הנכונה והטובה ביותר לקידוח ארונות וארגזים לייצור רהיטים. החלק נוסע לאורך ציר ה-x באמצעות ציר דיגיטלי מדויק, ותפוס ע"י צבת מכנית, כך שאין צורך לבצע setup (התאמות) לגדלים שונים של חלקים.
Data Virtualization: Bringing the Power of FME to Any ApplicationSafe Software
Imagine building web applications or dashboards on top of all your systems. With FME’s new Data Virtualization feature, you can deliver the full CRUD (create, read, update, and delete) capabilities on top of all your data that exploit the full power of FME’s all data, any AI capabilities. Data Virtualization enables you to build OpenAPI compliant API endpoints using FME Form’s no-code development platform.
In this webinar, you’ll see how easy it is to turn complex data into real-time, usable REST API based services. We’ll walk through a real example of building a map-based app using FME’s Data Virtualization, and show you how to get started in your own environment – no dev team required.
What you’ll take away:
-How to build live applications and dashboards with federated data
-Ways to control what’s exposed: filter, transform, and secure responses
-How to scale access with caching, asynchronous web call support, with API endpoint level security.
-Where this fits in your stack: from web apps, to AI, to automation
Whether you’re building internal tools, public portals, or powering automation – this webinar is your starting point to real-time data delivery.
Discover 7 best practices for Salesforce Data Cloud to clean, integrate, secure, and scale data for smarter decisions and improved customer experiences.
Interested in leveling up your JavaScript skills? Join us for our Introduction to TypeScript workshop.
Learn how TypeScript can improve your code with dynamic typing, better tooling, and cleaner architecture. Whether you're a beginner or have some experience with JavaScript, this session will give you a solid foundation in TypeScript and how to integrate it into your projects.
Workshop content:
- What is TypeScript?
- What is the problem with JavaScript?
- Why TypeScript is the solution
- Coding demo