0% found this document useful (0 votes)

2 views

the-case-against-vector-databases

The document argues against the necessity of vector databases for AI projects, suggesting that traditional keyword search is often sufficient and more cost-effective. It highlights the hidden costs and complexities of maintaining vector databases and emphasizes the importance of understanding user needs before implementation. The author advocates for simpler solutions and the potential of hybrid search combining various techniques for better results.

Uploaded by

soulartist

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

the-case-against-vector-databases

Uploaded by

soulartist

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Considering vector database?

You don't need it. 🙅

Guide for every CTO & engineers

vec3.ai by Dariusz Semba

Currently, there are over 20 vector databases
on the market!

vec3.ai 2
Even VCs like Sequoia admit:
infrastructure is getting overbuilt.

From AI’s $200B Question

by David Cahn, Sequoia's blog

They see this with GPUs - vector dbs are much easier to build in comparison

vec3.ai 3
VCs see vector databases as an investment in “picks and shovels” for AI,
with a proven business model (database).

💸
With so much funding (thus sponsored content)
and with all the hype around AI
- it's easy to assume you need a vector db in your AI project.

vec3.ai 4
Why you don’t need a vector db:

1. Traditional keyword search is good enough or will even better suit

your needs

2. You don't have enough data to use it anyway (see next slides)
3. Information retrieval is not your core focus, and it’s better to
integrate with out-of-the-box solution

vec3.ai 5
Way too fast…

vec3.ai 6
Alternatives sufficient for most data needs:

1. LLM alone can fit all your data in - no need for vector search
2. Exhaustive vector search (brute-force)

• if you filter results first, there's fewer data to compare

• can boost keyword search if done as a reranking step = hybrid search 🔥
3. Library for ANN (approximate nearest neighbors) search, e.g. FAISS
4. Your current database supports vector search efficiently enough

vec3.ai 7
Vector database comes with a cost.

vec3.ai 8
Hidden costs of vector databases:
1. Yet another database to maintain
2. Need to sync data with other dbs

3. Large memory overhead (or simply cost)

4. Need to train a custom embedding model for your data (and then again
when the data changes)
5. Need to recompute embeddings when model changes - additional cost

vec3.ai 9
👉 Always verify your users' needs first
before proceeding with an ambitious implementation :)

vec3.ai 10
Vector search is an optimization.
As engineers often say “premature optimization is the root of all evil”.

vec3.ai 11
What vector search tries to optimize

🔎
keyword search
🧠LLMs

✅ simpler, cheaper, well-known, ✅ capture semantics and

interpretable & customizable conduct complex reasoning

❌ doesn't capture semantics ❌ expensive, high latency

vec3.ai 12
Vector search and keyword search
are different capabilities

• Keyword search matches exact terms.

• Vector search captures semantic similarity.

Hence, vector search doesn't exactly replace keyword search.

vec3.ai 13
Why keyword search rocks! 🎸
1. Often performs better than vector search
2. Generalizes well to unseen, out-of-domain data
3. Search mechanics:

a. narrows down search results when query gets more specific

b. efficient autocomplete capability
c. highlights query matches
4. Easily interpretable, cheap, well-known

vec3.ai 14
Vector search needs an embedding model

Vector search relies on a neural net that encodes the data into vectors:
• a generic model can perform worse on data from a narrow domain

• the model might drift over time, losing accuracy

• embedding model has its own "knowledge cutoff date"

“ Vector search usually works better on demo (open domain)

than real enterprise applications (closed domain)
”
- Colin Harman, Head of Technology @ Nesh

vec3.ai 15
Embeddings are inherently limited

1. Meaning squashed into a limited-size vector

2. Query and document interaction limited by a relatively low-dimensional
vector “dot product” operation
3. Embedding models are much smaller (=less powerful) than LLMs
4. Single time step calculation - LLMs can do more complex reasoning when
generating tokens

vec3.ai 16
Most advanced solutions usually combine
different techniques

🔎
keyword search
+ 🧠LLMs
+ 🗂️ ↗
vector search

= HYBRID SEARCH
1. Make sure to start simple, with the right components.
2. Focus on optimizing whatever provides the best boost to overall accuracy.
vec3.ai 17
Vector dbs x AutoGPT: an overkill solution
Let's say LLM call takes 10 seconds, You reach 100k embeddings after 11 days.
1 embedding is generated every 10s. Even then, brute-force vector search
(np.dot) takes milliseconds

np.dot() np.dot()
1 ms <100 ms Embeddings np.dot time AutoGPT time AutoGPT cost
then after 100k calls…
10 s 10 s 1 <1ms 10s $0.27
LLM call LLM call LLM call LLM call LLM call LLM call 10k <10ms 27h $2.7k

≈ 11.57 days 100k 66ms 11d $27k

300k 0.2s 34d $81k

costing $10k-$250k
500k 5s 56d $135k

No need for approximate nearest neighbors, let alone vector databases!

Optimizing LLM calls and accuracy of the system is far more important.
vec3.ai 18
AI strategy

Focus on what brings most value to your users:

ChatGPT and its RLHF technique were a large breakthrough.

Vector search didn't have the same single "wow" moment.

Startups usually build value where no one else already had (blue ocean strategy).
Most novel value can be added through adopting generative LLMs.

vec3.ai 19
AI strategy - seek 10x improvement
In the example, improving company search
keyword search 50%
with vector search yields 25 pp gain.
vector search (fine-tuned) 60%
hybrid search 75% Building sales automation with LLMs would
sales automation 30% be much more disruptive compared to
previous methods.
sales automation with LLM 80%
0% 20% 40% 60% 80% 100%
% of automation, made-up data serving as an example

Some use cases might benefit much more from the current LLM revolution.

vec3.ai 20
Future research 🔬
There's only so much meaning you can squeeze into a vector.
On the other hand, generative LLMs will keep getting better.

ChatGPT can already continuously query keyword search,

until it finds the right answer.
In the future of AI agents, that might actually be the preferred way of
implementing search.

vec3.ai 21
Think about your users' needs.

Simpler = better.

Avoid vendor lock-in.

vec3.ai 22
If you found this page helpful, go ahead and share it with friends.

Let’s keep AI efforts sane together :)

@Dariusz Semba

vec3.ai 23
Sources / further reading
1. Vector Search with OpenAI Embeddings: Lucene Is All You Need paper
2. SPLADE: Sparse Lexical and Expansion Model for First Stage Ranking paper
3. On Hybrid Search by Qdrant
4. Beware Tunnel Vision in AI Retrieval by Colin Harman
5. Emerging Architectures for LLM Applications by a16z

6. AI’s $200B Question by Sequoia

7. Auto-GPT Unmasked: The Hype and Hard Truths of Its Production Pitfalls by Jina.AI
8. Why AutoGPT engineers ditched vector databases by Dariusz Semba
9. Introducing Natural Language Search for Podcast Episodes by Spotify
10.Why You Shouldn’t Invest In Vector Databases? by Yingjun Wu

vec3.ai 24

The Rise of Vector Databases in the Age of LLMs
No ratings yet
The Rise of Vector Databases in the Age of LLMs
26 pages
Elastic Ebook Building Ai Powered Search Experiences
No ratings yet
Elastic Ebook Building Ai Powered Search Experiences
33 pages
Embeddings, Vector Databases, and Search in LLM
No ratings yet
Embeddings, Vector Databases, and Search in LLM
38 pages
What is Vector
No ratings yet
What is Vector
4 pages
Ljybtwsye0gzyeq9z Embedding GenAI With MongoDB
No ratings yet
Ljybtwsye0gzyeq9z Embedding GenAI With MongoDB
17 pages
What Are Vector Databases
No ratings yet
What Are Vector Databases
5 pages
Ways To Use LLM in Finance Organisation
No ratings yet
Ways To Use LLM in Finance Organisation
5 pages
Embeddings
No ratings yet
Embeddings
13 pages
Vector Database in LLMs
No ratings yet
Vector Database in LLMs
14 pages
5bdb704a-2eaa-40ff-a177-1c16b064da57 -2
No ratings yet
5bdb704a-2eaa-40ff-a177-1c16b064da57 -2
54 pages
Large Language Model (LLM) Interview Question And Answer Course
No ratings yet
Large Language Model (LLM) Interview Question And Answer Course
10 pages
Generate Insights From Unstructured Financial Data
No ratings yet
Generate Insights From Unstructured Financial Data
3 pages
Stas Bekman - Machine Learning Engineering
No ratings yet
Stas Bekman - Machine Learning Engineering
308 pages
You Ll Learn Why They Matter What Makes Them Different How They Work the New Use Cases They Re Designed for and How to Get Started 1688203106
No ratings yet
You Ll Learn Why They Matter What Makes Them Different How They Work the New Use Cases They Re Designed for and How to Get Started 1688203106
25 pages
2023 LLMBC Augmented Lms
No ratings yet
2023 LLMBC Augmented Lms
95 pages
Vector DB Guide
No ratings yet
Vector DB Guide
47 pages
Sponsored DZ RC 396 Getting Started Vector Databas
No ratings yet
Sponsored DZ RC 396 Getting Started Vector Databas
9 pages
WP NAND Oracle Vector Search FINAL
No ratings yet
WP NAND Oracle Vector Search FINAL
14 pages
14_Key_Skills_to_Master_Large_Language_Models__1729745509
No ratings yet
14_Key_Skills_to_Master_Large_Language_Models__1729745509
17 pages
Generative Certification Notes-1
No ratings yet
Generative Certification Notes-1
22 pages
rag
No ratings yet
rag
20 pages
How To Use Grounding For Your LLMs With Text Embeddings
No ratings yet
How To Use Grounding For Your LLMs With Text Embeddings
1 page
PostgreSQL As A Vector Database: Create, Store, and Query OpenAI Embeddings With Pgvector
No ratings yet
PostgreSQL As A Vector Database: Create, Store, and Query OpenAI Embeddings With Pgvector
2 pages
5th and 6th Topic
No ratings yet
5th and 6th Topic
8 pages
Vector Databases
No ratings yet
Vector Databases
24 pages
Elasticsearch-2308 14963
No ratings yet
Elasticsearch-2308 14963
9 pages
Stas Bekman - Machine Learning Engineering
No ratings yet
Stas Bekman - Machine Learning Engineering
261 pages
Tactiq Free Transcript AC3h KzLARo (2)
No ratings yet
Tactiq Free Transcript AC3h KzLARo (2)
33 pages
(2) Basic AI & ML Concepts Explained _ LinkedIn
No ratings yet
(2) Basic AI & ML Concepts Explained _ LinkedIn
10 pages
AI For Everyone PDF
No ratings yet
AI For Everyone PDF
62 pages
Deeplearning - Ai Deeplearning - Ai
100% (1)
Deeplearning - Ai Deeplearning - Ai
39 pages
vectorsearch
No ratings yet
vectorsearch
37 pages
Gen AI guide
No ratings yet
Gen AI guide
6 pages
Vector Databases - A Technical Primer
No ratings yet
Vector Databases - A Technical Primer
68 pages
Ai and ML PDF 1
No ratings yet
Ai and ML PDF 1
11 pages
Stas Bekman - Machine Learning Engineering
No ratings yet
Stas Bekman - Machine Learning Engineering
217 pages
RAGHack-AzureAISearch-Spanish
No ratings yet
RAGHack-AzureAISearch-Spanish
85 pages
Final Year Project
No ratings yet
Final Year Project
25 pages
Vector Search- GenAI+Search
No ratings yet
Vector Search- GenAI+Search
40 pages
Exploring-HuggingFace
No ratings yet
Exploring-HuggingFace
16 pages
GENAI1
No ratings yet
GENAI1
25 pages
Vector Database Essentials
No ratings yet
Vector Database Essentials
26 pages
Vector Databases
No ratings yet
Vector Databases
2 pages
Data For GenAI
No ratings yet
Data For GenAI
17 pages
Own Your AI - Tech Deck
No ratings yet
Own Your AI - Tech Deck
75 pages
tm3
No ratings yet
tm3
8 pages
Ai
No ratings yet
Ai
12 pages
Post Graduate Program In: Artificial Intelligence & Machine Learning
No ratings yet
Post Graduate Program In: Artificial Intelligence & Machine Learning
19 pages
GenAI Workshop
No ratings yet
GenAI Workshop
35 pages
Principles of Building a i Agents
No ratings yet
Principles of Building a i Agents
93 pages
Azure OpenAI Workshop
No ratings yet
Azure OpenAI Workshop
30 pages
312018_ML-Exp1
No ratings yet
312018_ML-Exp1
5 pages
Artificial Intelligancy Architecture
No ratings yet
Artificial Intelligancy Architecture
13 pages
Machine Learning Re Defining Semiconductor Industry 1598272842
No ratings yet
Machine Learning Re Defining Semiconductor Industry 1598272842
33 pages
AI Interventions in Construction Engineering & Management (CEM)
No ratings yet
AI Interventions in Construction Engineering & Management (CEM)
35 pages
Ai GCN 16-Sep-2023
No ratings yet
Ai GCN 16-Sep-2023
1 page
Vector Database
No ratings yet
Vector Database
3 pages
Lecture 4-Deep Learning and Cognitive Computing
No ratings yet
Lecture 4-Deep Learning and Cognitive Computing
35 pages
Term Weighting & The Vector Space Model
No ratings yet
Term Weighting & The Vector Space Model
2 pages
Backend Development
From Everand
Backend Development
Kai Turing
No ratings yet
B.K.S. Iyengar Yoga the Path to Holistic Health (B.K.S. Iyengar, DK) - Knowledge Hub
No ratings yet
B.K.S. Iyengar Yoga the Path to Holistic Health (B.K.S. Iyengar, DK) - Knowledge Hub
434 pages
HE_McCullough_Disciplined_Investor_May2018
No ratings yet
HE_McCullough_Disciplined_Investor_May2018
36 pages
Build The WCM Case: Research & Analysis
No ratings yet
Build The WCM Case: Research & Analysis
1 page
Options On Futures
No ratings yet
Options On Futures
16 pages
PK Synopsis
No ratings yet
PK Synopsis
18 pages
International Style
No ratings yet
International Style
6 pages
Edu 600 Evaluating Websites Student Worksheet
No ratings yet
Edu 600 Evaluating Websites Student Worksheet
2 pages
Module 8-Photo Imaging and Post Processing
No ratings yet
Module 8-Photo Imaging and Post Processing
13 pages
Wheel Alignment On Light Vehicles
No ratings yet
Wheel Alignment On Light Vehicles
4 pages
Job Application Form: Human Resources Development Department Pt. Duta Visual Nusantara Tivi Tujuh
No ratings yet
Job Application Form: Human Resources Development Department Pt. Duta Visual Nusantara Tivi Tujuh
3 pages
MMA103 Chapter 3 Linear Equation
No ratings yet
MMA103 Chapter 3 Linear Equation
33 pages
Chapter 2 - Introduction To Data Science
No ratings yet
Chapter 2 - Introduction To Data Science
56 pages
HT Questions
No ratings yet
HT Questions
3 pages
S-Mobile Xenemetrics
No ratings yet
S-Mobile Xenemetrics
4 pages
BCNHL (Design-D) Check Sheet
No ratings yet
BCNHL (Design-D) Check Sheet
12 pages
EDS-T-5415: Cap Assembly - Horn Function / Durability Test
No ratings yet
EDS-T-5415: Cap Assembly - Horn Function / Durability Test
4 pages
E Xtreme Beats Aural Maximizer 2 Documentation
No ratings yet
E Xtreme Beats Aural Maximizer 2 Documentation
2 pages
Gym Website Project Report
No ratings yet
Gym Website Project Report
66 pages
Facility Installation Schedule
100% (1)
Facility Installation Schedule
22 pages
Deliverability Test
No ratings yet
Deliverability Test
14 pages
Operating Instructions: Diesel Engine 20 V 4000 G23 20 V 4000 G43 20 V 4000 G63, G63L 20 V 4000 G83, G83L
No ratings yet
Operating Instructions: Diesel Engine 20 V 4000 G23 20 V 4000 G43 20 V 4000 G63, G63L 20 V 4000 G83, G83L
199 pages
Instant download (Ebook) 3D photorealistic rendering. Volume 1, Interiors & exteriors with V-Ray & 3ds Max by Cardoso, Jamie ISBN 9781138780729, 1138780723 pdf all chapter
100% (11)
Instant download (Ebook) 3D photorealistic rendering. Volume 1, Interiors & exteriors with V-Ray & 3ds Max by Cardoso, Jamie ISBN 9781138780729, 1138780723 pdf all chapter
65 pages
Atrea Import Za Revit
No ratings yet
Atrea Import Za Revit
4 pages
45-Bridge Mode Lab
No ratings yet
45-Bridge Mode Lab
18 pages
Professional Graduate Programme in Electrical Engineering
No ratings yet
Professional Graduate Programme in Electrical Engineering
19 pages
Nama: Indra Yudha Pratama Prodi/Off: Pto/B2 NIM: 170513624003 Komponen-Komponen Sensor
No ratings yet
Nama: Indra Yudha Pratama Prodi/Off: Pto/B2 NIM: 170513624003 Komponen-Komponen Sensor
6 pages
PrimeOS Official Changelog
No ratings yet
PrimeOS Official Changelog
3 pages
Graph Theory Research Proposal
100% (1)
Graph Theory Research Proposal
3 pages
LearnTube Project Report
No ratings yet
LearnTube Project Report
14 pages
Muc 5 52003821.58084010 59902230
No ratings yet
Muc 5 52003821.58084010 59902230
16 pages
SAMPLE QUESTION PAPER, ENGLISH 10th, SET-3, 2022-23
No ratings yet
SAMPLE QUESTION PAPER, ENGLISH 10th, SET-3, 2022-23
23 pages
AC Servo Motor&D2 Drive (EN)
No ratings yet
AC Servo Motor&D2 Drive (EN)
41 pages
Interprocess Communication SKILL Functions Reference: Product Version 06.30 June 2003
No ratings yet
Interprocess Communication SKILL Functions Reference: Product Version 06.30 June 2003
54 pages
Intro To AI Course Outline Spring2021-SG-V2.0
No ratings yet
Intro To AI Course Outline Spring2021-SG-V2.0
2 pages

the-case-against-vector-databases

Uploaded by

the-case-against-vector-databases

Uploaded by

Considering vector database?

You don't need it. 🙅

vec3.ai by Dariusz Semba

From AI’s $200B Question

by David Cahn, Sequoia's blog

1. Traditional keyword search is good enough or will even better suit

• if you filter results first, there's fewer data to compare

3. Large memory overhead (or simply cost)

✅ simpler, cheaper, well-known, ✅ capture semantics and

❌ doesn't capture semantics ❌ expensive, high latency

• Keyword search matches exact terms.

Hence, vector search doesn't exactly replace keyword search.

a. narrows down search results when query gets more specific

• the model might drift over time, losing accuracy

“ Vector search usually works better on demo (open domain)

1. Meaning squashed into a limited-size vector

≈ 11.57 days 100k 66ms 11d $27k

300k 0.2s 34d $81k

No need for approximate nearest neighbors, let alone vector databases!

Focus on what brings most value to your users:

ChatGPT and its RLHF technique were a large breakthrough.

ChatGPT can already continuously query keyword search,

Avoid vendor lock-in.

Let’s keep AI efforts sane together :)

6. AI’s $200B Question by Sequoia

You might also like