0% found this document useful (0 votes)

3 views

Langchain App Design

The document outlines the design of a Langchain app that processes PDF documents for question-answering using a pipeline of functions, including loading, splitting, embedding, and querying. It details the functionality of each component in the pipeline and discusses various vector storage solutions and cloud platforms for machine learning. The document also compares different cloud services, highlighting their pros and cons for machine learning applications.

Uploaded by

akius

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Langchain App Design

Uploaded by

akius

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Langchain App Design

Summary:

Each function in the pipeline relies on the output of the previous step, creating a seamless flow
of data from loading the PDF to asking and answering questions about its content. The pipeline
utilizes embeddings and a pre-trained language model to enable question-answering on the
content of the PDF document.

Function Description:

● loadPDFFromLocal(pdf): This function takes the path of a PDF file as input and
attempts to load it using an UnstructuredPDFLoader class. If successful, it returns the
loaded PDF document. If an exception occurs during the loading process, it catches the
error and returns None.

● splitDocument(loaded_docs): This function takes the loaded PDF document as input

and splits it into smaller chunks of text using the CharacterTextSplitter class. The chunks
are typically of a fixed size with an overlap. The purpose of splitting the document is to
create smaller units for processing, as large documents may be computationally
expensive. If successful, it returns the chunked documents. If an exception occurs during
the splitting process, it catches the error and returns None.

● createEmbeddings(chunked_docs): This function takes the chunked documents as

input and creates numerical embeddings using the HuggingFaceEmbeddings class.
Embeddings are dense representations of the text data, allowing for semantic
understanding and similarity comparison. The embeddings are then stored in a FAISS
vector store. If successful, it returns the vector store. If an exception occurs during the
embedding creation process, it catches the error and returns None.

● loadLLMModel(): This function loads a pre-trained language model using the Hugging
Face Hub. Specifically, it loads the "flan-alpaca-large" model with specific keyword
arguments like "temperature": 0 and "max_length": 512. Additionally, it creates a
question-answering chain using the load_qa_chain function, where the language model
is used to answer questions related to the input documents. If successful, it returns the
question-answering chain. If an exception occurs during the loading process, it catches
the error and returns None.

● askQuestions(vector_store, chain, question): This function takes the vector store

(which contains the embeddings of the PDF chunks), the loaded question-answering
chain, and a specific question as input. It uses the vector store to find similar documents
related to the question (via similarity search). Then, it uses the question-answering chain
to answer the given question based on the most similar documents. If successful, it
returns the response (the answer to the question). If an exception occurs during the
process, it catches the error and returns None.

● create_vector_store(): This function acts as a pipeline to execute the entire process. It

first loads the PDF from a local file using loadPDFFromLocal, then splits the document
into chunks using splitDocument, and finally creates embeddings using
createEmbeddings. It then returns the resulting vector store.

● run_ask_questions(vector_store): This function executes the question-answering

process using the previously created vector store. It loads the language model using
loadLLMModel, and then uses askQuestions to ask a specific question related to the
content of the PDF. It returns the response (answer) to the question.

Function I/O

Loading PDF from Local (loadPDFFromLocal function):

● Input: Path to a PDF file (pdf_file_path).
● Output: loaded_docs (loaded PDF document).

Splitting Document (splitDocument function):

● Input: loaded_docs (loaded PDF document).
● Output: chunked_docs (list of smaller text chunks).

Creating Embeddings (createEmbeddings function):

● Input: chunked_docs (list of smaller text chunks).
● Output: vector_store (embedding representation of the chunks).

Loading LLM Model (loadLLMModel function):

● Input: None (No explicit input required).
● Output: chain (question-answering chain with a pre-trained language model).

Creating Vector Store (create_vector_store function):

● Input: pdf_file_path (path to a PDF file).
● Output: vector_store (embedding representation of the PDF content).

Asking Questions (askQuestions function):

● Input: vector_store (embedding representation of the chunks), chain (question-answering
chain), question (user-provided question).
● Output: response (answer to the user's question).

Running Ask Questions (run_ask_questions function):

● Input: vector_store (embedding representation of the PDF content).
● Output: response (answer to the predefined question).
Low Level Flow Diagram
Vector Storages:

Elastic:
Elastic can store and index vector representations of the data, making it possible to perform
similarity search and analytics on the vectors. It can be used to store vector embeddings and
other associated metadata for later retrieval.

FAISS:
FAISS is not a data store but a library for similarity search and clustering of dense vectors. It is
specifically designed for handling high-dimensional vectors efficiently and can complement other
data stores for similarity search tasks.

Redis:
Redis, with the RedisAI module, can be used to store and retrieve vector embeddings efficiently.
It provides low-latency access to vectors, which can be beneficial for real-time applications with
large language models.

MongoDB:
MongoDB can be used to store vector embeddings along with associated metadata. It supports
JSON-like data structures, making it suitable for storing varying types of data, including vectors.

Milvus:
Milvus is designed explicitly for similarity search and AI applications, including storing and
managing large-scale vector embeddings. It provides high-performance indexing and retrieval of
vectors, making it an excellent choice for LLM-related applications.
Chroma:
Chroma is a lightweight vector storage and retrieval system and could potentially handle vector
embeddings, but it might not be as feature-rich or optimized as dedicated vector databases like
Milvus.

Supabase:
Supabase can handle JSON data, which can be used to store vector embeddings along with
other associated information. However, it may not have specialized features for handling
large-scale similarity search tasks.

scikit-learn (sklearn):
scikit-learn is not a data store but a machine learning library. While it can be used for vector
operations and dimensionality reduction, it's not designed for large-scale data storage and
retrieval.

Considering the above, when storing vectors to be used with large language models,
specialized vector databases like Milvus are tailored for handling similarity search tasks
efficiently. However, other data stores like Elastic, Redis (with RedisAI), and MongoDB can also
be used effectively depending on the specific requirements and use cases of your application.
Be sure to consider factors such as data volume, query performance, and scalability while
making your decision.
Comparison of Cloud Solutions

Amazon SageMaker (AWS):

Pros:
● Fully managed service with end-to-end machine learning workflow support.
● Provides pre-configured environments for popular machine learning frameworks.
● Supports distributed training and model deployment at scale.
● Seamless integration with other AWS services.
● Offers specialized instance types (e.g., GPU-based instances) for efficient handling of
large language models.
Cons:
● Pricing can be complex and may become costly for high usage.

Google Cloud AI Platform (GCP):

Pros:
● Fully managed service with similar capabilities to Amazon SageMaker.
● Integrates well with other Google Cloud services.
● Provides access to specialized hardware accelerators like TPUs for high-performance
machine learning.
Cons:
● Like other cloud solutions, pricing can be a consideration for resource-intensive
workloads.

Microsoft Azure Machine Learning:

Pros:
● Comprehensive machine learning platform with robust tools and services.
● Supports distributed training and large-scale model deployment.
● Integrates seamlessly with other Azure services.
● Offers GPU and FPGA support for acceleration.
Cons:
● Some users may find the user interface and workflow a bit complex.

IBM Watson Machine Learning:

Pros:
● Managed service with support for popular machine learning frameworks.
● Provides scalability for handling large datasets and models.
● Integration with other IBM Cloud services.
Cons:
● Feature set may be less extensive compared to the major cloud providers.
Paperspace Gradient:
Pros:
● A cloud-based platform focused on machine learning and AI.
● Provides pre-configured environments for deep learning and NLP tasks.
● Supports GPU and TPU instances for performance optimization.
Cons:
● May have a smaller user base and ecosystem compared to major cloud providers.

FloydHub:
Pros:
● Cloud platform specifically designed for machine learning and data science.
● Easy to set up and use for training large models.
● Provides GPU and TPU support.
Cons:
● Smaller in scale compared to major cloud providers, potentially affecting service
availability and pricing.

Databricks:
Pros:
● Offers a collaborative workspace for big data and machine learning tasks.
● Integrates well with Apache Spark for scalable data processing.
● Provides GPU support for machine learning tasks.
Cons:
● May have a steeper learning curve for users new to Apache Spark and distributed
computing.

General Academic Seat Matrix 2024
No ratings yet
General Academic Seat Matrix 2024
293 pages
Learn NodeJS in 1 Day: Complete Node JS Guide with Examples
From Everand
Learn NodeJS in 1 Day: Complete Node JS Guide with Examples
Krishna Rungta
3.5/5 (4)
Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
C# For Beginners: An Introduction to C# Programming with Tutorials and Hands-On Examples
From Everand
C# For Beginners: An Introduction to C# Programming with Tutorials and Hands-On Examples
Nathan Metzler
5/5 (1)
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
From Everand
Google Cloud Platform for Data Engineering: From Beginner to Data Engineer using Google Cloud Platform
alasdair gilchrist
5/5 (1)
Ali Ahmad And Rameez_Project_Proposal
No ratings yet
Ali Ahmad And Rameez_Project_Proposal
5 pages
Learn C++
From Everand
Learn C++
Aishik Dutta
No ratings yet
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
100% (1)
Running Llama 2 On CPU Inference Locally For Document Q&A - by Kenneth Leung - Jul, 2023 - Towards Data Science
21 pages
DATABASE From the conceptual model to the final application in Access, Visual Basic, Pascal, Html and Php: Inside, examples of applications created with Access, Visual Studio, Lazarus and Wamp
From Everand
DATABASE From the conceptual model to the final application in Access, Visual Basic, Pascal, Html and Php: Inside, examples of applications created with Access, Visual Studio, Lazarus and Wamp
Olga Maria Stefania Cucaro
No ratings yet
Mastering Go A Practical Guide to Developers: A Practical Guide to Developers
From Everand
Mastering Go A Practical Guide to Developers: A Practical Guide to Developers
Miguel Miranda de Mattos
No ratings yet
01 coding the god bot (dragged) 6
No ratings yet
01 coding the god bot (dragged) 6
1 page
Eeb131 Intro to Ai and It-03
No ratings yet
Eeb131 Intro to Ai and It-03
23 pages
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
From Everand
SQLite Database Programming for Xamarin: Cross-platform C# database development for iOS and Android using SQLite.XM
Anthony Serpico
No ratings yet
mini project docubot power point
No ratings yet
mini project docubot power point
17 pages
JavaScript Introduction
From Everand
JavaScript Introduction
Lisa Saldivar
No ratings yet
Create AI Model Guide
No ratings yet
Create AI Model Guide
14 pages
Case Study
No ratings yet
Case Study
25 pages
Elements of Android Room
From Everand
Elements of Android Room
Mark Murphy
No ratings yet
An Effective Query System Using Llms and Langchain IJERTV12IS060161
No ratings yet
An Effective Query System Using Llms and Langchain IJERTV12IS060161
3 pages
Mastering Node.js Web Development: Go on a comprehensive journey from the fundamentals to advanced web development with Node.js
From Everand
Mastering Node.js Web Development: Go on a comprehensive journey from the fundamentals to advanced web development with Node.js
Adam Freeman
No ratings yet
JavaScript File Handling from Scratch: A Practical Guide with Examples
From Everand
JavaScript File Handling from Scratch: A Practical Guide with Examples
William E. Clark
No ratings yet
Deep Learning Blog
No ratings yet
Deep Learning Blog
6 pages
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)
Mastering DynamoDB
From Everand
Mastering DynamoDB
Tanmay Deshpande
No ratings yet
Practical Play Framework: Focus on what is really important
From Everand
Practical Play Framework: Focus on what is really important
Alberto Souza
No ratings yet
Introducing Transformers Agents 20
No ratings yet
Introducing Transformers Agents 20
8 pages
Software Design Simplified
From Everand
Software Design Simplified
Liviu Catalin Dorobantu
No ratings yet
50 Recipes for Programming Node.js
From Everand
50 Recipes for Programming Node.js
Jamie Munro
3/5 (4)
Getting Started with Oracle Data Integrator 11g: A Hands-On Tutorial
From Everand
Getting Started with Oracle Data Integrator 11g: A Hands-On Tutorial
David Hecksel
5/5 (2)
Papers With Code v2
No ratings yet
Papers With Code v2
15 pages
Brolly AI - Generative AI - Online Training
No ratings yet
Brolly AI - Generative AI - Online Training
13 pages
LangChain From 0 To 1 Public 1 PpuSgEN
No ratings yet
LangChain From 0 To 1 Public 1 PpuSgEN
39 pages
Learning DHTMLX Suite UI
From Everand
Learning DHTMLX Suite UI
Eli Geske
No ratings yet
Simple Golang Programming for Beginners
From Everand
Simple Golang Programming for Beginners
Terry T. Diaz
No ratings yet
Mastering Google App Engine: Build robust and highly scalable web applications with Google App Engine
From Everand
Mastering Google App Engine: Build robust and highly scalable web applications with Google App Engine
Packt Publishing
No ratings yet
Learn Kubernetes - Container orchestration using Docker: Learn Collection
From Everand
Learn Kubernetes - Container orchestration using Docker: Learn Collection
Arnaud Weil
4/5 (1)
QA_Using_Gemini_Langchain_ChromaDB_PDF
No ratings yet
QA_Using_Gemini_Langchain_ChromaDB_PDF
2 pages
LLM For QnA Proposal
No ratings yet
LLM For QnA Proposal
12 pages
02 Data Connections
No ratings yet
02 Data Connections
32 pages
Langchain Onepager
No ratings yet
Langchain Onepager
1 page
C# 2010 Coding Briefs Data Access
From Everand
C# 2010 Coding Briefs Data Access
Kevin Hough
No ratings yet
Let Us Code: Using Deep Learning Through A Library
No ratings yet
Let Us Code: Using Deep Learning Through A Library
17 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
.Net Framework and Programming in ASP.NET
From Everand
.Net Framework and Programming in ASP.NET
Priyanka Agarwal
No ratings yet
building RAG apps
No ratings yet
building RAG apps
32 pages
Deep_Learning_Packages_Theory
No ratings yet
Deep_Learning_Packages_Theory
1 page
13-Gradient Descent With Momentum-08!08!2024
No ratings yet
13-Gradient Descent With Momentum-08!08!2024
26 pages
Chat with PDFs Using Gen-AI and AWS Bedrock
No ratings yet
Chat with PDFs Using Gen-AI and AWS Bedrock
12 pages
Implementing Domain-Specific Languages with Xtext and Xtend - Second Edition
From Everand
Implementing Domain-Specific Languages with Xtext and Xtend - Second Edition
Lorenzo Bettini
4/5 (1)
Java / J2EE Interview Questions You'll Most Likely Be Asked
From Everand
Java / J2EE Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Four Programming Languages Creating a Complete Website Scraper Application
From Everand
Four Programming Languages Creating a Complete Website Scraper Application
Stephen J Link
No ratings yet
534aee3a-c05a-42ac-a763-c552c080a878
No ratings yet
534aee3a-c05a-42ac-a763-c552c080a878
4 pages
Core Java Programming
From Everand
Core Java Programming
Jitendra Patel
4/5 (11)
HTML5,CSS3,Javascript and JQuery Mobile Programming: Beginning to End Cross-Platform App Design
From Everand
HTML5,CSS3,Javascript and JQuery Mobile Programming: Beginning to End Cross-Platform App Design
Stephen J Link
5/5 (3)
Professional Access 2013 Programming
From Everand
Professional Access 2013 Programming
Ben Clothier
No ratings yet
Pranshi Singla IX C AI Activity 1
No ratings yet
Pranshi Singla IX C AI Activity 1
24 pages
Mastering Data Structures and Algorithms in Python & Java
From Everand
Mastering Data Structures and Algorithms in Python & Java
Sachin Naha
No ratings yet
AI-Driven Web Apps: Practical Machine Learning for Software Developers
From Everand
AI-Driven Web Apps: Practical Machine Learning for Software Developers
Sivaramarajalu Ramadurai Venkataraajalu
No ratings yet
Step by Step: Fault-tolerant, Scalable, Secure AWS Web Stack
From Everand
Step by Step: Fault-tolerant, Scalable, Secure AWS Web Stack
Savitra Sirohi
No ratings yet
Learn Docker - .NET Core, Java, Node.JS, PHP or Python: Learn Collection
From Everand
Learn Docker - .NET Core, Java, Node.JS, PHP or Python: Learn Collection
Arnaud Weil
5/5 (4)
Log
No ratings yet
Log
33 pages
E OUTPUT FUNDAMENTALS Pgdca Notes
No ratings yet
E OUTPUT FUNDAMENTALS Pgdca Notes
2 pages
Homework 3 Solution
No ratings yet
Homework 3 Solution
8 pages
BWD Unit-1
No ratings yet
BWD Unit-1
18 pages
DSBDA GROUP B 1
No ratings yet
DSBDA GROUP B 1
5 pages
Dccor 1.1
No ratings yet
Dccor 1.1
4 pages
DigiCert CPS V.5.3 1 2
No ratings yet
DigiCert CPS V.5.3 1 2
85 pages
1Y0-204 Exam Preparation Guide v06
No ratings yet
1Y0-204 Exam Preparation Guide v06
21 pages
Rafay's Resume 1
No ratings yet
Rafay's Resume 1
1 page
Hangzhou Hikvision Technology Co.,Ltd
No ratings yet
Hangzhou Hikvision Technology Co.,Ltd
1 page
Artificial Intelligence Parth Gupta
No ratings yet
Artificial Intelligence Parth Gupta
11 pages
Elements of The C Language
No ratings yet
Elements of The C Language
14 pages
Revision Tour Test
No ratings yet
Revision Tour Test
11 pages
Excel Introduction
No ratings yet
Excel Introduction
5 pages
PID Controller
No ratings yet
PID Controller
16 pages
Oracle Database 12c Enterprise Edition A Oracle Database SQL Language Reference
No ratings yet
Oracle Database 12c Enterprise Edition A Oracle Database SQL Language Reference
70 pages
Application of Blockchain Technology
No ratings yet
Application of Blockchain Technology
5 pages
Android Operating System
No ratings yet
Android Operating System
12 pages
Session - 08 - PAL PLA
No ratings yet
Session - 08 - PAL PLA
19 pages
Informacion Actuañizacion Del Primer Curso de Filament
No ratings yet
Informacion Actuañizacion Del Primer Curso de Filament
5 pages
Lecture 05
No ratings yet
Lecture 05
16 pages
Micro-Programmed Control
No ratings yet
Micro-Programmed Control
18 pages
M Tech Thesis Topics in Cloud Computing
100% (3)
M Tech Thesis Topics in Cloud Computing
6 pages
3G Throughput Improvement (Rehoming&F3Team)
No ratings yet
3G Throughput Improvement (Rehoming&F3Team)
13 pages
Cockpit Solutions: Experience The Future in Automotive HMI and High-End Computing
No ratings yet
Cockpit Solutions: Experience The Future in Automotive HMI and High-End Computing
8 pages
A Fast Algorithm To Generate Necklaces With &xed Content: 2003 Elsevier Science B.V. All Rights Reserved
No ratings yet
A Fast Algorithm To Generate Necklaces With &xed Content: 2003 Elsevier Science B.V. All Rights Reserved
13 pages
Accessible Prototypes Playground
No ratings yet
Accessible Prototypes Playground
19 pages
MS Office 2007 Product Key Free (Updated 2021)
No ratings yet
MS Office 2007 Product Key Free (Updated 2021)
1 page
Data Migration
No ratings yet
Data Migration
5 pages

Langchain App Design

Uploaded by

Langchain App Design

Uploaded by

Langchain App Design

● splitDocument(loaded_docs): This function takes the loaded PDF document as input

● createEmbeddings(chunked_docs): This function takes the chunked documents as

● askQuestions(vector_store, chain, question): This function takes the vector store

● create_vector_store(): This function acts as a pipeline to execute the entire process. It

● run_ask_questions(vector_store): This function executes the question-answering

Loading PDF from Local (loadPDFFromLocal function):

Splitting Document (splitDocument function):

Creating Embeddings (createEmbeddings function):

Loading LLM Model (loadLLMModel function):

Creating Vector Store (create_vector_store function):

Asking Questions (askQuestions function):

Running Ask Questions (run_ask_questions function):

Amazon SageMaker (AWS):

Google Cloud AI Platform (GCP):

Microsoft Azure Machine Learning:

IBM Watson Machine Learning:

You might also like