0% found this document useful (0 votes)

4 views

Introduction To Docs and Image Based Voice Chatbots

Uploaded by

shivaninaim

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Introduction To Docs and Image Based Voice Chatbots

Uploaded by

shivaninaim

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 17

Introduction to Docs

and Image-Based
Voice Chatbots
The project focuses on creating a voice chatbot that can read and understand
documents, like PDFs, Images and respond to voice queries.

It aims to enhance user interaction with technology through natural language

processing and optical character recognition, making the chatbot a smart
conversational agent for various applications

SUBMITTED BY :

RAKESH H R 1BM21EC413
SHIVANI S NAIK 1BM20EC142
VANSH JAIN 1BM20EC183
JAIDEEP A HEGWAD 1BM20EC059
Problem Definition
Integrating voice interaction Documents and images.
The project confronts the significant Traditional voice chatbots are adept at
challenge of integrating voice interaction handling spoken or written queries but fall
with the ability to process and interpret short when users need to extract and
both textual and visual data from PDF files discuss content from documents and
and images. images.

Accessibility Challenges
This limitation is particularly acute in sectors where information is conveyed through a
combination of text and visuals, such as academic research, technical manuals, and medical
imaging.
Proposed Solution
To address the problem of inefficient and time-consuming document and image
retrieval during voice-based interactions, we propose a comprehensive solution. This
innovative system will leverage advanced natural language processing and computer
vision techniques to seamlessly integrate textual and visual information into a voice
chatbot interface.

Voice Chatbot: Develop a sophisticated voice chatbot that can read, understand, and
interact based on the content of uploaded PDFs and other documents.

Technology Integration: Employ natural language processing (NLP) and optical

character recognition (OCR) to enable document comprehension and voice interaction.

Enhanced Interaction: The chatbot will provide a seamless, responsive conversational

experience, improving user engagement with digital content.

Broad Applicability: This solution has the potential to revolutionize industries such as
education, customer support, and accessibility by providing a more natural
communication interface.
Road maps
1 Landing Page & Navigation Page

2 Authentication

3 Functionality

4 Payments & Launch

Functionality
PDF Text Extraction: The system reads PDF documents and extracts text using
PyPDF2.

Text Chunking: The extracted text is split into manageable chunks using
Langchain’s Recursive Character Text Splitter.

Vector Store Creation: Text chunks are converted into embeddings and indexed
using FAISS for quick retrieval.

Conversational Chain: A conversational chain is established using Langchain and

Google Generative AI to generate context-aware responses.

User Interaction: Users can interact with the chatbot via a Streamlit interface,
asking questions that the chatbot answers based on the PDF content.
Flow Chart of Functionality
Project Flow

1. Upload PDF: User uploads PDF documents.

2. Extract Text: System extracts text from PDFs using PyPDF2.

3. Split Text: Text is split into chunks for processing.

4. Generate Embeddings: Convert text chunks into embeddings.

5. Create Vector Store: Embeddings are indexed in a vector store using FAISS.

6. User Query: User inputs a question to the chatbot.

7. Retrieve Documents: System retrieves relevant document sections based on the query.

8. Generate Response: Chatbot generates a response using the conversational chain.

9. Display Response: Response is displayed to the user.

10. End: End of the process.

Architecture

1.User Interface (UI): This is where users interact with the chatbot through voice commands or text
input. It’s designed to be intuitive and user-friendly.
2.Voice Recognition: When a user speaks, this component converts the spoken words into text using
speech-to-text technology.
3.Text Processing: This core part uses natural language processing (NLP) to understand the user’s
intent and context from the text.
4.Document Processing:
• PDF Processing: Extracts text from PDF files using OCR technology.
• Image Processing: Analyzes images to understand content like charts or graphs.
5.Dialogue Management: Manages the conversation flow, deciding how the chatbot should respond
based on the user’s queries and the information extracted from documents and images.
6.Response Generation: Uses NLP to create a natural and relevant response, which is then converted
from text to speech if needed.
7.Learning Component: Gathers data from interactions to improve the chatbot’s performance over
time.
Architecture of CHATBOT
Technologies Used
Streamlit PyPDF2 Langchain Google
Generative AI
For creating the web To read PDF files and For text splitting and
application interface. extract text. managing For generating
conversational chains. embeddings and
responses.

FAISS: Dotenv
For efficient similarity For managing
search and indexing of environment variables.
text chunks.
LITERATURE SURVEY
NO AUTHOR TITLE PAPERS OUTCOME DRAWBACK

1. M. A. Khadija Designing a PDF- 2023 1. Development of a PDF-Driven Chatbot 1. E-books are perceived
Driven Chatbot International using Generative AI. as uncomfortable for
A. Aziz, powered by OpenAI Conference 2. Utilization of LangChain Framework, prolonged reading
ChatGPT, on Computer Chat-GPT (GPT3.5 Turbo), and Pinecone for sessions.
response generation. 2. Potential limitations in
3. Successful demonstration of the chatbot's accessibility and
ability to provide coherent responses aligned readability for some
with the content of PDF documents. users.

2. Semmy Wellem AI-powered Chatbot 2023 5th 1. Introduction of Unklabot 1.0, showcasing 1. Dependency on
Taju, Andria for Information International innovative integration of advanced AI external API (OpenAI
Kusuma Wahyudi, Service at Klabat Conference technologies for information services within GPT-3) might lead to
Green Ferry University by on Klabat University. potential limitations or
Mandias, Reymon Integrating OpenAI Cybernetics 2. Improved accuracy and efficiency in disruptions in service if
Rotikan, Jimmy GPT-3 with Intent and question answering capabilities through the the API becomes
Herawan Recognition and Intelligent integration of intent recognition and semantic unavailable or undergoes
Semantic Search. System search techniques. changes. 2. Lack of
discussion on potential
privacy or security
concerns associated with
using an external AI
model for handling
NO AUTHOR TITLE PAPERS OUTCOME DRAWBACK

3 Max Dean, An AI Chatbot 2023 31st Irish Conference on 1. Development of a large 1. Dependency on
Michael F. for Interacting Artificial Intelligence language model (LLM) arXiv restricts the
McTear, Raymond with Academic augmentation chatbot diversity of papers
R. Bond and Research, tailored for computer and may limit the
Maurice D. science research queries. applicability of the
Mulvenna 2. Embedding of around chatbot to broader
200,000 computer science research domains.
research papers from arXiv, 2. Limited testing
resulting in ~11 million scope with only 30
vectors. sample questions
may not fully
capture the breadth
of inquiries in
computer science.
4 T. -H. Kim, S. Cho, S. Emotional Voice 2020 IEEE International Conference 1. Introduction of a voice 1. Previous VC methods
Choi, S. Park and S. -Y. Conversion Using converter using multitask learning based on seq2seq
Lee Multitask Learning with text-to-speech (TTS). models risk losing
with Text-To-Speech 2. Multitask learning aids in linguistic information.
capturing linguistic information 2. Textual supervision
and maintaining training stability. attempted to address this
but required explicit
alignment, nullifying the
benefits of seq2seq
models.
Efficient Indexing: Pinecone uses advanced indexing techniques
optimized for high-dimensional vector embeddings, enabling fast
similarity search. Scalability: Pinecone is built to handle large-scale
deployments, allowing you to store and search billions of vectors
with low latency. API Integration: Pinecone provides easy-to-use
APIs for inserting vectors, querying for nearest neighbors, and
managing indexes.
5 T. N. Thi, T. -H. Implementatio 2023 1. Successful development of 1. Limited exploration of
Do and M. Yoo n of OCR Internatio an OCR system tailored for alternative OCR models
system on nal Vietnamese book cover images. beyond those mentioned.
extracting Conferen 2. Demonstrated effectiveness 2. Lack of comparative
information ce of EAST and SAST for text analysis between different
from detection, and CRNN, SVTR, combinations of text
Vietnamese Transformer OCR for text detection and recognition
book cover recognition. models.
images

6 R. Vannala, S. AI Chatbot 2022 2. Proposed system bridges the 1. Reliance on human

B. Swathi and Y. For Answering IEEE 2nd gap between traditional FAQ agents for unsatisfactory
Puranam FAQ’s, Internatio systems and image-based responses may hinder
nal question answering. scalability.
Conferen 3. Enhancement of user 2. Potential limitations in
ce experience through seamless the chat bot's ability to
integration of AI-driven accurately interpret complex
responses and human agent image-based queries.
intervention.
7 V. Velasco, K. AI Chatbot 2023 4th 1. AI chatbots demonstrate 1.Limited inclusion of
Dedy Setiawan, Technology to Internatio significant potential in disease recent studies beyond 2020
R. Robert Predict nal prediction, offering valuable may overlook emerging
Sanjaya, M. Disease Conferen support for healthcare advancements.
Susan ce professionals. 2. Potential bias in the
Anggreainy and 2. Utilization of machine selection criteria of the
A. Kurniawan learning algorithms enhances reviewed journals may
accuracy and speed in disease affect the
diagnosis. comprehensiveness of the
analysis.

8 Hrushikesh Smart College Conferen 1. Implementation of an online 1. Dependency on a

Koundinya K.; Chatbot using ce July chatbot system for Matrusri database created by human
Ajay Krishna ML and 2020 Engineering College. experts limits scalability.
Palakurthi; Python 2. Investigation into the role of 2. Potential lack of
Vaishnavi AI and ML in improving service adaptability to diverse user
Putnala delivery, particularly through inputs beyond the trained
chatbots. responses.
Conclusion:
In conclusion, the voice-based chatbot project represents a significant
advancement in the field of human-computer interaction.

By integrating voice commands with the ability to process and understand content
from both images and PDF files, this chatbot transcends traditional text-based
systems.

It offers a versatile and dynamic tool that caters to a wide range of applications,
from educational resources to technical support and beyond.

The project’s success lies in its innovative approach to combining OCR and image
recognition with NLP, providing users with an intuitive and efficient way to access
and interact with information.

As we look to the future, the potential for further development and integration into
various industries holds the promise of transforming how we engage with digital
content

Learn IoT Programming Using Node-RED: Begin to Code Full Stack IoT Apps and Edge Devices with Raspberry Pi, NodeJS, and Grafana
From Everand
Learn IoT Programming Using Node-RED: Begin to Code Full Stack IoT Apps and Edge Devices with Raspberry Pi, NodeJS, and Grafana
Bernardo Ronquillo Japón
No ratings yet
Chatbot: International Journal of Trend in Scientific Research and Development (IJTSRD)
No ratings yet
Chatbot: International Journal of Trend in Scientific Research and Development (IJTSRD)
4 pages
College Enquiry Chat Bot
100% (2)
College Enquiry Chat Bot
47 pages
chatbot 4
No ratings yet
chatbot 4
6 pages
Ai Chatbot Bagchi 2020
No ratings yet
Ai Chatbot Bagchi 2020
6 pages
Development of A Natural Language Chatbot Interfac
No ratings yet
Development of A Natural Language Chatbot Interfac
10 pages
Res 5678
No ratings yet
Res 5678
5 pages
Paper 3128
No ratings yet
Paper 3128
6 pages
Iratj 08 00240
No ratings yet
Iratj 08 00240
6 pages
mini report-2
No ratings yet
mini report-2
20 pages
doc
No ratings yet
doc
5 pages
CPP Project Report
No ratings yet
CPP Project Report
15 pages
Encoder Decoder PDF
No ratings yet
Encoder Decoder PDF
12 pages
Published Paper
No ratings yet
Published Paper
7 pages
A Framework For Cognitive Chatbots Based On Abductiv - 2023 - Cognitive Systems
No ratings yet
A Framework For Cognitive Chatbots Based On Abductiv - 2023 - Cognitive Systems
16 pages
Chatbot Using A Knowledge in Database
No ratings yet
Chatbot Using A Knowledge in Database
7 pages
Using Chatbots As AI Conversational Part
No ratings yet
Using Chatbots As AI Conversational Part
16 pages
Applied Sciences: Using Chatbots As AI Conversational Partners in Language Learning
No ratings yet
Applied Sciences: Using Chatbots As AI Conversational Partners in Language Learning
16 pages
Designing and Implementing Conversationa
No ratings yet
Designing and Implementing Conversationa
12 pages
5bp
No ratings yet
5bp
7 pages
RP-4
No ratings yet
RP-4
6 pages
Inscription[2] (2)-1
No ratings yet
Inscription[2] (2)-1
16 pages
Major Project Synopsis
No ratings yet
Major Project Synopsis
14 pages
Novel Study On AI Based Chatbot ChatGPT Impacts On The Traditional Library Management
No ratings yet
Novel Study On AI Based Chatbot ChatGPT Impacts On The Traditional Library Management
4 pages
Edith PPT
No ratings yet
Edith PPT
22 pages
Chatbot: A Deep Neural Network Based Human To Machine Conversation Model
No ratings yet
Chatbot: A Deep Neural Network Based Human To Machine Conversation Model
7 pages
Fin Irjmets1687886863
No ratings yet
Fin Irjmets1687886863
4 pages
Project Phase 1 Progress 2
No ratings yet
Project Phase 1 Progress 2
15 pages
Futureinternet 15 00192
No ratings yet
Futureinternet 15 00192
24 pages
1nd-Progress-Presentation-2023-AI-1-update
No ratings yet
1nd-Progress-Presentation-2023-AI-1-update
15 pages
Journalsresaim Ijresm v3 I7 32
No ratings yet
Journalsresaim Ijresm v3 I7 32
3 pages
Synopsis Chatbot PDF
100% (1)
Synopsis Chatbot PDF
6 pages
The Role of Linear Algebra in Developing Scalable and Intelligent AI Chatbots
No ratings yet
The Role of Linear Algebra in Developing Scalable and Intelligent AI Chatbots
31 pages
Formatted Software Requirements Specification
No ratings yet
Formatted Software Requirements Specification
3 pages
Enhancing PDF Interaction For A More Engaging User Experience in Library: Introducing Chatpdf
No ratings yet
Enhancing PDF Interaction For A More Engaging User Experience in Library: Introducing Chatpdf
7 pages
1 s2.0 S1877050915020608 Main
No ratings yet
1 s2.0 S1877050915020608 Main
10 pages
Irjet V9i1268
No ratings yet
Irjet V9i1268
5 pages
doc-2
No ratings yet
doc-2
6 pages
Instant Download Programming Large Language Models With Azure Open Ai: Conversational Programming and Prompt Engineering With Llms (Developer Reference) 1st Edition Esposito PDF All Chapters
100% (4)
Instant Download Programming Large Language Models With Azure Open Ai: Conversational Programming and Prompt Engineering With Llms (Developer Reference) 1st Edition Esposito PDF All Chapters
40 pages
Major Project Ppt 3 2
No ratings yet
Major Project Ppt 3 2
13 pages
A Subject-Specific Chatbots For Primary Education End-Users Using Machine Learning Techniques
No ratings yet
A Subject-Specific Chatbots For Primary Education End-Users Using Machine Learning Techniques
10 pages
mini project docubot power point
No ratings yet
mini project docubot power point
17 pages
Adaptive_e_Learning_AI_Powered_Chatbot_b
No ratings yet
Adaptive_e_Learning_AI_Powered_Chatbot_b
10 pages
A Unique Approach Towards Image Publication and Provenance Using Blockchain
No ratings yet
A Unique Approach Towards Image Publication and Provenance Using Blockchain
4 pages
1 s2.0 S2667345223000317 Main
No ratings yet
1 s2.0 S2667345223000317 Main
10 pages
COLLEGE-ENQUIRY-CHAT-BOT-SYSTEM
No ratings yet
COLLEGE-ENQUIRY-CHAT-BOT-SYSTEM
5 pages
ZAX RESEARCH_PAPER_2
No ratings yet
ZAX RESEARCH_PAPER_2
8 pages
Chat 5
No ratings yet
Chat 5
23 pages
Research Paper
No ratings yet
Research Paper
4 pages
Chat With PDF
No ratings yet
Chat With PDF
18 pages
Conversation-to-Automation-in-Banking-Through-Chatbot-Using-Artificial-Machine-Intelligence-Language
No ratings yet
Conversation-to-Automation-in-Banking-Through-Chatbot-Using-Artificial-Machine-Intelligence-Language
9 pages
AI Voice Assistant Using NLP and Python Libraries (M.A.R.F)
No ratings yet
AI Voice Assistant Using NLP and Python Libraries (M.A.R.F)
8 pages
CUSTOM DATA-DRIVEN RAG CHATBOT USING API & LANGCHAIN FRAMEWORK
No ratings yet
CUSTOM DATA-DRIVEN RAG CHATBOT USING API & LANGCHAIN FRAMEWORK
18 pages
Deep Chit-Chat: Deep Learning For Chatbots: Wei Wu Rui Yan
No ratings yet
Deep Chit-Chat: Deep Learning For Chatbots: Wei Wu Rui Yan
1 page
Subin Raj Final158final
No ratings yet
Subin Raj Final158final
21 pages
Ieee Access Chatgpt
No ratings yet
Ieee Access Chatgpt
15 pages
Ijase 202106 18 2 007
No ratings yet
Ijase 202106 18 2 007
9 pages
Sample Paper 3
No ratings yet
Sample Paper 3
9 pages
Proposal Falcon AiCB
No ratings yet
Proposal Falcon AiCB
30 pages
Fin Irjmets1685011030
No ratings yet
Fin Irjmets1685011030
6 pages
U1-95 Severn ST Box Hill North - COS
No ratings yet
U1-95 Severn ST Box Hill North - COS
22 pages
Lessonplanmsc 4
No ratings yet
Lessonplanmsc 4
12 pages
Bottled Water vs. Tap Water - Pros and Cons
No ratings yet
Bottled Water vs. Tap Water - Pros and Cons
18 pages
Report
No ratings yet
Report
11 pages
CH: - 6 (The Sayyid and Lodi Dynasties) - Syed Dynasty
No ratings yet
CH: - 6 (The Sayyid and Lodi Dynasties) - Syed Dynasty
5 pages
Nife Cells User Instructions
No ratings yet
Nife Cells User Instructions
4 pages
Visual Management - An Overview
No ratings yet
Visual Management - An Overview
11 pages
210604draft Policy For The General Education Certificate GEC
No ratings yet
210604draft Policy For The General Education Certificate GEC
34 pages
Java Interview Questions
No ratings yet
Java Interview Questions
11 pages
400 Years of African American History Act
No ratings yet
400 Years of African American History Act
13 pages
WHITE_beet_E_datasheet_rev.1.02_20220630
No ratings yet
WHITE_beet_E_datasheet_rev.1.02_20220630
17 pages
International Business The Challenge of Global Competition 13th Edition Ball Solutions Manual - Read Now Or Download For A Complete Experience
100% (3)
International Business The Challenge of Global Competition 13th Edition Ball Solutions Manual - Read Now Or Download For A Complete Experience
40 pages
Week 1 - Foundations of Social Networking
No ratings yet
Week 1 - Foundations of Social Networking
10 pages
0.2 Some Basic Notions About Aerodynamics
No ratings yet
0.2 Some Basic Notions About Aerodynamics
27 pages
Nioec SP 00 03 PDF
No ratings yet
Nioec SP 00 03 PDF
11 pages
Elements With Answers
No ratings yet
Elements With Answers
6 pages
Holding Hands With An Angel - Chris Marcotte
No ratings yet
Holding Hands With An Angel - Chris Marcotte
8 pages
Class-3 NCO
No ratings yet
Class-3 NCO
2 pages
Sharqedges Catalogue 2013 WEB
No ratings yet
Sharqedges Catalogue 2013 WEB
32 pages
Fakultas Teknik, Universitas Pembangunan Nasional "Veteran" Jakarta Email
No ratings yet
Fakultas Teknik, Universitas Pembangunan Nasional "Veteran" Jakarta Email
6 pages
Physics Energy Booklet
No ratings yet
Physics Energy Booklet
53 pages
Brochure PSG Bheles
No ratings yet
Brochure PSG Bheles
2 pages
Natural Approach
No ratings yet
Natural Approach
17 pages
Grammar Unit 7 2star
No ratings yet
Grammar Unit 7 2star
1 page
AREA STATEMENT - Sheet1
No ratings yet
AREA STATEMENT - Sheet1
1 page
Full Engineering Chemistry Fundamentals and Applications 2nd Edition Shikha Agarwal PDF All Chapters
100% (5)
Full Engineering Chemistry Fundamentals and Applications 2nd Edition Shikha Agarwal PDF All Chapters
62 pages
Tenses - Study Notes
No ratings yet
Tenses - Study Notes
12 pages
12.02 The Working Limitations of Large Language Models
No ratings yet
12.02 The Working Limitations of Large Language Models
7 pages
Test 01 - Al Aqsa Mosque
No ratings yet
Test 01 - Al Aqsa Mosque
2 pages
[Ebooks PDF] download Deploy Container Applications Using Kubernetes: Implementations with microk8s and AWS EKS Shiva Subramanian full chapters
100% (3)
[Ebooks PDF] download Deploy Container Applications Using Kubernetes: Implementations with microk8s and AWS EKS Shiva Subramanian full chapters
41 pages