0% found this document useful (0 votes)

16 views

sem5_synopsis

The document is a synopsis report for a mini project titled 'VoiceMate: AI-powered personal assistant' submitted by students of Shivajirao S. Jondhale College of Engineering. It covers the project's objectives, literature survey, proposed system, and the significance of voice assistants in enhancing user interaction through AI and NLP technologies. The report also highlights the limitations of existing voice assistant systems and the potential for future improvements.

Uploaded by

Shivraj Chavan

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views

sem5_synopsis

Uploaded by

Shivraj Chavan

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Synopsis Report On

VoiceMate:AI-powered personal assistant

Submitted in partial fulfillment of the requirements of

T.E ARTIFICIAL INTELLIGENCE & MACHINE

LEARNING ENGINEERING
By

Snehal Barkale 02
Shivraj Chavan 06
Omkar Chendge 07

Name of the Mentor

Prof. Anita Shirture

Department of Artificial Intelligence & Machine

Learning
Shivajirao S. Jondhale College of Engineering Dombivli
(E)

University of Mumbai
(AY 2024-25)
CERTIFICATE
This is to certify that the Synopsis on Mini Project entitled “VoiceMate:AI-
powered personal assistant” is a bonafide work of Snehal Barkale (02), Shivraj
Chavan (06), Omkar Chendge (07) submitted to the University of Mumbai in
partial fulfillment for TE (Artificial Intelligence & Machine Learning Engineering)
semester V during the academic year 2024-25 as prescribed by University of
Mumbai.

Mentor
Prof. Anita Shirture

Prof Anita Shirture Dr. Renuka Deshpande Dr. Pramod Rodge

Project Coordinator Head of Department Principal

Mini Project Approval

This Synopsis on Mini Project entitled “VoiceMate:AI-powered personal

assistant” by Snehal Barkale (02), Shivraj Chavan (06), Omkar Chendge (07) is
approved for T.E. (Artificial Intelligence & Machine Learning Engineering) for
the academic year 2024-25.

Examiners

1………………………………………
(Internal Examiner Name & Sign)

2…………………………………………
(External Examiner name & Sign)

Date:

Place:
Contents
Abstract i

Acknowledgments ii

List of Abbreviations iii

List of Figures iii

1 Introduction 1
1.1 Introduction
1.2 Motivation
1.3 Problem Statement & Objectives
1.4 Organization of the Report

2 Literature Survey 4

2.1 Survey of Existing System/SRS

2.2 Limitation Existing system or Research gap
2.3 Mini Project Contribution

3 Proposed System (e.g. New Approach of Data Summarization) 17

3.1 Introduction
3.2 Architecture/ Framework
3.3 Algorithm and Process Design
3.4 Details of Hardware & Software
3.4 Experiment and Results for Validation and Verification
3.5 Analysis
3.6 Conclusion and Future work.

4 References 24
Abstract

A voice assistant is a type of artificial intelligence (AI) software application or

virtual assistant that is designed to respond to voice commands and interact with
users using natural language processing (NLP) technology. Voice assistants are
typically integrated into various devices and platforms, such as smartphones,
smart speakers, tablets, and even certain appliances, to provide users with hands-
free access to information, perform tasks, and control connected devices. The rise
of voice assistants represents a significant advancement in artificial intelligence
and human-computer interaction. Virtual assistants are designed to mimic human
interactions, enabling users to engage in natural conversations with these digital
entities. They can perform a wide range of tasks, including setting reminders,
scheduling appointments, answering questions, managing emails, and even
controlling smart home devices. Their adaptability and versatility make them an
indispensable tool for both individual users and businesses. Virtual assistants are
also becoming increasingly integrated into various devices and platforms,
including smartphones, smart speakers, and chatbots. They can understand user
preferences and tailor responses to specific needs, which fosters a more user
centric experience. This abstract delves into the technologies that power virtual
assistants, including machine learning, deep learning, and data analytics, which
enable them to continuously improve their performance and expand their
capabilities.
Acknowledgment

We sincerely wish to thank the project guide Prof. Anita Shirture for her
encouraging and inspiring guidance helped us to make our project a success. Our
project guide makes us endure with her expert guidance, kind advice and timely
motivation which helped us to determine our project.

We would like to thank our project coordinator Prof. Anita Shirture for all the
support we needed from her for our project.

We also express our deepest thanks to our HOD Dr. Renuka Deshpande who’s
benevolent helps us making available the computer facilities to us for project in
our laboratory and making it true success. Without her kind and keen co-operation
our project would have been stifled to standstill.

Lastly, we would like to thank our college principal Dr. Pramod Rodge for
providing lab facilities and permitting to go on with our project. We would also
like to thank our colleagues who helped us directly or indirectly during our project.
List of figures

Figure no. Title Page no.

3.2 Architecture 11

3.3 Algorithm 12

List of Abbreviations

1.AI Artificial intelligence

2.NLP Natural language processing

3.IOT Internet of Things

4.WI Web Intelligence

5.IROS Intelligent Robots and Systems

6. GUI Graphical User Interface

7. ICASSP International Conference on Acoustics, Speech and Signal

Processing
8. ASR Automatic Speech Recognition

9. NLG Natural Language Generation

10. STT Speech-to-Text

11. NLTK Natural Language Processing toolkit

12. SMTP Simple Mail Transfer Protocol

1. Introduction

1.1 Introduction
A voice assistant is a type of artificial intelligence (AI) software application or
virtual assistant. In the fast-paced world of today, the demand for efficiency and
convenience has led to the rise of virtual assistants, revolutionizing the way we
interact with technology and manage our daily tasks. A virtual assistant is a
computer program or application that uses artificial intelligence (AI) and natural
language processing (NLP) to provide users with a wide range of services and
support, often mimicking the role of a human personal assistant. These digital
companions have transformed the way we work, stay organized, and access
information. The concept of a virtual assistant can be traced back to the advent of
speech recognition and text-to-speech technology. Over the years, advancements
in machine learning, data analytics, and AI have allowed virtual assistants to
become increasingly sophisticated and versatile. These digital helpers are now
integrated into various devices and platforms, including smartphones, smart
speakers, smartwatches, and even cars, making them accessible to a wide range
of users.

Virtual assistants come in various forms and are often tailored to specific
applications and ecosystems. Some of the most popular virtual assistants include
Apple's Siri, Amazon's Alexa, Google Assistant, and Microsoft's Cortana. These
platforms can perform a multitude of tasks, such as answering questions, setting
reminders, sending messages, playing music, providing directions, and
controlling smart home devices.

The future of virtual assistants is incredibly promising. As AI technology

continues to evolve, virtual assistants are expected to become more personalized
and context-aware, providing users with increasingly tailored and proactive
assistance. They are likely to play a pivotal role in the development of smart cities,
healthcare, education, and various other sectors, making our lives more efficient
and convenient.
1.2 Motivation
Voice assistants are increasingly popular due to the convenience and efficiency
they offer to users. They allow hands-free operation, making it easy to perform
tasks like setting reminders, sending messages, or controlling smart home devices
using just voice commands. This convenience is especially valuable for
multitasking, as users can interact with technology while cooking, driving, or
working. Additionally, voice assistants enhance accessibility for people with
disabilities, such as those with limited mobility or vision impairments, enabling
them to interact with devices more easily.

For many, voice assistants also offer a personalized experience, adapting to

individual preferences by suggesting content, music, or news based on user
behaviour. Their integration with smart home devices further drives their appeal,
as they act as a central hub for controlling lights, thermostats, or security systems.

From the perspective of developers and companies, voice assistants provide a

powerful platform to engage users. They help businesses foster greater brand
loyalty by embedding services into users' daily routines, while also collecting
valuable data on user preferences and behaviours. This data can be used to improve
products or offer personalized services. Additionally, voice assistants open new
monetization avenues, such as voice commerce and subscription-based services.
As a frontier of natural user interfaces, voice technology represents the future of
human-computer interaction, offering intuitive, seamless engagement and driving
advancements in artificial intelligence.

1.3 Problem Statement

Design and develop a basic voice assistant that can recognize user commands and
perform simple tasks using natural language processing (NLP). The system should
be able to understand spoken commands, process them, and provide appropriate
responses or actions.
1.4 Organization of Report
When organizing a report for a voice assistant project, it's important to ensure the
structure is clear and flows logically to cover all relevant aspects comprehensively.
The report should begin with a Title Page, which includes the project title, the
names of the team members, the submission date, and the organization or
institution involved. Following this, a Table of Contents should be provided to list
all sections and subsections with corresponding page numbers, allowing readers to
easily navigate through the document.
In the Introduction, provide background information on voice assistant technology
and explain its growing relevance in today’s digital world. Clearly state the
Problem Statement of the project, detailing what the voice assistant is intended to
achieve and the problems it aims to solve.

We have also included Flowcharts to visually represent processes like voice

recognition, task execution, and response generation.
In the Development Process section, describe the methodology used to guide the
project, such as Agile or Waterfall. Explain the steps taken during development,
from planning to implementation and testing. Include any challenges faced during
the development phase and how they were addressed.
The report should conclude with a Testing and Evaluation section that discusses
the methods used to test the voice assistant's performance, user satisfaction, and
overall functionality. Include the results of these tests, as well as feedback gathered
during user testing. After this, a Conclusion and Future Work section should
summarize the project’s outcomes, the lessons learned, and any potential
improvements or features that could be added in future iterations.
Finally, include a References section for citing any sources, research, or technical
materials used throughout the report.
2. Literature Survey

2.1 Survey of Existing System/SRS

A survey of existing voice assistant systems focuses on the comparison of

popular voice assistants such as Amazon Alexa, Google Assistant, Apple Siri,
and Microsoft Cortana. These systems are evaluated based on their
technological foundations, user experience, conversational intelligence, and
applications across different domains. The key components of these systems
include natural language processing (NLP), speech recognition, and machine
learning algorithms, all of which enable them to interpret and respond to user
queries effectively.
The comparative analysis of these systems shows that each voice assistant
excels in different domains. Amazon Alexa dominates the smart home market,
offering the largest number of third-party integrations and custom skills. Google
Assistant leads in conversational intelligence and contextual understanding,
making it superior for general knowledge queries and dynamic conversation
handling. Apple Siri stands out for its focus on user privacy and device
integration, while Microsoft Cortana has refocused on enterprise solutions
rather than consumer voice services.
Assistant generally outperforms its competitors with superior natural language
understanding and contextual awareness. However, all systems face challenges
in handling conversational depth and multi-turn interactions. Privacy and
security are crucial concerns, especially for Alexa and Google Assistant, which
engage in continuous listening and data collection, prompting ongoing debates
about ethical data usage. Overall, each voice assistant serves unique user needs,
with Alexa leading in smart home automation, Google Assistant in productivity,
Siri in privacy, and Cortana in enterprise solutions.
SR AUTHOR NAME PAPER NAME DESCRIPTION
NO
1 Petukhova, Volha; "Conversational This paper discusses the
Bunt, Harry; Agents: Goals, challenges faced in natural
McGlashan, Scott; Technologies, and language processing (NLP) and
Sitter, Nathaniel; Challenges" human-computer interaction
Alexandersson, Jan (HCI).
2 Ramírez-Alcaraz, "Personal The paper examines voice
Marcos; Berrocal, Assistant for the assistants such as Amazon Alexa
Jesús; Merino, Internet of Things and Google Assistant.
Patricia; Canal, Era"
Carlos
3 Fang, Yiling; Ma, "Natural This paper explores deep
Kevin; Ng, Andrew Language learning techniques used in NLP
Processing and to build more sophisticated
Machine Learning virtual assistants.
in the
Development of
Virtual
Assistants"
4 Hoy, Matthew B. "The Anatomy of This paper delves into
Voice-Based improving the interaction
Assistants: User between users and systems like
Interaction and Siri, Alexa, and Cortana.
Machine Learning
Perspectives"
5 Moore, Robert J.; "Voice User The paper provides a
Arar, R. Interface Design: comparative analysis of popular
A Comparative voice assistants Alexa, Google
Analysis of Assistant, and Siri.
Amazon Alexa,
Google Assistant,
and Apple Siri"
6 Arora, Manan; "Improving User This study focuses on improving
Shah, Jignesh Experience in the user experience in voice-
Voice-Activated activated AI systems by
AI Systems incorporating personalization
through features.
Personalization"
7 Lau, Josephine; "Privacy in This paper addresses privacy
Zimmerman, Voice-Activated concerns related to the use of
Benjamin; Schaub, Digital voice assistants, evaluating
Florian Assistants" potential risks like continuous
listening and data usage.
8 Parker, Jeffrey; "Ethical Issues in This paper investigates the
Harper, Richard the Use of Voice ethical issues surrounding the
Assistants: Case use of voice assistants, using
Study of Alexa" Amazon Alexa as a case study.
9 Singh, Karan; "Advances in The paper explores how voice
Ramesh, V. Voice Assistants assistants are being integrated
for Healthcare into healthcare systems and
Applications" improving accessibility for
individuals with disabilities.
10 Xu, Qianli; John, "Multimodal This research focus on how
Ramesh Interaction in visual feedback, combined with
Voice Assistants: voice commands, enhances the
Enhancing User user experience.
Experience
through Visual
Feedback"

2.2 Limitations of existing systems

1. Privacy and Security Concerns

 Always Listening: Voice assistants require constant activation or are

always listening for wake words, which raises privacy concerns. Users
are concerned about unintended recordings or potential data leaks.
 Data Collection: These systems often send voice data to cloud servers for
processing, and some users worry about how their personal information
is stored, shared, and used by companies.

2. Lack of Personalization

 Generic Responses: Although some voice assistants attempt to

personalize interactions by recognizing user preferences or patterns, they
are still largely designed to offer one-size-fits-all responses
 Limited Adaptation to User Behaviour: Voice assistants generally don’t
learn and adapt significantly from user interactions. Personalization
remains superficial, often confined to reminders, preferences for apps
rather than building a deep understanding of the user’s lifestyle.
3. High Resource Consumption
 Processing Power: Voice assistants require significant computing power,
both locally (on devices like smartphones and smart speakers) and in the
cloud. This demand increases when performing complex tasks like voice
recognition, natural language understanding, and real-time responses.
 Battery Drain on Devices: Devices with voice assistants, especially
smartphones, can experience higher battery usage due to the constant
listening mode, which keeps microphones active in the background,
waiting for activation commands.

4. Limited Customization and Control

 Inflexibility in Commands: Most voice assistants offer a set of

predefined commands and responses. Users cannot extensively
customize how the assistant responds to specific queries or alter its
behaviour beyond basic settings.
 Lack of User-Controlled Features: Advanced customization options are
often locked behind developer tools, making it difficult for average users
to tailor the voice assistant to their exact needs. For example, users may
not be able to change how tasks are performed or create custom
workflows without significant technical know-how.

5. Cost Barrier for Advanced Capabilities

 Premium Features Locked Behind Paywalls: Some advanced

functionalities, such as integrating voice assistants with home
automation systems, require purchasing premium hardware or software.
For instance, unlocking certain smart home capabilities may require
expensive hub devices or subscription services.
 High Initial Setup Costs: Setting up a comprehensive smart home with
voice assistant integration can be costly. Smart speakers, lights,
thermostats, security systems, and other devices that are compatible with
voice assistants often come with a high price tag.
2.3 Mini project contribution

The voice assistance has been developed for users for educational, business and
for personal use. It has achieved the objectives and scope that were stated in this
project the project will achieve some of the below objectives:

It will have a proper Graphical User Interface (GUI). It can open chrome,
YouTube, Wikipedia, all windows applications, etc to search information and read
2 or 3 lines for the user from Wikipedia. It can open power point presentation. It
can tell us the current time. It can send mails, SMS. It can make phone calls. It can
play online music. It can predict weather. It will have a chat history keeping
feature. It will have a Face authentication system which will allow the program to
run only when it detects a face.
3. Proposed system

3.1 Introduction

Virtual assistant is software program that helps you ease your day-to-day tasks,
such as showing weather forecasting, playing music, etc. They can take commands
as voice or text. Voice based intelligent assistant need an invoking words or wake
words to active the listener, followed by commands. For my project the wake, up
word is “SOFIA”. Our voice assistant is designed to be used efficiently for all
users. This personal assistant software improves user’s productivity by managing
day to day tasks & providing information from online sources to users.

3.2 Architecture / Framework

Creating a voice assistant involves several components, including speech

recognition, natural language processing, and interaction design. Here's a
high-level architecture for a voice assistant project:

• Audio Input/Output:

o Microphone: To capture user voice commands.

o Speaker: To provide audio responses.

• Speech Recognition:

o Use a Speech-to-Text engine (ASR - Automatic Speech Recognition)

to convert spoken words into text. Popular ASR engines include
Google's Speech-to-Text, Microsoft Azure Speech Service, or open-
source solutions like Mozilla Deep-Speech.
• Natural Language Processing (NLP):

o Intent Recognition: Identify the user's intent from the transcribed

text. This involves understanding what the user wants to do or know.
o Named Entity Recognition (NER): Identify important entities like
dates, locations, and proper nouns in the user's command.

o Dialog Management: Keep track of the conversation context,

including the user's previous requests and responses.
• Knowledge Base and Data Sources:

o Store information or connect to external data sources to provide

answers to user queries. This can include APIs, databases, or web
scraping.
• Response Generation:

o Use the NLP results and the context to generate a meaningful

response.

o You can use pre-defined templates for common responses or

generate responses dynamically using NLG (Natural Language
Generation) techniques.
• Text-to-Speech (TTS):

o Convert the generated text response into speech using a Text-to-

Speech engine. Examples include Amazon Polly, Google Text-to-
Speech, or opensource TTS solutions.
• User Interface:

o Choose the platform for your voice assistant. It can be a mobile app,
a web-based interface, a smart speaker, or a custom hardware device.
The diagram shows the main process flow of how Voice Assistant works.
Fig 3.2 Voice Assiatant Framework

3.3 Algorithm and Process Design

 Module 1: Speech to text

In this module, the person’s commands are converted from speech to text using the
Google Speech to Text Cloud API. Google Speech to Text Cloud API transcribes
the speech file using the most advanced deep learning neural network algorithms
for automatic speech recognition (ASR) and returns the text statement. Google
Speech to Text Cloud API is one of the simplest methods for recognizing speech
and can analyse up to 1 min of voice data.

 Module 2: Text analysis and classification

This module is responsible for understanding the correct command from the text
generated by the Google API and then confirming it with a human to execute the
desired action. Because of the uncertainties in human language, it is extremely
challenging to create software that correctly ascertains the text’s intended meaning,
so NLP is used in this module for manipulating and recognizing the text. NLP
deconstructs the text into small units to assist the computer in understanding the
ingesting text. Different libraries and algorithms are proposed for NLP, such as the
Natural Language Processing toolkit (NLTK).
 Module 3: Command execution

Command execution in a voice assistant follows a series of steps, starting with

voice input capture, where the user speaks a command into a microphone-equipped
device (e.g., "Turn on the living room lights"). This spoken input is then converted
into text through a Speech-to-Text (STT) engine. Once the speech is converted to
text, the system processes it using Natural Language Processing (NLP). The NLP
system performs intent recognition to understand the action the user wants to
perform (e.g., turning on a device) and entity extraction to identify any specific
details or parameters related to the command (e.g., identifying the "living room"
and "lights" as key details).

Fig 3.3 Voice Assistant Flowchart

3.4 Details of Hardware and Software

Software Requirements:

• Operating System
• Speech Recognition Software
• Text-to-Speech (TTS) Software
• Local Databases
• Development Environment

Hardware Requirements:

• Microphone
• Speakers
• Processing unit (CPU/GPU)
• Storage (HDD/SSD)
• Memory (RAM)
• Power supply

3.5 Experiment and actual results

The expected result of our project is we will be developing a voice assistant that
will be useful in educational purposes, business, personal use, etc.

• It can open google

• It can open YouTube
• It can search on Wikipedia and read 2 lines for the user
• It can play music
• It can send mails
• It can tell current time.
• It can open our presentation.
• It will have a proper Graphical User Interface (GUI).
• It can open all windows applications.
• It can make phone calls.
• It can predict weather.
• It will have a chat history keeping feature.
• It will have a Face authentication system.
Fig 3.5 GUI of Voice Assistant

3.6 Analysis

As we know Python is a suitable language for scriptwriters and developers. The

query for the assistant can be manipulated as per the user’s need.

Modules needed

 Pyttsx3: This module is used for the conversion of text to speech in a

program it works offline. To install this module type the below command in
the terminal.
 Wikipedia: As we all know Wikipedia is a great source of knowledge to get
information from Wikipedia or to perform a Wikipedia search. To install this
module type the below command in the terminal.
 Speech Recognition: Since we’re building an application of voice assistant,
one of the most important things in this is that your assistant recognizes your
voice (means what you want to say/ ask). To install this module type the
below command in the terminal.
 Web browser: To perform Web Search. This module comes built-in with
Python.
 Datetime: Date and Time are used to showing Date and Time. This module
comes built-in with Python.
 Smtplib: Simple Mail Transfer Protocol (SMTP) is used as a protocol to
handle the email transfer using Python. It is used to route emails between
email servers. It is an application layer protocol which allows to users to
send mail to another

We will set our engine to Pyttsx3 which is used for text to speech in Python and
sapi5 is a Microsoft speech application platform interface we will be using this for
text to speech function.

You can change the voice Id to “0” for the Male voice while using assistant here
we are using a Female voice i.e “1” for all text to speech.

3.6 Conclusion

In conclusion, voice assistants represent a transformative technology that has

reshaped the way we interact with devices, access information, and accomplish
tasks. Their significance lies in their ability to provide accessibility, convenience,
and efficiency in a wide range of applications. From simplifying daily tasks and
enhancing productivity to improving accessibility for individuals with disabilities,
voice assistants have become an integral part of our lives. As voice assistant
technology continues to advance, we can expect even more innovative
applications and improvements in natural language understanding, making these
virtual assistants increasingly valuable in our homes, workplaces, and
communities. Voice assistants are not merely a technological convenience; they
are a powerful tool that fosters accessibility, personalization, and safety while
driving innovation in AI and human-computer interaction. Their continued
evolution promises a future where seamless voice-powered interactions will
further enrich our lives and redefine how we interact with the digital world.
References

[1] Shaughnessy, IEEE, Interacting with Computers by Voice: Automatic Speech

Recognition and Synthesis proceedings of the IEEE, vol. 91, no. 9, september
2003.

[2] Patrick Nguyen, Georg Heigold, Geoffrey Zweig, Speech Recognition with
Flat Direct Models, IEEE Journal of Selected Topics in Signal Processing,
2010

[3] Mackworth (2019-2020), Python code for voice assistant: Foundations of

Computational Agents- David L. Poole and AlanK. Mackworth.

[4] Nil Goksel, CanbekMehmet, EminMutlu, On the track of Artificial

Intelligence: Learning with Intelligent Personal Assistant, proceedings of
International Journal of Human Sciences, 2016.

[5] Keerthana S, Meghana H, Priyanka K, Sahana V. Rao, Ashwini B Smart Home

Using Internet of Things, proceedings of Perspectives in Communication,
Embedded -systems and signal processing, 2017.

[6] Sutar Shekhar, P. Sameer, Kamad Neha, Prof. Devkate Laxman, An Intelligent
Voice Assistant Using Android Platform, IJARCSMS, ISSN: 232-7782, 2017.

[7] Rishabh Shah, Siddhant Lahoti, Prof. Lavanya. K, An Intelligent Chatbot using
Natural Language Processing, International Journal of Engineering Research,
Vol.6 , pp.281-286, 2017.

[8] Luis Javier RodrÃguez-Fuentes, Mikel PeÃagarikano, AparoVarona,

GermÃ¡n

Bordel, GTTS-EHU Systems for the Albayzin 2018 Search on Speech Evaluation.

The AI Wealth Creation Blueprint PDF
67% (3)
The AI Wealth Creation Blueprint PDF
50 pages
The Age of AI and Our Human Future (Henry Kissinger, Eric Schmidt Etc.) (Z-Library)
100% (8)
The Age of AI and Our Human Future (Henry Kissinger, Eric Schmidt Etc.) (Z-Library)
148 pages
How To Hack Atm
87% (15)
How To Hack Atm
1 page
Christopher Langan - CTMU, The Cognitive-Theoretic Model of The Universe, A New Kind of Reality Theory
88% (8)
Christopher Langan - CTMU, The Cognitive-Theoretic Model of The Universe, A New Kind of Reality Theory
56 pages
Virtual Assistant Project - REPORT
85% (20)
Virtual Assistant Project - REPORT
37 pages
Data Structure and Algorithmic Thinking With Python Data Structure and Algorithmic Puzzles PDF
95% (20)
Data Structure and Algorithmic Thinking With Python Data Structure and Algorithmic Puzzles PDF
471 pages
Gayle Laakmann McDowell - Cracking The Coding Interview - 189 Programming Questions and Solutions (2015, CareerCup)
81% (48)
Gayle Laakmann McDowell - Cracking The Coding Interview - 189 Programming Questions and Solutions (2015, CareerCup)
708 pages
Gödel, Escher, Bach - An Eternal Golden Braid (20th Anniversary Edition) by Douglas R. Hofstadter (Charm-Quark) PDF
100% (10)
Gödel, Escher, Bach - An Eternal Golden Braid (20th Anniversary Edition) by Douglas R. Hofstadter (Charm-Quark) PDF
821 pages
Cracking The Coding Interview - 189 Programming Questions and Solutions (6th Edition) (EnglishOnlineClub - Com)
100% (10)
Cracking The Coding Interview - 189 Programming Questions and Solutions (6th Edition) (EnglishOnlineClub - Com)
708 pages
Chris Bailey - Hyperfocus - The New Science of Attention, Productivity, and Creativity-Viking (2018)
100% (25)
Chris Bailey - Hyperfocus - The New Science of Attention, Productivity, and Creativity-Viking (2018)
306 pages
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
100% (24)
The Art of Asking ChatGPT For High-Quality Answers A Complete Guide To Prompt Engineering Techniques (Ibrahim John) (Z-Library)
52 pages
Mca 6 Semester: A Major Project Report ON
75% (4)
Mca 6 Semester: A Major Project Report ON
50 pages
The Fabric of Reality
100% (1)
The Fabric of Reality
6 pages
Banana Pancakes - Ukulele Chord Chart
100% (1)
Banana Pancakes - Ukulele Chord Chart
2 pages
75 Productivity Hacks - System Sunday
100% (7)
75 Productivity Hacks - System Sunday
75 pages
Military Remote Viewing Manual
100% (5)
Military Remote Viewing Manual
72 pages
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
No ratings yet
Cs 229, Autumn 2016 Problem Set #2: Naive Bayes, SVMS, and Theory
20 pages
Machine Learning For Humans
100% (4)
Machine Learning For Humans
97 pages
Syn Back Merged
No ratings yet
Syn Back Merged
20 pages
chatbot-5-43
No ratings yet
chatbot-5-43
39 pages
voice sample (1)
No ratings yet
voice sample (1)
44 pages
AI Report
No ratings yet
AI Report
50 pages
Bala Report
No ratings yet
Bala Report
60 pages
Just A Rather Very Intelligent System (Virtual Personal Assistant)
No ratings yet
Just A Rather Very Intelligent System (Virtual Personal Assistant)
8 pages
Project Report AI
No ratings yet
Project Report AI
34 pages
doc2
No ratings yet
doc2
59 pages
"Virtual Assistant (Youtube Play) ": Prof. Irfan A. Chaugule
No ratings yet
"Virtual Assistant (Youtube Play) ": Prof. Irfan A. Chaugule
24 pages
VIRTUAL
No ratings yet
VIRTUAL
67 pages
Ai Assistant Major Project
No ratings yet
Ai Assistant Major Project
33 pages
ai assistant ppt1
No ratings yet
ai assistant ppt1
11 pages
Major Project Report
No ratings yet
Major Project Report
38 pages
Virtual Assistant: Green University of Bangladesh
No ratings yet
Virtual Assistant: Green University of Bangladesh
14 pages
Synopsis SEM4
No ratings yet
Synopsis SEM4
24 pages
Virtual Assistant Project REPORT
No ratings yet
Virtual Assistant Project REPORT
37 pages
Final Thesis
No ratings yet
Final Thesis
41 pages
Project ADARSHA ROUTRAY Nice Project
No ratings yet
Project ADARSHA ROUTRAY Nice Project
64 pages
VIRTUAL ASSISTANT (Minor)
No ratings yet
VIRTUAL ASSISTANT (Minor)
8 pages
VoiceAssistantMiniProject Report
No ratings yet
VoiceAssistantMiniProject Report
42 pages
Virtual Assistant
No ratings yet
Virtual Assistant
21 pages
Prathu
No ratings yet
Prathu
16 pages
From Data to Impact : How Artificial Intelligent is Driving Non-Profit Success
From Everand
From Data to Impact : How Artificial Intelligent is Driving Non-Profit Success
Ziheng Song
No ratings yet
REPORTVOICE
No ratings yet
REPORTVOICE
26 pages
Ai Virtual Assistant: Karthikeyan R (Urk18Cs120)
No ratings yet
Ai Virtual Assistant: Karthikeyan R (Urk18Cs120)
31 pages
Kiki Sample Doc 2
No ratings yet
Kiki Sample Doc 2
36 pages
Developing-A-Desktop-Voice project
No ratings yet
Developing-A-Desktop-Voice project
5 pages
miniproject synopsis
No ratings yet
miniproject synopsis
7 pages
Jarvis Report Editing
No ratings yet
Jarvis Report Editing
66 pages
07 Major
No ratings yet
07 Major
27 pages
Mega Project Report
No ratings yet
Mega Project Report
38 pages
Proposal
No ratings yet
Proposal
5 pages
Final Year Project
No ratings yet
Final Year Project
46 pages
Personal AI Desktop Assistant
No ratings yet
Personal AI Desktop Assistant
8 pages
SHALLETTTTT
No ratings yet
SHALLETTTTT
46 pages
Major Voice Project
No ratings yet
Major Voice Project
29 pages
Personal Ai - Mini Project Record Final PDF
No ratings yet
Personal Ai - Mini Project Record Final PDF
32 pages
personal assistant chatbot
No ratings yet
personal assistant chatbot
5 pages
1822 B.E Cse Batchno 10
No ratings yet
1822 B.E Cse Batchno 10
56 pages
Project Report GitHub
No ratings yet
Project Report GitHub
32 pages
Synopsis
No ratings yet
Synopsis
8 pages
Swag
No ratings yet
Swag
33 pages
Ai Virtual Assistant DD
No ratings yet
Ai Virtual Assistant DD
24 pages
Voice Assistant Project Report
0% (2)
Voice Assistant Project Report
20 pages
Aditya Blacbook
No ratings yet
Aditya Blacbook
58 pages
SOURCE BOOK (1) (1)
No ratings yet
SOURCE BOOK (1) (1)
15 pages
VIRTAUAL ASSISTANT BUJJI(college).pdf
No ratings yet
VIRTAUAL ASSISTANT BUJJI(college).pdf
39 pages
CPP Project Report
No ratings yet
CPP Project Report
15 pages
Voice Assistant
No ratings yet
Voice Assistant
36 pages
Virtual Assistant Using Python
No ratings yet
Virtual Assistant Using Python
9 pages
Kooky
No ratings yet
Kooky
26 pages
Personal Voice Assistant
100% (1)
Personal Voice Assistant
118 pages
Thilak Report1
No ratings yet
Thilak Report1
40 pages
Mini Project Jarvis
No ratings yet
Mini Project Jarvis
38 pages
Python
No ratings yet
Python
62 pages
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
From Everand
The Today and Future of WSN, AI, and IoT: A Compass and Torchbearer for the Technocrats
Dr.Chandrakant
No ratings yet
“Careers in Information Technology: IoT Solutions Engineer”: GoodMan, #1
From Everand
“Careers in Information Technology: IoT Solutions Engineer”: GoodMan, #1
Patrick Mukosha
No ratings yet
Mobile Agents in Networking and Distributed Computing
From Everand
Mobile Agents in Networking and Distributed Computing
Jiannong Cao
No ratings yet
The Secrets of A Slot Machine
No ratings yet
The Secrets of A Slot Machine
4 pages
Roadmap How To Learn AI in 2024 (Uncovered AI)
No ratings yet
Roadmap How To Learn AI in 2024 (Uncovered AI)
6 pages
Teas Topics To Study
100% (12)
Teas Topics To Study
6 pages
From Music To Mathematic
100% (1)
From Music To Mathematic
4 pages
My Ai Cheat List
100% (11)
My Ai Cheat List
3 pages
2045: The Year Man Becomes Immortal
No ratings yet
2045: The Year Man Becomes Immortal
9 pages
Wisc V Interpretation
100% (1)
Wisc V Interpretation
8 pages
Attention Is All You Need
67% (3)
Attention Is All You Need
11 pages
Rationality From AI To Zombies
86% (7)
Rationality From AI To Zombies
1,813 pages
Mind Control Patents
100% (1)
Mind Control Patents
41 pages
Tech Trend 2024 Report-2
No ratings yet
Tech Trend 2024 Report-2
11 pages
Python Programming and Maching Learning 2 in 1 B08Y5DPX32
100% (7)
Python Programming and Maching Learning 2 in 1 B08Y5DPX32
145 pages
Psych Unit 7a Practice Quiz
No ratings yet
Psych Unit 7a Practice Quiz
4 pages
Current and Future Trends on AI Applications - Mohammed A Al-Sharafi
No ratings yet
Current and Future Trends on AI Applications - Mohammed A Al-Sharafi
456 pages
Chapter 6
No ratings yet
Chapter 6
12 pages
Adoption of Cloud Computing Model For Managing E-Banking System in Banking Organizations
No ratings yet
Adoption of Cloud Computing Model For Managing E-Banking System in Banking Organizations
9 pages
Bacs Payment Services Reports
100% (1)
Bacs Payment Services Reports
7 pages
Ramesh Pagidala: Professional Summary
No ratings yet
Ramesh Pagidala: Professional Summary
9 pages
GMI - Associate - Research JD
No ratings yet
GMI - Associate - Research JD
2 pages
ISO27001 Policies MoreThanWords
No ratings yet
ISO27001 Policies MoreThanWords
19 pages
Industry 40 Digital Transformation
No ratings yet
Industry 40 Digital Transformation
50 pages
AA White Paper - Setu
No ratings yet
AA White Paper - Setu
22 pages
ICCCVo LII
No ratings yet
ICCCVo LII
446 pages
Compare Kony Vs OutSystems Vs Mendix
No ratings yet
Compare Kony Vs OutSystems Vs Mendix
8 pages
Activity Sheet in TLE ICT and ENTREPRENEURSHIP
No ratings yet
Activity Sheet in TLE ICT and ENTREPRENEURSHIP
15 pages
CTRM Briefing Note - ENVERUS Trading & Risk
No ratings yet
CTRM Briefing Note - ENVERUS Trading & Risk
2 pages
Nokia On The Slope: The Failure of A Hybrid Open/closed Source Model
No ratings yet
Nokia On The Slope: The Failure of A Hybrid Open/closed Source Model
8 pages
Allioth - Mi 11x - F3 CF Program - Update
No ratings yet
Allioth - Mi 11x - F3 CF Program - Update
25 pages
Stellantis Ex FCA Scorecard Quick Reference Guide July 2021
No ratings yet
Stellantis Ex FCA Scorecard Quick Reference Guide July 2021
2 pages
TDR 10.05.2022
No ratings yet
TDR 10.05.2022
3 pages
Thotobolo Morapedi - Orange
No ratings yet
Thotobolo Morapedi - Orange
16 pages
Entrepreneurial Mind-Module 4
No ratings yet
Entrepreneurial Mind-Module 4
9 pages
Configuration Management Itil Change Management Cmii
No ratings yet
Configuration Management Itil Change Management Cmii
8 pages
What Is End-to-End Recruiting AI?
No ratings yet
What Is End-to-End Recruiting AI?
22 pages
Business Proposal DGAS...
No ratings yet
Business Proposal DGAS...
6 pages
Usability of Computer Programming Language
No ratings yet
Usability of Computer Programming Language
13 pages
About Pentonix
No ratings yet
About Pentonix
12 pages
Data Analytics in Audit
100% (1)
Data Analytics in Audit
17 pages
Graphic Designer Services Sample Proposal
No ratings yet
Graphic Designer Services Sample Proposal
12 pages
Sales Invoice Management
No ratings yet
Sales Invoice Management
5 pages
Complete SDK Guide Book 2021 - UXCam
No ratings yet
Complete SDK Guide Book 2021 - UXCam
32 pages
IBM Netcool Operations Insight Version 1.4 Deployment Guide
100% (1)
IBM Netcool Operations Insight Version 1.4 Deployment Guide
292 pages
Unit 2
No ratings yet
Unit 2
51 pages
Engineering Company Training Topics 2
No ratings yet
Engineering Company Training Topics 2
1 page

sem5_synopsis

Uploaded by

sem5_synopsis

Uploaded by

Synopsis Report On

VoiceMate:AI-powered personal assistant

Submitted in partial fulfillment of the requirements of

T.E ARTIFICIAL INTELLIGENCE & MACHINE

Name of the Mentor

Department of Artificial Intelligence & Machine

Prof Anita Shirture Dr. Renuka Deshpande Dr. Pramod Rodge

Project Coordinator Head of Department Principal

This Synopsis on Mini Project entitled “VoiceMate:AI-powered personal

List of Abbreviations iii

List of Figures iii

2.1 Survey of Existing System/SRS

3 Proposed System (e.g. New Approach of Data Summarization) 17

A voice assistant is a type of artificial intelligence (AI) software application or

Figure no. Title Page no.

1.AI Artificial intelligence

2.NLP Natural language processing

3.IOT Internet of Things

4.WI Web Intelligence

5.IROS Intelligent Robots and Systems

6. GUI Graphical User Interface

7. ICASSP International Conference on Acoustics, Speech and Signal

9. NLG Natural Language Generation

10. STT Speech-to-Text

11. NLTK Natural Language Processing toolkit

12. SMTP Simple Mail Transfer Protocol

The future of virtual assistants is incredibly promising. As AI technology

For many, voice assistants also offer a personalized experience, adapting to

From the perspective of developers and companies, voice assistants provide a

1.3 Problem Statement

We have also included Flowcharts to visually represent processes like voice

2.1 Survey of Existing System/SRS

A survey of existing voice assistant systems focuses on the comparison of

2.2 Limitations of existing systems

1. Privacy and Security Concerns

 Always Listening: Voice assistants require constant activation or are

 Generic Responses: Although some voice assistants attempt to

4. Limited Customization and Control

 Inflexibility in Commands: Most voice assistants offer a set of

5. Cost Barrier for Advanced Capabilities

 Premium Features Locked Behind Paywalls: Some advanced

3.2 Architecture / Framework

Creating a voice assistant involves several components, including speech

o Microphone: To capture user voice commands.

o Speaker: To provide audio responses.

o Use a Speech-to-Text engine (ASR - Automatic Speech Recognition)

o Intent Recognition: Identify the user's intent from the transcribed

o Dialog Management: Keep track of the conversation context,

o Store information or connect to external data sources to provide

o Use the NLP results and the context to generate a meaningful

o You can use pre-defined templates for common responses or

o Convert the generated text response into speech using a Text-to-

3.3 Algorithm and Process Design

 Module 1: Speech to text

 Module 2: Text analysis and classification

Command execution in a voice assistant follows a series of steps, starting with

Fig 3.3 Voice Assistant Flowchart

3.4 Details of Hardware and Software

3.5 Experiment and actual results

• It can open google

As we know Python is a suitable language for scriptwriters and developers. The

 Pyttsx3: This module is used for the conversion of text to speech in a

In conclusion, voice assistants represent a transformative technology that has

[1] Shaughnessy, IEEE, Interacting with Computers by Voice: Automatic Speech

[3] Mackworth (2019-2020), Python code for voice assistant: Foundations of

[4] Nil Goksel, CanbekMehmet, EminMutlu, On the track of Artificial

[5] Keerthana S, Meghana H, Priyanka K, Sahana V. Rao, Ashwini B Smart Home

[8] Luis Javier RodrÃguez-Fuentes, Mikel PeÃagarikano, AparoVarona,

You might also like