35 End-To-End Conversion Speed Analysis of An FPT - AI-based Text-to-Speech Application

The document summarizes a study analyzing the end-to-end conversion speed of a text-to-speech (TTS) application based on FPT.AI. The application converts Vietnamese text to speech using 7 voices through an API. Conversion time for 400-500 character text was around 10 seconds initially and under 1.8 seconds for subsequent conversions. The proposed system was found to have advantages over existing systems by allowing download of converted audio, supporting multiple user requests, and using the FPT.AI engine.

Uploaded by

Balakrishna Chennamsetti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

275 views4 pages

35 End-To-End Conversion Speed Analysis of An FPT - AI-based Text-to-Speech Application

Uploaded by

Balakrishna Chennamsetti

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

End-to-end Conversion Speed Analysis of an FPT.

AI-based Text-to-
Speech Application
In this project, an FPT.AI-based text-to-speech (TTS) application is developed that
converts Vietnamese text into spoken words. The FPT stands for Financing and
Promoting Technology. The application is developed based on Django for Python
and in the form of an interactive web page which is connected to an FPT.AI server
through its application programming interface (API). The application supports
conversion of text to seven different Vietnamese speeches. Four out of seven
voices can be used to convert up to 500 characters in a single transaction while the
others support that of 400 characters. Based on the results obtained, the first
conversion time takes up to 10 s to convert 400-character text into speech while the
subsequent times, given same text, it takes under 1.8 s for the conversion. This is
applicable to all voices. Speech synthesis is the fundamental component of many
artificial intelligence systems. With our own ambition, FPT Technology Innovation
Department has been working hard for nearly 8 years to launch FPT Speech
Synthesis. Being considered as the best integrated system of Vietnamese language
voice in the market today, FPT's new Vietnamese Speech Synthesis API is being
opened for free evaluation.

EXISTING SYSTEM:
In the existing system there are several parameters that can be used for indicating
the performance of a TTS system. The most common parameter is mean opinion
score (MOS) which is broadly used to measure the naturalness of the generated
speech. However, this is not enough to indicate the performance of a system from
customer perspective. The reason is that end-users only experience end-to-end TTS
conversion, therefore, even if the core engine is very fast, the intermediate
communication media may affect how fast the system can perform the conversion.
DISADVANTAGES OF EXISTING SYSTEM:
 Cloud Based solution not available for the MOS based TTS.
 The Converted file into mp3 is not downloadable to the user.
 It is simple Speech engine gtts
 Algorithm: mean opinion score (MOS)

PROPOSED SYSTEM:
The proposed System end-to-end conversion speed of an FPT.AI-based TTS
application is analyzed with the main focus on the relationship between the length
of the input text and its end-to-end conversion time. The main contributions of this
research are: (i) an FPT.AIbased TTS application using Django for Python, (ii)
performance analysis of the application for seven different supported voices and
several lengths of input text. In this work, FPT.AI API is used for interfacing
between local host and remote FPT TTS server. This represents an actual working
condition when an external user needs to use the API for converting text to speech.

ADVANTAGES OF PROPOSED SYSTEM:

 Each registered user is allowed to use the TTS service for multiple requests
amounting a total of 10,000 characters monthly.
 There are three main input parameters that user can key into the system: text
to convert to speech, the desired speed of generated speech and voice type.
 For each request, a response will be returned to host application by the
server. It has JavaScript Object Notation (JSON) format and contains a static
HTTP link to download the converted audio file in *.mp3 format. In
addition, the response has an error-or-success indicator.
 Algorithm: FPT.AI core engine

SYSTEM REQUIREMENTS:
HARDWARE REQUIREMENTS:

 System : Intel Core i3.

 Hard Disk : 1 TB.
 Monitor : 15’’ LED
 Input Devices : Keyboard, Mouse
 Ram : 8 GB.

SOFTWARE REQUIREMENTS:

 Operating system : Windows 10.

 Coding Language : Python
 Tool : PyCharm, Visual Studio Code
 Database : SQLite

REFERENCE:
Tran Duc Chung, Micheal Drieberg, Mohd Fadzil Bin Hassan, Alexandra
Khalyasmaa Centre for Research and Data Science (CeRDaS), Computer and
Information Science Department Automated Electrical Systems Department FPT
University, Hoa Lac Hi-Tech Park, Hanoi, Vietnam " End-to-end Conversion
Speed Analysis of an FPT.AI-based Text-to-Speech Application " Global
Conference on Life Sciences and Technologies Date Added to IEEE Xplore: 30
April 2020 INSPEC Accession Number: 19575875 DOI:
10.1109/LifeTech48969.2020.1570620448

Design and Implementation of Text to Speech audio System
No ratings yet
Design and Implementation of Text to Speech audio System
5 pages
Text To Speech Converter 25,26,27
No ratings yet
Text To Speech Converter 25,26,27
10 pages
real time voice translator
No ratings yet
real time voice translator
28 pages
Report Sample
No ratings yet
Report Sample
61 pages
Report
No ratings yet
Report
38 pages
TTS SRM Speech
No ratings yet
TTS SRM Speech
38 pages
Format of Mini_Project Report
No ratings yet
Format of Mini_Project Report
32 pages
AIspeaker
No ratings yet
AIspeaker
10 pages
PRJ Final
No ratings yet
PRJ Final
33 pages
[EN] FTI_Customer_Voicebot Estimation_Reference
No ratings yet
[EN] FTI_Customer_Voicebot Estimation_Reference
6 pages
Mini Project
No ratings yet
Mini Project
19 pages
Text To Speech Conversion
No ratings yet
Text To Speech Conversion
75 pages
Text to Speech
No ratings yet
Text to Speech
14 pages
Synopsis
No ratings yet
Synopsis
11 pages
Tamil Textual Image Reader
No ratings yet
Tamil Textual Image Reader
4 pages
Programmable Controller: c.pCO Sistema
No ratings yet
Programmable Controller: c.pCO Sistema
64 pages
Automatic Centre 2017 Christmas Catalog
100% (11)
Automatic Centre 2017 Christmas Catalog
92 pages
1.Modern Text Tool
No ratings yet
1.Modern Text Tool
8 pages
A Focus On Codemixing and Codeswitching in Tamil Speech To Text
No ratings yet
A Focus On Codemixing and Codeswitching in Tamil Speech To Text
12 pages
CXDI Software License Issue Manual Rev.07
100% (1)
CXDI Software License Issue Manual Rev.07
18 pages
Paper 5728
No ratings yet
Paper 5728
3 pages
Enroll Devices in Microsoft Intune
No ratings yet
Enroll Devices in Microsoft Intune
227 pages
imp tts
No ratings yet
imp tts
4 pages
TEXT - TO - SPEECH - CONVERSION - 22215a1211
No ratings yet
TEXT - TO - SPEECH - CONVERSION - 22215a1211
8 pages
Principles of Programming-2016
100% (1)
Principles of Programming-2016
90 pages
State Management in React - Kim Tran
No ratings yet
State Management in React - Kim Tran
92 pages
9S12 Datasheet PDF
No ratings yet
9S12 Datasheet PDF
136 pages
Abstract Final Year Project
No ratings yet
Abstract Final Year Project
1 page
QB_DEE501 microprocessor
No ratings yet
QB_DEE501 microprocessor
5 pages
6.python Text To Speech
No ratings yet
6.python Text To Speech
2 pages
5.installation of Web-e-TDS
No ratings yet
5.installation of Web-e-TDS
43 pages
SF Dump
No ratings yet
SF Dump
15 pages
LPC214x Architecture - Peripherals and Programming
No ratings yet
LPC214x Architecture - Peripherals and Programming
44 pages
mn06116 VLR Simulator for AccuLoad III
No ratings yet
mn06116 VLR Simulator for AccuLoad III
22 pages
Mobile Computing (KCS 713) unit-3
No ratings yet
Mobile Computing (KCS 713) unit-3
12 pages
System Pro M Compact InSite - Catalog Pages-Dpi
No ratings yet
System Pro M Compact InSite - Catalog Pages-Dpi
12 pages
JasperReports Server CP Install Guide
No ratings yet
JasperReports Server CP Install Guide
76 pages
Show Interface in Depth
No ratings yet
Show Interface in Depth
12 pages
MINI PROJECT REPORT New
No ratings yet
MINI PROJECT REPORT New
5 pages
Alcatel 1S 2021 (6025D) - EU ID Card
No ratings yet
Alcatel 1S 2021 (6025D) - EU ID Card
2 pages
Programmable Interrupt Controller (SUB: Microprocessor and Interfaces)
No ratings yet
Programmable Interrupt Controller (SUB: Microprocessor and Interfaces)
7 pages
Nokia Case Study
100% (1)
Nokia Case Study
11 pages
Kafka Cluster
No ratings yet
Kafka Cluster
11 pages
CSE ([email protected] II-Sem) EXP-7: Output
No ratings yet
CSE ([email protected] II-Sem) EXP-7: Output
4 pages
HP Laserjet Text Codes A-Z
No ratings yet
HP Laserjet Text Codes A-Z
6 pages
Introduction To Lad Sim
No ratings yet
Introduction To Lad Sim
7 pages
FPGA Implementation of Convolutional Encoder and Hard Decision Viterbi Decoder
No ratings yet
FPGA Implementation of Convolutional Encoder and Hard Decision Viterbi Decoder
5 pages
Tps400 Qs v2 1 0 English
No ratings yet
Tps400 Qs v2 1 0 English
4 pages
Applications of C++ - Uses of C++ - C++ Tutorial
100% (1)
Applications of C++ - Uses of C++ - C++ Tutorial
2 pages
VPLEX Uptime Bulletin Q211
No ratings yet
VPLEX Uptime Bulletin Q211
4 pages
How To Debug NW BPC
No ratings yet
How To Debug NW BPC
2 pages
Urus Partition Hard Disk Dengan EASEUS Partition Master
No ratings yet
Urus Partition Hard Disk Dengan EASEUS Partition Master
2 pages
Learning Python Network Programming
From Everand
Learning Python Network Programming
Dr. M. O. Faruque Sarker
5/5 (2)
AI Mastery in Python: Unleashing the Power of OpenAI API
From Everand
AI Mastery in Python: Unleashing the Power of OpenAI API
Dargslan
No ratings yet
Python: For Beginners A Crash Course Guide To Learn Python in 1 Week
From Everand
Python: For Beginners A Crash Course Guide To Learn Python in 1 Week
Timothy C. Needham
4/5 (25)
Learn Python in 10 Minutes
From Everand
Learn Python in 10 Minutes
Victor Ebai
4/5 (30)
LEARN PYTHON PROGRAMMING: A Comprehensive Guide for Beginners to Master Python Programming (2024)
From Everand
LEARN PYTHON PROGRAMMING: A Comprehensive Guide for Beginners to Master Python Programming (2024)
ELISE HARRISON
No ratings yet
Python Programming for Beginners: A Comprehensive Crash Course With Practical Exercises to Quickly Learn Coding and Programming for Data Analysis and Machine Learning
From Everand
Python Programming for Beginners: A Comprehensive Crash Course With Practical Exercises to Quickly Learn Coding and Programming for Data Analysis and Machine Learning
Anthony Adams
4.5/5 (6)
Building AI Applications with OpenAI APIs: Leverage ChatGPT, Whisper, and DALL-E APIs to build 10 innovative AI projects
From Everand
Building AI Applications with OpenAI APIs: Leverage ChatGPT, Whisper, and DALL-E APIs to build 10 innovative AI projects
Martin Yanev
No ratings yet
Python for Beginners
From Everand
Python for Beginners
Jo Foster
No ratings yet
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
From Everand
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
James Tudor
5/5 (1)
Mastering Python Networking - Third Edition: Your one-stop solution to using Python for network automation, programmability, and DevOps, 3rd Edition
From Everand
Mastering Python Networking - Third Edition: Your one-stop solution to using Python for network automation, programmability, and DevOps, 3rd Edition
Eric Chou
3/5 (2)
Mastering Python Programming for Beginners
From Everand
Mastering Python Programming for Beginners
gareth thomas
No ratings yet
A Guide to Python Mastery: Python
From Everand
A Guide to Python Mastery: Python
Ummed Singh
No ratings yet
Learn Python in 7 Days
From Everand
Learn Python in 7 Days
Mohit
No ratings yet
Learn PHP Programming in 7Days: Ultimate PHP Crash Course For Beginners
From Everand
Learn PHP Programming in 7Days: Ultimate PHP Crash Course For Beginners
Austin Myers
3/5 (11)
PYTHON FOR BEGINNERS: Master the Basics of Python Programming and Start Writing Your Own Code in No Time (2023 Guide for Beginners)
From Everand
PYTHON FOR BEGINNERS: Master the Basics of Python Programming and Start Writing Your Own Code in No Time (2023 Guide for Beginners)
Glen Jennings
No ratings yet
The ChatGPT Handbook
From Everand
The ChatGPT Handbook
PA BOOKS
4/5 (1)
Practical Guide to Python: From Basics to Advanced Programming
From Everand
Practical Guide to Python: From Basics to Advanced Programming
Arcadia J. Darell
No ratings yet
Profound Python
From Everand
Profound Python
Onder Teker
5/5 (1)
Mastering ServiceStack: Utilize ServiceStack as the rock solid foundation of your distributed system
From Everand
Mastering ServiceStack: Utilize ServiceStack as the rock solid foundation of your distributed system
Andreas Niedermair
No ratings yet
Learning .NET High-performance Programming
From Everand
Learning .NET High-performance Programming
Antonio Esposito
No ratings yet
Chat GPT Prompt Engineering With Tech Trends: Tech trends, #1
From Everand
Chat GPT Prompt Engineering With Tech Trends: Tech trends, #1
ATHEER Mahir
No ratings yet
Python for Engineers: Solving Real-World Technical Challenges
From Everand
Python for Engineers: Solving Real-World Technical Challenges
Robert Johnson
No ratings yet
Your First Python Program
From Everand
Your First Python Program
Alexander Paz
No ratings yet
PYTHON FOR BEGINNERS: A Comprehensive Guide to Learning Python Programming from Scratch (2023)
From Everand
PYTHON FOR BEGINNERS: A Comprehensive Guide to Learning Python Programming from Scratch (2023)
Denton Freeman
No ratings yet
Python Programming For Beginners: Python Programming Language Tutorial
From Everand
Python Programming For Beginners: Python Programming Language Tutorial
Joseph Joyner
No ratings yet
Essential Python 3
From Everand
Essential Python 3
Kevin Vans-Colina
No ratings yet
How to use ChatGPT
From Everand
How to use ChatGPT
Bernhard Gaum
No ratings yet
The 1 Page Python Book
From Everand
The 1 Page Python Book
Barani Kumar
2/5 (1)
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
From Everand
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Maximus Wilson
1/5 (1)
The Best Python Programming Step-By-Step Beginners Guide Easily Master Software engineering with Machine Learning, Data Structures, Syntax, Django Object-Oriented Programming, and AI application
From Everand
The Best Python Programming Step-By-Step Beginners Guide Easily Master Software engineering with Machine Learning, Data Structures, Syntax, Django Object-Oriented Programming, and AI application
Chris Williamson
No ratings yet
Understanding Python: Beginner's Guide to Programming
From Everand
Understanding Python: Beginner's Guide to Programming
Sabry Fattah
No ratings yet
Learn PHP in 24 Hours
From Everand
Learn PHP in 24 Hours
Alex Nordeen
No ratings yet
Python for Beginners: Learn It as Easy as Pie
From Everand
Python for Beginners: Learn It as Easy as Pie
Yatin Bayya
No ratings yet
How To Program A Mobile Game
From Everand
How To Program A Mobile Game
Duong Tran
4/5 (1)
Python Programming Illustrated For Beginners & Intermediates: “Learn By Doing” Approach-Step By Step Ultimate Guide To Mastering Python: The Future Is Here!: The Future Is Here!
From Everand
Python Programming Illustrated For Beginners & Intermediates: “Learn By Doing” Approach-Step By Step Ultimate Guide To Mastering Python: The Future Is Here!: The Future Is Here!
William Sullivan
4/5 (2)
ASP.NET For Beginners: The Simple Guide to Learning ASP.NET Web Programming Fast!
From Everand
ASP.NET For Beginners: The Simple Guide to Learning ASP.NET Web Programming Fast!
Tim Warren
No ratings yet
Python Programming Illustrated For Beginners & Intermediates“Learn By Doing” Approach-Step By Step Ultimate Guide To Mastering Python: The Future Is Here!
From Everand
Python Programming Illustrated For Beginners & Intermediates“Learn By Doing” Approach-Step By Step Ultimate Guide To Mastering Python: The Future Is Here!
William Sullivan
3/5 (1)
Python Fundamentals
From Everand
Python Fundamentals
IntroBooks Team
No ratings yet

35 End-To-End Conversion Speed Analysis of An FPT - AI-based Text-to-Speech Application

Uploaded by

35 End-To-End Conversion Speed Analysis of An FPT - AI-based Text-to-Speech Application

Uploaded by

End-to-end Conversion Speed Analysis of an FPT.

ADVANTAGES OF PROPOSED SYSTEM:

 System : Intel Core i3.

 Operating system : Windows 10.

You might also like