0% found this document useful (0 votes)
4 views1 page

14_Transcribo (3)

Transcribo is a boardroom management system that automates real-time transcription and speaker diarization to improve meeting documentation accuracy and efficiency. It utilizes advanced technologies like Whisper-Medium for transcription, Picovoice's Falcon for speaker segmentation, and PyAnnote for speaker identification. By streamlining the process of generating meeting minutes and summaries, Transcribo enhances decision-making and productivity in organizations.

Uploaded by

patilsiddeshb16
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views1 page

14_Transcribo (3)

Transcribo is a boardroom management system that automates real-time transcription and speaker diarization to improve meeting documentation accuracy and efficiency. It utilizes advanced technologies like Whisper-Medium for transcription, Picovoice's Falcon for speaker segmentation, and PyAnnote for speaker identification. By streamlining the process of generating meeting minutes and summaries, Transcribo enhances decision-making and productivity in organizations.

Uploaded by

patilsiddeshb16
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Transcribo

Siddesh Patil, Prasad Alai, Yuvraj Rathod


14
Department of Information Technology,
The Bombay Salesian Society’s
2024-2025
Don Bosco Institute of Technology, Mumbai- 400070

Abstract System Flow Results


In modern organizations, manual note-taking
during board meetings often results in Audio Extracting Necessary
Speaker Embeddings
inefficiencies and inaccuracies. Transcribo Enrollement Features
addresses this problem by providing a
( Speaker Enrollment )
comprehensive boardroom management system
with real-time transcription and speaker
diarization. Utilizing Whisper-Medium for
transcription and Picovoice's Falcon for speaker
segmentation, the system ensures accurate and Feature Pyannote Speaker Falcon Speaker
Meeting Audio Fig 2.Transcription Generation from Recorded Meeting
efficient minutes-of-meeting generation. Extraction Diarization Identification
Additionally, speaker identification and labeling
using PyAnnote enhances clarity. The solution
Whisper
automates the transcription process, summarizes
discussions, and shares the minutes with Email SMTP Llama 3.1 B
participants, streamlining decision-making and Summary Transcript
Distribution Fig 3.Summary Generation from Transcript
boosting productivity.
Fig 1.System FLow

Introduction Technologies References


In contemporary organizational environments, Whisper-Medium: [‌ 1] S. K. Gaikwad, B. W. Gawali, and P. Yannawar, ”A Review on Speech
clear and accurate meeting records are essential Performs Automatic Speech Recognition (ASR) to convert speech into text. Recognition Technique,” Dr. Babasaheb Ambedkar Marathwada
Uses a transformer-based architecture to generate accurate transcriptions in real-time. University, Aurangabad, 2023.‌‌
for effective decision-making. However, traditional
manual note-taking often leads to incomplete or [2] T. von Neumann, C. Boeddeker, T. Cord-Landwehr, M. Delcroix,
Picovoice's Falcon:
inaccurate minutes, causing misunderstandings Conducts Speaker Diarization by detecting speaker boundaries.
and R. Haeb-Umbach, ”Meeting Recognition with Continuous Speech
Separation and Transcription-Supported Diarization,” 2024.‌
and inefficiencies. Existing methods lack the Analyzes voice characteristics to identify when the speaker changes.
advanced technology needed for efficient [3] K. M. Lyu, R. Y. Lyu, and H. T. Chang, ”Real-time multilingual
documentation, particularly in speech-to-text PyAnnote: speech recognition and speaker diarization system based on Whisper
transcription and room booking management. Performs Speaker Identification and labels each speaker. segmentation,” PeerJ Computer Science, vol. 10, 2024.‌‌
Utilizes deep learning models trained on speaker embeddings to match and assign speaker identities.
Transcribo offers a solution by automating the [4] L. E. Shafey, H. Soltau, and I. Shafran, ”Joint speech recognition
minutes of meetings, providing accurate speaker LLaMA 3.1B: and speaker diarization via sequence transduction,” arXiv preprint
recognition and automated transcription. Generates concise summaries of the transcribed meeting content. arXiv:1907.05337, 2019.‌‌

By simplifying the documentation process, Utilizes natural language understanding to extract key points and generate coherent summaries.
[5] D. Al-Fraihat, Y. Sharrab, F. Alzyoud, A. Qahmash, M. Tarawneh,
Transcribo minimizes administrative effort, and A. Maaita, ”Speech recognition utilizing deep learning: A
enhances collaboration, and ensures actionable SMTP: systematic review of the latest developments,” Human-centric
Sends transcriptions and summaries to participants via email. Computing and Information Sciences, vol. 14, 2024.‌
insights for more effective decision-making.
WorkEstablishes a secure connection to the mail server and handles email transmission.

You might also like