0% found this document useful (0 votes)

26 views6 pages

A Novel Machine Lip Reading Model A Novel Machine Lip Reading Model

Uploaded by

tasinsafwathc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views6 pages

A Novel Machine Lip Reading Model A Novel Machine Lip Reading Model

Uploaded by

tasinsafwathc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Available online at www.sciencedirect.

com
Available
Availableonline
onlineatatwww.sciencedirect.com
www.sciencedirect.com

ScienceDirect
Procedia
ProcediaComputer
ComputerScience
Science00199
(2018) 000–000
(2022) 1432–1437
Procedia Computer Science 00 (2018) 000–000 www.elsevier.com/locate/procedia
www.elsevier.com/locate/procedia

The 8th International Conference on Information Technology and Quantitative Management

The 8th International Conference on Information Technology and Quantitative Management
(ITQM 2020 & 2021)
(ITQM 2020 & 2021)

A Novel
AHongyang Machine
Novel Huang,
Machine Lip
Lip Reading
Reading Model
Model
Chai Song*, Jin Ting, Taoling Tian,
HongyangChen
Huang, ChaiZhang
Hong, Song*,
Di,JinDanni
Ting,Gao
Taoling Tian,
Chen Hong, Zhang Di, Danni
[email protected] Gao
[email protected]
Southwest University for Nationalities
Southwest University for Nationalities
Chengdu, China
Chengdu, China

Abstract
Abstract
Lip Reading is the technology of obtaining the language content by analyzing the change of the speaker's lip shape and
Lip Readingthe
recognizing is the technology
information of obtaining
of the the language
lip movement. content
Lip reading helpsbypeople
analyzing
with the change
hearing of the speaker's
disabilities understandlip what
shapeother
and
recognizing the information of the lip movement. Lip reading helps people with hearing disabilities understand
people are saying, which is difficult for humans. This paper proposes a novel Lip Reading model using Transformer network, what other
people are saying,
to achieve whichmodel
a lip reading is difficult
with for humans.
high ThisThe
accuracy. paper proposes
main processa of
novel
the Lip Reading
model model
includes the using Transformer
processing of datanetwork,
sets, the
to achieve a lip reading model with high accuracy. The main process of the model includes the processing
extraction of lip features using the pre-trained neural network, and then input into Transformer network for training. of data sets, the
Finally,
extraction of lip features using the pre-trained neural network, and then input into
our model achieves a word-level lip reading accuracy of 45.81% on the open source GRID corpus. Transformer network for training. Finally,
our model achieves a word-level lip reading accuracy of 45.81% on the open source GRID corpus.
© 2021 The Authors. Published by Elsevier B.V.
© 2021 The Authors. Published by Elsevier B.V.
This
© is an
2021 open
The accessPublished
Authors. article under the CC BY-NC-ND
by Elsevier B.V. of the license (https://creativecommons.org/licenses/by-nc-nd/4.0)
Selection
Peer-reviewand/or
under peer-review under
responsibility of responsibility organizers
the scientific committee of the of
TheITQM 2020&2021 Conference on Information Technology
8th International
Selection and/or peer-review under responsibility of the organizers of ITQM 2020&2021
and Quantitative Management (ITQM 2020 & 2021)
Keywords：Lip Reading, Transformer, Transfer learning
Keywords：Lip Reading, Transformer, Transfer learning

1. Introduction
1. Introduction
In China, in the sixth national census, the number of people with hearing and language disabilities reached
In million,
20.7 China, in the sixth national
accounting for 1.67% census,
of thethe number
total numberof of
people with
people in hearing and [1]
the country language disabilities
. Lip reading reached
can assist the
hearing impaired to communicate with others through lip movement. However, lip reading is very difficult the
20.7 million, accounting for 1.67% of the total number of people in the country . Lip reading can assist
[1]
for
hearing
humans.impaired to communicate
In the study of Easton et al. with
[2] others through lip movement. However, lip reading is very difficult for
, for the hearing impaired without lip reading training, in a corpus of only
humans. In the
30 syllables, study
their of Eastonrate
recognition is up ,tofor
et al. [2]
29%the, hearing
and onlyimpaired
32% whenwithout lip reading
the corpus training, inwords.
is 30 compound a corpus of only
Obviously,
30 syllables, their recognition rate is up to 29%
reading a language with your lips is a very difficult task., and only 32% when the corpus is 30 compound words. Obviously,
reading a language
In recent with your
years, deep lips isis developing
learning a very difficult task.it has become possible for machines to understand lips.
rapidly,
In recent years, deep learning is developing
In 2017, Google proposed a new model, Transformer network rapidly, it has become
[3] possible
, which for machines
is constructed to understand
using lips.
a self-attention
In 2017, Google proposed a new model, Transformer network [3]
, which is constructed
mechanism, instead of CNN and RNN which are commonly used in deep learning. The output of each state in using a self-attention
mechanism, instead
traditional RNN of CNN and
is contained RNN
in the inputwhich are commonly
of the usedcausing
previous state, in deep RNN
learning.
to be The output
slow of each
in some state in
sequential
traditional RNN is contained in the input of the previous state, causing RNN to be slow in some sequential

1877-0509 © 2021 The Authors. Published by Elsevier B.V.

This is an open access article under the CC BY-NC-ND license (https://creativecommons.org/licenses/by-nc-nd/4.0)
Peer-review under responsibility of the scientific committee of the The 8th International Conference on Information Technology and
Quantitative Management (ITQM 2020 & 2021)
10.1016/j.procs.2022.01.181
Hongyang Huang et al. / Procedia Computer Science 199 (2022) 1432–1437 1433
Hongyang Huang/ Procedia Computer Science 00 (2017) 000–000

processing tasks. Transformer network adopts a self-attention mechanism, which effectively solves the problem
that RNN cannot be parallelized and greatly improves the speed of model training. Since then, Transformer has
been widely used in the field of NLP and achieved remarkable results in machine translation, speech recognition
and other directions.
In this paper, the pre-trained neural network VGG16 is used to extract the features of lips in the video. As the
extracted feature dimensions are too high, adopt some dimensionality reduction operations to deal with these
features. After obtaining the features of lips with lower dimensions, the features are then input into our
Transformer network for training. The experiment proves that using our Transformer network to train these
features can significantly reduce training costs and improve the lip reading accuracy of the model.
The rest of this paper is organized as follow: Section II studies related to lip speech recognition are reviewed;
Section III analyzes the model in detail; Section IV describes the detailed process of the experiment; Section V
concludes the paper.

2. Related Word

Lip reading technology was first proposed by W.H.Dumby and I.Pollack in 1954 [4], but the real Automatic
Lipreading System was established by Petajan at the University of Illinois in 1984 [5]. In recent years, computer
vision technology and computer speech technology continue to develop breakthrough, and lip recognition as a
comprehensive reflection of image, speech and natural language processing technology, has also made great
progress.
In terms of lip reading techniques based on traditional computer methods: In 1984, Petajan et al. proposed lip
reading system with single word as the minimum recognition unit for the first time. It calculates the features of
the lip image sequence, carries out the Nearest Neighbor search with all samples in the feature database, and
outputs the most similar feature samples as the predicted results. In 1998 [6], Gerasimos Potamianos et al. studied
a visual front end of automatic lip reading based on Hidden Markov model and proposed two methods for
extracting lip features: the feature method based on lip contour and the method based on image change. In 2007,
Zhao et al. [7] proposed a time-space local binary feature recognition method in order to solve the problem of
isolated phrase recognition, and used SVM (Support Vector Machine) to recognize phrases.
In terms of deep learning based lip reading: Wand et al. introduced LSTM (Long Short-Term Memory) for lip
reading research, and the recognition accuracy of the model reached 79.6% in word-level lip reading [8]. In 2016,
Chung et al. from the University of Oxford published the LRW data set in the field of lip reading and established
a WLAS (Watch, Listen, Attend and SPELL) network, which achieved a classification accuracy of 61.1% [9]. In
the same year, the Oxford Artificial Intelligence Laboratory, the DeepMind team, and the Canadian Institute for
Advanced Study (CIFAR) jointly released the LipNet lip reading model [10], which is the first end-to-end sentence
level lip reading model that can simultaneously learn spatial temporal visual features and sequential models. It
adopted STCNN (Spatiotemporal Convolution), LSTM and CTC loss (Connectionist Temporal Classification
loss), which was the best lip reading model at that time.

3. Proposed lip reading model

Our model construction mainly consists of three parts: the first part is dataset processing, the second part is
feature extraction, and the third part is model training. The overall structure of the model is shown in Figure 1:
1434 Hongyang Huang et al. / Procedia Computer Science 199 (2022) 1432–1437
Hongyang Huang/ Procedia Computer Science 00 (2017) 000–000

Cropping lip

Outputs Outputs
Transformer Decoder
(label) (probabilities)
Feature extraction

Transformer Encoder
75x512
75x3x224x224

Fig.1 Overall structure of the model

(1). Data preprocess

The GRID corpus has 33,000 samples, each sample is 3S video, the video frame rate is 25, so each sample is
composed of 75 frames, and the sample label of each video is 6 words. In order to get a trainable dataset, first
need to restore the video sample to the original frame image, face detection is carried out for each frame of the
picture, and the video samples without faces are discarded, and then extract each frame image of the lips, the
dimension of the extracted lips is (60, 120, 3). The concrete operating process is shown in figure 2 below:

Video Sample 75 x Frame Lip image

(60, 120,3)

restore cropping

Figure.2 Extract a lip image from the video

(2). Lip feature extraction

Transfer learning is to transfer the training model parameters to the new model to help the training of the new
model. Considering that most data or tasks are correlated, can share the model parameters we have learned with
the new model in some way through transfer learning, so as to accelerate and optimize the learning efficiency of
the model, instead of learning from zero as most networks do. VGG16 [11] is a deep convolutional neural network
developed by researchers from the Visual Geometry Group of Oxford University and Google DeepMind. The
structure of VGG16 is very simple. The entire network uses the same size of the convolution kernel (3*3) and
the maximum pooling size (2*2). The structure diagram of the VGG16 pre-trained neural network used to extract
the required lip reading features is shown in Figure 3 below:
Hongyang Huang et al. / Procedia Computer Science 199 (2022) 1432–1437 1435
Hongyang Huang/ Procedia Computer Science 00 (2017) 000–000

Size=512x7x7
3x3 conv, 512

3x3 conv, 512

3x3 conv, 512
3x3 conv, 128

3x3 conv, 128

3x3 conv, 256

3x3 conv, 512

Size=2x224x224

3x3 conv, 64

pool2d

pool2d
pool2d

pool2d

pool2d
64x112x112

128x56x56

256x28x28

512x14x14
3x224x224

Size:

Size:
Size:

Figure.3 VGG16 feature extraction structure diagram

In this paper, the pre-trained neural network VGG16 is used for transfer learning. After fine-tuning the VGG16
network structure, the lips in the video are extracted for features, which can not only effectively extract high-
dimensional lip features, but also avoid a large amount of time cost caused by training the model from scratch.
Here, we used all the feature layers of VGG16 to extract the features of the lips. Since the feature dimensions
extracted from the VGG16 pre-training model are too large, it is necessary to carry out corresponding
dimensionality reduction operation and then input them into our model for training. There are two dimensionality
reduction methods adopted in this paper. The first one is to convolve the extracted features to reduce the number
of convolution kernels to achieve the purpose of dimensionality reduction; the second is to reduce the feature
dimension by adding a full connection layer.

(3). Transformer
Transformer is a model proposed by Google in 2017. The model is composed of traditional encoder-decoder
structure. The Decoder and Encoder part does not use structures such as RNN and CNN, but use an attention
mechanism to build the model. We have modified Transformer network and added it to our model to train the lip
reading dataset. The Transformer network structure diagram is shown in Figure 4 below:

Outputs Outputs
Add & Norm

Add & Norm

Multi-Head
Multi-Head
Attention

Attention

(label) (probabilities)
Frowrd
Maked

Feed

Nx
Decoder
Add & Norm

Add & Norm

Input
Multi-Head
Attention

Frowrd

(75 x 512)
Feed

Nx
Encoder

Figure.4 the Transformer architecture

Encoder：The encoder is composed of N identical layers, and each layer contains a multi-head self-attention
1436 Hongyang Huang et al. / Procedia Computer Science 199 (2022) 1432–1437
Hongyang Huang/ Procedia Computer Science 00 (2017) 000–000

mechanism and a position-wise fully connected feed-forward neural network. The residual connection is used
between the two sublayers and the layer normalization is performed on the output, the output of each Sublayer is
LayerNorm (x + Sublayer(x)).
Decoder: The decoder is also composed of N identical layers. In addition to the two sub-layers in each encoder
layer, the decoder inserts a third sub-layer. This sublayer is a multi-head attention mechanism that uses masking
in order to conceal the prediction of the current location from being affected by subsequent states.
The Transformer network has achieved significant results in machine translation tasks, and we need to modify
the structure of the Transformer network due to the different tasks we are dealing with. 1) The encoder input of
the model is a high-dimensional lip feature, which does not require word embedding. 2) The encoder input and
the decoder input of the model are not padding to a fixed length.

4. Experiment

The experimental model was trained on a Dell workstation and the operating system was Ubuntu18.0.4. The
GPU is a 24G Quadro P6000, the processor is Intel Xeon(R) GOLD 512, and the memory is 128GB. The model
was built using the Pytroch (1.6.0) framework.
The optimizer used in the model is Adagrad. Adagrad optimizer is an adaptive optimization method, which
adaptively allocates different learning rates to each parameter. In Adagrad optimization, we set the initial learning
rate as 7e-4 and trained 30 epochs. Finally, the accuracy of the model reached 45.81% in word-level lip reading.
The change of the loss function and accuracy of the model is shown in Figure 5 below:

(a) Loss (b) Accuracy

Hongyang Huang et al. / Procedia Computer Science 199 (2022) 1432–1437 1437
Hongyang Huang/ Procedia Computer Science 00 (2017) 000–000

Figure.5 Training result: (a) Loss; (b) Accuracy

5. Conclution

The accuracy of our model reaches 45.81% in word-level lip reading, which forms a simple lip reading system.
The recognition of the model still needs to be improved. I believe that after continuous debugging and
improvement of the model in the future, a lip reading model with high accuracy can finally be formed.

6. Acknowledgement

This paper is supported by the Key Research and Development Project of Sichuan Province (2021YFG0358)
and the Fundamental Research Funds for the Central Universities, Southwest Minzu University (2021PTJS24).

References

[1] http://www.stats.gov.cn/tjsj/zxfb/201104/t20110428_12705.html.
[2] Easton R D, Basala M. Perceptual dominance during lipreading[J]. Perception & Psychophysics, 1982, 32(6): 562-570.
[3] Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need[J]. arXiv preprint arXiv:1706.03762, 2017.
[4] Sumby W H, Pollack I. Visual contribution to speech intelligibility in noise[J]. The journal of the acoustical society of america, 1954,
26(2): 212-215.
[5] Petajan E D. Automatic Lipreading to Enhance Speech Recognition (Speech Reading)[J]. 1985.
[6] Potamianos G, Graf H P, Cosatto E. An image transform approach for HMM based automatic lipreading[C]//Proceedings 1998
International Conference on Image Processing. ICIP98 (Cat. No. 98CB36269). IEEE, 1998: 173-177.
[7] Zhao G, Pietikäinen M, Hadid A. Local spatiotemporal descriptors for visual recognition of spoken phrases[C]//Proceedings of the
international workshop on Human-centered multimedia. 2007: 57-66.
[8] Wand M, Koutník J, Schmidhuber J. Lipreading with long short-term memory[C]//2016 IEEE International Conference on Acoustics,
Speech and Signal Processing (ICASSP). IEEE, 2016: 6115-6119.
[9] Chung J S, Zisserman A. Lip reading in the wild[C]//Asian Conference on Computer Vision. Springer, Cham, 2016: 87-103.
[10] Assael Y M, Shillingford B, Whiteson S, et al. Lipnet: End-to-end sentence-level lipreading[J]. arXiv preprint arXiv:1611.01599,
2016.
[11] Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[J]. arXiv preprint arXiv:1409.1556,
2014.
[12] Maas A, Xie Z, Jurafsky D, et al. Lexicon-free conversational speech recognition with neural networks[C]//Proceedings of the 2015
Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2015:
345-354.
[13] Zhang C, Zhang S. Lip Reading using CNN Lip Deflection Classifier and GAN Two-Stage Lip Corrector[C]//Journal of Physics:
Conference Series. IOP Publishing, 2021, 1883(1): 012134.

Chapter 3 Solutions Calculus
No ratings yet
Chapter 3 Solutions Calculus
83 pages
Deep Audio-Visual Speech Recognition
No ratings yet
Deep Audio-Visual Speech Recognition
13 pages
00. Analyzing Lower Half Facial Gestures for Lip Reading Applications Survey on Vision Techniques
No ratings yet
00. Analyzing Lower Half Facial Gestures for Lip Reading Applications Survey on Vision Techniques
45 pages
Hybrid Attention Mechanisms in 3D CNN for Noise-Resilient Lip Reading in Complex Environments
No ratings yet
Hybrid Attention Mechanisms in 3D CNN for Noise-Resilient Lip Reading in Complex Environments
11 pages
Lip Reading Sentences Using Deep Learning With Only Visual Cues
No ratings yet
Lip Reading Sentences Using Deep Learning With Only Visual Cues
15 pages
applsci-15-00563
No ratings yet
applsci-15-00563
22 pages
A_Survey_on_Deep_Learning_based_Lip-Reading_Techniques
No ratings yet
A_Survey_on_Deep_Learning_based_Lip-Reading_Techniques
8 pages
Lip Reading Using Deep Learning in Turkish Language
No ratings yet
Lip Reading Using Deep Learning in Turkish Language
12 pages
Lip Reading With Hahn Convolutional Neural Networks
No ratings yet
Lip Reading With Hahn Convolutional Neural Networks
28 pages
cep report
No ratings yet
cep report
21 pages
Deep_Learning-Based_Automated_Lip-Reading_A_Survey
No ratings yet
Deep_Learning-Based_Automated_Lip-Reading_A_Survey
22 pages
A Lip Reading Method Based On 3D Convolutional Vision Transformer
No ratings yet
A Lip Reading Method Based On 3D Convolutional Vision Transformer
8 pages
LipReadNet_A_Deep_Learning_Approach_to_Lip_Reading (1)
No ratings yet
LipReadNet_A_Deep_Learning_Approach_to_Lip_Reading (1)
6 pages
Chung 18
No ratings yet
Chung 18
28 pages
ashrith miniproject 2
No ratings yet
ashrith miniproject 2
11 pages
Learning Individual Speaking Styles For Accurate L
No ratings yet
Learning Individual Speaking Styles For Accurate L
11 pages
Engineering Science and Technology, An International Journal
No ratings yet
Engineering Science and Technology, An International Journal
10 pages
Learning Spatio-Temporal Features With Two-Stream Deep 3D Cnns For Lipreading
No ratings yet
Learning Spatio-Temporal Features With Two-Stream Deep 3D Cnns For Lipreading
13 pages
1 s2.0 S1877050922001843 Main
No ratings yet
1 s2.0 S1877050922001843 Main
6 pages
Hitachi Storage Systems Introduction
100% (1)
Hitachi Storage Systems Introduction
41 pages
Multi-Grained Spatio-Temporal Modeling For Lip-Reading: Wangchenhao17@mails - Ucas.ac - CN
No ratings yet
Multi-Grained Spatio-Temporal Modeling For Lip-Reading: Wangchenhao17@mails - Ucas.ac - CN
11 pages
Analysis_of_Lip-Reading_Using_Deep_Learning_Techniques_A_Review
No ratings yet
Analysis_of_Lip-Reading_Using_Deep_Learning_Techniques_A_Review
6 pages
Lip-Reading With Densely Connected Temporal Convolutional Networks
No ratings yet
Lip-Reading With Densely Connected Temporal Convolutional Networks
10 pages
SCOU_220_MANUAL_T06
No ratings yet
SCOU_220_MANUAL_T06
32 pages
NMITCON 2024 Submitted 23 24 B27 Lip Reading Using Deep Learning Project Paper
No ratings yet
NMITCON 2024 Submitted 23 24 B27 Lip Reading Using Deep Learning Project Paper
8 pages
Automatic Lip Reading Classification of
No ratings yet
Automatic Lip Reading Classification of
5 pages
minorppt
No ratings yet
minorppt
15 pages
Developing Phoneme-Based Lip-Reading Sentences System For Silent Speech Recognition
No ratings yet
Developing Phoneme-Based Lip-Reading Sentences System For Silent Speech Recognition
10 pages
Cascade
No ratings yet
Cascade
6 pages
Zhu 2020 J. Phys. Conf. Ser. 1651 012076
No ratings yet
Zhu 2020 J. Phys. Conf. Ser. 1651 012076
8 pages
2001 08702v1
No ratings yet
2001 08702v1
6 pages
A Multimodal German Dataset For Automatic Lip Reading Systems and Transfer Learning
No ratings yet
A Multimodal German Dataset For Automatic Lip Reading Systems and Transfer Learning
8 pages
ANN Paper (1)
No ratings yet
ANN Paper (1)
7 pages
584 Camera Ready
No ratings yet
584 Camera Ready
6 pages
Lip-Decoder
No ratings yet
Lip-Decoder
11 pages
Lipsound2: Self-Supervised Pre-Training For Lip-To-Speech Reconstruction and Lip Reading
No ratings yet
Lipsound2: Self-Supervised Pre-Training For Lip-To-Speech Reconstruction and Lip Reading
11 pages
Pseudo-Convolutional Policy Gradient For Sequence-to-Sequence Lip-Reading
No ratings yet
Pseudo-Convolutional Policy Gradient For Sequence-to-Sequence Lip-Reading
8 pages
Developing Phoneme Based Lip Reading Sentences System For Silent Speech Recognition
No ratings yet
Developing Phoneme Based Lip Reading Sentences System For Silent Speech Recognition
10 pages
Toward_Language-independent_Lip_Reading_A_Transfer_Learning_Approach
No ratings yet
Toward_Language-independent_Lip_Reading_A_Transfer_Learning_Approach
4 pages
Lipreading With 3D-2D-Cnn BLSTM-HMM and Word-Ctc Models
No ratings yet
Lipreading With 3D-2D-Cnn BLSTM-HMM and Word-Ctc Models
5 pages
2
No ratings yet
2
7 pages
1-s2.0-S2666764923000450-main
No ratings yet
1-s2.0-S2666764923000450-main
10 pages
DL_REVIEW
No ratings yet
DL_REVIEW
4 pages
Lipreading Using a Comparative Machine Learning Approach
No ratings yet
Lipreading Using a Comparative Machine Learning Approach
7 pages
Icassp19 Zhoupan
No ratings yet
Icassp19 Zhoupan
5 pages
s41598-024-81904-y
No ratings yet
s41598-024-81904-y
11 pages
Lip Reading Using External Viseme Decoding: 1 Javad Peymanfard 2 Mohammad Reza Mohammadi 3 Hossein Zeinali
No ratings yet
Lip Reading Using External Viseme Decoding: 1 Javad Peymanfard 2 Mohammad Reza Mohammadi 3 Hossein Zeinali
5 pages
ANN Paper
No ratings yet
ANN Paper
6 pages
Deep Learning for Lip Reading and Speech Recognition
No ratings yet
Deep Learning for Lip Reading and Speech Recognition
4 pages
Lip Reading Word Classification: Abiel Gutierrez Stanford University Zoe-Alanah Robert Stanford University
No ratings yet
Lip Reading Word Classification: Abiel Gutierrez Stanford University Zoe-Alanah Robert Stanford University
9 pages
Lipx
No ratings yet
Lipx
9 pages
1
No ratings yet
1
3 pages
Prajwal Sub-Word Level Lip Reading With Visual Attention CVPR 2022 Paper
No ratings yet
Prajwal Sub-Word Level Lip Reading With Visual Attention CVPR 2022 Paper
11 pages
Park College of Engineering and Teknology Lip Reading Using Neural Network
No ratings yet
Park College of Engineering and Teknology Lip Reading Using Neural Network
10 pages
LRW-1000: A Naturally-Distributed Large-Scale Benchmark For Lip Reading in The Wild
No ratings yet
LRW-1000: A Naturally-Distributed Large-Scale Benchmark For Lip Reading in The Wild
8 pages
Mutual Information Maximization For Effective Lip Reading: Xing Zhao, Shuang Yang, Shiguang Shan, Xilin Chen
No ratings yet
Mutual Information Maximization For Effective Lip Reading: Xing Zhao, Shuang Yang, Shiguang Shan, Xilin Chen
8 pages
Deep Learning Model For Lip Reading To ImproveAccessibility
No ratings yet
Deep Learning Model For Lip Reading To ImproveAccessibility
6 pages
Deep Learning For Lip Reading Using Audio-Visual Information For Urdu Language
No ratings yet
Deep Learning For Lip Reading Using Audio-Visual Information For Urdu Language
5 pages
Lip Reading Using CNN and LTSM
No ratings yet
Lip Reading Using CNN and LTSM
9 pages
Deformation Flow Based Two-Stream Network For Lip Reading
No ratings yet
Deformation Flow Based Two-Stream Network For Lip Reading
7 pages
LIP Reading Using Facial Feature Extraction and Deep Learning
No ratings yet
LIP Reading Using Facial Feature Extraction and Deep Learning
5 pages
Evidence-Based Health Practice
100% (3)
Evidence-Based Health Practice
9 pages
Sonata Software Sample Technical Placement Paper Level1
No ratings yet
Sonata Software Sample Technical Placement Paper Level1
7 pages
PRS 505 Manual
No ratings yet
PRS 505 Manual
74 pages
05 - Internal Memory
No ratings yet
05 - Internal Memory
25 pages
Informed Search
No ratings yet
Informed Search
65 pages
DiskSavvy Disk Space Analyzer
No ratings yet
DiskSavvy Disk Space Analyzer
39 pages
Microsoft Application Virtualization 5.1 Geekboy - Ir
No ratings yet
Microsoft Application Virtualization 5.1 Geekboy - Ir
228 pages
MOP - Flexi Multiradio WCDMA BTS Commissioning - Zain Iraq
No ratings yet
MOP - Flexi Multiradio WCDMA BTS Commissioning - Zain Iraq
51 pages
Digital Video Assessment and Activities
100% (6)
Digital Video Assessment and Activities
7 pages
CDDHv3 Assessment Brief-AssignmentI
No ratings yet
CDDHv3 Assessment Brief-AssignmentI
4 pages
TD 4 Linked List
No ratings yet
TD 4 Linked List
2 pages
Programming1 Lab#2
No ratings yet
Programming1 Lab#2
11 pages
Human Computer Interaction Tutorial Example Exam Questions 1
No ratings yet
Human Computer Interaction Tutorial Example Exam Questions 1
9 pages
Qualitative Research Methodology Course Outline
No ratings yet
Qualitative Research Methodology Course Outline
3 pages
Security by Books
No ratings yet
Security by Books
6 pages
1-Chapter One (Background Information)
No ratings yet
1-Chapter One (Background Information)
2 pages
Mckinsey
100% (4)
Mckinsey
15 pages
Technical Proposal
No ratings yet
Technical Proposal
7 pages
Comparch Comparch-002 Exams Midterm A8Xj46NCRo
No ratings yet
Comparch Comparch-002 Exams Midterm A8Xj46NCRo
9 pages
Building C# Applications
No ratings yet
Building C# Applications
20 pages
Entity Relationship Notes
No ratings yet
Entity Relationship Notes
15 pages
Resume
No ratings yet
Resume
4 pages
Dbms 123
No ratings yet
Dbms 123
3 pages
ABAP XML - Mapping Simplified
No ratings yet
ABAP XML - Mapping Simplified
7 pages
Microsoft (AZURE) AZ-301 Dumps PDF - Best Study Material Ever
No ratings yet
Microsoft (AZURE) AZ-301 Dumps PDF - Best Study Material Ever
8 pages
Data Mining: Concepts, Fundamentals And Applications
From Everand
Data Mining: Concepts, Fundamentals And Applications
Enrico Guardelli
No ratings yet
Speech Recognition: Fundamentals and Applications
From Everand
Speech Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet