Understanding Deep Learning

Deep learning is one of the most exciting areas of machine learning and AI. This presentation covers all the very basics of deep neural networks, from the concept down to applications and why this technology is so popular in today's business landscape. This presentation is provided by the Tesseract Academy, which provides executive education for deep technical subjects such as data science and blockchain. For a video of the presentation please visit https://www.youtube.com/watch?v=RiYGluH_cx0&t=0s&list=PLVce3C5Hi9BBfabvhEzYQTQDYEg2vtuxH&index=2 For an associated blog post about deep learning also visit http://thedatascientist.com/what-deep-learning-is-and-isnt/

Uploaded by

stelios

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

153 views

Understanding Deep Learning

Uploaded by

stelios

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 39

Understanding deep

learning
A COMPLETE NOVICE’S PERSPECTIVE
Deep learning overview
Why now?
1. Data deluge
2. Cheaper GPUs
3. New techniques
Why is it popular?
Amazing performance in many tasks like never before
1. Machine translation
2. Speech recognition
3. Computer vision
4. Reinforcement learning
5. Natural language processing
Machine translation: Before deep
learning
Rule-based machine translation (1970s)
◦ Bilingual dictionary and linguistic rules
◦ Interlingua
◦ Find a ‘universal language’ as a middle layer
◦ Impossible task, can’t handle exceptions

Example-based machine translation (1980s)

◦ 1984, Makoto Nago (University of Tokyo)
◦ Learn through translations

Statistical machine translation (1990s)

◦ Use corpora to extract statistical relationships
Machine translation: Deep learning
Paper in 2014 by Bengio’s Lab
◦ Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
◦ https://arxiv.org/abs/1406.1078

Basic idea: Recurrent Neural Network Encoder-Decoder

Machine translation: Deep learning
27 September, 2016
A Neural Network for Machine Translation, at Production Scale
◦ https://ai.googleblog.com/2016/09/a-neural-network-for-machine.html
A few years ago we started using Recurrent Neural Networks (RNNs) to directly learn the mapping
between an input sequence (e.g. a sentence in one language) to an output sequence (that same
sentence in another language) [2].
Whereas Phrase-Based Machine Translation (PBMT) breaks an input sentence into words and phrases
to be translated largely independently, Neural Machine Translation (NMT) considers the entire input
sentence as a unit for translation.
The advantage of this approach is that it requires fewer engineering design choices than previous
Phrase-Based translation systems. When it first came out, NMT showed equivalent accuracy with
existing Phrase-Based translation systems on modest-sized public benchmark data sets.
Machine translation: deep learning
Speech recognition
Object recognition
Automatic colouring
Style transfer
Automatic text generation
NLP with deep learning
Word embeddings
Turn text into numbers
◦ Word2Vec

Perform operations on them

Based on shallow neural networks (used as input to deep neural networks)
Intuition
Automatic hierarchical feature extraction
Types of neural networks
Simple feedforward neural networks
Most common type
◦ Input: 1 vector
◦ Output: probabily, real number, or multiple outputs
Recurrent neural network
Like feedforward, but signal feeds back into itself
Recurrent neural networks
Recurrent neural networks
Useful for sequences where the past can affect the future
◦ Natural language
◦ Time series (e.g. finance)

Provide ‘memory’ to neural networks

LSTM (Long-Short Term Memory)
◦ Longer dependencies
◦ Gated Recurrent Units
RNN: Neural machine translation
Seq2Seq model
◦ Deep recurrent architecture
◦ Je suis étudiant -> I am a student
RNN: Text generation
Feed a sequence of characters
◦ Predict the next character
◦ Recurrent units keep the context

Then feed the output back into itself!

Convolutional neural networks
Use a sliding window to capture parts of an image
◦ Then use pooling
◦ E.g. keep only 1 pixel out of 9, or average their values

Allows the extraction of higher level features

◦ By utilising feature locality
◦ And ignoring noise
Feature extraction
Image classification
VCG (right), inception module (bottom), Alexnet (Middle)
Reinforcement learning
Deep Q-learning
Approach by Google Deep Mind
◦ AI company in London

Create AI that can play video games

◦ Goal to extend to real environments

Current evolution
◦ Networks play against each other
◦ Managed to beat professional Go players
Generative Adversarial Network
Putting it all together
Image captioning
Combination of convolutional units and RNN
Same architecture (but with 3d convolution) can be used for video captioning
Style transfer
Feed random images to pretrained network
Dual loss (content and style)
Train to combine the two
Images colorization
Image generation
Through GAN (left – real, right – generated)
Image translation through GANs
Tools for deep learning
https://en.wikipedia.org/wiki/Comparison_of_deep_learning_software
Tensorflow
◦ Google
◦ Very flexible
PyTorch
◦ Open source
◦ Facebook, Nvidia, Twitter and other companies develop it
◦ Useful for research
Keras
◦ Python higher-level interface for Tensorflow
Caffe
◦ Berkley AI research
◦ Useful for computer vision
Commoditised services
Google Cloud AI
◦ https://cloud.google.com/products/machine-learning/
◦ Vision, speech-to-text, text-to-speech, translation, and other

IBM
◦ https://www.ibm.com/watson/products-services/
◦ Visual recognition, translation, sentiment analysis, entity extraction

Microsoft Azure
◦ https://azure.microsoft.com/en-gb/solutions/
◦ Vision, NLP, etc.
So when to use deep learning
Amazing for anything relating to
◦ Audio
◦ Computer vision
◦ NLP
Drawbacks
◦ Loads of data
◦ Lots of processing power
◦ 1000s of hyperparameter
◦ Months of training
When to use
◦ ML or stats better for many problems (especially when datasets are smaller)
◦ If you face a computer vision, audio, etc. problem then deep learning is the best bet
◦ Try using a commoditized service before developing your own
◦ Developing your own solution -> cost effective in the long run (plus IP)
Learn more
Tesseract Academy
◦ http://tesseract.academy
◦ https://www.youtube.com/playlist?list=PLVce3C5Hi9BBfabvhEzYQTQDYEg2vtuxH
◦ Data science, big data and blockchain for executives and managers.

The Data scientist

◦ Personal blog
◦ Covers data science, analytics, blockchain, tokenomics and many more subjects
◦ http://thedatascientist.com/what-deep-learning-is-and-isnt/

Instant Ebooks Textbook Deep Generative Modeling Jakub M. Tomczak Download All Chapters
No ratings yet
Instant Ebooks Textbook Deep Generative Modeling Jakub M. Tomczak Download All Chapters
49 pages
Generative AI Interview Questions
100% (1)
Generative AI Interview Questions
12 pages
Machine Learning For Humans
100% (4)
Machine Learning For Humans
97 pages
Deep Learning Interview Questions and Answers
No ratings yet
Deep Learning Interview Questions and Answers
21 pages
Adaline/Madaline:Applications
100% (1)
Adaline/Madaline:Applications
25 pages
36 Planters Development Bank v. Lopez
0% (1)
36 Planters Development Bank v. Lopez
1 page
Introduction To Google Analytics - A Guide For Absolute Beginners
100% (2)
Introduction To Google Analytics - A Guide For Absolute Beginners
148 pages
Management and Ethics
100% (1)
Management and Ethics
12 pages
101 Gen AI Cheat Sheets. "Perhaps The Best Test of A Man's - by Anushka Bajpai - Sep, 2024 - Medium
No ratings yet
101 Gen AI Cheat Sheets. "Perhaps The Best Test of A Man's - by Anushka Bajpai - Sep, 2024 - Medium
39 pages
Practices For Governing Agentic Ai Systems
No ratings yet
Practices For Governing Agentic Ai Systems
23 pages
Analysis of Different Text Features Using NLP
No ratings yet
Analysis of Different Text Features Using NLP
7 pages
Deep Learning University
No ratings yet
Deep Learning University
129 pages
Text Summarization
No ratings yet
Text Summarization
60 pages
TensorFlow Cheatsheet Zero To Mastery V1.01
No ratings yet
TensorFlow Cheatsheet Zero To Mastery V1.01
26 pages
LlamaIndex Prompt Engineering Tutorial (FlowGPT)
No ratings yet
LlamaIndex Prompt Engineering Tutorial (FlowGPT)
20 pages
Machine Learning Tutorial
100% (1)
Machine Learning Tutorial
44 pages
Mastering Chunking in RAG - Techniques and Strategies
No ratings yet
Mastering Chunking in RAG - Techniques and Strategies
12 pages
Top 100 Deep Learning Interview Questions
No ratings yet
Top 100 Deep Learning Interview Questions
157 pages
NLP Presentation
No ratings yet
NLP Presentation
20 pages
Follow Me On For More:: Steve Nouri
No ratings yet
Follow Me On For More:: Steve Nouri
39 pages
DL
No ratings yet
DL
9 pages
Altoros Tensorflow Cheat Sheet
100% (1)
Altoros Tensorflow Cheat Sheet
1 page
Regularization_for_Neural_Networks_1718966083
No ratings yet
Regularization_for_Neural_Networks_1718966083
9 pages
Deep Learning by Andrew NG
100% (1)
Deep Learning by Andrew NG
34 pages
Machine Learning Platforms: The Definitive Guide To
No ratings yet
Machine Learning Platforms: The Definitive Guide To
39 pages
Natural Language Processing
No ratings yet
Natural Language Processing
21 pages
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
No ratings yet
A Novel Adoption of LSTM in Customer Touchpoint Prediction Problems Presentation 1
73 pages
Natural Language Processing - Semantic Aspects PDF
100% (3)
Natural Language Processing - Semantic Aspects PDF
343 pages
My Self-Created Artificial Intelligence Masters Degree
100% (1)
My Self-Created Artificial Intelligence Masters Degree
10 pages
Best Machine Learning Platform Comparison
No ratings yet
Best Machine Learning Platform Comparison
38 pages
Simplified Guide To Fingerprint Analysis
No ratings yet
Simplified Guide To Fingerprint Analysis
13 pages
MLOPS
No ratings yet
MLOPS
56 pages
Speech and Language Processing
100% (1)
Speech and Language Processing
623 pages
Math of Deep Learning Neural Networks
No ratings yet
Math of Deep Learning Neural Networks
9 pages
Building Effective Agents _ Anthropic
No ratings yet
Building Effective Agents _ Anthropic
16 pages
Deep Learning Book
100% (5)
Deep Learning Book
42 pages
Natural Language Processing
No ratings yet
Natural Language Processing
49 pages
LangGraph: multi-agent systems
No ratings yet
LangGraph: multi-agent systems
9 pages
Lecture 2 - AI Agents
100% (1)
Lecture 2 - AI Agents
38 pages
Machine Learning Interviews V 2 Week 11715787639480
No ratings yet
Machine Learning Interviews V 2 Week 11715787639480
49 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
110 pages
Transformers LLMs
No ratings yet
Transformers LLMs
163 pages
Fine Tuning LLM For Enterprise: Practical Guidelines and Recommendations
No ratings yet
Fine Tuning LLM For Enterprise: Practical Guidelines and Recommendations
17 pages
Crud Rag
No ratings yet
Crud Rag
31 pages
Deep Learning Handbook
No ratings yet
Deep Learning Handbook
33 pages
Nothing To Hide:: The Privacy Expert's Guide To Artificial Intelligence and Machine Learning
No ratings yet
Nothing To Hide:: The Privacy Expert's Guide To Artificial Intelligence and Machine Learning
32 pages
A Guide To Deep Learning and Neural Networks
No ratings yet
A Guide To Deep Learning and Neural Networks
15 pages
How To Build An AI-powered Recommendation System
100% (1)
How To Build An AI-powered Recommendation System
28 pages
Deep Learning For NLP and Speech Recogni
100% (5)
Deep Learning For NLP and Speech Recogni
640 pages
Machine Learning With Python PDF
No ratings yet
Machine Learning With Python PDF
5 pages
Deep Learning PPT Full Notes
No ratings yet
Deep Learning PPT Full Notes
105 pages
52SmallChangesForTheMind Worksheets
No ratings yet
52SmallChangesForTheMind Worksheets
23 pages
Explaining Vector Databases in 3 Levels of Difficulty - by Leonie Monigatti - Jul, 2023 - Towards Data Science
No ratings yet
Explaining Vector Databases in 3 Levels of Difficulty - by Leonie Monigatti - Jul, 2023 - Towards Data Science
12 pages
Introduction To Neural Networks
No ratings yet
Introduction To Neural Networks
25 pages
Convolutional Neural Networks
100% (1)
Convolutional Neural Networks
31 pages
Lecture 1 Introduction by Dr. Fazeel Abid
No ratings yet
Lecture 1 Introduction by Dr. Fazeel Abid
26 pages
Lang Chain
No ratings yet
Lang Chain
8 pages
Agent-Based Hybrid Intelligent System
No ratings yet
Agent-Based Hybrid Intelligent System
200 pages
T Thesis Topics in Machine Learning For Research Scholars
No ratings yet
T Thesis Topics in Machine Learning For Research Scholars
14 pages
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
From Everand
Hopfield Networks: Fundamentals and Applications of The Neural Network That Stores Memories
Fouad Sabry
No ratings yet
Alternating Decision Tree: Fundamentals and Applications
From Everand
Alternating Decision Tree: Fundamentals and Applications
Fouad Sabry
No ratings yet
Deep_Learning_1737909076
No ratings yet
Deep_Learning_1737909076
29 pages
Pump Life Cycle Cost
100% (3)
Pump Life Cycle Cost
19 pages
Game Theory and Business Strategy PDF
No ratings yet
Game Theory and Business Strategy PDF
22 pages
Business Model Canvas
No ratings yet
Business Model Canvas
2 pages
Coa sm0441
No ratings yet
Coa sm0441
3 pages
Resume of Zinnsqt
No ratings yet
Resume of Zinnsqt
6 pages
Advanced Accounting Baker Test Bank - Chap013
100% (2)
Advanced Accounting Baker Test Bank - Chap013
47 pages
Design and Construction of Diaphragm Walls Embedded in Rock For A Metro Project
No ratings yet
Design and Construction of Diaphragm Walls Embedded in Rock For A Metro Project
27 pages
Conditional Formatting in Excel
No ratings yet
Conditional Formatting in Excel
23 pages
(Ebook) Get It, Set It, Move It, Prove It: 60 Ways To Get Real Results In Your Organization by Mark Graham Brown ISBN 9780203519837, 0203519833 - Download the ebook today and own the complete content
100% (1)
(Ebook) Get It, Set It, Move It, Prove It: 60 Ways To Get Real Results In Your Organization by Mark Graham Brown ISBN 9780203519837, 0203519833 - Download the ebook today and own the complete content
49 pages
Year 8 Revision Questions Towards First Test
No ratings yet
Year 8 Revision Questions Towards First Test
4 pages
Basic Input Devices Table
No ratings yet
Basic Input Devices Table
4 pages
DRAM
No ratings yet
DRAM
24 pages
Smart Watch PPT Btech Project
No ratings yet
Smart Watch PPT Btech Project
21 pages
Exercises E2 42b On Cogm With Solution
No ratings yet
Exercises E2 42b On Cogm With Solution
3 pages
CL 0272
No ratings yet
CL 0272
39 pages
Gardacid X: Safety Data Sheet
No ratings yet
Gardacid X: Safety Data Sheet
6 pages
Level 1 NISM
No ratings yet
Level 1 NISM
13 pages
Tms 320 DM 6467
No ratings yet
Tms 320 DM 6467
355 pages
Gca 1
No ratings yet
Gca 1
9 pages
CPAR- WEEK 9-10
No ratings yet
CPAR- WEEK 9-10
13 pages
IELTS Reading Practice
No ratings yet
IELTS Reading Practice
5 pages
PT19 1300 Series
100% (1)
PT19 1300 Series
112 pages
1 - Introduction To Orcad
No ratings yet
1 - Introduction To Orcad
4 pages
Browser Navigation
No ratings yet
Browser Navigation
26 pages
BS Zoology 25092023
No ratings yet
BS Zoology 25092023
2 pages
Literature Review of Financial Analysis of HDFC Bank
100% (1)
Literature Review of Financial Analysis of HDFC Bank
5 pages
Tourism Operations) - Terminologies
100% (2)
Tourism Operations) - Terminologies
12 pages

Understanding Deep Learning

Uploaded by

Understanding Deep Learning

Uploaded by

Understanding deep

Example-based machine translation (1980s)

Statistical machine translation (1990s)

Basic idea: Recurrent Neural Network Encoder-Decoder

Perform operations on them

Provide ‘memory’ to neural networks

Then feed the output back into itself!

Allows the extraction of higher level features

Create AI that can play video games

The Data scientist

You might also like