SlideShare a Scribd company logo
Deep Learning for Developers
Julien Simon, AI Evangelist, EMEA
@julsimon
What	to	expect
• An introduction to Deep Learning
• An introduction to Apache MXNet
• Demos using Jupyter notebooks on Amazon SageMaker
• Resources
• Artificial	Intelligence:	design	software	applications	which	exhibit	
human-like	behavior,	e.g.	speech,	natural	language	processing,	
reasoning	or	intuition
• Machine	Learning:	teach	machines	to	learn	without	being	
explicitly	programmed
• Deep	Learning:	using	neural	networks,	teach	machines	to	learn	
from	complex	data	where	features	cannot	be	explicitly	expressed
Amazon Echo
Amazon Alexa is based on Deep Learning
Amazon AI is based on Deep Learning
https://www.oreilly.com/ideas/self-driving-trucks-enter-the-fast-lane-using-deep-learning
Last June, tuSimple drove an autonomous truck
for 200 miles from Yuma, AZ to San Diego, CA
An	introduction	to	Deep	Learning
Activation functions
The neuron
! xi ∗ wi
&
'()
= u
”Multiply and Accumulate”
Source: Wikipedia
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
x =
x11, x12, …. x1I
x21, x22, …. x2I
… … …
xm1, xm2, …. xmI
I features
m samples
y =
y1
y2
…
ym
m labels,
N2 outputs
0,0,1,0,0,…,0
1,0,0,0,0,…,0
…
0,0,0,0,1,…,0
One-hot encoding
Neural networks
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
x =
x11, x12, …. x1I
x21, x22, …. x2I
… … …
xm1, xm2, …. xmI
I features
m samples
y =
y1
y2
…
ym
m labels,
N2 categories
Total number of predictions
Accuracy =
Number of correct predictions
0,0,1,0,0,…,0
1,0,0,0,0,…,0
…
0,0,0,0,1,…,0
One-hot encoding
Neural networks
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Neural networks
Initially, the network will not predict correctly
f(X1) = Y’1
A loss function measures the difference between
the real label Y1 and the predicted label Y’1
error = loss(Y1, Y’1)
For a batch of samples:
! loss(Yi,	Y’i)	
,-./0	2345
3()
= batch	error
The purpose of the training process is to
minimize error by gradually adjusting weights
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Training
Training data set Training
Trained
neural network
Batch size
Learning rate
Number of epochs
Hyper parameters
Backpropagation
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Validation
Validation data set Trained
neural network
Validation
accuracy
Prediction at
the end of
each epoch
Save the model at the end of each epoch
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Early stopping
Training accuracy
Loss function
Accuracy
100%
Epochs
Validation accuracy
Loss
Best	checkpoint
OVERFITTING
Apache	MXNet
Apache MXNet: Open Source library for Deep Learning
Programmable Portable High Performance
Near linear scaling
across hundreds of
GPUs
Highly efficient
models for
mobile
and IoT
Simple syntax,
multiple
languages
Most Open Best On AWS
Optimized for
Deep Learning on AWS
Accepted into the
Apache Incubator
MXNet 1.0 released on December 4th
Object Detection
https://github.com/precedenceguo/mx-rcnn https://github.com/zhreshold/mxnet-yolo
Object Segmentation
https://github.com/TuSimple/mx-maskrcnn
Text Detection and Recognition
https://github.com/Bartzi/stn-ocr
Face Detection
https://github.com/tornadomeet/mxnet-face
Real-Time Pose Estimation
https://github.com/dragonfly90/mxnet_Realtime_Multi-Person_Pose_Estimation
Machine Translation
https://github.com/awslabs/sockeye
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
The Apache MXNet API
• Storing and accessing data in multi-dimensional arrays
àNDArray API
• Building models (layers, weights, activation functions)
à Symbol API
• Serving data during training and validation
à Iterators
• Training and using models
à Module API
Demos	
https://github.com/juliensimon/dlnotebooks
1) Synthetic	data	set
2) Classify	images	with	pre-trained	models
3) Learn	MNIST	with	a	Multi-Layer	Perceptron
4) Learn	MNIST	with	the	LeNet CNN
5) Predict	handmade	MNIST	samples
6) Train	and	host	a	MNIST	model	with	Amazon	SageMaker
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Resources
https://aws.amazon.com/machine-learning
https://aws.amazon.com/blogs/ai
https://mxnet.incubator.apache.org
https://github.com/apache/incubator-mxnet
https://github.com/gluon-api
https://github.com/awslabs/sockeye
An overview of Amazon SageMaker https://www.youtube.com/watch?v=ym7NEYEx9x4
https://medium.com/@julsimon
Thank you!
Julien Simon, AI Evangelist, EMEA
@julsimon

More Related Content

What's hot (13)

PPTX
Build, train, and deploy Machine Learning models at scale (May 2018)
Julien SIMON
 
PPTX
Machine Learning inference at the Edge
Julien SIMON
 
PPTX
Build, train, and deploy Machine Learning models at scale (May 2018)
Julien SIMON
 
PPTX
Machine Learning: From Notebook to Production with Amazon Sagemaker (January ...
Julien SIMON
 
PDF
New AI/ML services at AWS re:Invent 2017
Julien SIMON
 
PDF
IoT & Deep Learning on AWS
Julien SIMON
 
PPTX
AI on a PI
Julien SIMON
 
PPTX
Picking the right AWS backend for your application (September 2017)
Julien SIMON
 
PDF
Accelerate your Machine Learning workflows with Amazon SageMaker
Julien SIMON
 
PDF
Amazon SageMaker workshop
Julien SIMON
 
PPTX
Optimize your Machine Learning Workloads on AWS (July 2019)
Julien SIMON
 
PDF
Scalable Deep Learning on AWS with Apache MXNet
Julien SIMON
 
PDF
Picking the right AWS backend for your Java application
Julien SIMON
 
Build, train, and deploy Machine Learning models at scale (May 2018)
Julien SIMON
 
Machine Learning inference at the Edge
Julien SIMON
 
Build, train, and deploy Machine Learning models at scale (May 2018)
Julien SIMON
 
Machine Learning: From Notebook to Production with Amazon Sagemaker (January ...
Julien SIMON
 
New AI/ML services at AWS re:Invent 2017
Julien SIMON
 
IoT & Deep Learning on AWS
Julien SIMON
 
AI on a PI
Julien SIMON
 
Picking the right AWS backend for your application (September 2017)
Julien SIMON
 
Accelerate your Machine Learning workflows with Amazon SageMaker
Julien SIMON
 
Amazon SageMaker workshop
Julien SIMON
 
Optimize your Machine Learning Workloads on AWS (July 2019)
Julien SIMON
 
Scalable Deep Learning on AWS with Apache MXNet
Julien SIMON
 
Picking the right AWS backend for your Java application
Julien SIMON
 

Similar to Deep Learning for Developers (December 2017) (14)

PPTX
Emotion recognition in images: from idea to a model in production - Nordic DS...
Hagay Lupesko
 
PDF
Emotion Recognition in Images
Apache MXNet
 
PDF
An Introduction to Amazon AI Services
Dinah Barrett
 
PDF
Artificial Intelligence (Machine Learning) on AWS: How to Start
Vladimir Simek
 
PDF
Maschinelles Lernen auf AWS für Entwickler, Data Scientists und Experten
AWS Germany
 
PDF
Time series modeling workd AMLD 2018 Lausanne
Sunil Mallya
 
PPTX
What is deep learning (and why you should care) - Talk at SJSU Oct 2018
Hagay Lupesko
 
PDF
Julien Simon, Principal Technical Evangelist at Amazon - Machine Learning: Fr...
Codiax
 
PPTX
Amazon SageMaker (December 2018)
Julien SIMON
 
PDF
Scalable Deep Learning on AWS using Apache MXNet (May 2017)
Julien SIMON
 
PDF
Machine Learning on the Cloud with Apache MXNet
delagoya
 
PDF
Scalable Deep Learning on AWS using Apache MXNet (May 2017)
Julien SIMON
 
PDF
Introducing aws deep lens
Richard Harvey
 
PDF
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018
Amazon Web Services Korea
 
Emotion recognition in images: from idea to a model in production - Nordic DS...
Hagay Lupesko
 
Emotion Recognition in Images
Apache MXNet
 
An Introduction to Amazon AI Services
Dinah Barrett
 
Artificial Intelligence (Machine Learning) on AWS: How to Start
Vladimir Simek
 
Maschinelles Lernen auf AWS für Entwickler, Data Scientists und Experten
AWS Germany
 
Time series modeling workd AMLD 2018 Lausanne
Sunil Mallya
 
What is deep learning (and why you should care) - Talk at SJSU Oct 2018
Hagay Lupesko
 
Julien Simon, Principal Technical Evangelist at Amazon - Machine Learning: Fr...
Codiax
 
Amazon SageMaker (December 2018)
Julien SIMON
 
Scalable Deep Learning on AWS using Apache MXNet (May 2017)
Julien SIMON
 
Machine Learning on the Cloud with Apache MXNet
delagoya
 
Scalable Deep Learning on AWS using Apache MXNet (May 2017)
Julien SIMON
 
Introducing aws deep lens
Richard Harvey
 
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018
Amazon Web Services Korea
 
Ad

More from Julien SIMON (20)

PDF
Arcee AI - building and working with small language models (06/25)
Julien SIMON
 
PDF
deep_dive_multihead_latent_attention.pdf
Julien SIMON
 
PDF
Deep Dive: Model Distillation with DistillKit
Julien SIMON
 
PDF
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
Julien SIMON
 
PDF
Building High-Quality Domain-Specific Models with Mergekit
Julien SIMON
 
PDF
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
PDF
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
PDF
Julien Simon - Deep Dive: Compiling Deep Learning Models
Julien SIMON
 
PDF
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
PDF
Julien Simon - Deep Dive - Optimizing LLM Inference
Julien SIMON
 
PDF
Julien Simon - Deep Dive - Accelerating Models with Better Attention Layers
Julien SIMON
 
PDF
Julien Simon - Deep Dive - Quantizing LLMs
Julien SIMON
 
PDF
Julien Simon - Deep Dive - Model Merging
Julien SIMON
 
PDF
An introduction to computer vision with Hugging Face
Julien SIMON
 
PDF
Reinventing Deep Learning
 with Hugging Face Transformers
Julien SIMON
 
PDF
Building NLP applications with Transformers
Julien SIMON
 
PPTX
Building Machine Learning Models Automatically (June 2020)
Julien SIMON
 
PDF
Starting your AI/ML project right (May 2020)
Julien SIMON
 
PPTX
Scale Machine Learning from zero to millions of users (April 2020)
Julien SIMON
 
PPTX
An Introduction to Generative Adversarial Networks (April 2020)
Julien SIMON
 
Arcee AI - building and working with small language models (06/25)
Julien SIMON
 
deep_dive_multihead_latent_attention.pdf
Julien SIMON
 
Deep Dive: Model Distillation with DistillKit
Julien SIMON
 
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
Julien SIMON
 
Building High-Quality Domain-Specific Models with Mergekit
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Julien Simon - Deep Dive: Compiling Deep Learning Models
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Julien Simon - Deep Dive - Optimizing LLM Inference
Julien SIMON
 
Julien Simon - Deep Dive - Accelerating Models with Better Attention Layers
Julien SIMON
 
Julien Simon - Deep Dive - Quantizing LLMs
Julien SIMON
 
Julien Simon - Deep Dive - Model Merging
Julien SIMON
 
An introduction to computer vision with Hugging Face
Julien SIMON
 
Reinventing Deep Learning
 with Hugging Face Transformers
Julien SIMON
 
Building NLP applications with Transformers
Julien SIMON
 
Building Machine Learning Models Automatically (June 2020)
Julien SIMON
 
Starting your AI/ML project right (May 2020)
Julien SIMON
 
Scale Machine Learning from zero to millions of users (April 2020)
Julien SIMON
 
An Introduction to Generative Adversarial Networks (April 2020)
Julien SIMON
 
Ad

Recently uploaded (20)

PPTX
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
PPTX
UI5Con 2025 - Beyond UI5 Controls with the Rise of Web Components
Wouter Lemaire
 
PDF
Empowering Cloud Providers with Apache CloudStack and Stackbill
ShapeBlue
 
PDF
2025-07-15 EMEA Volledig Inzicht Dutch Webinar
ThousandEyes
 
PDF
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
PDF
Human-centred design in online workplace learning and relationship to engagem...
Tracy Tang
 
PDF
Productivity Management Software | Workstatus
Lovely Baghel
 
PDF
HydITEx corporation Booklet 2025 English
Георгий Феодориди
 
PDF
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
PDF
Wojciech Ciemski for Top Cyber News MAGAZINE. June 2025
Dr. Ludmila Morozova-Buss
 
PDF
Impact of IEEE Computer Society in Advancing Emerging Technologies including ...
Hironori Washizaki
 
PDF
Bitcoin+ Escalando sin concesiones - Parte 1
Fernando Paredes García
 
PDF
Sustainable and comertially viable mining process.pdf
Avijit Kumar Roy
 
PDF
Rethinking Security Operations - Modern SOC.pdf
Haris Chughtai
 
PDF
Women in Automation Presents: Reinventing Yourself — Bold Career Pivots That ...
DianaGray10
 
PDF
Building Resilience with Digital Twins : Lessons from Korea
SANGHEE SHIN
 
PPTX
Lifting and Rigging Safety AQG-2025-2.pptx
farrukhkhan658034
 
PDF
How Current Advanced Cyber Threats Transform Business Operation
Eryk Budi Pratama
 
PDF
Are there government-backed agri-software initiatives in Limerick.pdf
giselawagner2
 
PDF
Market Wrap for 18th July 2025 by CIFDAQ
CIFDAQ
 
Building a Production-Ready Barts Health Secure Data Environment Tooling, Acc...
Barts Health
 
UI5Con 2025 - Beyond UI5 Controls with the Rise of Web Components
Wouter Lemaire
 
Empowering Cloud Providers with Apache CloudStack and Stackbill
ShapeBlue
 
2025-07-15 EMEA Volledig Inzicht Dutch Webinar
ThousandEyes
 
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
Human-centred design in online workplace learning and relationship to engagem...
Tracy Tang
 
Productivity Management Software | Workstatus
Lovely Baghel
 
HydITEx corporation Booklet 2025 English
Георгий Феодориди
 
TrustArc Webinar - Data Privacy Trends 2025: Mid-Year Insights & Program Stra...
TrustArc
 
Wojciech Ciemski for Top Cyber News MAGAZINE. June 2025
Dr. Ludmila Morozova-Buss
 
Impact of IEEE Computer Society in Advancing Emerging Technologies including ...
Hironori Washizaki
 
Bitcoin+ Escalando sin concesiones - Parte 1
Fernando Paredes García
 
Sustainable and comertially viable mining process.pdf
Avijit Kumar Roy
 
Rethinking Security Operations - Modern SOC.pdf
Haris Chughtai
 
Women in Automation Presents: Reinventing Yourself — Bold Career Pivots That ...
DianaGray10
 
Building Resilience with Digital Twins : Lessons from Korea
SANGHEE SHIN
 
Lifting and Rigging Safety AQG-2025-2.pptx
farrukhkhan658034
 
How Current Advanced Cyber Threats Transform Business Operation
Eryk Budi Pratama
 
Are there government-backed agri-software initiatives in Limerick.pdf
giselawagner2
 
Market Wrap for 18th July 2025 by CIFDAQ
CIFDAQ
 

Deep Learning for Developers (December 2017)