SlideShare a Scribd company logo
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Julien Simon
Principal Technical Evangelist, AI and Machine Learning
@julsimon
Adding Image and Video Analysis
to your Applications
Put Machine Learning in the hands of every developer
and data scientist
Our mission
Application
Services
Platform
Services
Frameworks
& Infrastructure
API-driven services:Vision, Language & Speech Services, Chatbots
AWS ML Stack
Deploy machine learning models with high-performance machine learning algorithms,
broad framework support, and one-click training, tuning, and inference.
Develop sophisticated models with any framework, create managed, auto-scaling
clusters of GPUs for large scale training, or run prediction
on trained models.
Application
Services
Platform
Services
Frameworks
& Infrastructure
API-driven services:Vision, Language & Speech Services, Chatbots
AWS ML Stack
Deploy machine learning models with high-performance machine learning algorithms,
broad framework support, and one-click training, tuning, and inference.
Develop sophisticated models with any framework, create managed, auto-scaling
clusters of GPUs for large scale training, or run prediction
on trained models.
© 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved.
Amazon Rekognition
Amazon Rekognition
Fully
managed
service
Easy-to-use
API
Low costImage
analysis
Video
analysis
Easy-to-use deep learning-based computer vision analysis
Real-time and
batch analysis
Amazon Rekognition Image
Object and Scene
Detection
Facial
Analysis
Face
Recognition
Text in ImageUnsafe Image
Detection
Celebrity
Recognition
Detect objects, scenes, and faces, extract text, recognize celebrities, and identify unsafe
content in images
Face Comparison
Object & Scene Detection
Facial Analysis
Crowd Detection – up to 100 faces
Facial Search
Explicit Nudity
Nudity
Graphic Male Nudity
Graphic Female Nudity
Sexual Activity
Partial Nudity
Suggestive
Female Swimwear or Underwear
Male Swimwear or Underwear
Revealing Clothes
Image Moderation
Celebrity Recognition
Text in Image
Marinus Analytics
Marinus Analytics provides law enforcement
with tools founded in artificial intelligence.
Traffic Jam, is a suite of tools for use by law
enforcement agencies on sex trafficking
investigations.
Before using Amazon Rekognition, their only
recourse was manual processing; this was
time-intensive or not possible.
Now, investigators are able to take effective
action by searching through millions of
records in seconds to find victims.
http://www.marinusanalytics.com/articles/2017/
10/17/amazon-rekognition-helps-marinus-
analytics-fight-human-trafficking
Rekognition demo
© 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved.
Amazon Rekognition Video
Manually intensive Slow and error-prone Expensive
Challenges of video analysis
Temporal information lost Motion context lost
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Rekognition Video
Analyze activity, recognize, and track people in stored and live video
Object and activity
detection
Person tracking
Face recognition
Real-time
live stream
Unsafe video
detection
Celebrity
recognition
Facial
Analysis
Sky News will use Amazon
Rekognition to perform real-
time identification of guests
as they enter St. George’s
Chapel
Use case: video search index
Video Amazon S3 AWS Lambda Amazon Rekognition Video
Amazon Elasticsearch Amazon DynamoDB
1. Video is uploaded
and stored to S3
2. Amazon Rekognition Video creates
metadata for celebrities, emotions, key
topics in video with time segments for
search
4. Lambda also pushes
the metadata and confidence scores
into Elasticsearch
3. The output is persisted as
metadata into DynamoDB to
ensure durability
Indexed 99,000 people
Saves ~9,000 hours a year in labor
CSPAN: automatic footage tagging
Amazon RekognitionVideo
Public safety
Recognize a person of interest across a collection
of millions in real time across hundreds of cameras
Track a person of interest across a video
Create alerts by detecting objects and activities
of interest, such as car, license plate, running, etc.
Use case: public safety immediate response
Live Street Camera Amazon Kinesis Video Streams Amazon Rekognition Video Face collection
1. Camera-captured video
streams are processed by Kinesis
Video Streams
2. Amazon Rekognition Video analyses the
video and searches faces on screen against
a collection of millions of faces
User
3. User is notified
in case of face matches
Amazon SNS AWS Lambda Amazon Kinesis
Streams
Orlando: real-time public safety
Real-time video analysis
”The City of Orlando is excited to work with
Amazon to pilot the latest in public safety
software through a unique, first-of-it's-kind public-
private partnership.Through the pilot, Orlando will
utilize Amazon Rekognition Video and Amazon
KinesisVideo Streams technology in a way that
will use existing City resources to provide real-time
detection and notification of persons-of-interests,
further increasing public safety, and operational
efficiency opportunities for the City of Orlando and
other cities across the nation”.
John Mina, Police Chief, City of Orlando
Rekognition Video demo
https://aws.amazon.com/blogs/machine-learning/easily-perform-facial-analysis-on-live-feeds-by-creating-a-
serverless-video-analytics-environment-with-amazon-rekognition-video-and-amazon-kinesis-video-streams
FRAMEWORKS AND INTERFACES
PLATFORM SERVICES
APPLICATION SERVICES
Amazon
Rekognition
Amazon
Polly
Amazon
Lex
Democratization of ML
Amazon
Rekognition
Video
Amazon
Transcribe
Amazon
Comprehend
Amazon
SageMaker
AWS DeepLens Amazon EMR
Deep Learning
AMI
Amazon
Translate
Thank you!
Julien Simon
PrincipalTechnical Evangelist, AI and Machine Learning
@julsimon
https://aws.amazon.com/machine-learning
https://aws.amazon.com/rekognition
https://aws.amazon.com/blogs/ai
https://medium.com/@julsimon
https://youtube.com/juliensimonfr
Ad

More Related Content

Similar to Adding Image and Video Analysis to your Applications (May 2018) (10)

AWS의 인공지능 기반 비디오 분석 서비스 소개 l 남궁영환 솔루션즈 아키텍트
AWS의 인공지능 기반 비디오 분석 서비스 소개 l 남궁영환 솔루션즈 아키텍트AWS의 인공지능 기반 비디오 분석 서비스 소개 l 남궁영환 솔루션즈 아키텍트
AWS의 인공지능 기반 비디오 분석 서비스 소개 l 남궁영환 솔루션즈 아키텍트
Amazon Web Services Korea
 
Amazon Rekognition
Amazon RekognitionAmazon Rekognition
Amazon Rekognition
Amazon Web Services Japan
 
AI for developers
AI for developersAI for developers
AI for developers
Julien SIMON
 
AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018
AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018
AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018
Amazon Web Services Korea
 
Integrando Machine Learning - da ingestão à persistência - AWS
Integrando Machine Learning - da ingestão à persistência - AWS Integrando Machine Learning - da ingestão à persistência - AWS
Integrando Machine Learning - da ingestão à persistência - AWS
Hugo Rozestraten
 
Visão Geral de Inteligência Artificial
Visão Geral de Inteligência ArtificialVisão Geral de Inteligência Artificial
Visão Geral de Inteligência Artificial
Amazon Web Services LATAM
 
AI now ron tolido, capgemini cwin18_toulouse
AI now ron tolido, capgemini cwin18_toulouseAI now ron tolido, capgemini cwin18_toulouse
AI now ron tolido, capgemini cwin18_toulouse
Capgemini
 
[AWS Media Symposium 2019] Enhancing your Media Workflows with Amazon Machine...
[AWS Media Symposium 2019] Enhancing your Media Workflows with Amazon Machine...[AWS Media Symposium 2019] Enhancing your Media Workflows with Amazon Machine...
[AWS Media Symposium 2019] Enhancing your Media Workflows with Amazon Machine...
Amazon Web Services Korea
 
Machine Learning on AWS (December 2018)
Machine Learning on AWS (December 2018)Machine Learning on AWS (December 2018)
Machine Learning on AWS (December 2018)
Julien SIMON
 
Artificial Intelligence on the AWS Platform
Artificial Intelligence on the AWS PlatformArtificial Intelligence on the AWS Platform
Artificial Intelligence on the AWS Platform
Adrian Hornsby
 
AWS의 인공지능 기반 비디오 분석 서비스 소개 l 남궁영환 솔루션즈 아키텍트
AWS의 인공지능 기반 비디오 분석 서비스 소개 l 남궁영환 솔루션즈 아키텍트AWS의 인공지능 기반 비디오 분석 서비스 소개 l 남궁영환 솔루션즈 아키텍트
AWS의 인공지능 기반 비디오 분석 서비스 소개 l 남궁영환 솔루션즈 아키텍트
Amazon Web Services Korea
 
AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018
AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018
AWS 기반 인공지능 비디오 분석 서비스 소개::Ranju Das::AWS Summit Seoul 2018
Amazon Web Services Korea
 
Integrando Machine Learning - da ingestão à persistência - AWS
Integrando Machine Learning - da ingestão à persistência - AWS Integrando Machine Learning - da ingestão à persistência - AWS
Integrando Machine Learning - da ingestão à persistência - AWS
Hugo Rozestraten
 
AI now ron tolido, capgemini cwin18_toulouse
AI now ron tolido, capgemini cwin18_toulouseAI now ron tolido, capgemini cwin18_toulouse
AI now ron tolido, capgemini cwin18_toulouse
Capgemini
 
[AWS Media Symposium 2019] Enhancing your Media Workflows with Amazon Machine...
[AWS Media Symposium 2019] Enhancing your Media Workflows with Amazon Machine...[AWS Media Symposium 2019] Enhancing your Media Workflows with Amazon Machine...
[AWS Media Symposium 2019] Enhancing your Media Workflows with Amazon Machine...
Amazon Web Services Korea
 
Machine Learning on AWS (December 2018)
Machine Learning on AWS (December 2018)Machine Learning on AWS (December 2018)
Machine Learning on AWS (December 2018)
Julien SIMON
 
Artificial Intelligence on the AWS Platform
Artificial Intelligence on the AWS PlatformArtificial Intelligence on the AWS Platform
Artificial Intelligence on the AWS Platform
Adrian Hornsby
 

More from Julien SIMON (20)

deep_dive_multihead_latent_attention.pdf
deep_dive_multihead_latent_attention.pdfdeep_dive_multihead_latent_attention.pdf
deep_dive_multihead_latent_attention.pdf
Julien SIMON
 
Deep Dive: Model Distillation with DistillKit
Deep Dive: Model Distillation with DistillKitDeep Dive: Model Distillation with DistillKit
Deep Dive: Model Distillation with DistillKit
Julien SIMON
 
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and SpectrumDeep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
Julien SIMON
 
Building High-Quality Domain-Specific Models with Mergekit
Building High-Quality Domain-Specific Models with MergekitBuilding High-Quality Domain-Specific Models with Mergekit
Building High-Quality Domain-Specific Models with Mergekit
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Tailoring Small Language Models for Enterprise Use CasesTailoring Small Language Models for Enterprise Use Cases
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Tailoring Small Language Models for Enterprise Use CasesTailoring Small Language Models for Enterprise Use Cases
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Julien Simon - Deep Dive: Compiling Deep Learning Models
Julien Simon - Deep Dive: Compiling Deep Learning ModelsJulien Simon - Deep Dive: Compiling Deep Learning Models
Julien Simon - Deep Dive: Compiling Deep Learning Models
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Tailoring Small Language Models for Enterprise Use CasesTailoring Small Language Models for Enterprise Use Cases
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Julien Simon - Deep Dive - Optimizing LLM Inference
Julien Simon - Deep Dive - Optimizing LLM InferenceJulien Simon - Deep Dive - Optimizing LLM Inference
Julien Simon - Deep Dive - Optimizing LLM Inference
Julien SIMON
 
Julien Simon - Deep Dive - Accelerating Models with Better Attention Layers
Julien Simon - Deep Dive - Accelerating  Models with Better Attention LayersJulien Simon - Deep Dive - Accelerating  Models with Better Attention Layers
Julien Simon - Deep Dive - Accelerating Models with Better Attention Layers
Julien SIMON
 
Julien Simon - Deep Dive - Quantizing LLMs
Julien Simon - Deep Dive - Quantizing LLMsJulien Simon - Deep Dive - Quantizing LLMs
Julien Simon - Deep Dive - Quantizing LLMs
Julien SIMON
 
Julien Simon - Deep Dive - Model Merging
Julien Simon - Deep Dive - Model MergingJulien Simon - Deep Dive - Model Merging
Julien Simon - Deep Dive - Model Merging
Julien SIMON
 
An introduction to computer vision with Hugging Face
An introduction to computer vision with Hugging FaceAn introduction to computer vision with Hugging Face
An introduction to computer vision with Hugging Face
Julien SIMON
 
Reinventing Deep Learning
 with Hugging Face Transformers
Reinventing Deep Learning
 with Hugging Face TransformersReinventing Deep Learning
 with Hugging Face Transformers
Reinventing Deep Learning
 with Hugging Face Transformers
Julien SIMON
 
Building NLP applications with Transformers
Building NLP applications with TransformersBuilding NLP applications with Transformers
Building NLP applications with Transformers
Julien SIMON
 
Building Machine Learning Models Automatically (June 2020)
Building Machine Learning Models Automatically (June 2020)Building Machine Learning Models Automatically (June 2020)
Building Machine Learning Models Automatically (June 2020)
Julien SIMON
 
Scale Machine Learning from zero to millions of users (April 2020)
Scale Machine Learning from zero to millions of users (April 2020)Scale Machine Learning from zero to millions of users (April 2020)
Scale Machine Learning from zero to millions of users (April 2020)
Julien SIMON
 
An Introduction to Generative Adversarial Networks (April 2020)
An Introduction to Generative Adversarial Networks (April 2020)An Introduction to Generative Adversarial Networks (April 2020)
An Introduction to Generative Adversarial Networks (April 2020)
Julien SIMON
 
AIM410R1 Deep learning applications with TensorFlow, featuring Fannie Mae (De...
AIM410R1 Deep learning applications with TensorFlow, featuring Fannie Mae (De...AIM410R1 Deep learning applications with TensorFlow, featuring Fannie Mae (De...
AIM410R1 Deep learning applications with TensorFlow, featuring Fannie Mae (De...
Julien SIMON
 
AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)
AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)
AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)
Julien SIMON
 
deep_dive_multihead_latent_attention.pdf
deep_dive_multihead_latent_attention.pdfdeep_dive_multihead_latent_attention.pdf
deep_dive_multihead_latent_attention.pdf
Julien SIMON
 
Deep Dive: Model Distillation with DistillKit
Deep Dive: Model Distillation with DistillKitDeep Dive: Model Distillation with DistillKit
Deep Dive: Model Distillation with DistillKit
Julien SIMON
 
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and SpectrumDeep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
Deep Dive: Parameter-Efficient Model Adaptation with LoRA and Spectrum
Julien SIMON
 
Building High-Quality Domain-Specific Models with Mergekit
Building High-Quality Domain-Specific Models with MergekitBuilding High-Quality Domain-Specific Models with Mergekit
Building High-Quality Domain-Specific Models with Mergekit
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Tailoring Small Language Models for Enterprise Use CasesTailoring Small Language Models for Enterprise Use Cases
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Tailoring Small Language Models for Enterprise Use CasesTailoring Small Language Models for Enterprise Use Cases
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Julien Simon - Deep Dive: Compiling Deep Learning Models
Julien Simon - Deep Dive: Compiling Deep Learning ModelsJulien Simon - Deep Dive: Compiling Deep Learning Models
Julien Simon - Deep Dive: Compiling Deep Learning Models
Julien SIMON
 
Tailoring Small Language Models for Enterprise Use Cases
Tailoring Small Language Models for Enterprise Use CasesTailoring Small Language Models for Enterprise Use Cases
Tailoring Small Language Models for Enterprise Use Cases
Julien SIMON
 
Julien Simon - Deep Dive - Optimizing LLM Inference
Julien Simon - Deep Dive - Optimizing LLM InferenceJulien Simon - Deep Dive - Optimizing LLM Inference
Julien Simon - Deep Dive - Optimizing LLM Inference
Julien SIMON
 
Julien Simon - Deep Dive - Accelerating Models with Better Attention Layers
Julien Simon - Deep Dive - Accelerating  Models with Better Attention LayersJulien Simon - Deep Dive - Accelerating  Models with Better Attention Layers
Julien Simon - Deep Dive - Accelerating Models with Better Attention Layers
Julien SIMON
 
Julien Simon - Deep Dive - Quantizing LLMs
Julien Simon - Deep Dive - Quantizing LLMsJulien Simon - Deep Dive - Quantizing LLMs
Julien Simon - Deep Dive - Quantizing LLMs
Julien SIMON
 
Julien Simon - Deep Dive - Model Merging
Julien Simon - Deep Dive - Model MergingJulien Simon - Deep Dive - Model Merging
Julien Simon - Deep Dive - Model Merging
Julien SIMON
 
An introduction to computer vision with Hugging Face
An introduction to computer vision with Hugging FaceAn introduction to computer vision with Hugging Face
An introduction to computer vision with Hugging Face
Julien SIMON
 
Reinventing Deep Learning
 with Hugging Face Transformers
Reinventing Deep Learning
 with Hugging Face TransformersReinventing Deep Learning
 with Hugging Face Transformers
Reinventing Deep Learning
 with Hugging Face Transformers
Julien SIMON
 
Building NLP applications with Transformers
Building NLP applications with TransformersBuilding NLP applications with Transformers
Building NLP applications with Transformers
Julien SIMON
 
Building Machine Learning Models Automatically (June 2020)
Building Machine Learning Models Automatically (June 2020)Building Machine Learning Models Automatically (June 2020)
Building Machine Learning Models Automatically (June 2020)
Julien SIMON
 
Scale Machine Learning from zero to millions of users (April 2020)
Scale Machine Learning from zero to millions of users (April 2020)Scale Machine Learning from zero to millions of users (April 2020)
Scale Machine Learning from zero to millions of users (April 2020)
Julien SIMON
 
An Introduction to Generative Adversarial Networks (April 2020)
An Introduction to Generative Adversarial Networks (April 2020)An Introduction to Generative Adversarial Networks (April 2020)
An Introduction to Generative Adversarial Networks (April 2020)
Julien SIMON
 
AIM410R1 Deep learning applications with TensorFlow, featuring Fannie Mae (De...
AIM410R1 Deep learning applications with TensorFlow, featuring Fannie Mae (De...AIM410R1 Deep learning applications with TensorFlow, featuring Fannie Mae (De...
AIM410R1 Deep learning applications with TensorFlow, featuring Fannie Mae (De...
Julien SIMON
 
AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)
AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)
AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)
Julien SIMON
 
Ad

Recently uploaded (20)

Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
HCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser EnvironmentsHCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser Environments
panagenda
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptxDevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
Justin Reock
 
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
BookNet Canada
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
 
AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
HCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser EnvironmentsHCL Nomad Web – Best Practices and Managing Multiuser Environments
HCL Nomad Web – Best Practices and Managing Multiuser Environments
panagenda
 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptxDevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
Justin Reock
 
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
BookNet Canada
 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
 
Ad

Adding Image and Video Analysis to your Applications (May 2018)

  • 1. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Julien Simon Principal Technical Evangelist, AI and Machine Learning @julsimon Adding Image and Video Analysis to your Applications
  • 2. Put Machine Learning in the hands of every developer and data scientist Our mission
  • 3. Application Services Platform Services Frameworks & Infrastructure API-driven services:Vision, Language & Speech Services, Chatbots AWS ML Stack Deploy machine learning models with high-performance machine learning algorithms, broad framework support, and one-click training, tuning, and inference. Develop sophisticated models with any framework, create managed, auto-scaling clusters of GPUs for large scale training, or run prediction on trained models.
  • 4. Application Services Platform Services Frameworks & Infrastructure API-driven services:Vision, Language & Speech Services, Chatbots AWS ML Stack Deploy machine learning models with high-performance machine learning algorithms, broad framework support, and one-click training, tuning, and inference. Develop sophisticated models with any framework, create managed, auto-scaling clusters of GPUs for large scale training, or run prediction on trained models.
  • 5. © 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved. Amazon Rekognition
  • 6. Amazon Rekognition Fully managed service Easy-to-use API Low costImage analysis Video analysis Easy-to-use deep learning-based computer vision analysis Real-time and batch analysis
  • 7. Amazon Rekognition Image Object and Scene Detection Facial Analysis Face Recognition Text in ImageUnsafe Image Detection Celebrity Recognition Detect objects, scenes, and faces, extract text, recognize celebrities, and identify unsafe content in images Face Comparison
  • 8. Object & Scene Detection
  • 10. Crowd Detection – up to 100 faces
  • 12. Explicit Nudity Nudity Graphic Male Nudity Graphic Female Nudity Sexual Activity Partial Nudity Suggestive Female Swimwear or Underwear Male Swimwear or Underwear Revealing Clothes Image Moderation
  • 15. Marinus Analytics Marinus Analytics provides law enforcement with tools founded in artificial intelligence. Traffic Jam, is a suite of tools for use by law enforcement agencies on sex trafficking investigations. Before using Amazon Rekognition, their only recourse was manual processing; this was time-intensive or not possible. Now, investigators are able to take effective action by searching through millions of records in seconds to find victims. http://www.marinusanalytics.com/articles/2017/ 10/17/amazon-rekognition-helps-marinus- analytics-fight-human-trafficking
  • 17. © 2018, Amazon Web Services, Inc. or Its Affiliates. All rights reserved. Amazon Rekognition Video
  • 18. Manually intensive Slow and error-prone Expensive Challenges of video analysis Temporal information lost Motion context lost
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 21. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
  • 22. Amazon Rekognition Video Analyze activity, recognize, and track people in stored and live video Object and activity detection Person tracking Face recognition Real-time live stream Unsafe video detection Celebrity recognition Facial Analysis
  • 23. Sky News will use Amazon Rekognition to perform real- time identification of guests as they enter St. George’s Chapel
  • 24. Use case: video search index Video Amazon S3 AWS Lambda Amazon Rekognition Video Amazon Elasticsearch Amazon DynamoDB 1. Video is uploaded and stored to S3 2. Amazon Rekognition Video creates metadata for celebrities, emotions, key topics in video with time segments for search 4. Lambda also pushes the metadata and confidence scores into Elasticsearch 3. The output is persisted as metadata into DynamoDB to ensure durability
  • 25. Indexed 99,000 people Saves ~9,000 hours a year in labor CSPAN: automatic footage tagging
  • 26. Amazon RekognitionVideo Public safety Recognize a person of interest across a collection of millions in real time across hundreds of cameras Track a person of interest across a video Create alerts by detecting objects and activities of interest, such as car, license plate, running, etc.
  • 27. Use case: public safety immediate response Live Street Camera Amazon Kinesis Video Streams Amazon Rekognition Video Face collection 1. Camera-captured video streams are processed by Kinesis Video Streams 2. Amazon Rekognition Video analyses the video and searches faces on screen against a collection of millions of faces User 3. User is notified in case of face matches Amazon SNS AWS Lambda Amazon Kinesis Streams
  • 28. Orlando: real-time public safety Real-time video analysis ”The City of Orlando is excited to work with Amazon to pilot the latest in public safety software through a unique, first-of-it's-kind public- private partnership.Through the pilot, Orlando will utilize Amazon Rekognition Video and Amazon KinesisVideo Streams technology in a way that will use existing City resources to provide real-time detection and notification of persons-of-interests, further increasing public safety, and operational efficiency opportunities for the City of Orlando and other cities across the nation”. John Mina, Police Chief, City of Orlando
  • 30. FRAMEWORKS AND INTERFACES PLATFORM SERVICES APPLICATION SERVICES Amazon Rekognition Amazon Polly Amazon Lex Democratization of ML Amazon Rekognition Video Amazon Transcribe Amazon Comprehend Amazon SageMaker AWS DeepLens Amazon EMR Deep Learning AMI Amazon Translate
  • 31. Thank you! Julien Simon PrincipalTechnical Evangelist, AI and Machine Learning @julsimon https://aws.amazon.com/machine-learning https://aws.amazon.com/rekognition https://aws.amazon.com/blogs/ai https://medium.com/@julsimon https://youtube.com/juliensimonfr

Editor's Notes

  • #8: Let’s take a look at what Rekognition Image can do and how it has evolved over the past year Last re:Invent we launched Rekognition with Face detection, Face recognition and Object and Scene Detection In April, we added Unsafe Image detection. . In June, we launched Celebrity Recognition. In November, we launched Text In Image. And we also updated Face Detection, added real-time Face recognition, and updated our Moderation models …and we are continually improving based on customer feedback.
  • #13: You can use the ‘MinConfidence’ parameter in your API requests to balance detection of content (recall) vs the accuracy of detection (precision). 
  • #15: You can use the ‘MinConfidence’ parameter in your API requests to balance detection of content (recall) vs the accuracy of detection (precision). 
  • #17: SageMaker is going to make it much easier for everyday developers to build machine-learning models. But, people and developers are still really interested in learning more about how they can use machine learning. They want to do it, so they're reading all kinds of literature, and there are some code samples they can play around with. But, for any of us who've had to learn something new that has any kind of complexity, there's no substitute for hands-on training and application. And so we thought about: What can we do that would allow our builders and our developers to get this hands-on training? Our teams worked on this problem and developed AWS DeepLens, which is the world's first wireless deep-learning-enabled video-camera for developers.
  • #19: Current solutions mostly rely on having humans review and catalog the most important footage. It is expensive, slow, and error-prone, but it also leaves potential value on the table with the remaining uncatalogued video. Some people have tried applying traditional image analysis solutions to videos by sampling individual frames, but this loses important context. For example, one frame may detect a person standing, and another person lying down. However, the analysis would miss the context that this is the same person who has had a sudden fall. This solution also doesn’t support analysis of real-time video streaming. Existing solutions for extracting data on streaming video is expensive and does not scale. For example, Home IoT cameras providers are seeking a high quality analytics solution that provides output at low latency and does not trigger false alerts.
  • #23: We heard a need from customers they would like to have a deep learning-based Video analysis service Introducing Amazon Rekognition Video: a deep learning-based video recognition and analysis service for live or stored video that offers: Object and Activity Detection: With Rekognition Video, you can detect thousands of objects and activities accurately, and extract motion-based context from a video. For example, your application can quickly analyze a consumer video of a baby's first walk, and generate labels like "baby”, "crawling”, "falling”, and "hugging” with timestamps and confidence scores. This allows you to generate search indexes for consumer video archives. Person Tracking: With Rekognition Video motion-based video analysis capabilities, you can track people through the video even when their faces are not visible, or as the whole person might go in and out of the scene. This makes investigation and monitoring of individuals easy and accurate. Face Recognition: Rekognition Video allows you to identify persons from a large existing collection of faces. This allows you to build applications that locate a Person of Interest in real time from a live stream or in batch from Amazon S3. Streaming mode by native integration with Amazon Kinesis Video Streams: In streaming mode, the face detection and recognition APIs natively integrate with video stream from Kinesis Video Streams to provide output at low latency. Kinesis Video Streams enables developers to transmit thousands of live feeds and associated metadata to persist them on Amazon S3 and Glacier. Unsafe Video Detection: Rekognition Video enables easy filtering of video for explicit and suggestive content, providing fine-grained detection of labels associated with inappropriate / NSFW content at a video frame level. Celebrity Recognition: With Rekognition Video, you can detect and recognize celebrities in a video and track in which video frames each of them appears. This allows you to index and search digital video libraries for celebrities based on your particular interest. Feature capabilities summary: Contextual insight - motion and real-time Fully managed, scalable, and easy-to-use video analysis service Deep learning-based -> Continuously improving Integrated with Amazon S3, Amazon Kinesis Video Streams, AWS Lambda – get started with video analysis right out-of-the-box
  • #25: A mobile app uploads new videos to S3 An S3 notification triggers a Lambda function which calls Rekognition Video’s APIs with an S3 url DetectLabels analyzes the video and returns labels for objects and scenes detected in the video CelebrityRecognition returns celebrities recognized in the videos This output is persisted as metadata into DynamoDB to ensure durability Lambda writes the metadata to Elasticsearch The application provides a smart search index (powered by Elasticsearch), and serves the selected videos directly from S3
  • #27: Live streaming for immediate response Accurate analysis of a massive amount of data for investigative response Recognize a person across a collection of millions at low latency and high accuracy in streaming mode, across hundreds of cameras. Track a person of interest by outputting time segments when the person appeared in the video. Create alerts by detecting objects and activities of interest, such as car, license plate, running etc.
  • #31: So far, we've discussed the bottom and middle layers of the machine learning stack – first we talked about the frameworks and the deep learning AMI for expert practitioners. Then, SageMaker and DeepLens in the middle layer to bring ML capabilities to all developers. Now, at the top of the stack, we serve developers and companies who want to add solution-oriented intelligence to their applications through an API call rather than developing and training their own models. These are services that exhibit artificial intelligence that emulates a human’s cognitive skills. Last year, we announced three services in this area: Amazon Rekognition (image analysis), Amazon Polly (text-to-speech), and Amazon Lex (conversational applications).