SlideShare a Scribd company logo
Image Recognition
on Streaming Data
Neil Dahlke, Senior Solutions Engineer
21 August 2017
AT MEMSQL
Senior Solutions Engineer, San Francisco
BEFORE MEMSQL
I worked on Globus, a high performance data transfer tool for
research scientists, out of the University of Chicago in
coordination with Argonne National Lab.
PREVIOUS TALKS
Real Time, Geospatial, Maps (slides)
Streaming in the Enterprise (slides)
Real Time Analytics with Spark and MemSQL (slides)
2
Me at a Glance
The future of
computing is
visual…
4
5
6
Mapping Social Imagery Handwriting
and many more.
and it is also
mathematical.
But first, let’s create a shared vocabulary.
Easy to setup real-time data
pipelines with exactly-once
semantics
Streaming Data Ingest
Memory optimized tables for
analyzing real-time events
Live Data
Disk optimized tables with up to
10x compression and vectorized
queries for fast analytics
Historical Data
10
MemSQL at a Glance
Data Loading Query Latency
Concurrency
FAST LOW
Vectorized queries
Real-time dashboards
Live data access
Multi-threaded processing
Transactions and Analytics
Scalable performance
HIGH
Stream data
Real-time loading
Full data access
11
• Distributed, ANSI SQL, database
• Full ACID features
• Lock free, shared nothing
• Compiled queries
• Massively parallel
• Geospatial and JSON
• In-memory and on-disk
• MySQL protocol
• Streaming
• HTAP (rowstore and columnstore)
MemSQL in One Slide
12
Architecture: MemSQL Building Blocks
memsqld
13
Architecture: Aggregators and Leaves
Agg 1 Agg 2
Leaf 1 Leaf 2 Leaf 3 Leaf 4
14
Architecture: Aggregators Aggregate
Agg 1 Agg 2
Leaf 1 Leaf 2 Leaf 3 Leaf 4
15
Architecture: Leaves Hold Partitions
Agg 1 Agg 2
Leaf 1 Leaf 2 Leaf 3 Leaf 4
16
Architecture: It’s SQL All The Way Down
Agg 1 Agg 2
agg1> select avg(price) from
orders;
leaf1> using memsql_demo_0
select count(1), sum(price)
from orders;
leaf2> using memsql_demo_12
select count(1), sum(price)
from orders;
...
Leaf 1 Leaf 2 Leaf 3 Leaf 4
17
Now, back to image recognition.
19
DIGITAL DEFENDERS
OF CHILDREN
How does it work?
22
Real-Time Image Recognition Workflow
▪ Train a model with Spark and TensorFlow
▪ Use the Model to extract feature vectors from images
• Model + Image => FV
▪ You can store every feature vector in a MemSQL table
CREATE TABLE features (
id bigint(11) NOT NULL,
image binary(4096) DEFAULT NULL,
KEY id (id)USING CLUSTERED COLUMNSTORE
);
4,996 POINTS
 CLASSIFICATION
 DE-DUPLICATION
 MATCHING
949:0.026740,961:0.011758,962:0.01 ...
949:0.026740,961:0.011758,962:0.01 ...
12:0.005868,16:0.004575,49:0.002
193,52:0.009880,67:0.034832,72:0.
030992,77:0.012170,108:0.012382,
120:0.012916,125:0.005741,137:0.
015322,143:0.020548,157:0.03040
7,220:0.061202,228:0.026140,232:
0.040047,236:0.023434,242:0.0266
05,252:0.007459,264:0.022012,269
:0.016690,270:0.057932,282:0.011
975,292:0.028855,298:0.006937,31
7:0.005120,333:0.028555,338:0.03
9100,348:0.017727,358:0.055682,3
76:0.006209,386:0.028764,413:0.0
17220,417:0.018298,422:0.004943,
433:0.031690,443:0.011401,451:0.
016825,452:0.000745,458:0.01076
9,460:0.044923,471:0.039836,479:
0.008343,482:0.009446,484:0.0194
43,497:0.061289,502:0.015072,508
:0.029485,530:0.013753,532:0.007
153,543:0.044873,551:0.010136,55
5:0.012994,560:0.008001,563:0.03
8678,579:0.015128,610:0.007795,6
27:0.019286,634:0.021111,641:0.0
07065,642:0.007089,659:0.058285,
672:0.018122,674:0.024745,703:0.
012181,704:0.010520,705:0.01980
5,726:0.004800,734:0.020477,751:
0.005154,753:0.023470,763:0.0026
51,783:0.033653,786:0.010800,824
:0.017787,846:0.017696,850:0.040
618,853:0.006627,880:0.020177,88
7:0.040712,901:0.004130,902:0.01
2970,926:0.011321,949:0.026740,9
61:0.
067235,1551:0.002643,1569:0.030
303,1592:0.000982,1595:0.021256
,1606:0.029090,1619:0.030494,16
28:0.007809,1630:0.012805,1632:
0.074610,1658:0.046989,1663:0.0
11392,1683:0.025755,1689:0.0005
51,1690:0.019549,1707:0.002039,
1718:0.000027,1753:0.003988,176
1:0.016639,1787:0.004682,1788:0.
036989,1793:0.010178,1799:0.032
016,1820:0.001699,1862:0.026061
,1865:0.033358,1888:0.015540,18
93:0.015230,1913:0.029057,1917:
0.017459,1930:0.012725,1932:0.0
20591,1939:0.036401,1940:0.0014
55,1941:0.029777,1948:0.028731,
1950:0.015147,1966:0.008172,197
6:0.004087,2009:0.005937,2011:0.
026532,2016:0.018998,2023:0.003
567,2024:0.033425,2043:0.024501
,2060:0.035672,2077:0.026460,20
92:0.006496,2099:0.042786,2110:
0.031982,2117:0.026819,2118:0.0
02956,2127:0.002132,2171:0.0066
93,2174:0.006085,2193:0.038693,
2207:0.080437,2210:0.036449,221
5:0.027432,2216:0.000524,2228:0.
022542,2232:0.023016,2245:0.035
095,2258:0.008138,2291:0.014170
,2297:0.024569,2301:0.019651,23
10:0.037032,2333:0.010741,2337:
0.010183,2353:0.056520,2382:0.0
05700,2406:0.012346,2409:0.0459
50,2411:0.005816,2415:0.001264,
2424:0.046932,2439:0.010018,.
033653,786:0.010800,824:0.0177
87,846:0.017696,850:0.040618,8
53:0.006627,880:0.020177,887:0.
040712,901:0.004130,902:0.0129
70,926:0.011321,949:0.026740,9
61:0.011758,962:0.01,.003080,96
6:0.025391,969:0.008317,980:0.0
24180,999:0.025001,1003:0.0099
95,1018:0.026575,1024:0.014152
,1030:0.014807,1032:0.001685,1
037:0.059401,1041:0.008451,108
3:0.004498,1086:0.042539,1100:
0.019762,1107:0.003233,1111:0.
010055,1118:0.004970,1120:0.01
3391,1137:0.033611,1143:0.0041
84,1151:0.011988,1156:0.018991
,1164:0.005059,1165:0.009926,1
171:0.041736,1181:0.009872,118
7:0.001813,1188:0.010391,1193:
0.020764,1194:0.002471,1222:0.
006705,1238:0.009757,1246:0.06
7453,1259:0.042624,1264:0.0175
58,1265:0.019401,1269:0.015384
,1299:0.013593,1310:0.002139,1
359:0.006642,1371:0.034178,137
4:0.016396,1384:0.022928,1404:
0.017169,1408:0.009406,1418:0.
073914,1420:0.011940,1421:0.00
5672,1430:0.003974,1433:0.0027
76,1463:0.031537,1481:0.000885
,1485:0.039955,1492:0.023929,1
494:0.048229,1497:0.053608,150
8:0.003894,1518:0.011840,1524:
0.011318,1528:0.
26
Working with Feature Vectors
For every image we store an ID and a normalized feature vector in a MemSQL table called
features.
ID | Feature Vector
x | 4KB
To find similar images using cosine similarity, we use this SQL query:
SELECT
id
FROM
feature_vectors
WHERE
DOT_PRODUCT(image, 0xDEADBEEF) > 0.9
949:0.026740,961:0.011758,962:0.01 ...
12:0.005868,16:0.004575,49:0.002
193,52:0.009880,67:0.034832,72:0.
030992,77:0.012170,108:0.012382,
120:0.012916,125:0.005741,137:0.
015322,143:0.020548,157:0.03040
7,220:0.061202,228:0.026140,232:
0.040047,236:0.023434,242:0.0266
05,252:0.007459,264:0.022012,269
:0.016690,270:0.057932,282:0.011
975,292:0.028855,298:0.006937,31
7:0.005120,333:0.028555,338:0.03
9100,348:0.017727,358:0.055682,3
76:0.006209,386:0.028764,413:0.0
17220,417:0.018298,422:0.004943,
433:0.031690,443:0.011401,451:0.
016825,452:0.000745,458:0.01076
9,460:0.044923,471:0.039836,479:
0.008343,482:0.009446,484:0.0194
43,497:0.061289,502:0.015072,508
:0.029485,530:0.013753,532:0.007
153,543:0.044873,551:0.010136,55
5:0.012994,560:0.008001,563:0.03
8678,579:0.015128,610:0.007795,6
27:0.019286,634:0.021111,641:0.0
07065,642:0.007089,659:0.058285,
672:0.018122,674:0.024745,703:0.
012181,704:0.010520,705:0.01980
5,726:0.004800,734:0.020477,751:
0.005154,753:0.023470,763:0.0026
51,783:0.033653,786:0.010800,824
:0.017787,846:0.017696,850:0.040
618,853:0.006627,880:0.020177,88
7:0.040712,901:0.004130,902:0.01
2970,926:0.011321,949:0.026740,9
61:0.
067235,1551:0.002643,1569:0.030
303,1592:0.000982,1595:0.021256
,1606:0.029090,1619:0.030494,16
28:0.007809,1630:0.012805,1632:
0.074610,1658:0.046989,1663:0.0
11392,1683:0.025755,1689:0.0005
51,1690:0.019549,1707:0.002039,
1718:0.000027,1753:0.003988,176
1:0.016639,1787:0.004682,1788:0.
036989,1793:0.010178,1799:0.032
016,1820:0.001699,1862:0.026061
,1865:0.033358,1888:0.015540,18
93:0.015230,1913:0.029057,1917:
0.017459,1930:0.012725,1932:0.0
20591,1939:0.036401,1940:0.0014
55,1941:0.029777,1948:0.028731,
1950:0.015147,1966:0.008172,197
6:0.004087,2009:0.005937,2011:0.
026532,2016:0.018998,2023:0.003
567,2024:0.033425,2043:0.024501
,2060:0.035672,2077:0.026460,20
92:0.006496,2099:0.042786,2110:
0.031982,2117:0.026819,2118:0.0
02956,2127:0.002132,2171:0.0066
93,2174:0.006085,2193:0.038693,
2207:0.080437,2210:0.036449,221
5:0.027432,2216:0.000524,2228:0.
022542,2232:0.023016,2245:0.035
095,2258:0.008138,2291:0.014170
,2297:0.024569,2301:0.019651,23
10:0.037032,2333:0.010741,2337:
0.010183,2353:0.056520,2382:0.0
05700,2406:0.012346,2409:0.0459
50,2411:0.005816,2415:0.001264,
2424:0.046932,2439:0.010018,.
033653,786:0.010800,824:0.0177
87,846:0.017696,850:0.040618,8
53:0.006627,880:0.020177,887:0.
040712,901:0.004130,902:0.0129
70,926:0.011321,949:0.026740,9
61:0.011758,962:0.01,.003080,96
6:0.025391,969:0.008317,980:0.0
24180,999:0.025001,1003:0.0099
95,1018:0.026575,1024:0.014152
,1030:0.014807,1032:0.001685,1
037:0.059401,1041:0.008451,108
3:0.004498,1086:0.042539,1100:
0.019762,1107:0.003233,1111:0.
010055,1118:0.004970,1120:0.01
3391,1137:0.033611,1143:0.0041
84,1151:0.011988,1156:0.018991
,1164:0.005059,1165:0.009926,1
171:0.041736,1181:0.009872,118
7:0.001813,1188:0.010391,1193:
0.020764,1194:0.002471,1222:0.
006705,1238:0.009757,1246:0.06
7453,1259:0.042624,1264:0.0175
58,1265:0.019401,1269:0.015384
,1299:0.013593,1310:0.002139,1
359:0.006642,1371:0.034178,137
4:0.016396,1384:0.022928,1404:
0.017169,1408:0.009406,1418:0.
073914,1420:0.011940,1421:0.00
5672,1430:0.003974,1433:0.0027
76,1463:0.031537,1481:0.000885
,1485:0.039955,1492:0.023929,1
494:0.048229,1497:0.053608,150
8:0.003894,1518:0.011840,1524:
0.011318,1528:0.
100s of millions of
images to match
28
Understanding Dot Product
▪ Dot Product is an algebraic operation
• X = (x1, …, xN), Y = (y1, …, yN)
• (X*Y) = SUM(Xi * Yi)
▪ With the specific model and normalized feature vectors
DOT PRODUCT results in a similarity score.
• The closer the score is to 1 the more similar are the images
29
Performance numbers
▪ Memory speed: ~50GB/sec
▪ Vector size: 4KB
▪ 12.5 Million Images a second per node
▪ 1 Billion images a second on 100 node cluster
30
Performance Enhancing Techniques
Achieving best-in-class dot product implementation
▪ SIMD-powered
▪ Data compression
▪ Query parallelism
▪ Scale out
▪ Result: Processing at Memory Bandwidth Speed
31
MemSQL gives us…
▪ Performance
▪ Scalability
▪ High concurrency
▪ Real-time (operational)
▪ Compatibility (BI, Spark, Kafka, ETL, etc)
▪ Hybrid deployment
▪ Robustness, durability, security
▪ And also…
Load
Ingest from Apache Kafka,
Amazon S3 or Kinesis
Guarantee message
delivery with exactly-once
semantics
Transform
Map and enrich data with
user defined or Apache
Spark transformations
MemSQL Streaming
Extract
32
memsql> CREATE PIPELINE features_pipeline AS
-> LOAD DATA KAFKA "public-kafka.memcompute.com:9092/features"
-> INTO TABLE feature_vectors
-> (id, feature);
-> FIELDS TERMINATED BY ',’
-> (id, @image)
-> SET image = UNHEX(SUBSTRING(@image, 3));
Query OK, (0.89 sec)
memsql> START PIPELINE features_pipeline;
Query OK, (0.01 sec)
33
Simple Streaming Setup with CREATE PIPELINE
34
MemSQL Pipelines Sequence
Data Sources MemSQL
1. Extract 2. Transform extracted data 3. Load into Database tables
Pipelines
35
MemSQL Pipelines Architecture: Kafka
Kafka
Broker
MemSQL LeafPipelines
Kafka
Broker
MemSQL LeafPipelines
Kafka
Broker
MemSQL LeafPipelines
MemSQL AggPipelines
1. Extract 2. Transform 3. Load
Data
reshuffle
Metadata query
1. Extract 2. Transform 3. Load
1. Extract 2. Transform 3. Load
Real-Time
Application
New
Image
Stream
Real-Time
Processing
Reference
Image Store
TensorFlow
Streaming Real Time Image Recognition Workflow
Q&A
The Data Warehouse Blueprint
for ML, AI and Hybrid Cloud
GARY ORENSTEIN, SVP PRODUCT
Tuesday, August 22 | 1:45p.m.-2:05p.m. | TechZone
Theater
at

More Related Content

What's hot (20)

PPTX
MemSQL 201: Advanced Tips and Tricks Webcast
SingleStore
 
PPTX
Google Cloud Spanner Preview
DoiT International
 
PDF
Architecting Data in the AWS Ecosystem
SingleStore
 
PDF
Data Versioning and Reproducible ML with DVC and MLflow
Databricks
 
PPTX
Bringing olap fully online analyze changing datasets in mem sql and spark wi...
SingleStore
 
PDF
Operationalizing Machine Learning Using GPU-accelerated, In-database Analytics
Kinetica
 
PPTX
How to build analytics for 100bn logs a month with ClickHouse. By Vadim Tkach...
Valery Tkachenko
 
PPTX
Apache Cassandra Lunch #71: Creating a User Profile Using DataStax Astra and ...
Anant Corporation
 
PDF
Architecture Best Practices to Master + Pitfalls to Avoid
Elasticsearch
 
PDF
Dataflow - A Unified Model for Batch and Streaming Data Processing
DoiT International
 
PPTX
Carrier Paas - CloudStack Collaboration Event 2012
Uri Cohen
 
PPTX
Migrating on premises workload to azure sql database
PARIKSHIT SAVJANI
 
PDF
Building Identity Graphs over Heterogeneous Data
Databricks
 
PDF
Challenging Web-Scale Graph Analytics with Apache Spark with Xiangrui Meng
Databricks
 
PPTX
How SkyElectric Uses Scylla to Power Its Smart Energy Platform
ScyllaDB
 
PDF
Geek Sync | Planning a SQL Server to Azure Migration in 2021 - Brent Ozar
IDERA Software
 
PDF
Alluxio + Spark: Accelerating Auto Data Tagging in WeRide
Alluxio, Inc.
 
PPTX
Azure Data Lake Analytics Deep Dive
Ilyas F ☁☁☁
 
PDF
Lessons Learned: Understanding Azure Data Factory Pricing (Microsoft Ignite 2...
Cathrine Wilhelmsen
 
PDF
The Future of Computing is Distributed
Alluxio, Inc.
 
MemSQL 201: Advanced Tips and Tricks Webcast
SingleStore
 
Google Cloud Spanner Preview
DoiT International
 
Architecting Data in the AWS Ecosystem
SingleStore
 
Data Versioning and Reproducible ML with DVC and MLflow
Databricks
 
Bringing olap fully online analyze changing datasets in mem sql and spark wi...
SingleStore
 
Operationalizing Machine Learning Using GPU-accelerated, In-database Analytics
Kinetica
 
How to build analytics for 100bn logs a month with ClickHouse. By Vadim Tkach...
Valery Tkachenko
 
Apache Cassandra Lunch #71: Creating a User Profile Using DataStax Astra and ...
Anant Corporation
 
Architecture Best Practices to Master + Pitfalls to Avoid
Elasticsearch
 
Dataflow - A Unified Model for Batch and Streaming Data Processing
DoiT International
 
Carrier Paas - CloudStack Collaboration Event 2012
Uri Cohen
 
Migrating on premises workload to azure sql database
PARIKSHIT SAVJANI
 
Building Identity Graphs over Heterogeneous Data
Databricks
 
Challenging Web-Scale Graph Analytics with Apache Spark with Xiangrui Meng
Databricks
 
How SkyElectric Uses Scylla to Power Its Smart Energy Platform
ScyllaDB
 
Geek Sync | Planning a SQL Server to Azure Migration in 2021 - Brent Ozar
IDERA Software
 
Alluxio + Spark: Accelerating Auto Data Tagging in WeRide
Alluxio, Inc.
 
Azure Data Lake Analytics Deep Dive
Ilyas F ☁☁☁
 
Lessons Learned: Understanding Azure Data Factory Pricing (Microsoft Ignite 2...
Cathrine Wilhelmsen
 
The Future of Computing is Distributed
Alluxio, Inc.
 

Similar to Gartner Catalyst 2017: Image Recognition on Streaming Data (20)

PDF
Real-Time Image Recognition with Apache Spark with Nikita Shamgunov
Databricks
 
PPTX
Spark Summit West 2017: Real-Time Image Recognition with MemSQL and Spark
SingleStore
 
PDF
Discovering Your AI Super Powers - Tips and Tricks to Jumpstart your AI Projects
Wee Hyong Tok
 
PPTX
DataStax | Data Science with DataStax Enterprise (Brian Hess) | Cassandra Sum...
DataStax
 
PPTX
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
PyData
 
PDF
Spark Based Distributed Deep Learning Framework For Big Data Applications
Humoyun Ahmedov
 
PDF
Expanding Apache Spark Use Cases in 2.2 and Beyond with Matei Zaharia and dem...
Databricks
 
PDF
Deep learning and streaming in Apache Spark 2.2 by Matei Zaharia
GoDataDriven
 
PDF
Steps towards business intelligence
Ahsan Kabir
 
PPTX
Designing Artificial Intelligence
David Chou
 
PPTX
My Master's Thesis
Humoyun Ahmedov
 
PDF
Data herding
unbracketed
 
PDF
Data herding
unbracketed
 
PDF
Python business intelligence (PyData 2012 talk)
Stefan Urbanek
 
PDF
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
StampedeCon
 
PDF
OpenPOWER Workshop in Silicon Valley
Ganesan Narayanasamy
 
PPTX
PL SQLDay Machine Learning- Hands on ML.NET.pptx
Luis Beltran
 
PPTX
Microsoft Fabric trough the Power BI lenses
Data & Analytics Magazin
 
PDF
NoSQL Deepdive - with Informix NoSQL. IOD 2013
Keshav Murthy
 
PDF
Leveraging sql server to improve vector display through point clustering
Texas Natural Resources Information System
 
Real-Time Image Recognition with Apache Spark with Nikita Shamgunov
Databricks
 
Spark Summit West 2017: Real-Time Image Recognition with MemSQL and Spark
SingleStore
 
Discovering Your AI Super Powers - Tips and Tricks to Jumpstart your AI Projects
Wee Hyong Tok
 
DataStax | Data Science with DataStax Enterprise (Brian Hess) | Cassandra Sum...
DataStax
 
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
PyData
 
Spark Based Distributed Deep Learning Framework For Big Data Applications
Humoyun Ahmedov
 
Expanding Apache Spark Use Cases in 2.2 and Beyond with Matei Zaharia and dem...
Databricks
 
Deep learning and streaming in Apache Spark 2.2 by Matei Zaharia
GoDataDriven
 
Steps towards business intelligence
Ahsan Kabir
 
Designing Artificial Intelligence
David Chou
 
My Master's Thesis
Humoyun Ahmedov
 
Data herding
unbracketed
 
Data herding
unbracketed
 
Python business intelligence (PyData 2012 talk)
Stefan Urbanek
 
End-to-end Big Data Projects with Python - StampedeCon Big Data Conference 2017
StampedeCon
 
OpenPOWER Workshop in Silicon Valley
Ganesan Narayanasamy
 
PL SQLDay Machine Learning- Hands on ML.NET.pptx
Luis Beltran
 
Microsoft Fabric trough the Power BI lenses
Data & Analytics Magazin
 
NoSQL Deepdive - with Informix NoSQL. IOD 2013
Keshav Murthy
 
Leveraging sql server to improve vector display through point clustering
Texas Natural Resources Information System
 
Ad

More from SingleStore (20)

PPTX
Five ways database modernization simplifies your data life
SingleStore
 
PPTX
How Kafka and Modern Databases Benefit Apps and Analytics
SingleStore
 
PPTX
Building the Foundation for a Latency-Free Life
SingleStore
 
PDF
Converging Database Transactions and Analytics
SingleStore
 
PDF
Building a Machine Learning Recommendation Engine in SQL
SingleStore
 
PDF
Introduction to MemSQL
SingleStore
 
PPTX
Building a Fault Tolerant Distributed Architecture
SingleStore
 
PDF
Stream Processing with Pipelines and Stored Procedures
SingleStore
 
PPTX
Curriculum Associates Strata NYC 2017
SingleStore
 
PDF
The State of the Data Warehouse in 2017 and Beyond
SingleStore
 
PDF
Real-Time Analytics at Uber Scale
SingleStore
 
PDF
Machines and the Magic of Fast Learning
SingleStore
 
PPTX
Machines and the Magic of Fast Learning - Strata Keynote
SingleStore
 
PDF
Enabling Real-Time Analytics for IoT
SingleStore
 
PPTX
Real-Time Analytics with Spark and MemSQL
SingleStore
 
PPTX
Driving the On-Demand Economy with Predictive Analytics
SingleStore
 
PPTX
Tapjoy: Building a Real-Time Data Science Service for Mobile Advertising
SingleStore
 
PPTX
The Real-Time CDO and the Cloud-Forward Path to Predictive Analytics
SingleStore
 
PDF
Enabling Real-Time Analytics for IoT
SingleStore
 
PDF
Driving the On-Demand Economy with Predictive Analytics
SingleStore
 
Five ways database modernization simplifies your data life
SingleStore
 
How Kafka and Modern Databases Benefit Apps and Analytics
SingleStore
 
Building the Foundation for a Latency-Free Life
SingleStore
 
Converging Database Transactions and Analytics
SingleStore
 
Building a Machine Learning Recommendation Engine in SQL
SingleStore
 
Introduction to MemSQL
SingleStore
 
Building a Fault Tolerant Distributed Architecture
SingleStore
 
Stream Processing with Pipelines and Stored Procedures
SingleStore
 
Curriculum Associates Strata NYC 2017
SingleStore
 
The State of the Data Warehouse in 2017 and Beyond
SingleStore
 
Real-Time Analytics at Uber Scale
SingleStore
 
Machines and the Magic of Fast Learning
SingleStore
 
Machines and the Magic of Fast Learning - Strata Keynote
SingleStore
 
Enabling Real-Time Analytics for IoT
SingleStore
 
Real-Time Analytics with Spark and MemSQL
SingleStore
 
Driving the On-Demand Economy with Predictive Analytics
SingleStore
 
Tapjoy: Building a Real-Time Data Science Service for Mobile Advertising
SingleStore
 
The Real-Time CDO and the Cloud-Forward Path to Predictive Analytics
SingleStore
 
Enabling Real-Time Analytics for IoT
SingleStore
 
Driving the On-Demand Economy with Predictive Analytics
SingleStore
 
Ad

Recently uploaded (20)

PDF
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
PPTX
Introduction to computer chapter one 2017.pptx
mensunmarley
 
PPTX
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
PPTX
UVA-Ortho-PPT-Final-1.pptx Data analytics relevant to the top
chinnusindhu1
 
PPTX
UPS Case Study - Group 5 with example and implementation .pptx
yasserabdelwahab6
 
PDF
McKinsey - Global Energy Perspective 2023_11.pdf
niyudha
 
PPTX
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
PDF
Basotho Satisfaction with Electricity(Statspack)
KatlehoMefane
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PDF
apidays Munich 2025 - The Double Life of the API Product Manager, Emmanuel Pa...
apidays
 
PDF
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
PPTX
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
PPTX
7 Easy Ways to Improve Clarity in Your BI Reports
sophiegracewriter
 
PDF
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
PPTX
Introduction to Data Analytics and Data Science
KavithaCIT
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PDF
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
apidays
 
PPTX
short term internship project on Data visualization
JMJCollegeComputerde
 
PDF
Before tackling these green level readers child Will need to be able to
startshws
 
PPTX
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
Introduction to computer chapter one 2017.pptx
mensunmarley
 
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
UVA-Ortho-PPT-Final-1.pptx Data analytics relevant to the top
chinnusindhu1
 
UPS Case Study - Group 5 with example and implementation .pptx
yasserabdelwahab6
 
McKinsey - Global Energy Perspective 2023_11.pdf
niyudha
 
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
Basotho Satisfaction with Electricity(Statspack)
KatlehoMefane
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
apidays Munich 2025 - The Double Life of the API Product Manager, Emmanuel Pa...
apidays
 
Blitz Campinas - Dia 24 de maio - Piettro.pdf
fabigreek
 
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
7 Easy Ways to Improve Clarity in Your BI Reports
sophiegracewriter
 
202501214233242351219 QASS Session 2.pdf
lauramejiamillan
 
Introduction to Data Analytics and Data Science
KavithaCIT
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
apidays
 
short term internship project on Data visualization
JMJCollegeComputerde
 
Before tackling these green level readers child Will need to be able to
startshws
 
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 

Gartner Catalyst 2017: Image Recognition on Streaming Data

Editor's Notes

  • #26: Normalize each feature vector by dividing each element in the vector by the length of the vector, which gives us a length of 1 The similarity is higher if the dot product is close to one. is an algebraic operation that takes two equal-length sequences of numbers (usually coordinate vectors) and returns a single number. In Euclidean geometry, the dot product of the Cartesian coordinates of two vectors is widely used and often called inner product (or rarely projection product); see also inner product space. Algebraically, the dot product is the sum of the products of the corresponding entries of the two sequences of numbers.