SlideShare a Scribd company logo
APAC Big Data &
Cloud Summit 2013
Girish Juneja
GM, Big Data Software
Software & Services Group
Data Fab Transistor System Enablement Optimization Intelligence
Data
30 million
networked
sensors growing
at 30% a year
Computing
1 trillion devices
connected to the
Internet by 2015
Experience
500 million smart
phone users
increasing 20% a year
Social
Machine Generated
User Generated
Feedback loops driving exponential growth
Evolving towards end-to-end real-time analytics
Decade Paradigm Architecture Platform
• Reporting / Data Mining
• High Cost / Isolated use
90s
2000s
Today
• Model-based discovery
• High Cost / Dept Use
• Unbounded Map Reduce Query
• Low Cost / Enterprise Use
• Arrival of vast amounts of
unstructured data
• Batch – “sales reports”
• Sequential SQL queries
• Batch-ie correlated buying pattern
• No SQL. parallel analysis
• Shared disk/memory
Unlimited
Linear Scale
RDMS
Proprietary MPP/
DW Appliance
Open Source SW loosely
coupled to commodity HW
No SQL RDMS
Scale
Scale NodeNode
• Real-time - ie recommend engine
• Process @ storage node
• Built-in data replication/reliability
• Shared nothing, in memory
Distributed node addition
NodeNode Node
Multi-core
Node
Make big data work for you
Amount of data your enterprise will need to ingest: 50X
Proportion of data that is useful to you: 10%
Projected increase in your IT budget: 10%
=> Business as usual is not an option
Software
Global
Ecosystem
Security
Systems
Architecture
Energy
Efficient
Performance
Manufacturing
Leadership
Benefit from Intel’s long-standing investments
Using volume economics to drive innovation
Intel
Fabricating silicon for big data
22nm
A Revolutionary Leap
in Process
Technology
37%
Performance Gain at Low
Voltage1
>50%
Active Power Reduction at
Constant Performance1
Intel lead vs. Industry
3.5 years
2007
45 nm
2009
32 nm
2011
22 nm
High-k Metal Gate Tri Gate
Intel lead vs. Industry
4 years
Intel® Xeon® Processor E5-4600
Product Family
Highest reliability & scalability
Highest memory capacity
Highest enterprise & database performance
Density-optimized
Cost-optimized
Improved HPC performance
1 Source: Published results as of 8 May 2012. See http://www.intel.com/performance/server/xeonE7/summary.htm for full list of benchmarks and configuration details.
Pumping the heart of the open datacenter
Intel® Xeon® Processor E7-4800
Product Family
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components,
software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the
performance of that product when combined with other products.
Enabling open source solutions
Optimize software to take advantage of Intel® architecture
AES-NI SSD, 10GbE TXTMCAVT-*
3x performance in
3 years
Mission Critical
deployments
Accelerates Crypto
in JBoss
30x throughput Trusted Compute
Pools
Contributing to Apache Hadoop
• File based encryption for Hadoop jobs
• ACLs for HDFS and HBase at cell level
• Flash storage for MapReduce shuffle data
• Caching and non-volatile memory for increased throughput
• HDFS adaptive replication of hot-files
• HBase distributed tables across data centers
• HDFS data replication across data centers
• Archival storage support for cold data on HDFS
• SSE Instructions
• JVM Enhancements
• Infiniband RDMA Support
Supporting Intel Distribution for Apache Hadoop
Data Mining
Graph Analytics
Full Text SearchFull SQL
Batch Analytics
Security
Intel® Distribution for Apache Hadoop* software
Granular access control in HBase
Up to 20X faster crypto with AES-NI*
30X faster Terasort on Intel® Xeon
processors, Intel 10GbE, and SSD
Up to 8.5X faster queries in Hive*
Job profiling and configuration,
automated by Intel® Active Tuner
*Based on internal testing
Rhino
Cloud
HPC
Common authentication,
access control, auditing
Bringing MapReduce to
data on Lustre FS
Enabling real-time 100%
SQL on Hadoop
Optimizing Hadoop for
virtualization & cloud
Backed by portfolio of datacenter products
Software
NetworkStorage & MemoryServer
Cache
Acceleration
Software
With broad support from the ecosystem
* Other names and brands may be claimed as the property of others.
Proven in the enterprise
Using the Intel® Distribution to gain tremendous results
* Other names and brands may be claimed as the property of others.
IT
Putting advanced capabilities at work…
• Expose new data
• Dashboard/historical reporting
• Real-time campaigns
• Vertical apps
• Predictive data services
• Graph visualization
• Log analysis
to solve real use cases
• Fraud & threat detection
• Life sciences research
• Behavioral analysis
• Warranty analysis
• Customer segmentation
• Infrastructure optimization
From Hype to High Performance
Data-Driven Business: Customer Service
Value
• Enable subscriber access to billing data
• 30X gain in performance; lower TCO
Analytics
• Provides real-time retrieval of 6 months data
• Supports new BI with 15 types of queries
• Enables targeted ad serving and promotions
Data Management
• 30 TB/month of billing data
• 300K reads/second; 800K inserts/second
• 133-node cluster / Intel Xeon E5 processors
CDR
Subscriber Self Service
Intel Distribution
Value
Enable researchers to discover biomarkers and drug
targets by correlating genomic data sets
90% gain in throughput; 6X data compression
Analytics
Provide curated data sets with pre-computed analysis
(classification, correlation, biomarkers)
Provide APIs for applications to combine and analyze
public and private data sets
Data Management
Use Hive and Hadoop for query and search
Dynamically partition and scale Hbase
10-node cluster / Intel Xeon E5 processors / 10GbE
Data-Intensive Discovery: Genomics
Intel Distribution
Data-Rich Communities: Smart City
Value
• Enforce traffic laws and detect license fraud
• Monitor and predict traffic patterns
• In a city of 31 million people
Analytics
• Detect traffic law violations automatically
• Detect driver license fraud by data mining
• Forecast traffic with predictive analytics
Data Management
• 30,000 cameras
• 6Mb/s stream rate per camera
• 15 PB of images in use / 2B records in HBase
Detection Prevention
Regional
Local
Catalyzing the ecosystem
Foster the ecosystem and develop new markets for Intel and its partners
Resources
Content
Case Studies
Whitepapers
Demos
http://hadoop.intel.com
Contacts
Girish Juneja
RK Hiremane
Eddie Toh
hadoop@intel.com
Girish Juneja - Intel Big Data & Cloud Summit 2013

More Related Content

PPTX
Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venka...
PDF
Vortrag ralph behrens_ibm-data
PPTX
Data infrastructure and Hadoop at LinkedIn
PDF
Big Data/Hadoop Infrastructure Considerations
PDF
VMworld 2013: Big Data Extensions: Advanced Features and Customer Case Study
PPTX
Big data cloud architecture
PDF
The datascientists workplace of the future, IBM developerDays 2014, Vienna by...
PDF
Big Data Taiwan 2014 Track2-2: Informatica Big Data Solution
Apache Hadoop India Summit 2011 talk "Data Infrastructure on Hadoop" by Venka...
Vortrag ralph behrens_ibm-data
Data infrastructure and Hadoop at LinkedIn
Big Data/Hadoop Infrastructure Considerations
VMworld 2013: Big Data Extensions: Advanced Features and Customer Case Study
Big data cloud architecture
The datascientists workplace of the future, IBM developerDays 2014, Vienna by...
Big Data Taiwan 2014 Track2-2: Informatica Big Data Solution

What's hot (20)

PPTX
SQLSaturday #230 - Introduction to Microsoft Big Data (Part 1)
PDF
Big Data Taiwan 2014 Keynote 4: Monetize Enterprise Data – Big Data 在台灣的經典應用與行動
PDF
Architecting Virtualized Infrastructure for Big Data
PPTX
Dev Lakhani, Data Scientist at Batch Insights "Real Time Big Data Applicatio...
PPTX
Interactive query in hadoop
PDF
GPU databases - How to use them and what the future holds
PPTX
Hadoop and big data
PPTX
Big Data Analytics with Hadoop, MongoDB and SQL Server
PDF
20100806 cloudera 10 hadoopable problems webinar
PPTX
Real-Time Analytics in Transactional Applications by Brian Bulkowski
PPTX
Interactive query using hadoop
PPTX
Big data analytics with hadoop volume 2
PPTX
C* Summit EU 2013: Leveraging the Power of Cassandra: Operational Reporting a...
PDF
Introduction to Big Data Analytics on Apache Hadoop
PDF
Hive
PPTX
Hadoop Reporting and Analysis - Jaspersoft
PPTX
Hadoop and BigData - July 2016
PDF
Introduction to Big Data and Hadoop
PPTX
Hd insight overview
PPTX
Powering Real-Time Big Data Analytics with a Next-Gen GPU Database
SQLSaturday #230 - Introduction to Microsoft Big Data (Part 1)
Big Data Taiwan 2014 Keynote 4: Monetize Enterprise Data – Big Data 在台灣的經典應用與行動
Architecting Virtualized Infrastructure for Big Data
Dev Lakhani, Data Scientist at Batch Insights "Real Time Big Data Applicatio...
Interactive query in hadoop
GPU databases - How to use them and what the future holds
Hadoop and big data
Big Data Analytics with Hadoop, MongoDB and SQL Server
20100806 cloudera 10 hadoopable problems webinar
Real-Time Analytics in Transactional Applications by Brian Bulkowski
Interactive query using hadoop
Big data analytics with hadoop volume 2
C* Summit EU 2013: Leveraging the Power of Cassandra: Operational Reporting a...
Introduction to Big Data Analytics on Apache Hadoop
Hive
Hadoop Reporting and Analysis - Jaspersoft
Hadoop and BigData - July 2016
Introduction to Big Data and Hadoop
Hd insight overview
Powering Real-Time Big Data Analytics with a Next-Gen GPU Database
Ad

Viewers also liked (14)

PDF
Intel’s Strategy in IoT Standards
PDF
intel-it-annual-performance-report-2014-15-paper-l
PDF
Ron Kasabian - Intel Big Data & Cloud Summit 2013
PPTX
Internet of things - 2016 trends.
PDF
Austin Cherian: Big data and HPC technologies - intel
PPTX
Intel Cloud Foundry and OpenStack
PDF
Workplace Transformation: Intel’s Vision for Embracing Change and Innovation
PDF
Big Data Intel® Platform
PDF
Day 2 aziz apj aziz_big_datakeynote_press
PDF
Three Steps to Making a Digital Workplace a Reality
PDF
Intel IT Cloud Strategy
PDF
Your Favourite Quotes
PPTX
The Vortex of Change - Digital Transformation (Presented by Intel)
PPTX
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Intel’s Strategy in IoT Standards
intel-it-annual-performance-report-2014-15-paper-l
Ron Kasabian - Intel Big Data & Cloud Summit 2013
Internet of things - 2016 trends.
Austin Cherian: Big data and HPC technologies - intel
Intel Cloud Foundry and OpenStack
Workplace Transformation: Intel’s Vision for Embracing Change and Innovation
Big Data Intel® Platform
Day 2 aziz apj aziz_big_datakeynote_press
Three Steps to Making a Digital Workplace a Reality
Intel IT Cloud Strategy
Your Favourite Quotes
The Vortex of Change - Digital Transformation (Presented by Intel)
Evolution of Big Data at Intel - Crawl, Walk and Run Approach
Ad

Similar to Girish Juneja - Intel Big Data & Cloud Summit 2013 (20)

PPTX
Hadoop in the Cloud: Common Architectural Patterns
PDF
Analytics&IoT
PDF
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
PDF
Machine Data Analytics
PPTX
Skilwise Big data
PPTX
Skillwise Big Data part 2
PDF
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
PPTX
Derfor skal du bruge en DataLake
PPTX
Big data analytics and machine intelligence v5.0
PDF
Ibm pure data system for analytics n3001
PDF
J1 - Keynote Data Platform - Rohan Kumar
PDF
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
PPTX
Delivering fast, powerful and scalable analytics
PDF
Customer value analysis of big data products
PDF
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
PPTX
PDF
Future of Power: Power Strategy and Offerings for Denmark - Steve Sibley
PPTX
Unlocking Operational Intelligence from the Data Lake
PDF
Webinar: Faster Big Data Analytics with MongoDB
PDF
Data & Analytics with CIS & Microsoft Platforms
Hadoop in the Cloud: Common Architectural Patterns
Analytics&IoT
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Machine Data Analytics
Skilwise Big data
Skillwise Big Data part 2
Sudhir Rawat, Sr Techonology Evangelist at Microsoft SQL Business Intelligenc...
Derfor skal du bruge en DataLake
Big data analytics and machine intelligence v5.0
Ibm pure data system for analytics n3001
J1 - Keynote Data Platform - Rohan Kumar
Cortana Analytics Workshop: The "Big Data" of the Cortana Analytics Suite, Pa...
Delivering fast, powerful and scalable analytics
Customer value analysis of big data products
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
Future of Power: Power Strategy and Offerings for Denmark - Steve Sibley
Unlocking Operational Intelligence from the Data Lake
Webinar: Faster Big Data Analytics with MongoDB
Data & Analytics with CIS & Microsoft Platforms

More from IntelAPAC (20)

PDF
Intel apj cloud big data summit sdi press briefing - panhorst
PDF
Netweb flytxt-big-data-case-study
PDF
2 pc enterprise summit cronin newfinal aug 18
PDF
5 Cronin Steen - IOT Smart Cities
PDF
Gab Genai Cloudera - Going Beyond Traditional Analytic
PDF
1 RK Hiremane
PDF
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
PDF
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
PDF
Intel APJ Enterprise Day - Intel puts Automotive Innovation into High Gear
PDF
Intel APJ Enterprise Day - Intro to Intel Collaboration Centre
PDF
Intel APJ Enterprise Day - Strategic IT, A New Way of Business
PDF
Intel APJ Enterprise Day - Keynote by RK Hiremane
PDF
Intel APJ Enterprise Day - Introduction to Intel Kabushiki Kaisha
PDF
RedHat - Intel Big Data & Cloud Summit 2013
PDF
Greg Brown - Intel Big Data & Cloud Summit 2013
PDF
TWSE - Intel Big Data & Cloud Summit 2013
PDF
Lynn Comp - Intel Big Data & Cloud Summit 2013 (2)
PDF
Lynn Comp - Big Data & Cloud Summit 2013
PDF
Designed in Asia: Intel's Manufacturing Powerhouse in Asia
PDF
Designed in Asia: Welcome
Intel apj cloud big data summit sdi press briefing - panhorst
Netweb flytxt-big-data-case-study
2 pc enterprise summit cronin newfinal aug 18
5 Cronin Steen - IOT Smart Cities
Gab Genai Cloudera - Going Beyond Traditional Analytic
1 RK Hiremane
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Synopses of Demos at Intel Collaboration Center
Intel APJ Enterprise Day - Intel puts Automotive Innovation into High Gear
Intel APJ Enterprise Day - Intro to Intel Collaboration Centre
Intel APJ Enterprise Day - Strategic IT, A New Way of Business
Intel APJ Enterprise Day - Keynote by RK Hiremane
Intel APJ Enterprise Day - Introduction to Intel Kabushiki Kaisha
RedHat - Intel Big Data & Cloud Summit 2013
Greg Brown - Intel Big Data & Cloud Summit 2013
TWSE - Intel Big Data & Cloud Summit 2013
Lynn Comp - Intel Big Data & Cloud Summit 2013 (2)
Lynn Comp - Big Data & Cloud Summit 2013
Designed in Asia: Intel's Manufacturing Powerhouse in Asia
Designed in Asia: Welcome

Recently uploaded (20)

PPTX
Telecom Fraud Prevention Guide | Hyperlink InfoSystem
PDF
DevOps & Developer Experience Summer BBQ
PDF
Sensors and Actuators in IoT Systems using pdf
PDF
Google’s NotebookLM Unveils Video Overviews
PDF
Building High-Performance Oracle Teams: Strategic Staffing for Database Manag...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
KodekX | Application Modernization Development
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PPTX
Web Security: Login Bypass, SQLi, CSRF & XSS.pptx
PDF
Event Presentation Google Cloud Next Extended 2025
PDF
creating-agentic-ai-solutions-leveraging-aws.pdf
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
PDF
How AI Agents Improve Data Accuracy and Consistency in Due Diligence.pdf
PPTX
How to Build Crypto Derivative Exchanges from Scratch.pptx
PDF
How Onsite IT Support Drives Business Efficiency, Security, and Growth.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
ABU RAUP TUGAS TIK kelas 8 hjhgjhgg.pptx
PDF
Top Generative AI Tools for Patent Drafting in 2025.pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Telecom Fraud Prevention Guide | Hyperlink InfoSystem
DevOps & Developer Experience Summer BBQ
Sensors and Actuators in IoT Systems using pdf
Google’s NotebookLM Unveils Video Overviews
Building High-Performance Oracle Teams: Strategic Staffing for Database Manag...
Understanding_Digital_Forensics_Presentation.pptx
KodekX | Application Modernization Development
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
Web Security: Login Bypass, SQLi, CSRF & XSS.pptx
Event Presentation Google Cloud Next Extended 2025
creating-agentic-ai-solutions-leveraging-aws.pdf
Automating ArcGIS Content Discovery with FME: A Real World Use Case
How AI Agents Improve Data Accuracy and Consistency in Due Diligence.pdf
How to Build Crypto Derivative Exchanges from Scratch.pptx
How Onsite IT Support Drives Business Efficiency, Security, and Growth.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
ABU RAUP TUGAS TIK kelas 8 hjhgjhgg.pptx
Top Generative AI Tools for Patent Drafting in 2025.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy

Girish Juneja - Intel Big Data & Cloud Summit 2013

  • 1. APAC Big Data & Cloud Summit 2013 Girish Juneja GM, Big Data Software Software & Services Group
  • 2. Data Fab Transistor System Enablement Optimization Intelligence
  • 3. Data 30 million networked sensors growing at 30% a year Computing 1 trillion devices connected to the Internet by 2015 Experience 500 million smart phone users increasing 20% a year Social Machine Generated User Generated Feedback loops driving exponential growth
  • 4. Evolving towards end-to-end real-time analytics Decade Paradigm Architecture Platform • Reporting / Data Mining • High Cost / Isolated use 90s 2000s Today • Model-based discovery • High Cost / Dept Use • Unbounded Map Reduce Query • Low Cost / Enterprise Use • Arrival of vast amounts of unstructured data • Batch – “sales reports” • Sequential SQL queries • Batch-ie correlated buying pattern • No SQL. parallel analysis • Shared disk/memory Unlimited Linear Scale RDMS Proprietary MPP/ DW Appliance Open Source SW loosely coupled to commodity HW No SQL RDMS Scale Scale NodeNode • Real-time - ie recommend engine • Process @ storage node • Built-in data replication/reliability • Shared nothing, in memory Distributed node addition NodeNode Node Multi-core Node
  • 5. Make big data work for you Amount of data your enterprise will need to ingest: 50X Proportion of data that is useful to you: 10% Projected increase in your IT budget: 10% => Business as usual is not an option
  • 7. Using volume economics to drive innovation Intel
  • 8. Fabricating silicon for big data 22nm A Revolutionary Leap in Process Technology 37% Performance Gain at Low Voltage1 >50% Active Power Reduction at Constant Performance1 Intel lead vs. Industry 3.5 years 2007 45 nm 2009 32 nm 2011 22 nm High-k Metal Gate Tri Gate Intel lead vs. Industry 4 years
  • 9. Intel® Xeon® Processor E5-4600 Product Family Highest reliability & scalability Highest memory capacity Highest enterprise & database performance Density-optimized Cost-optimized Improved HPC performance 1 Source: Published results as of 8 May 2012. See http://www.intel.com/performance/server/xeonE7/summary.htm for full list of benchmarks and configuration details. Pumping the heart of the open datacenter Intel® Xeon® Processor E7-4800 Product Family Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products.
  • 10. Enabling open source solutions Optimize software to take advantage of Intel® architecture AES-NI SSD, 10GbE TXTMCAVT-* 3x performance in 3 years Mission Critical deployments Accelerates Crypto in JBoss 30x throughput Trusted Compute Pools
  • 11. Contributing to Apache Hadoop • File based encryption for Hadoop jobs • ACLs for HDFS and HBase at cell level • Flash storage for MapReduce shuffle data • Caching and non-volatile memory for increased throughput • HDFS adaptive replication of hot-files • HBase distributed tables across data centers • HDFS data replication across data centers • Archival storage support for cold data on HDFS • SSE Instructions • JVM Enhancements • Infiniband RDMA Support
  • 12. Supporting Intel Distribution for Apache Hadoop Data Mining Graph Analytics Full Text SearchFull SQL Batch Analytics Security
  • 13. Intel® Distribution for Apache Hadoop* software Granular access control in HBase Up to 20X faster crypto with AES-NI* 30X faster Terasort on Intel® Xeon processors, Intel 10GbE, and SSD Up to 8.5X faster queries in Hive* Job profiling and configuration, automated by Intel® Active Tuner *Based on internal testing Rhino Cloud HPC Common authentication, access control, auditing Bringing MapReduce to data on Lustre FS Enabling real-time 100% SQL on Hadoop Optimizing Hadoop for virtualization & cloud
  • 14. Backed by portfolio of datacenter products Software NetworkStorage & MemoryServer Cache Acceleration Software
  • 15. With broad support from the ecosystem * Other names and brands may be claimed as the property of others.
  • 16. Proven in the enterprise Using the Intel® Distribution to gain tremendous results * Other names and brands may be claimed as the property of others. IT
  • 17. Putting advanced capabilities at work… • Expose new data • Dashboard/historical reporting • Real-time campaigns • Vertical apps • Predictive data services • Graph visualization • Log analysis to solve real use cases • Fraud & threat detection • Life sciences research • Behavioral analysis • Warranty analysis • Customer segmentation • Infrastructure optimization From Hype to High Performance
  • 18. Data-Driven Business: Customer Service Value • Enable subscriber access to billing data • 30X gain in performance; lower TCO Analytics • Provides real-time retrieval of 6 months data • Supports new BI with 15 types of queries • Enables targeted ad serving and promotions Data Management • 30 TB/month of billing data • 300K reads/second; 800K inserts/second • 133-node cluster / Intel Xeon E5 processors CDR Subscriber Self Service Intel Distribution
  • 19. Value Enable researchers to discover biomarkers and drug targets by correlating genomic data sets 90% gain in throughput; 6X data compression Analytics Provide curated data sets with pre-computed analysis (classification, correlation, biomarkers) Provide APIs for applications to combine and analyze public and private data sets Data Management Use Hive and Hadoop for query and search Dynamically partition and scale Hbase 10-node cluster / Intel Xeon E5 processors / 10GbE Data-Intensive Discovery: Genomics Intel Distribution
  • 20. Data-Rich Communities: Smart City Value • Enforce traffic laws and detect license fraud • Monitor and predict traffic patterns • In a city of 31 million people Analytics • Detect traffic law violations automatically • Detect driver license fraud by data mining • Forecast traffic with predictive analytics Data Management • 30,000 cameras • 6Mb/s stream rate per camera • 15 PB of images in use / 2B records in HBase Detection Prevention Regional Local
  • 21. Catalyzing the ecosystem Foster the ecosystem and develop new markets for Intel and its partners