SlideShare a Scribd company logo
© ALTOROS Systems | CONFIDENTIAL
“The norm for data analytics is now to run them on commodity clusters with
MapReduce-like abstractions. One only needs to read the popular blogs to see the
evidence of this. We believe that we could now say that
“nobody ever got fired
for using Hadoop on a cluster”!
© ALTOROS Systems | CONFIDENTIAL
Breaking
News
IBM Keynote at JavaOne 2013: Java Flies in Blue Skies and Open Clouds
Java and GPUs open up a world of new opportunities
for GPU accelerators and Java programmers alike.
© ALTOROS Systems | CONFIDENTIAL
Breaking
News
Duimovich showed an example of GPU acceleration
of sorting using standard NVIDIA CUDA libraries
that are already available!
The speedups are phenomenal — ranging from 2x to 48x faster!
© ALTOROS Systems | CONFIDENTIAL
Breaking
News?
© ALTOROS Systems | CONFIDENTIAL
Breaking
News?
© ALTOROS Systems | CONFIDENTIAL
Breaking
Hadoop
© ALTOROS Systems | CONFIDENTIAL
Breaking
Hadoop
10 000x faster
© ALTOROS Systems | CONFIDENTIAL
Breaking
Hadoop
10 000x faster
© ALTOROS Systems | CONFIDENTIAL
Hadoop vs GPU
Hadoop & GPU
Hadoop + GPU
HPC
Big Data
GPGPU in Java
Heterogeneous systems
Horizontal and vertical scalability
© ALTOROS Systems | CONFIDENTIAL
Hadoop horizontal scalability
file01 file02 file03
© ALTOROS Systems | CONFIDENTIAL
Hadoop horizontal scalability
file01 file02 file03
© ALTOROS Systems | CONFIDENTIAL
Hadoop horizontal scalability
file01 file02 file03
Node 1 Node 2 Node 3
01 02 03 04 05 06 07 08 09 10
01
02
03
04
05 0607 0809 10
© ALTOROS Systems | CONFIDENTIAL
Hadoop horizontal scalability
file01 file02 file03
Node 1 Node 2 Node 3
01 02 03 04 05 06 07 08 09 10
01
02
03
04
05 0607 0809 10
3 4 3
© ALTOROS Systems | CONFIDENTIAL
Hadoop horizontal scalability
file01 file02 file03
Node 1 Node 2 Node 3
01 02 03 04 05 06 07 08 09 10
01
02
03
04
05 0607 0809 10
3 4 3
Node 1 Node 2 Node 3
01 02
03 04
05 06
07 08
09 10
Node 4 Node 5 Node 6
01 02 03
04
05 06 07
08 09 10
© ALTOROS Systems | CONFIDENTIAL
Hadoop horizontal scalability
file01 file02 file03
Node 1 Node 2 Node 3
01 02 03 04 05 06 07 08 09 10
01
02
03
04
05 0607 0809 10
3 4 3
Node 1 Node 2 Node 3
01 02
03 04
05 06
07 08
09 10
Node 4 Node 5 Node 6
01 02 03
04
05 06 07
08 09 10
221 1 2 2
© ALTOROS Systems | CONFIDENTIAL
Hadoop horizontal scalability
Node 1 Node 2 Node 3
01 02
03 04
05 06
07 08
09 10
Node 4 Node 5 Node 6
01 02 03
04
05 06 07
08 09 10
221 1 2 2
© ALTOROS Systems | CONFIDENTIAL
Hadoop horizontal scalability
Node 1 Node 2 Node 3
01 02
03 04
05 06
07 08
09 10
Node 4 Node 5 Node 6
01 02 03
04
05 06 07
08 09 10
221 1 2 2
© ALTOROS Systems | CONFIDENTIAL
Use GPU to scale vertically
Node 1 Node 2 Node 3
01 02
03 04
05 06
07 08
09 10
Node 4 Node 5 Node 6
01 02 03
04
05 06 07
08 09 10
221 1 2 20.5 1 1 0.5 1 1
© ALTOROS Systems | CONFIDENTIAL
Profit estimation
“Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU” by Intel
NVidia GTX280
vs
Intel Core i7-960
© ALTOROS Systems | CONFIDENTIAL
Profit estimation
“Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU” by Intel
“OpenCL: the advantages of heterogeneous approach” by Intel
NVidia GTX280
vs
Intel Core i7-960
© ALTOROS Systems | CONFIDENTIAL
How to use OpenCL?
© ALTOROS Systems | CONFIDENTIAL
How to use OpenCL?
© ALTOROS Systems | CONFIDENTIAL
How to use OpenCL?
Hadoop streaming
© ALTOROS Systems | CONFIDENTIAL
Aparapi
Expands Java's “Write Once Run Anywhere” to include APU and GPU devices
by expressing data parallel algorithm through extending Kernel base class.
MyKernel.class
© ALTOROS Systems | CONFIDENTIAL
Aparapi
Expands Java's “Write Once Run Anywhere” to include APU and GPU devices
by expressing data parallel algorithm through extending Kernel base class.
MyKernel.class
Platform
Supports
OpenCL?
© ALTOROS Systems | CONFIDENTIAL
Aparapi
Expands Java's “Write Once Run Anywhere” to include APU and GPU devices
by expressing data parallel algorithm through extending Kernel base class.
MyKernel.class
Platform
Supports
OpenCL?
Execute using
Java Thread Pool
© ALTOROS Systems | CONFIDENTIAL
Aparapi
Expands Java's “Write Once Run Anywhere” to include APU and GPU devices
by expressing data parallel algorithm through extending Kernel base class.
MyKernel.class
Platform
Supports
OpenCL?
Bytecode can
be converted
to OpenCL?
Execute using
Java Thread Pool
© ALTOROS Systems | CONFIDENTIAL
Aparapi
Expands Java's “Write Once Run Anywhere” to include APU and GPU devices
by expressing data parallel algorithm through extending Kernel base class.
MyKernel.class
Platform
Supports
OpenCL?
Bytecode can
be converted
to OpenCL?
Convert it
Execute OpenCL
Kernel on DeviceExecute using
Java Thread Pool
© ALTOROS Systems | CONFIDENTIAL
Aparapi
Expands Java's “Write Once Run Anywhere” to include APU and GPU devices
by expressing data parallel algorithm through extending Kernel base class.
© ALTOROS Systems | CONFIDENTIAL
Aparapi
Expands Java's “Write Once Run Anywhere” to include APU and GPU devices
by expressing data parallel algorithm through extending Kernel base class.
© ALTOROS Systems | CONFIDENTIAL
Aparapi
Expands Java's “Write Once Run Anywhere” to include APU and GPU devices
by expressing data parallel algorithm through extending Kernel base class.
© ALTOROS Systems | CONFIDENTIAL
Aparapi
Expands Java's “Write Once Run Anywhere” to include APU and GPU devices
by expressing data parallel algorithm through extending Kernel base class.
lambda
© ALTOROS Systems | CONFIDENTIAL
Aparapi
Expands Java's “Write Once Run Anywhere” to include APU and GPU devices
by expressing data parallel algorithm through extending Kernel base class.
lambda
HSA
© ALTOROS Systems | CONFIDENTIAL
Aparapi
Characteristics of ideal data parallel workload
© ALTOROS Systems | CONFIDENTIAL
Aparapi
Characteristics of ideal data parallel workload
Code which iterates over large arrays of primitives
- 32/64 bit data types preferred
- where the order of iterations is not critical
avoid data dependencies between iterations
- each iteration contains sequential code (few branches)
© ALTOROS Systems | CONFIDENTIAL
Aparapi
Characteristics of ideal data parallel workload
Code which iterates over large arrays of primitives
- 32/64 bit data types preferred
- where the order of iterations is not critical
avoid data dependencies between iterations
- each iteration contains sequential code (few branches)
Balance between data size (low) and compute (high)
- data transfer to/from the GPU can be costly
- trivial compute not worth the transfer cost
- may still benefit by freeing up CPU for other work(?)
© ALTOROS Systems | CONFIDENTIAL
HadoopCL
Rice University, AMD
© ALTOROS Systems | CONFIDENTIAL
HadoopCL
© ALTOROS Systems | CONFIDENTIAL
HadoopCL
© ALTOROS Systems | CONFIDENTIAL
HadoopCL
© ALTOROS Systems | CONFIDENTIAL
HadoopCL
2 six-core Intel X5660
(48 GB mem)
2 NVidia Tesla M2050
(2*2.5 GB mem)
AMD A10-5800K APU
(16 GB mem)
© ALTOROS Systems | CONFIDENTIAL
HadoopCL
2 six-core Intel X5660
(48 GB mem)
2 NVidia Tesla M2050
(2*2.5 GB mem)
AMD A10-5800K APU
(16 GB mem)
WHY?
© ALTOROS Systems | CONFIDENTIAL
HadoopCL
© ALTOROS Systems | CONFIDENTIAL
Back to OpenCL, Aparapi and heterogeneous computing
© ALTOROS Systems | CONFIDENTIAL
OpenCL, Aparapi and heterogeneous computing
GPU cache
GPU GDDR5
CPU cache
SATA 3.0 (HDD)
SATA 2.0 (SSD)
1 GBit networkFormula in terms of time:
(CPU calc1) + disk read + disk write
>
(CPU calc2 + GPU calc + GPU-write + GPU-read) + disk read + disk write
© ALTOROS Systems | CONFIDENTIAL
OpenCL future
© ALTOROS Systems | CONFIDENTIAL
OpenCL future
http://streamcomputing.eu/
© ALTOROS Systems | CONFIDENTIAL
Questions?
Big Data Experts FB group
Ad

More Related Content

What's hot (20)

LCU13: GPGPU on ARM Experience Report
LCU13: GPGPU on ARM Experience ReportLCU13: GPGPU on ARM Experience Report
LCU13: GPGPU on ARM Experience Report
Linaro
 
Distributed Deep Learning with Apache Spark and TensorFlow with Jim Dowling
Distributed Deep Learning with Apache Spark and TensorFlow with Jim DowlingDistributed Deep Learning with Apache Spark and TensorFlow with Jim Dowling
Distributed Deep Learning with Apache Spark and TensorFlow with Jim Dowling
Databricks
 
JMI Techtalk: 한재근 - How to use GPU for developing AI
JMI Techtalk: 한재근 - How to use GPU for developing AIJMI Techtalk: 한재근 - How to use GPU for developing AI
JMI Techtalk: 한재근 - How to use GPU for developing AI
Lablup Inc.
 
Leveraging GPU-Accelerated Analytics on top of Apache Spark with Todd Mostak
Leveraging GPU-Accelerated Analytics on top of Apache Spark with Todd MostakLeveraging GPU-Accelerated Analytics on top of Apache Spark with Todd Mostak
Leveraging GPU-Accelerated Analytics on top of Apache Spark with Todd Mostak
Databricks
 
Parallel Linear Regression in Interative Reduce and YARN
Parallel Linear Regression in Interative Reduce and YARNParallel Linear Regression in Interative Reduce and YARN
Parallel Linear Regression in Interative Reduce and YARN
DataWorks Summit
 
Making Hardware Accelerator Easier to Use
Making Hardware Accelerator Easier to UseMaking Hardware Accelerator Easier to Use
Making Hardware Accelerator Easier to Use
Kazuaki Ishizaki
 
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production Scale
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production ScaleGPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production Scale
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production Scale
Spark Summit
 
MIT's experience on OpenPOWER/POWER 9 platform
MIT's experience on OpenPOWER/POWER 9 platformMIT's experience on OpenPOWER/POWER 9 platform
MIT's experience on OpenPOWER/POWER 9 platform
Ganesan Narayanasamy
 
GPU Computing With Apache Spark And Python
GPU Computing With Apache Spark And PythonGPU Computing With Apache Spark And Python
GPU Computing With Apache Spark And Python
Jen Aman
 
MEW22 22nd Machine Evaluation Workshop Microsoft
MEW22 22nd Machine Evaluation Workshop MicrosoftMEW22 22nd Machine Evaluation Workshop Microsoft
MEW22 22nd Machine Evaluation Workshop Microsoft
Lee Stott
 
Optimizing High Performance Computing Applications for Energy
Optimizing High Performance Computing Applications for EnergyOptimizing High Performance Computing Applications for Energy
Optimizing High Performance Computing Applications for Energy
David Lecomber
 
Distributed Model Training using MXNet with Horovod
Distributed Model Training using MXNet with HorovodDistributed Model Training using MXNet with Horovod
Distributed Model Training using MXNet with Horovod
Lin Yuan
 
Which Is Deeper - Comparison Of Deep Learning Frameworks On Spark
 Which Is Deeper - Comparison Of Deep Learning Frameworks On Spark Which Is Deeper - Comparison Of Deep Learning Frameworks On Spark
Which Is Deeper - Comparison Of Deep Learning Frameworks On Spark
Spark Summit
 
Overview of Scientific Workflows - Why Use Them?
Overview of Scientific Workflows - Why Use Them?Overview of Scientific Workflows - Why Use Them?
Overview of Scientific Workflows - Why Use Them?
inside-BigData.com
 
Open Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
Open Source RAPIDS GPU Platform to Accelerate Predictive Data AnalyticsOpen Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
Open Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
inside-BigData.com
 
Project Hydrogen: Unifying State-of-the-Art AI and Big Data in Apache Spark w...
Project Hydrogen: Unifying State-of-the-Art AI and Big Data in Apache Spark w...Project Hydrogen: Unifying State-of-the-Art AI and Big Data in Apache Spark w...
Project Hydrogen: Unifying State-of-the-Art AI and Big Data in Apache Spark w...
Databricks
 
RAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data ScienceRAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data Science
Data Works MD
 
Ac922 cdac webinar
Ac922 cdac webinarAc922 cdac webinar
Ac922 cdac webinar
Ganesan Narayanasamy
 
Google Cloud Platform Empowers TensorFlow and Machine Learning
Google Cloud Platform Empowers TensorFlow and Machine LearningGoogle Cloud Platform Empowers TensorFlow and Machine Learning
Google Cloud Platform Empowers TensorFlow and Machine Learning
DataWorks Summit/Hadoop Summit
 
Accelerated Machine Learning with RAPIDS and MLflow, Nvidia/RAPIDS
Accelerated Machine Learning with RAPIDS and MLflow, Nvidia/RAPIDSAccelerated Machine Learning with RAPIDS and MLflow, Nvidia/RAPIDS
Accelerated Machine Learning with RAPIDS and MLflow, Nvidia/RAPIDS
Databricks
 
LCU13: GPGPU on ARM Experience Report
LCU13: GPGPU on ARM Experience ReportLCU13: GPGPU on ARM Experience Report
LCU13: GPGPU on ARM Experience Report
Linaro
 
Distributed Deep Learning with Apache Spark and TensorFlow with Jim Dowling
Distributed Deep Learning with Apache Spark and TensorFlow with Jim DowlingDistributed Deep Learning with Apache Spark and TensorFlow with Jim Dowling
Distributed Deep Learning with Apache Spark and TensorFlow with Jim Dowling
Databricks
 
JMI Techtalk: 한재근 - How to use GPU for developing AI
JMI Techtalk: 한재근 - How to use GPU for developing AIJMI Techtalk: 한재근 - How to use GPU for developing AI
JMI Techtalk: 한재근 - How to use GPU for developing AI
Lablup Inc.
 
Leveraging GPU-Accelerated Analytics on top of Apache Spark with Todd Mostak
Leveraging GPU-Accelerated Analytics on top of Apache Spark with Todd MostakLeveraging GPU-Accelerated Analytics on top of Apache Spark with Todd Mostak
Leveraging GPU-Accelerated Analytics on top of Apache Spark with Todd Mostak
Databricks
 
Parallel Linear Regression in Interative Reduce and YARN
Parallel Linear Regression in Interative Reduce and YARNParallel Linear Regression in Interative Reduce and YARN
Parallel Linear Regression in Interative Reduce and YARN
DataWorks Summit
 
Making Hardware Accelerator Easier to Use
Making Hardware Accelerator Easier to UseMaking Hardware Accelerator Easier to Use
Making Hardware Accelerator Easier to Use
Kazuaki Ishizaki
 
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production Scale
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production ScaleGPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production Scale
GPU Support In Spark And GPU/CPU Mixed Resource Scheduling At Production Scale
Spark Summit
 
MIT's experience on OpenPOWER/POWER 9 platform
MIT's experience on OpenPOWER/POWER 9 platformMIT's experience on OpenPOWER/POWER 9 platform
MIT's experience on OpenPOWER/POWER 9 platform
Ganesan Narayanasamy
 
GPU Computing With Apache Spark And Python
GPU Computing With Apache Spark And PythonGPU Computing With Apache Spark And Python
GPU Computing With Apache Spark And Python
Jen Aman
 
MEW22 22nd Machine Evaluation Workshop Microsoft
MEW22 22nd Machine Evaluation Workshop MicrosoftMEW22 22nd Machine Evaluation Workshop Microsoft
MEW22 22nd Machine Evaluation Workshop Microsoft
Lee Stott
 
Optimizing High Performance Computing Applications for Energy
Optimizing High Performance Computing Applications for EnergyOptimizing High Performance Computing Applications for Energy
Optimizing High Performance Computing Applications for Energy
David Lecomber
 
Distributed Model Training using MXNet with Horovod
Distributed Model Training using MXNet with HorovodDistributed Model Training using MXNet with Horovod
Distributed Model Training using MXNet with Horovod
Lin Yuan
 
Which Is Deeper - Comparison Of Deep Learning Frameworks On Spark
 Which Is Deeper - Comparison Of Deep Learning Frameworks On Spark Which Is Deeper - Comparison Of Deep Learning Frameworks On Spark
Which Is Deeper - Comparison Of Deep Learning Frameworks On Spark
Spark Summit
 
Overview of Scientific Workflows - Why Use Them?
Overview of Scientific Workflows - Why Use Them?Overview of Scientific Workflows - Why Use Them?
Overview of Scientific Workflows - Why Use Them?
inside-BigData.com
 
Open Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
Open Source RAPIDS GPU Platform to Accelerate Predictive Data AnalyticsOpen Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
Open Source RAPIDS GPU Platform to Accelerate Predictive Data Analytics
inside-BigData.com
 
Project Hydrogen: Unifying State-of-the-Art AI and Big Data in Apache Spark w...
Project Hydrogen: Unifying State-of-the-Art AI and Big Data in Apache Spark w...Project Hydrogen: Unifying State-of-the-Art AI and Big Data in Apache Spark w...
Project Hydrogen: Unifying State-of-the-Art AI and Big Data in Apache Spark w...
Databricks
 
RAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data ScienceRAPIDS – Open GPU-accelerated Data Science
RAPIDS – Open GPU-accelerated Data Science
Data Works MD
 
Google Cloud Platform Empowers TensorFlow and Machine Learning
Google Cloud Platform Empowers TensorFlow and Machine LearningGoogle Cloud Platform Empowers TensorFlow and Machine Learning
Google Cloud Platform Empowers TensorFlow and Machine Learning
DataWorks Summit/Hadoop Summit
 
Accelerated Machine Learning with RAPIDS and MLflow, Nvidia/RAPIDS
Accelerated Machine Learning with RAPIDS and MLflow, Nvidia/RAPIDSAccelerated Machine Learning with RAPIDS and MLflow, Nvidia/RAPIDS
Accelerated Machine Learning with RAPIDS and MLflow, Nvidia/RAPIDS
Databricks
 

Viewers also liked (20)

IS-4011, Accelerating Analytics on HADOOP using OpenCL, by Zubin Dowlaty and ...
IS-4011, Accelerating Analytics on HADOOP using OpenCL, by Zubin Dowlaty and ...IS-4011, Accelerating Analytics on HADOOP using OpenCL, by Zubin Dowlaty and ...
IS-4011, Accelerating Analytics on HADOOP using OpenCL, by Zubin Dowlaty and ...
AMD Developer Central
 
PG-Strom - GPGPU meets PostgreSQL, PGcon2015
PG-Strom - GPGPU meets PostgreSQL, PGcon2015PG-Strom - GPGPU meets PostgreSQL, PGcon2015
PG-Strom - GPGPU meets PostgreSQL, PGcon2015
Kohei KaiGai
 
PG-Strom
PG-StromPG-Strom
PG-Strom
Kohei KaiGai
 
GPUs in Big Data - StampedeCon 2014
GPUs in Big Data - StampedeCon 2014GPUs in Big Data - StampedeCon 2014
GPUs in Big Data - StampedeCon 2014
StampedeCon
 
GPU Ecosystem
GPU EcosystemGPU Ecosystem
GPU Ecosystem
Ofer Rosenberg
 
SIGGRAPH 2012: GPU-Accelerated 2D and Web Rendering
SIGGRAPH 2012: GPU-Accelerated 2D and Web RenderingSIGGRAPH 2012: GPU-Accelerated 2D and Web Rendering
SIGGRAPH 2012: GPU-Accelerated 2D and Web Rendering
Mark Kilgard
 
PG-Strom - GPU Accelerated Asyncr
PG-Strom - GPU Accelerated AsyncrPG-Strom - GPU Accelerated Asyncr
PG-Strom - GPU Accelerated Asyncr
Kohei KaiGai
 
Deep learning on spark
Deep learning on sparkDeep learning on spark
Deep learning on spark
Satyendra Rana
 
Computational Techniques for the Statistical Analysis of Big Data in R
Computational Techniques for the Statistical Analysis of Big Data in RComputational Techniques for the Statistical Analysis of Big Data in R
Computational Techniques for the Statistical Analysis of Big Data in R
herbps10
 
GTC 2012: GPU-Accelerated Path Rendering
GTC 2012: GPU-Accelerated Path RenderingGTC 2012: GPU-Accelerated Path Rendering
GTC 2012: GPU-Accelerated Path Rendering
Mark Kilgard
 
Accelerating Machine Learning Applications on Spark Using GPUs
Accelerating Machine Learning Applications on Spark Using GPUsAccelerating Machine Learning Applications on Spark Using GPUs
Accelerating Machine Learning Applications on Spark Using GPUs
IBM
 
Enabling Graph Analytics at Scale: The Opportunity for GPU-Acceleration of D...
Enabling Graph Analytics at Scale:  The Opportunity for GPU-Acceleration of D...Enabling Graph Analytics at Scale:  The Opportunity for GPU-Acceleration of D...
Enabling Graph Analytics at Scale: The Opportunity for GPU-Acceleration of D...
odsc
 
Heterogeneous System Architecture Overview
Heterogeneous System Architecture OverviewHeterogeneous System Architecture Overview
Heterogeneous System Architecture Overview
inside-BigData.com
 
PyData Amsterdam - Name Matching at Scale
PyData Amsterdam - Name Matching at ScalePyData Amsterdam - Name Matching at Scale
PyData Amsterdam - Name Matching at Scale
GoDataDriven
 
Deep Learning on Hadoop
Deep Learning on HadoopDeep Learning on Hadoop
Deep Learning on Hadoop
DataWorks Summit
 
From Machine Learning to Learning Machines: Creating an End-to-End Cognitive ...
From Machine Learning to Learning Machines: Creating an End-to-End Cognitive ...From Machine Learning to Learning Machines: Creating an End-to-End Cognitive ...
From Machine Learning to Learning Machines: Creating an End-to-End Cognitive ...
Spark Summit
 
DeepLearning4J and Spark: Successes and Challenges - François Garillot
DeepLearning4J and Spark: Successes and Challenges - François GarillotDeepLearning4J and Spark: Successes and Challenges - François Garillot
DeepLearning4J and Spark: Successes and Challenges - François Garillot
sparktc
 
How to Solve Real-Time Data Problems
How to Solve Real-Time Data ProblemsHow to Solve Real-Time Data Problems
How to Solve Real-Time Data Problems
IBM Power Systems
 
Containerizing GPU Applications with Docker for Scaling to the Cloud
Containerizing GPU Applications with Docker for Scaling to the CloudContainerizing GPU Applications with Docker for Scaling to the Cloud
Containerizing GPU Applications with Docker for Scaling to the Cloud
Subbu Rama
 
Tallinn Estonia Advanced Java Meetup Spark + TensorFlow = TensorFrames Oct 24...
Tallinn Estonia Advanced Java Meetup Spark + TensorFlow = TensorFrames Oct 24...Tallinn Estonia Advanced Java Meetup Spark + TensorFlow = TensorFrames Oct 24...
Tallinn Estonia Advanced Java Meetup Spark + TensorFlow = TensorFrames Oct 24...
Chris Fregly
 
IS-4011, Accelerating Analytics on HADOOP using OpenCL, by Zubin Dowlaty and ...
IS-4011, Accelerating Analytics on HADOOP using OpenCL, by Zubin Dowlaty and ...IS-4011, Accelerating Analytics on HADOOP using OpenCL, by Zubin Dowlaty and ...
IS-4011, Accelerating Analytics on HADOOP using OpenCL, by Zubin Dowlaty and ...
AMD Developer Central
 
PG-Strom - GPGPU meets PostgreSQL, PGcon2015
PG-Strom - GPGPU meets PostgreSQL, PGcon2015PG-Strom - GPGPU meets PostgreSQL, PGcon2015
PG-Strom - GPGPU meets PostgreSQL, PGcon2015
Kohei KaiGai
 
GPUs in Big Data - StampedeCon 2014
GPUs in Big Data - StampedeCon 2014GPUs in Big Data - StampedeCon 2014
GPUs in Big Data - StampedeCon 2014
StampedeCon
 
SIGGRAPH 2012: GPU-Accelerated 2D and Web Rendering
SIGGRAPH 2012: GPU-Accelerated 2D and Web RenderingSIGGRAPH 2012: GPU-Accelerated 2D and Web Rendering
SIGGRAPH 2012: GPU-Accelerated 2D and Web Rendering
Mark Kilgard
 
PG-Strom - GPU Accelerated Asyncr
PG-Strom - GPU Accelerated AsyncrPG-Strom - GPU Accelerated Asyncr
PG-Strom - GPU Accelerated Asyncr
Kohei KaiGai
 
Deep learning on spark
Deep learning on sparkDeep learning on spark
Deep learning on spark
Satyendra Rana
 
Computational Techniques for the Statistical Analysis of Big Data in R
Computational Techniques for the Statistical Analysis of Big Data in RComputational Techniques for the Statistical Analysis of Big Data in R
Computational Techniques for the Statistical Analysis of Big Data in R
herbps10
 
GTC 2012: GPU-Accelerated Path Rendering
GTC 2012: GPU-Accelerated Path RenderingGTC 2012: GPU-Accelerated Path Rendering
GTC 2012: GPU-Accelerated Path Rendering
Mark Kilgard
 
Accelerating Machine Learning Applications on Spark Using GPUs
Accelerating Machine Learning Applications on Spark Using GPUsAccelerating Machine Learning Applications on Spark Using GPUs
Accelerating Machine Learning Applications on Spark Using GPUs
IBM
 
Enabling Graph Analytics at Scale: The Opportunity for GPU-Acceleration of D...
Enabling Graph Analytics at Scale:  The Opportunity for GPU-Acceleration of D...Enabling Graph Analytics at Scale:  The Opportunity for GPU-Acceleration of D...
Enabling Graph Analytics at Scale: The Opportunity for GPU-Acceleration of D...
odsc
 
Heterogeneous System Architecture Overview
Heterogeneous System Architecture OverviewHeterogeneous System Architecture Overview
Heterogeneous System Architecture Overview
inside-BigData.com
 
PyData Amsterdam - Name Matching at Scale
PyData Amsterdam - Name Matching at ScalePyData Amsterdam - Name Matching at Scale
PyData Amsterdam - Name Matching at Scale
GoDataDriven
 
From Machine Learning to Learning Machines: Creating an End-to-End Cognitive ...
From Machine Learning to Learning Machines: Creating an End-to-End Cognitive ...From Machine Learning to Learning Machines: Creating an End-to-End Cognitive ...
From Machine Learning to Learning Machines: Creating an End-to-End Cognitive ...
Spark Summit
 
DeepLearning4J and Spark: Successes and Challenges - François Garillot
DeepLearning4J and Spark: Successes and Challenges - François GarillotDeepLearning4J and Spark: Successes and Challenges - François Garillot
DeepLearning4J and Spark: Successes and Challenges - François Garillot
sparktc
 
How to Solve Real-Time Data Problems
How to Solve Real-Time Data ProblemsHow to Solve Real-Time Data Problems
How to Solve Real-Time Data Problems
IBM Power Systems
 
Containerizing GPU Applications with Docker for Scaling to the Cloud
Containerizing GPU Applications with Docker for Scaling to the CloudContainerizing GPU Applications with Docker for Scaling to the Cloud
Containerizing GPU Applications with Docker for Scaling to the Cloud
Subbu Rama
 
Tallinn Estonia Advanced Java Meetup Spark + TensorFlow = TensorFrames Oct 24...
Tallinn Estonia Advanced Java Meetup Spark + TensorFlow = TensorFrames Oct 24...Tallinn Estonia Advanced Java Meetup Spark + TensorFlow = TensorFrames Oct 24...
Tallinn Estonia Advanced Java Meetup Spark + TensorFlow = TensorFrames Oct 24...
Chris Fregly
 
Ad

Similar to Hadoop + GPU (20)

HPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyHPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journey
Peter Clapham
 
Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster
Fran Navarro
 
Introduction to Distributed Computing & Distributed Databases
Introduction to Distributed Computing & Distributed DatabasesIntroduction to Distributed Computing & Distributed Databases
Introduction to Distributed Computing & Distributed Databases
Shankar Iyer
 
OpenStack Preso: DevOps on Hybrid Infrastructure
OpenStack Preso: DevOps on Hybrid InfrastructureOpenStack Preso: DevOps on Hybrid Infrastructure
OpenStack Preso: DevOps on Hybrid Infrastructure
rhirschfeld
 
2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure
2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure
2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure
devopsdaysaustin
 
AMD It's Time to ROC
AMD It's Time to ROCAMD It's Time to ROC
AMD It's Time to ROC
inside-BigData.com
 
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Stefan Lipp
 
How to Win When Migrating to Azure
How to Win When Migrating to AzureHow to Win When Migrating to Azure
How to Win When Migrating to Azure
Kellyn Pot'Vin-Gorman
 
Cloud comparison - AWS vs Azure vs Google
Cloud comparison - AWS vs Azure vs GoogleCloud comparison - AWS vs Azure vs Google
Cloud comparison - AWS vs Azure vs Google
Patrick Pierson
 
Oracle Cloud Infrastructure
Oracle Cloud InfrastructureOracle Cloud Infrastructure
Oracle Cloud Infrastructure
MarketingArrowECS_CZ
 
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
AMD Developer Central
 
Migrating enterprise workloads to AWS
Migrating enterprise workloads to AWS Migrating enterprise workloads to AWS
Migrating enterprise workloads to AWS
Tom Laszewski
 
Sanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticiansSanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticians
Peter Clapham
 
Flexible compute
Flexible computeFlexible compute
Flexible compute
Peter Clapham
 
Deep Learning Frameworks Using Spark on YARN by Vartika Singh
Deep Learning Frameworks Using Spark on YARN by Vartika SinghDeep Learning Frameworks Using Spark on YARN by Vartika Singh
Deep Learning Frameworks Using Spark on YARN by Vartika Singh
Data Con LA
 
Helix core on aws webinar
Helix core on aws webinar Helix core on aws webinar
Helix core on aws webinar
Perforce
 
MySQL Fabric - High Availability & Automated Sharding for MySQL
MySQL Fabric - High Availability & Automated Sharding for MySQLMySQL Fabric - High Availability & Automated Sharding for MySQL
MySQL Fabric - High Availability & Automated Sharding for MySQL
Ted Wennmark
 
Oracle Cloud : Big Data Use Cases and Architecture
Oracle Cloud : Big Data Use Cases and ArchitectureOracle Cloud : Big Data Use Cases and Architecture
Oracle Cloud : Big Data Use Cases and Architecture
Riccardo Romani
 
High Performance Computing Pitch Deck
High Performance Computing Pitch DeckHigh Performance Computing Pitch Deck
High Performance Computing Pitch Deck
Nicholas Vossburg
 
Introduction to HPC & Supercomputing in AI
Introduction to HPC & Supercomputing in AIIntroduction to HPC & Supercomputing in AI
Introduction to HPC & Supercomputing in AI
Tyrone Systems
 
HPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyHPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journey
Peter Clapham
 
Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster Simplify IT: Oracle SuperCluster
Simplify IT: Oracle SuperCluster
Fran Navarro
 
Introduction to Distributed Computing & Distributed Databases
Introduction to Distributed Computing & Distributed DatabasesIntroduction to Distributed Computing & Distributed Databases
Introduction to Distributed Computing & Distributed Databases
Shankar Iyer
 
OpenStack Preso: DevOps on Hybrid Infrastructure
OpenStack Preso: DevOps on Hybrid InfrastructureOpenStack Preso: DevOps on Hybrid Infrastructure
OpenStack Preso: DevOps on Hybrid Infrastructure
rhirschfeld
 
2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure
2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure
2016 - Open Mic - IGNITE - Open Infrastructure = ANY Infrastructure
devopsdaysaustin
 
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Cloudera Analytics and Machine Learning Platform - Optimized for Cloud
Stefan Lipp
 
Cloud comparison - AWS vs Azure vs Google
Cloud comparison - AWS vs Azure vs GoogleCloud comparison - AWS vs Azure vs Google
Cloud comparison - AWS vs Azure vs Google
Patrick Pierson
 
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...Computer Vision Powered by Heterogeneous System Architecture (HSA) by  Dr. Ha...
Computer Vision Powered by Heterogeneous System Architecture (HSA) by Dr. Ha...
AMD Developer Central
 
Migrating enterprise workloads to AWS
Migrating enterprise workloads to AWS Migrating enterprise workloads to AWS
Migrating enterprise workloads to AWS
Tom Laszewski
 
Sanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticiansSanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticians
Peter Clapham
 
Deep Learning Frameworks Using Spark on YARN by Vartika Singh
Deep Learning Frameworks Using Spark on YARN by Vartika SinghDeep Learning Frameworks Using Spark on YARN by Vartika Singh
Deep Learning Frameworks Using Spark on YARN by Vartika Singh
Data Con LA
 
Helix core on aws webinar
Helix core on aws webinar Helix core on aws webinar
Helix core on aws webinar
Perforce
 
MySQL Fabric - High Availability & Automated Sharding for MySQL
MySQL Fabric - High Availability & Automated Sharding for MySQLMySQL Fabric - High Availability & Automated Sharding for MySQL
MySQL Fabric - High Availability & Automated Sharding for MySQL
Ted Wennmark
 
Oracle Cloud : Big Data Use Cases and Architecture
Oracle Cloud : Big Data Use Cases and ArchitectureOracle Cloud : Big Data Use Cases and Architecture
Oracle Cloud : Big Data Use Cases and Architecture
Riccardo Romani
 
High Performance Computing Pitch Deck
High Performance Computing Pitch DeckHigh Performance Computing Pitch Deck
High Performance Computing Pitch Deck
Nicholas Vossburg
 
Introduction to HPC & Supercomputing in AI
Introduction to HPC & Supercomputing in AIIntroduction to HPC & Supercomputing in AI
Introduction to HPC & Supercomputing in AI
Tyrone Systems
 
Ad

Recently uploaded (20)

Metamorphosis: Life's Transformative Journey
Metamorphosis: Life's Transformative JourneyMetamorphosis: Life's Transformative Journey
Metamorphosis: Life's Transformative Journey
Arshad Shaikh
 
Quality Contril Analysis of Containers.pdf
Quality Contril Analysis of Containers.pdfQuality Contril Analysis of Containers.pdf
Quality Contril Analysis of Containers.pdf
Dr. Bindiya Chauhan
 
SPRING FESTIVITIES - UK AND USA -
SPRING FESTIVITIES - UK AND USA            -SPRING FESTIVITIES - UK AND USA            -
SPRING FESTIVITIES - UK AND USA -
Colégio Santa Teresinha
 
Stein, Hunt, Green letter to Congress April 2025
Stein, Hunt, Green letter to Congress April 2025Stein, Hunt, Green letter to Congress April 2025
Stein, Hunt, Green letter to Congress April 2025
Mebane Rash
 
Operations Management (Dr. Abdulfatah Salem).pdf
Operations Management (Dr. Abdulfatah Salem).pdfOperations Management (Dr. Abdulfatah Salem).pdf
Operations Management (Dr. Abdulfatah Salem).pdf
Arab Academy for Science, Technology and Maritime Transport
 
Biophysics Chapter 3 Methods of Studying Macromolecules.pdf
Biophysics Chapter 3 Methods of Studying Macromolecules.pdfBiophysics Chapter 3 Methods of Studying Macromolecules.pdf
Biophysics Chapter 3 Methods of Studying Macromolecules.pdf
PKLI-Institute of Nursing and Allied Health Sciences Lahore , Pakistan.
 
How to Subscribe Newsletter From Odoo 18 Website
How to Subscribe Newsletter From Odoo 18 WebsiteHow to Subscribe Newsletter From Odoo 18 Website
How to Subscribe Newsletter From Odoo 18 Website
Celine George
 
LDMMIA Reiki Master Spring 2025 Mini Updates
LDMMIA Reiki Master Spring 2025 Mini UpdatesLDMMIA Reiki Master Spring 2025 Mini Updates
LDMMIA Reiki Master Spring 2025 Mini Updates
LDM Mia eStudios
 
To study Digestive system of insect.pptx
To study Digestive system of insect.pptxTo study Digestive system of insect.pptx
To study Digestive system of insect.pptx
Arshad Shaikh
 
New Microsoft PowerPoint Presentation.pptx
New Microsoft PowerPoint Presentation.pptxNew Microsoft PowerPoint Presentation.pptx
New Microsoft PowerPoint Presentation.pptx
milanasargsyan5
 
Political History of Pala dynasty Pala Rulers NEP.pptx
Political History of Pala dynasty Pala Rulers NEP.pptxPolitical History of Pala dynasty Pala Rulers NEP.pptx
Political History of Pala dynasty Pala Rulers NEP.pptx
Arya Mahila P. G. College, Banaras Hindu University, Varanasi, India.
 
How to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
How to Customize Your Financial Reports & Tax Reports With Odoo 17 AccountingHow to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
How to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
Celine George
 
YSPH VMOC Special Report - Measles Outbreak Southwest US 4-30-2025.pptx
YSPH VMOC Special Report - Measles Outbreak  Southwest US 4-30-2025.pptxYSPH VMOC Special Report - Measles Outbreak  Southwest US 4-30-2025.pptx
YSPH VMOC Special Report - Measles Outbreak Southwest US 4-30-2025.pptx
Yale School of Public Health - The Virtual Medical Operations Center (VMOC)
 
Geography Sem II Unit 1C Correlation of Geography with other school subjects
Geography Sem II Unit 1C Correlation of Geography with other school subjectsGeography Sem II Unit 1C Correlation of Geography with other school subjects
Geography Sem II Unit 1C Correlation of Geography with other school subjects
ProfDrShaikhImran
 
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
larencebapu132
 
P-glycoprotein pamphlet: iteration 4 of 4 final
P-glycoprotein pamphlet: iteration 4 of 4 finalP-glycoprotein pamphlet: iteration 4 of 4 final
P-glycoprotein pamphlet: iteration 4 of 4 final
bs22n2s
 
Presentation of the MIPLM subject matter expert Erdem Kaya
Presentation of the MIPLM subject matter expert Erdem KayaPresentation of the MIPLM subject matter expert Erdem Kaya
Presentation of the MIPLM subject matter expert Erdem Kaya
MIPLM
 
To study the nervous system of insect.pptx
To study the nervous system of insect.pptxTo study the nervous system of insect.pptx
To study the nervous system of insect.pptx
Arshad Shaikh
 
CBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - Worksheet
CBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - WorksheetCBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - Worksheet
CBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - Worksheet
Sritoma Majumder
 
Unit 6_Introduction_Phishing_Password Cracking.pdf
Unit 6_Introduction_Phishing_Password Cracking.pdfUnit 6_Introduction_Phishing_Password Cracking.pdf
Unit 6_Introduction_Phishing_Password Cracking.pdf
KanchanPatil34
 
Metamorphosis: Life's Transformative Journey
Metamorphosis: Life's Transformative JourneyMetamorphosis: Life's Transformative Journey
Metamorphosis: Life's Transformative Journey
Arshad Shaikh
 
Quality Contril Analysis of Containers.pdf
Quality Contril Analysis of Containers.pdfQuality Contril Analysis of Containers.pdf
Quality Contril Analysis of Containers.pdf
Dr. Bindiya Chauhan
 
Stein, Hunt, Green letter to Congress April 2025
Stein, Hunt, Green letter to Congress April 2025Stein, Hunt, Green letter to Congress April 2025
Stein, Hunt, Green letter to Congress April 2025
Mebane Rash
 
How to Subscribe Newsletter From Odoo 18 Website
How to Subscribe Newsletter From Odoo 18 WebsiteHow to Subscribe Newsletter From Odoo 18 Website
How to Subscribe Newsletter From Odoo 18 Website
Celine George
 
LDMMIA Reiki Master Spring 2025 Mini Updates
LDMMIA Reiki Master Spring 2025 Mini UpdatesLDMMIA Reiki Master Spring 2025 Mini Updates
LDMMIA Reiki Master Spring 2025 Mini Updates
LDM Mia eStudios
 
To study Digestive system of insect.pptx
To study Digestive system of insect.pptxTo study Digestive system of insect.pptx
To study Digestive system of insect.pptx
Arshad Shaikh
 
New Microsoft PowerPoint Presentation.pptx
New Microsoft PowerPoint Presentation.pptxNew Microsoft PowerPoint Presentation.pptx
New Microsoft PowerPoint Presentation.pptx
milanasargsyan5
 
How to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
How to Customize Your Financial Reports & Tax Reports With Odoo 17 AccountingHow to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
How to Customize Your Financial Reports & Tax Reports With Odoo 17 Accounting
Celine George
 
Geography Sem II Unit 1C Correlation of Geography with other school subjects
Geography Sem II Unit 1C Correlation of Geography with other school subjectsGeography Sem II Unit 1C Correlation of Geography with other school subjects
Geography Sem II Unit 1C Correlation of Geography with other school subjects
ProfDrShaikhImran
 
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
larencebapu132
 
P-glycoprotein pamphlet: iteration 4 of 4 final
P-glycoprotein pamphlet: iteration 4 of 4 finalP-glycoprotein pamphlet: iteration 4 of 4 final
P-glycoprotein pamphlet: iteration 4 of 4 final
bs22n2s
 
Presentation of the MIPLM subject matter expert Erdem Kaya
Presentation of the MIPLM subject matter expert Erdem KayaPresentation of the MIPLM subject matter expert Erdem Kaya
Presentation of the MIPLM subject matter expert Erdem Kaya
MIPLM
 
To study the nervous system of insect.pptx
To study the nervous system of insect.pptxTo study the nervous system of insect.pptx
To study the nervous system of insect.pptx
Arshad Shaikh
 
CBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - Worksheet
CBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - WorksheetCBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - Worksheet
CBSE - Grade 8 - Science - Chemistry - Metals and Non Metals - Worksheet
Sritoma Majumder
 
Unit 6_Introduction_Phishing_Password Cracking.pdf
Unit 6_Introduction_Phishing_Password Cracking.pdfUnit 6_Introduction_Phishing_Password Cracking.pdf
Unit 6_Introduction_Phishing_Password Cracking.pdf
KanchanPatil34
 

Hadoop + GPU

  • 1. © ALTOROS Systems | CONFIDENTIAL “The norm for data analytics is now to run them on commodity clusters with MapReduce-like abstractions. One only needs to read the popular blogs to see the evidence of this. We believe that we could now say that “nobody ever got fired for using Hadoop on a cluster”!
  • 2. © ALTOROS Systems | CONFIDENTIAL Breaking News IBM Keynote at JavaOne 2013: Java Flies in Blue Skies and Open Clouds Java and GPUs open up a world of new opportunities for GPU accelerators and Java programmers alike.
  • 3. © ALTOROS Systems | CONFIDENTIAL Breaking News Duimovich showed an example of GPU acceleration of sorting using standard NVIDIA CUDA libraries that are already available! The speedups are phenomenal — ranging from 2x to 48x faster!
  • 4. © ALTOROS Systems | CONFIDENTIAL Breaking News?
  • 5. © ALTOROS Systems | CONFIDENTIAL Breaking News?
  • 6. © ALTOROS Systems | CONFIDENTIAL Breaking Hadoop
  • 7. © ALTOROS Systems | CONFIDENTIAL Breaking Hadoop 10 000x faster
  • 8. © ALTOROS Systems | CONFIDENTIAL Breaking Hadoop 10 000x faster
  • 9. © ALTOROS Systems | CONFIDENTIAL Hadoop vs GPU Hadoop & GPU Hadoop + GPU HPC Big Data GPGPU in Java Heterogeneous systems Horizontal and vertical scalability
  • 10. © ALTOROS Systems | CONFIDENTIAL Hadoop horizontal scalability file01 file02 file03
  • 11. © ALTOROS Systems | CONFIDENTIAL Hadoop horizontal scalability file01 file02 file03
  • 12. © ALTOROS Systems | CONFIDENTIAL Hadoop horizontal scalability file01 file02 file03 Node 1 Node 2 Node 3 01 02 03 04 05 06 07 08 09 10 01 02 03 04 05 0607 0809 10
  • 13. © ALTOROS Systems | CONFIDENTIAL Hadoop horizontal scalability file01 file02 file03 Node 1 Node 2 Node 3 01 02 03 04 05 06 07 08 09 10 01 02 03 04 05 0607 0809 10 3 4 3
  • 14. © ALTOROS Systems | CONFIDENTIAL Hadoop horizontal scalability file01 file02 file03 Node 1 Node 2 Node 3 01 02 03 04 05 06 07 08 09 10 01 02 03 04 05 0607 0809 10 3 4 3 Node 1 Node 2 Node 3 01 02 03 04 05 06 07 08 09 10 Node 4 Node 5 Node 6 01 02 03 04 05 06 07 08 09 10
  • 15. © ALTOROS Systems | CONFIDENTIAL Hadoop horizontal scalability file01 file02 file03 Node 1 Node 2 Node 3 01 02 03 04 05 06 07 08 09 10 01 02 03 04 05 0607 0809 10 3 4 3 Node 1 Node 2 Node 3 01 02 03 04 05 06 07 08 09 10 Node 4 Node 5 Node 6 01 02 03 04 05 06 07 08 09 10 221 1 2 2
  • 16. © ALTOROS Systems | CONFIDENTIAL Hadoop horizontal scalability Node 1 Node 2 Node 3 01 02 03 04 05 06 07 08 09 10 Node 4 Node 5 Node 6 01 02 03 04 05 06 07 08 09 10 221 1 2 2
  • 17. © ALTOROS Systems | CONFIDENTIAL Hadoop horizontal scalability Node 1 Node 2 Node 3 01 02 03 04 05 06 07 08 09 10 Node 4 Node 5 Node 6 01 02 03 04 05 06 07 08 09 10 221 1 2 2
  • 18. © ALTOROS Systems | CONFIDENTIAL Use GPU to scale vertically Node 1 Node 2 Node 3 01 02 03 04 05 06 07 08 09 10 Node 4 Node 5 Node 6 01 02 03 04 05 06 07 08 09 10 221 1 2 20.5 1 1 0.5 1 1
  • 19. © ALTOROS Systems | CONFIDENTIAL Profit estimation “Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU” by Intel NVidia GTX280 vs Intel Core i7-960
  • 20. © ALTOROS Systems | CONFIDENTIAL Profit estimation “Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU” by Intel “OpenCL: the advantages of heterogeneous approach” by Intel NVidia GTX280 vs Intel Core i7-960
  • 21. © ALTOROS Systems | CONFIDENTIAL How to use OpenCL?
  • 22. © ALTOROS Systems | CONFIDENTIAL How to use OpenCL?
  • 23. © ALTOROS Systems | CONFIDENTIAL How to use OpenCL? Hadoop streaming
  • 24. © ALTOROS Systems | CONFIDENTIAL Aparapi Expands Java's “Write Once Run Anywhere” to include APU and GPU devices by expressing data parallel algorithm through extending Kernel base class. MyKernel.class
  • 25. © ALTOROS Systems | CONFIDENTIAL Aparapi Expands Java's “Write Once Run Anywhere” to include APU and GPU devices by expressing data parallel algorithm through extending Kernel base class. MyKernel.class Platform Supports OpenCL?
  • 26. © ALTOROS Systems | CONFIDENTIAL Aparapi Expands Java's “Write Once Run Anywhere” to include APU and GPU devices by expressing data parallel algorithm through extending Kernel base class. MyKernel.class Platform Supports OpenCL? Execute using Java Thread Pool
  • 27. © ALTOROS Systems | CONFIDENTIAL Aparapi Expands Java's “Write Once Run Anywhere” to include APU and GPU devices by expressing data parallel algorithm through extending Kernel base class. MyKernel.class Platform Supports OpenCL? Bytecode can be converted to OpenCL? Execute using Java Thread Pool
  • 28. © ALTOROS Systems | CONFIDENTIAL Aparapi Expands Java's “Write Once Run Anywhere” to include APU and GPU devices by expressing data parallel algorithm through extending Kernel base class. MyKernel.class Platform Supports OpenCL? Bytecode can be converted to OpenCL? Convert it Execute OpenCL Kernel on DeviceExecute using Java Thread Pool
  • 29. © ALTOROS Systems | CONFIDENTIAL Aparapi Expands Java's “Write Once Run Anywhere” to include APU and GPU devices by expressing data parallel algorithm through extending Kernel base class.
  • 30. © ALTOROS Systems | CONFIDENTIAL Aparapi Expands Java's “Write Once Run Anywhere” to include APU and GPU devices by expressing data parallel algorithm through extending Kernel base class.
  • 31. © ALTOROS Systems | CONFIDENTIAL Aparapi Expands Java's “Write Once Run Anywhere” to include APU and GPU devices by expressing data parallel algorithm through extending Kernel base class.
  • 32. © ALTOROS Systems | CONFIDENTIAL Aparapi Expands Java's “Write Once Run Anywhere” to include APU and GPU devices by expressing data parallel algorithm through extending Kernel base class. lambda
  • 33. © ALTOROS Systems | CONFIDENTIAL Aparapi Expands Java's “Write Once Run Anywhere” to include APU and GPU devices by expressing data parallel algorithm through extending Kernel base class. lambda HSA
  • 34. © ALTOROS Systems | CONFIDENTIAL Aparapi Characteristics of ideal data parallel workload
  • 35. © ALTOROS Systems | CONFIDENTIAL Aparapi Characteristics of ideal data parallel workload Code which iterates over large arrays of primitives - 32/64 bit data types preferred - where the order of iterations is not critical avoid data dependencies between iterations - each iteration contains sequential code (few branches)
  • 36. © ALTOROS Systems | CONFIDENTIAL Aparapi Characteristics of ideal data parallel workload Code which iterates over large arrays of primitives - 32/64 bit data types preferred - where the order of iterations is not critical avoid data dependencies between iterations - each iteration contains sequential code (few branches) Balance between data size (low) and compute (high) - data transfer to/from the GPU can be costly - trivial compute not worth the transfer cost - may still benefit by freeing up CPU for other work(?)
  • 37. © ALTOROS Systems | CONFIDENTIAL HadoopCL Rice University, AMD
  • 38. © ALTOROS Systems | CONFIDENTIAL HadoopCL
  • 39. © ALTOROS Systems | CONFIDENTIAL HadoopCL
  • 40. © ALTOROS Systems | CONFIDENTIAL HadoopCL
  • 41. © ALTOROS Systems | CONFIDENTIAL HadoopCL 2 six-core Intel X5660 (48 GB mem) 2 NVidia Tesla M2050 (2*2.5 GB mem) AMD A10-5800K APU (16 GB mem)
  • 42. © ALTOROS Systems | CONFIDENTIAL HadoopCL 2 six-core Intel X5660 (48 GB mem) 2 NVidia Tesla M2050 (2*2.5 GB mem) AMD A10-5800K APU (16 GB mem) WHY?
  • 43. © ALTOROS Systems | CONFIDENTIAL HadoopCL
  • 44. © ALTOROS Systems | CONFIDENTIAL Back to OpenCL, Aparapi and heterogeneous computing
  • 45. © ALTOROS Systems | CONFIDENTIAL OpenCL, Aparapi and heterogeneous computing GPU cache GPU GDDR5 CPU cache SATA 3.0 (HDD) SATA 2.0 (SSD) 1 GBit networkFormula in terms of time: (CPU calc1) + disk read + disk write > (CPU calc2 + GPU calc + GPU-write + GPU-read) + disk read + disk write
  • 46. © ALTOROS Systems | CONFIDENTIAL OpenCL future
  • 47. © ALTOROS Systems | CONFIDENTIAL OpenCL future http://streamcomputing.eu/
  • 48. © ALTOROS Systems | CONFIDENTIAL Questions? Big Data Experts FB group