SlideShare a Scribd company logo
www.edureka.co/talend-for-big-data
ETL using Big Data Talend
View Talend For Big Data course details at www.edureka.co/talend-for-big-data
Slide 2 www.edureka.co/talend-for-big-data
๏‚ฎ Understand how ETL is complementing Hadoop Ecosystem
๏‚ฎ Adapt to ETL-Big Data industry
๏‚ฎ Understand why Talend is used with Big Data
๏‚ฎ Learn Big Data not in months but in Minutes
๏‚ฎ Understand the Use Case โ€“ Banking Industry
๏‚ฎ Implement a Talend job with Hadoop
At the end of this session, you will be able to:
Objectives
Slide 3 www.edureka.co/talend-for-big-data
๏‚ฎA Graphical Abstraction Layer on top of Hadoop Applications โ€“ this makes life so much easy in the Big Data buzz
world
๏‚ฎThe surprising stuff about the current buzz and questions heralding the end of ETL and even data warehousing
is the lack of pushback and analysis of some of the outlandish comments made
ETL with Big Data
ยป What no one seems to question in response to these sorts of comments is
the naive assumptions these statements are based on !!
ยป Is it realistic for most companies to move all of their data into Hadoop?
The typical assertion is that "Hadoop eliminates the need for ETLโ€โ€ฆ. Seriously ?
Slide 4 www.edureka.co/talend-for-big-data
ETL with Big Data
Machine
Data
Transactional
Data
Business Apps
Data
ETL
Workflow
Big Data
Extra and Load
Slide 5 www.edureka.co/talend-for-big-data
Is writing ETL scripts in
MapReduce code still ETL?
Is ETL running faster (in
few cases & slower in
others) on Hadoop
eliminating ETL?
Is introduction of Hadoop
changing when, where
and how ETL happens?
Yes No Yes
The question isn't really that are we eliminating ETL, but where does ETL take place & how are we changing its definition
ETL with Big Data (Contd.)
Slide 6 www.edureka.co/talend-for-big-data
Defining ETL
E
โ€ข represents the ability to consistently and reliably extract data with
high performance and minimal impact to the source system
T
โ€ข represents the ability to transform one or more data sets in batch or
real-time into a consumable format
L โ€ข stands for loading data into a persistent or virtual data store
Slide 7 www.edureka.co/talend-for-big-data
How learning ETL (along Big Data) is addressing major business problems ?
Why ETL + Hadoop?
BIG DATA
DATA
INTEGRATION
DATA QUALITY MDM ESB BPM
TALEND UNIFIED PLATFORM
Slide 8 www.edureka.co/talend-for-big-data
One Stop Solution!!
Improves efficiency of big data job design with graphic interface
Abstract and generates code
Run transforms inside Hadoop
Native support for HDFS, Sqoop, HBase, Mahout, Pig, Hive &
MapReduce code generate
Apache License 2.0
Embedded in Hortonworks Data Platform
Certified with Cloudera, MapR and Grenplum
An open source ecosystem
Slide 9 www.edureka.co/talend-for-big-data
Talend
Q. Why Talend?
Ans . Because the more connected the world becomes, the more quickly a business must adapt
Slide 10 www.edureka.co/talend-for-big-data
๏‚ฎTalend is the only Graphical User Interface tool which is capable enough to โ€œtranslateโ€ an ETL job to a
MapReduce job. Thus, Talend ETL job gets executed as a MapReduce job on Hadoop and get the big data work
done in minutes
๏‚ฎThis is a key innovation which helps to reduce entry barriers in Big Data technology and allows ETL job
developers (beginners and advanced) to carry out Data Warehouse offloading to greater extent
๏‚ฎWith its Eclipse-based graphical workspace, Talend Open Studio for Big Data enables the developer and data
scientist to leverage Hadoop loading and processing technologies like HDFS, HBase, Hive, and Pig without
having to write Hadoop application code
๏‚ฎHadoop Applications, Seamlessly gets Integrated within minutes using Talend
Why Talend?
Slide 11 www.edureka.co/talend-for-big-data
๏‚ฎBy simply selecting graphical components from a palette, arranging and configuring them, you can create Hadoop jobs
For example:
1. Load data into HDFS (Hadoop Distributed File System)
2. Use Hadoop Pig to transform data in HDFS
3. Load data into a Hadoop Hive based data warehouse
4. Perform ELT (extract, load, transform) aggregations in Hive
5. Leverage Sqoop to integrate relational databases and Hadoop
Why Talend? (Contd.)
Slide 12 www.edureka.co/talend-for-big-data
Talend Hadoop Integration
Slide 13 www.edureka.co/talend-for-big-data
๏‚ฎ For Hadoop applications to be truly accessible to your organization, they need to be smoothly integrated into your
overall data flows
๏‚ฎ Talend Open Studio for Big Data is the ideal tool for integrating Hadoop applications into your broader data
architecture
๏‚ฎ Talend provides more built-in connector components than any other data integration solution available, with more
than 800+ connectors that make it easy to read from or write to any major file format, database, or packaged
enterprise application
For Example, in Talend Open Studio for Big Data, you can use drag 'n drop configurable components to create data
integration flows that move data from delimited log files into Hadoop Hive, perform operations in Hive, and extract
data from Hive into a MySQL database (or Oracle, Sybase, SQL Server, and so on)
Talend Hadoop Integration (Contd.)
Slide 14 www.edureka.co/talend-for-big-data
๏‚ฎ More and more enterprise wanted to scale up in Hadoop/Big Data technologies with use of existing pool of
talent and reduce overspending on map-reduce programmer (which is pretty new and expensive)
๏‚ฎ High rise of job trend in Data Scientist/Data Analysis (Talend also comes along with basic BI transformations
which reduces your dependency on simple excel dash board/ BI tools)
๏‚ฎ Gartner is featuring Talend as the best technology in market for Data Integration and Big Data
๏‚ฎ 3 major players in Big Data industry, Hortonworks, Cloudera, MapR have already tied up with Talend for big data
solutions
๏‚ฎ And mostly any level person in industry can quickly get started on this without much pre-requisites
Myth : I donโ€™t know Java programming , how would this course help me learn and excel in Big Data? The biggest
advantage you get with Talend for Big Data is โ€œthere is no prerequisiteโ€ to learn this concept. Whether you come with
prior knowledge of Hadoop or not , this course has some or other best things to offer
Talend Hadoop Integration (Contd.)
Slide 15 www.edureka.co/talend-for-big-data
Learn Big Data not in months but in Minutes!! Sounds too good ? But true
Big Data in 10 minutes
HADOOP
HORTONWORKSMAPR
CLOUDERA Go from zero to big data in under 10 minutes
Get big data without coding. The Talend Big Data
Sandbox is a ready-to-run virtual environment that
includes Talend Platform for Big Data, popular
Hadoop distributions and data examples
Slide 16 www.edureka.co/talend-for-big-data
Who can use โ€œTalend for Big Dataโ€!!
Slide 17 www.edureka.co/talend-for-big-data
Let us all see quickly, what Talend
can do in minutes, reducing the
man-hours in doing MapReduce
programming in Hadoop, shall we?
We are just about to see the Bigger Picture
Slide 18 www.edureka.co/talend-for-big-data
A Banking industry use case :
โ€œAddressing the challenges in growing the business with use of Big Dataโ€œ . We will use customer filled web-log data
(collected by bank) and with the help of Pig-ETL job will answer the question โ€œwhere should bank hold marketing
campaigns for new product launch to get more businessโ€ , in ETL-Big Data Analytics style
In this section, you will be able to sense the true power of Talend+Big Data
Real time Use Case : ETL + Big Data
Slide 19 www.edureka.co/talend-for-big-data
Our use case setup is using the below :
ยป Hortonworks Sandbox 1.3
ยป Talend Open Studio for Big Data 5.5
ยป Windows 7 (64 Bit OS)
ยป Machine : 4GB RAM , i3 processor
Environment Setup
Slide 20 www.edureka.co/talend-for-big-data
Use-case demonstration has been divided into steps such as :
ยป Step 1:
Generate huge web-log data (we are generating our own source sample data to simulate real time data)
ยป Step 2:
Load the data from local file system to HDFS (Hadoop) in seconds
ยป Step 3:
Read from HDFS, Process via Pig Scripts and achieve results
Use case Design
Slide 21 www.edureka.co/talend-for-big-data
Use-case Snapshot
Slide 22 www.edureka.co/talend-for-big-data
Salary Trend
Slide 23 www.edureka.co/talend-for-big-data
References
๏‚ฎ https://www.talend.com/resource/hadoop-applications.html
๏‚ฎ http://www.edureka.co/blog/big-data-and-etl-are-family/
Slide 24 www.edureka.co/talend-for-big-data
Course Topics
๏‚ฎ Module 1
ยป Role of Open Source ETL Technologies in
Big Data
๏‚ฎ Module 2
ยป Talend: A Revolution in Big Data
๏‚ฎ Module 3
ยป Talend: Read & Write Various Types of
Source/Target Systems
๏‚ฎ Module 4
ยป Talend: How to Transform your Business:
Basic
๏‚ฎ Module 5
ยป Talend: How to Transform your Business:
Advanced 1
๏‚ฎ Module 6
ยป Talend: How to Transform your Business:
Advanced 2
๏‚ฎ Module 7
ยป Big Data Concepts: Required for Talend
for Big Data
๏‚ฎ Module 8
ยป Introduction to Talend for Big Data
๏‚ฎ Module 9
ยป Hive in Talend for Big Data
๏‚ฎ Module 10
ยป Pig in Talend for Big Data and Project
Slide 25
LIVE Online Class
Class Recording in LMS
24/7 Post Class Support
Module Wise Quiz
Project Work
Verifiable Certificate
www.edureka.co/talend-for-big-data
How it Works?
Questions
Slide 26 www.edureka.co/talend-for-big-data
Slide 27 Course Url
Ad

More Related Content

What's hot (20)

Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...
Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...
Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...
Edureka!
ย 
Evolving Hadoop into an Operational Platform with Data Applications
Evolving Hadoop into an Operational Platform with Data ApplicationsEvolving Hadoop into an Operational Platform with Data Applications
Evolving Hadoop into an Operational Platform with Data Applications
DataWorks Summit
ย 
The convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on HadoopThe convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on Hadoop
DataWorks Summit
ย 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course Workshop
DataWorks Summit
ย 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Hortonworks
ย 
YARN: the Key to overcoming the challenges of broad-based Hadoop Adoption
YARN: the Key to overcoming the challenges of broad-based Hadoop AdoptionYARN: the Key to overcoming the challenges of broad-based Hadoop Adoption
YARN: the Key to overcoming the challenges of broad-based Hadoop Adoption
DataWorks Summit
ย 
Hadoop crash course workshop at Hadoop Summit
Hadoop crash course workshop at Hadoop SummitHadoop crash course workshop at Hadoop Summit
Hadoop crash course workshop at Hadoop Summit
DataWorks Summit
ย 
Hybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsHybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop Implementations
David Portnoy
ย 
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop ProfessionalsBest Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Cloudera, Inc.
ย 
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for HadoopPartners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Eric Sun
ย 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
DataWorks Summit
ย 
Luo june27 1150am_room230_a_v2
Luo june27 1150am_room230_a_v2Luo june27 1150am_room230_a_v2
Luo june27 1150am_room230_a_v2
DataWorks Summit
ย 
Talend Big Data Tutorial | Talend DI and Big Data Certification | Talend Onli...
Talend Big Data Tutorial | Talend DI and Big Data Certification | Talend Onli...Talend Big Data Tutorial | Talend DI and Big Data Certification | Talend Onli...
Talend Big Data Tutorial | Talend DI and Big Data Certification | Talend Onli...
Edureka!
ย 
A Reference Architecture for ETL 2.0
A Reference Architecture for ETL 2.0 A Reference Architecture for ETL 2.0
A Reference Architecture for ETL 2.0
DataWorks Summit
ย 
Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014
Hortonworks
ย 
A Big Data Journey: Bringing Open Source to Finance
A Big Data Journey: Bringing Open Source to FinanceA Big Data Journey: Bringing Open Source to Finance
A Big Data Journey: Bringing Open Source to Finance
Slim Baltagi
ย 
Common and unique use cases for Apache Hadoop
Common and unique use cases for Apache HadoopCommon and unique use cases for Apache Hadoop
Common and unique use cases for Apache Hadoop
Brock Noland
ย 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
DataWorks Summit/Hadoop Summit
ย 
Empowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine LearningEmpowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine Learning
DataWorks Summit
ย 
Splice machine-bloor-webinar-data-lakes
Splice machine-bloor-webinar-data-lakesSplice machine-bloor-webinar-data-lakes
Splice machine-bloor-webinar-data-lakes
Edgar Alejandro Villegas
ย 
Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...
Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...
Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...
Edureka!
ย 
Evolving Hadoop into an Operational Platform with Data Applications
Evolving Hadoop into an Operational Platform with Data ApplicationsEvolving Hadoop into an Operational Platform with Data Applications
Evolving Hadoop into an Operational Platform with Data Applications
DataWorks Summit
ย 
The convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on HadoopThe convergence of reporting and interactive BI on Hadoop
The convergence of reporting and interactive BI on Hadoop
DataWorks Summit
ย 
Internet of things Crash Course Workshop
Internet of things Crash Course WorkshopInternet of things Crash Course Workshop
Internet of things Crash Course Workshop
DataWorks Summit
ย 
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Starting Small and Scaling Big with Hadoop (Talend and Hortonworks webinar)) ...
Hortonworks
ย 
YARN: the Key to overcoming the challenges of broad-based Hadoop Adoption
YARN: the Key to overcoming the challenges of broad-based Hadoop AdoptionYARN: the Key to overcoming the challenges of broad-based Hadoop Adoption
YARN: the Key to overcoming the challenges of broad-based Hadoop Adoption
DataWorks Summit
ย 
Hadoop crash course workshop at Hadoop Summit
Hadoop crash course workshop at Hadoop SummitHadoop crash course workshop at Hadoop Summit
Hadoop crash course workshop at Hadoop Summit
DataWorks Summit
ย 
Hybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop ImplementationsHybrid Data Warehouse Hadoop Implementations
Hybrid Data Warehouse Hadoop Implementations
David Portnoy
ย 
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop ProfessionalsBest Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Best Practices for the Hadoop Data Warehouse: EDW 101 for Hadoop Professionals
Cloudera, Inc.
ย 
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for HadoopPartners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Partners 2013 LinkedIn Use Cases for Teradata Connectors for Hadoop
Eric Sun
ย 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
DataWorks Summit
ย 
Luo june27 1150am_room230_a_v2
Luo june27 1150am_room230_a_v2Luo june27 1150am_room230_a_v2
Luo june27 1150am_room230_a_v2
DataWorks Summit
ย 
Talend Big Data Tutorial | Talend DI and Big Data Certification | Talend Onli...
Talend Big Data Tutorial | Talend DI and Big Data Certification | Talend Onli...Talend Big Data Tutorial | Talend DI and Big Data Certification | Talend Onli...
Talend Big Data Tutorial | Talend DI and Big Data Certification | Talend Onli...
Edureka!
ย 
A Reference Architecture for ETL 2.0
A Reference Architecture for ETL 2.0 A Reference Architecture for ETL 2.0
A Reference Architecture for ETL 2.0
DataWorks Summit
ย 
Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014Teradata - Presentation at Hortonworks Booth - Strata 2014
Teradata - Presentation at Hortonworks Booth - Strata 2014
Hortonworks
ย 
A Big Data Journey: Bringing Open Source to Finance
A Big Data Journey: Bringing Open Source to FinanceA Big Data Journey: Bringing Open Source to Finance
A Big Data Journey: Bringing Open Source to Finance
Slim Baltagi
ย 
Common and unique use cases for Apache Hadoop
Common and unique use cases for Apache HadoopCommon and unique use cases for Apache Hadoop
Common and unique use cases for Apache Hadoop
Brock Noland
ย 
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the CloudBring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
Bring your SAP and Enterprise Data to Hadoop, Apache Kafka and the Cloud
DataWorks Summit/Hadoop Summit
ย 
Empowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine LearningEmpowering you with Democratized Data Access, Data Science and Machine Learning
Empowering you with Democratized Data Access, Data Science and Machine Learning
DataWorks Summit
ย 
Splice machine-bloor-webinar-data-lakes
Splice machine-bloor-webinar-data-lakesSplice machine-bloor-webinar-data-lakes
Splice machine-bloor-webinar-data-lakes
Edgar Alejandro Villegas
ย 

Similar to ETL using Big Data Talend (20)

Webinar : Talend : The Non-Programmer's Swiss Knife for Big Data
Webinar  : Talend : The Non-Programmer's Swiss Knife for Big DataWebinar  : Talend : The Non-Programmer's Swiss Knife for Big Data
Webinar : Talend : The Non-Programmer's Swiss Knife for Big Data
Edureka!
ย 
Simplifying Big Data ETL with Talend
Simplifying Big Data ETL with TalendSimplifying Big Data ETL with Talend
Simplifying Big Data ETL with Talend
Edureka!
ย 
Talend For Big Data : Secret Key to Hadoop
Talend For Big Data  : Secret Key to HadoopTalend For Big Data  : Secret Key to Hadoop
Talend For Big Data : Secret Key to Hadoop
Edureka!
ย 
Talend webinar
Talend webinarTalend webinar
Talend webinar
Edureka!
ย 
Manipulating data with Talend. Learn how?
Manipulating data with Talend. Learn how?Manipulating data with Talend. Learn how?
Manipulating data with Talend. Learn how?
Edureka!
ย 
Manipulating Data with Talend.
Manipulating Data with Talend.Manipulating Data with Talend.
Manipulating Data with Talend.
Edureka!
ย 
5 Scenarios: When To Use & When Not to Use Hadoop
5 Scenarios: When To Use & When Not to Use Hadoop5 Scenarios: When To Use & When Not to Use Hadoop
5 Scenarios: When To Use & When Not to Use Hadoop
Edureka!
ย 
Hadoop : The Pile of Big Data
Hadoop : The Pile of Big DataHadoop : The Pile of Big Data
Hadoop : The Pile of Big Data
Edureka!
ย 
Webinar: Big Data & Hadoop - When not to use Hadoop
Webinar: Big Data & Hadoop - When not to use HadoopWebinar: Big Data & Hadoop - When not to use Hadoop
Webinar: Big Data & Hadoop - When not to use Hadoop
Edureka!
ย 
Oct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on HadoopOct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on Hadoop
Josh Patterson
ย 
Scalable ETL with Talend and Hadoop, Cรฉdric Carbone, Talend.
Scalable ETL with Talend and Hadoop, Cรฉdric Carbone, Talend.Scalable ETL with Talend and Hadoop, Cรฉdric Carbone, Talend.
Scalable ETL with Talend and Hadoop, Cรฉdric Carbone, Talend.
OW2
ย 
Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7
mmathipra
ย 
Hadoop(Term Paper)
Hadoop(Term Paper)Hadoop(Term Paper)
Hadoop(Term Paper)
Dux Chandegra
ย 
2014 july 24_what_ishadoop
2014 july 24_what_ishadoop2014 july 24_what_ishadoop
2014 july 24_what_ishadoop
Adam Muise
ย 
How pig and hadoop fit in data processing architecture
How pig and hadoop fit in data processing architectureHow pig and hadoop fit in data processing architecture
How pig and hadoop fit in data processing architecture
Kovid Academy
ย 
Hadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointHadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter Point
Inside Analysis
ย 
Hadoop's Problem and How to Fix it
Hadoop's Problem and How to Fix itHadoop's Problem and How to Fix it
Hadoop's Problem and How to Fix it
Kognitio
ย 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks
Pactera_US
ย 
Capgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using HadoopCapgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using Hadoop
Appfluent Technology
ย 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and Hortonworks
Hortonworks
ย 
Webinar : Talend : The Non-Programmer's Swiss Knife for Big Data
Webinar  : Talend : The Non-Programmer's Swiss Knife for Big DataWebinar  : Talend : The Non-Programmer's Swiss Knife for Big Data
Webinar : Talend : The Non-Programmer's Swiss Knife for Big Data
Edureka!
ย 
Simplifying Big Data ETL with Talend
Simplifying Big Data ETL with TalendSimplifying Big Data ETL with Talend
Simplifying Big Data ETL with Talend
Edureka!
ย 
Talend For Big Data : Secret Key to Hadoop
Talend For Big Data  : Secret Key to HadoopTalend For Big Data  : Secret Key to Hadoop
Talend For Big Data : Secret Key to Hadoop
Edureka!
ย 
Talend webinar
Talend webinarTalend webinar
Talend webinar
Edureka!
ย 
Manipulating data with Talend. Learn how?
Manipulating data with Talend. Learn how?Manipulating data with Talend. Learn how?
Manipulating data with Talend. Learn how?
Edureka!
ย 
Manipulating Data with Talend.
Manipulating Data with Talend.Manipulating Data with Talend.
Manipulating Data with Talend.
Edureka!
ย 
5 Scenarios: When To Use & When Not to Use Hadoop
5 Scenarios: When To Use & When Not to Use Hadoop5 Scenarios: When To Use & When Not to Use Hadoop
5 Scenarios: When To Use & When Not to Use Hadoop
Edureka!
ย 
Hadoop : The Pile of Big Data
Hadoop : The Pile of Big DataHadoop : The Pile of Big Data
Hadoop : The Pile of Big Data
Edureka!
ย 
Webinar: Big Data & Hadoop - When not to use Hadoop
Webinar: Big Data & Hadoop - When not to use HadoopWebinar: Big Data & Hadoop - When not to use Hadoop
Webinar: Big Data & Hadoop - When not to use Hadoop
Edureka!
ย 
Oct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on HadoopOct 2011 CHADNUG Presentation on Hadoop
Oct 2011 CHADNUG Presentation on Hadoop
Josh Patterson
ย 
Scalable ETL with Talend and Hadoop, Cรฉdric Carbone, Talend.
Scalable ETL with Talend and Hadoop, Cรฉdric Carbone, Talend.Scalable ETL with Talend and Hadoop, Cรฉdric Carbone, Talend.
Scalable ETL with Talend and Hadoop, Cรฉdric Carbone, Talend.
OW2
ย 
Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7Meet the experts dwo bde vds v7
Meet the experts dwo bde vds v7
mmathipra
ย 
Hadoop(Term Paper)
Hadoop(Term Paper)Hadoop(Term Paper)
Hadoop(Term Paper)
Dux Chandegra
ย 
2014 july 24_what_ishadoop
2014 july 24_what_ishadoop2014 july 24_what_ishadoop
2014 july 24_what_ishadoop
Adam Muise
ย 
How pig and hadoop fit in data processing architecture
How pig and hadoop fit in data processing architectureHow pig and hadoop fit in data processing architecture
How pig and hadoop fit in data processing architecture
Kovid Academy
ย 
Hadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter PointHadoop and the Data Warehouse: Point/Counter Point
Hadoop and the Data Warehouse: Point/Counter Point
Inside Analysis
ย 
Hadoop's Problem and How to Fix it
Hadoop's Problem and How to Fix itHadoop's Problem and How to Fix it
Hadoop's Problem and How to Fix it
Kognitio
ย 
Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks Transform Your Business with Big Data and Hortonworks
Transform Your Business with Big Data and Hortonworks
Pactera_US
ย 
Capgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using HadoopCapgemini Data Warehouse Optimization Using Hadoop
Capgemini Data Warehouse Optimization Using Hadoop
Appfluent Technology
ย 
Transform You Business with Big Data and Hortonworks
Transform You Business with Big Data and HortonworksTransform You Business with Big Data and Hortonworks
Transform You Business with Big Data and Hortonworks
Hortonworks
ย 
Ad

More from Edureka! (20)

What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
Edureka!
ย 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
ย 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
ย 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
Edureka!
ย 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
Edureka!
ย 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
Edureka!
ย 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
Edureka!
ย 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
Edureka!
ย 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
Edureka!
ย 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
Edureka!
ย 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
Edureka!
ย 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
Edureka!
ย 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
Edureka!
ย 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
Edureka!
ย 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
ย 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
Edureka!
ย 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
Edureka!
ย 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
ย 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
Edureka!
ย 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
Edureka!
ย 
What to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | EdurekaWhat to learn during the 21 days Lockdown | Edureka
What to learn during the 21 days Lockdown | Edureka
Edureka!
ย 
Top 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | EdurekaTop 10 Dying Programming Languages in 2020 | Edureka
Top 10 Dying Programming Languages in 2020 | Edureka
Edureka!
ย 
Top 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | EdurekaTop 5 Trending Business Intelligence Tools | Edureka
Top 5 Trending Business Intelligence Tools | Edureka
Edureka!
ย 
Tableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | EdurekaTableau Tutorial for Data Science | Edureka
Tableau Tutorial for Data Science | Edureka
Edureka!
ย 
Python Programming Tutorial | Edureka
Python Programming Tutorial | EdurekaPython Programming Tutorial | Edureka
Python Programming Tutorial | Edureka
Edureka!
ย 
Top 5 PMP Certifications | Edureka
Top 5 PMP Certifications | EdurekaTop 5 PMP Certifications | Edureka
Top 5 PMP Certifications | Edureka
Edureka!
ย 
Top Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | EdurekaTop Maven Interview Questions in 2020 | Edureka
Top Maven Interview Questions in 2020 | Edureka
Edureka!
ย 
Linux Mint Tutorial | Edureka
Linux Mint Tutorial | EdurekaLinux Mint Tutorial | Edureka
Linux Mint Tutorial | Edureka
Edureka!
ย 
How to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| EdurekaHow to Deploy Java Web App in AWS| Edureka
How to Deploy Java Web App in AWS| Edureka
Edureka!
ย 
Importance of Digital Marketing | Edureka
Importance of Digital Marketing | EdurekaImportance of Digital Marketing | Edureka
Importance of Digital Marketing | Edureka
Edureka!
ย 
RPA in 2020 | Edureka
RPA in 2020 | EdurekaRPA in 2020 | Edureka
RPA in 2020 | Edureka
Edureka!
ย 
Email Notifications in Jenkins | Edureka
Email Notifications in Jenkins | EdurekaEmail Notifications in Jenkins | Edureka
Email Notifications in Jenkins | Edureka
Edureka!
ย 
EA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | EdurekaEA Algorithm in Machine Learning | Edureka
EA Algorithm in Machine Learning | Edureka
Edureka!
ย 
Cognitive AI Tutorial | Edureka
Cognitive AI Tutorial | EdurekaCognitive AI Tutorial | Edureka
Cognitive AI Tutorial | Edureka
Edureka!
ย 
AWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | EdurekaAWS Cloud Practitioner Tutorial | Edureka
AWS Cloud Practitioner Tutorial | Edureka
Edureka!
ย 
Blue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | EdurekaBlue Prism Top Interview Questions | Edureka
Blue Prism Top Interview Questions | Edureka
Edureka!
ย 
Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka Big Data on AWS Tutorial | Edureka
Big Data on AWS Tutorial | Edureka
Edureka!
ย 
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | EdurekaA star algorithm | A* Algorithm in Artificial Intelligence | Edureka
A star algorithm | A* Algorithm in Artificial Intelligence | Edureka
Edureka!
ย 
Kubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | EdurekaKubernetes Installation on Ubuntu | Edureka
Kubernetes Installation on Ubuntu | Edureka
Edureka!
ย 
Introduction to DevOps | Edureka
Introduction to DevOps | EdurekaIntroduction to DevOps | Edureka
Introduction to DevOps | Edureka
Edureka!
ย 
Ad

Recently uploaded (20)

Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
ย 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
ย 
AI Changes Everything โ€“ Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything โ€“ Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything โ€“ Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything โ€“ Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
ย 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
ย 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
ย 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
ย 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
ย 
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath MaestroDev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
UiPathCommunity
ย 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
ย 
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
BookNet Canada
ย 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
ย 
2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx
Samuele Fogagnolo
ย 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
ย 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
ย 
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell
ย 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
ย 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
ย 
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptxDevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
Justin Reock
ย 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
ย 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
ย 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
ย 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
ย 
AI Changes Everything โ€“ Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything โ€“ Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything โ€“ Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything โ€“ Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
ย 
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptxIncreasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Increasing Retail Store Efficiency How can Planograms Save Time and Money.pptx
Anoop Ashok
ย 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
ย 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
ย 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
ย 
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath MaestroDev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
UiPathCommunity
ย 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
ย 
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
BookNet Canada
ย 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
ย 
2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx
Samuele Fogagnolo
ย 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
ย 
Heap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and DeletionHeap, Types of Heap, Insertion and Deletion
Heap, Types of Heap, Insertion and Deletion
Jaydeep Kale
ย 
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell
ย 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
ย 
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
IEDM 2024 Tutorial2_Advances in CMOS Technologies and Future Directions for C...
organizerofv
ย 
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptxDevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptx
Justin Reock
ย 
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdfComplete Guide to Advanced Logistics Management Software in Riyadh.pdf
Complete Guide to Advanced Logistics Management Software in Riyadh.pdf
Software Company
ย 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
ย 

ETL using Big Data Talend

  • 1. www.edureka.co/talend-for-big-data ETL using Big Data Talend View Talend For Big Data course details at www.edureka.co/talend-for-big-data
  • 2. Slide 2 www.edureka.co/talend-for-big-data ๏‚ฎ Understand how ETL is complementing Hadoop Ecosystem ๏‚ฎ Adapt to ETL-Big Data industry ๏‚ฎ Understand why Talend is used with Big Data ๏‚ฎ Learn Big Data not in months but in Minutes ๏‚ฎ Understand the Use Case โ€“ Banking Industry ๏‚ฎ Implement a Talend job with Hadoop At the end of this session, you will be able to: Objectives
  • 3. Slide 3 www.edureka.co/talend-for-big-data ๏‚ฎA Graphical Abstraction Layer on top of Hadoop Applications โ€“ this makes life so much easy in the Big Data buzz world ๏‚ฎThe surprising stuff about the current buzz and questions heralding the end of ETL and even data warehousing is the lack of pushback and analysis of some of the outlandish comments made ETL with Big Data ยป What no one seems to question in response to these sorts of comments is the naive assumptions these statements are based on !! ยป Is it realistic for most companies to move all of their data into Hadoop? The typical assertion is that "Hadoop eliminates the need for ETLโ€โ€ฆ. Seriously ?
  • 4. Slide 4 www.edureka.co/talend-for-big-data ETL with Big Data Machine Data Transactional Data Business Apps Data ETL Workflow Big Data Extra and Load
  • 5. Slide 5 www.edureka.co/talend-for-big-data Is writing ETL scripts in MapReduce code still ETL? Is ETL running faster (in few cases & slower in others) on Hadoop eliminating ETL? Is introduction of Hadoop changing when, where and how ETL happens? Yes No Yes The question isn't really that are we eliminating ETL, but where does ETL take place & how are we changing its definition ETL with Big Data (Contd.)
  • 6. Slide 6 www.edureka.co/talend-for-big-data Defining ETL E โ€ข represents the ability to consistently and reliably extract data with high performance and minimal impact to the source system T โ€ข represents the ability to transform one or more data sets in batch or real-time into a consumable format L โ€ข stands for loading data into a persistent or virtual data store
  • 7. Slide 7 www.edureka.co/talend-for-big-data How learning ETL (along Big Data) is addressing major business problems ? Why ETL + Hadoop? BIG DATA DATA INTEGRATION DATA QUALITY MDM ESB BPM TALEND UNIFIED PLATFORM
  • 8. Slide 8 www.edureka.co/talend-for-big-data One Stop Solution!! Improves efficiency of big data job design with graphic interface Abstract and generates code Run transforms inside Hadoop Native support for HDFS, Sqoop, HBase, Mahout, Pig, Hive & MapReduce code generate Apache License 2.0 Embedded in Hortonworks Data Platform Certified with Cloudera, MapR and Grenplum An open source ecosystem
  • 9. Slide 9 www.edureka.co/talend-for-big-data Talend Q. Why Talend? Ans . Because the more connected the world becomes, the more quickly a business must adapt
  • 10. Slide 10 www.edureka.co/talend-for-big-data ๏‚ฎTalend is the only Graphical User Interface tool which is capable enough to โ€œtranslateโ€ an ETL job to a MapReduce job. Thus, Talend ETL job gets executed as a MapReduce job on Hadoop and get the big data work done in minutes ๏‚ฎThis is a key innovation which helps to reduce entry barriers in Big Data technology and allows ETL job developers (beginners and advanced) to carry out Data Warehouse offloading to greater extent ๏‚ฎWith its Eclipse-based graphical workspace, Talend Open Studio for Big Data enables the developer and data scientist to leverage Hadoop loading and processing technologies like HDFS, HBase, Hive, and Pig without having to write Hadoop application code ๏‚ฎHadoop Applications, Seamlessly gets Integrated within minutes using Talend Why Talend?
  • 11. Slide 11 www.edureka.co/talend-for-big-data ๏‚ฎBy simply selecting graphical components from a palette, arranging and configuring them, you can create Hadoop jobs For example: 1. Load data into HDFS (Hadoop Distributed File System) 2. Use Hadoop Pig to transform data in HDFS 3. Load data into a Hadoop Hive based data warehouse 4. Perform ELT (extract, load, transform) aggregations in Hive 5. Leverage Sqoop to integrate relational databases and Hadoop Why Talend? (Contd.)
  • 13. Slide 13 www.edureka.co/talend-for-big-data ๏‚ฎ For Hadoop applications to be truly accessible to your organization, they need to be smoothly integrated into your overall data flows ๏‚ฎ Talend Open Studio for Big Data is the ideal tool for integrating Hadoop applications into your broader data architecture ๏‚ฎ Talend provides more built-in connector components than any other data integration solution available, with more than 800+ connectors that make it easy to read from or write to any major file format, database, or packaged enterprise application For Example, in Talend Open Studio for Big Data, you can use drag 'n drop configurable components to create data integration flows that move data from delimited log files into Hadoop Hive, perform operations in Hive, and extract data from Hive into a MySQL database (or Oracle, Sybase, SQL Server, and so on) Talend Hadoop Integration (Contd.)
  • 14. Slide 14 www.edureka.co/talend-for-big-data ๏‚ฎ More and more enterprise wanted to scale up in Hadoop/Big Data technologies with use of existing pool of talent and reduce overspending on map-reduce programmer (which is pretty new and expensive) ๏‚ฎ High rise of job trend in Data Scientist/Data Analysis (Talend also comes along with basic BI transformations which reduces your dependency on simple excel dash board/ BI tools) ๏‚ฎ Gartner is featuring Talend as the best technology in market for Data Integration and Big Data ๏‚ฎ 3 major players in Big Data industry, Hortonworks, Cloudera, MapR have already tied up with Talend for big data solutions ๏‚ฎ And mostly any level person in industry can quickly get started on this without much pre-requisites Myth : I donโ€™t know Java programming , how would this course help me learn and excel in Big Data? The biggest advantage you get with Talend for Big Data is โ€œthere is no prerequisiteโ€ to learn this concept. Whether you come with prior knowledge of Hadoop or not , this course has some or other best things to offer Talend Hadoop Integration (Contd.)
  • 15. Slide 15 www.edureka.co/talend-for-big-data Learn Big Data not in months but in Minutes!! Sounds too good ? But true Big Data in 10 minutes HADOOP HORTONWORKSMAPR CLOUDERA Go from zero to big data in under 10 minutes Get big data without coding. The Talend Big Data Sandbox is a ready-to-run virtual environment that includes Talend Platform for Big Data, popular Hadoop distributions and data examples
  • 16. Slide 16 www.edureka.co/talend-for-big-data Who can use โ€œTalend for Big Dataโ€!!
  • 17. Slide 17 www.edureka.co/talend-for-big-data Let us all see quickly, what Talend can do in minutes, reducing the man-hours in doing MapReduce programming in Hadoop, shall we? We are just about to see the Bigger Picture
  • 18. Slide 18 www.edureka.co/talend-for-big-data A Banking industry use case : โ€œAddressing the challenges in growing the business with use of Big Dataโ€œ . We will use customer filled web-log data (collected by bank) and with the help of Pig-ETL job will answer the question โ€œwhere should bank hold marketing campaigns for new product launch to get more businessโ€ , in ETL-Big Data Analytics style In this section, you will be able to sense the true power of Talend+Big Data Real time Use Case : ETL + Big Data
  • 19. Slide 19 www.edureka.co/talend-for-big-data Our use case setup is using the below : ยป Hortonworks Sandbox 1.3 ยป Talend Open Studio for Big Data 5.5 ยป Windows 7 (64 Bit OS) ยป Machine : 4GB RAM , i3 processor Environment Setup
  • 20. Slide 20 www.edureka.co/talend-for-big-data Use-case demonstration has been divided into steps such as : ยป Step 1: Generate huge web-log data (we are generating our own source sample data to simulate real time data) ยป Step 2: Load the data from local file system to HDFS (Hadoop) in seconds ยป Step 3: Read from HDFS, Process via Pig Scripts and achieve results Use case Design
  • 23. Slide 23 www.edureka.co/talend-for-big-data References ๏‚ฎ https://www.talend.com/resource/hadoop-applications.html ๏‚ฎ http://www.edureka.co/blog/big-data-and-etl-are-family/
  • 24. Slide 24 www.edureka.co/talend-for-big-data Course Topics ๏‚ฎ Module 1 ยป Role of Open Source ETL Technologies in Big Data ๏‚ฎ Module 2 ยป Talend: A Revolution in Big Data ๏‚ฎ Module 3 ยป Talend: Read & Write Various Types of Source/Target Systems ๏‚ฎ Module 4 ยป Talend: How to Transform your Business: Basic ๏‚ฎ Module 5 ยป Talend: How to Transform your Business: Advanced 1 ๏‚ฎ Module 6 ยป Talend: How to Transform your Business: Advanced 2 ๏‚ฎ Module 7 ยป Big Data Concepts: Required for Talend for Big Data ๏‚ฎ Module 8 ยป Introduction to Talend for Big Data ๏‚ฎ Module 9 ยป Hive in Talend for Big Data ๏‚ฎ Module 10 ยป Pig in Talend for Big Data and Project
  • 25. Slide 25 LIVE Online Class Class Recording in LMS 24/7 Post Class Support Module Wise Quiz Project Work Verifiable Certificate www.edureka.co/talend-for-big-data How it Works?