SlideShare a Scribd company logo
WHAT IS ETL?
“EXTRACT … TRANSFORM … LOAD”



Eng. Ismail El Gayar
Software Engineer
WHY ETL?

 Companies need a way to analyze their data
  for critical business decisions.
 Transactional Database can’t answer
  complex business questions.
 A data warehouse provide a common data
  repository.
 ETL provide a method of moving the data
  from various source into a data warehouse.
ETL CONCEPT
 A Company data may be scattered in
  different locations and in different formats.
 ETL Allows you to:
     Migrate the data into a data warehouse.
     Convert the various formats and types to adhere
      to one consistent system.
   ETL is a predefined process for access and
    manipulate source data and loading it into a
    target database.
ETL REQUIREMENTS
   Any ETL Architecture must meet the following
    requirements:
       Business Requirement
       Compliance Requirement
       Data Profiling
       Data Security
       Data Integration
       Right Data at Right Time
       Archiving & Uneage
       Final End User Delivery Interface
       Available Skills
       Legacy License
       Alignment with overall Enterprise Architecture
THE ETL PROCESS



                                                    Load
                                                 The process of
                            Transform            writing data into
                                                    the target
                            The process of
                                                     database
                          converting data from
                          one form to another
         Extract
      The process of
    reading data from a
         database
EXTRACT

   Gathering the data
     Raw  data that was written directly into the disk
     Data written to flat files or relational tables from
      structured source systems
     Data can be read multiple times, if needed.

   Cleansing the data
     Eliminateduplicates or fragmented data
     Exclude unwanted / unneeded information
TRANSFORM

 Preparing the data to be housed in the data
  warehouse.
 Converting the extracted data
     Using  rules and lookup tables
     Combining data

     Verification/Validity checks

     Standardization
LOAD

   Storing the transformed data in the data
    warehouse.

   Batch/Real-time processing

   Can follow star schema and snowflake
    schema
ETL FLOW
ADVANTAGE OF ETL TOOL
 Simple, faster and cheaper development
 Most ETL tools provide a metadata
  repository, synchronizing metadata from
  various sources.
 Most ETL tools deliver good performance,
  even for very large dataset.
 Most ETL tools provide impact analysis tools
  for any proposed schema changes.
 Most ETL tools have built-in connectors for
  all the major RDBMS systems
ADVANTAGE OF ETL TOOL
 Most ETL tools allow reuse of the existing
  complex programs.
 Several ETL tools offers visual Development
  Environment.
 Most ETL tools offers built-in scheduler
  sequencers and documentation.
 Several ETL tools offer various performance
  optimization options such as (parallel
  processing, complex load balancing etc)
POPULAR ETL TOOLS

                 Tools                             Company
Infosphere Datastage               IBM
Informatica                        Informatica Corp
DT/Studio                          Embarcadero Technologies
Ab Inito                           Ab Inito Software Corp
Oracle Warehouse Builder           ORACLE
Microsoft SQL Server Integration   Microsoft
Transformation Manager             ETL Solutions
THANK YOU
Ad

More Related Content

What's hot (20)

Etl overview training
Etl overview trainingEtl overview training
Etl overview training
Mondy Holten
 
Data warehouse architecture
Data warehouse architectureData warehouse architecture
Data warehouse architecture
pcherukumalla
 
Introduction to Data Engineering
Introduction to Data EngineeringIntroduction to Data Engineering
Introduction to Data Engineering
Vivek Aanand Ganesan
 
Introduction to Oracle Database
Introduction to Oracle DatabaseIntroduction to Oracle Database
Introduction to Oracle Database
puja_dhar
 
Data warehousing
Data warehousingData warehousing
Data warehousing
Shruti Dalela
 
Ms sql-server
Ms sql-serverMs sql-server
Ms sql-server
Md.Mojibul Hoque
 
DATA WAREHOUSING
DATA WAREHOUSINGDATA WAREHOUSING
DATA WAREHOUSING
King Julian
 
ETL Testing Overview
ETL Testing OverviewETL Testing Overview
ETL Testing Overview
Chetan Gadodia
 
Data Engineering Basics
Data Engineering BasicsData Engineering Basics
Data Engineering Basics
Catherine Kimani
 
Oracle
OracleOracle
Oracle
JIGAR MAKHIJA
 
Oltp vs olap
Oltp vs olapOltp vs olap
Oltp vs olap
Mr. Fmhyudin
 
Etl process in data warehouse
Etl process in data warehouseEtl process in data warehouse
Etl process in data warehouse
Komal Choudhary
 
Dimensional Modelling
Dimensional ModellingDimensional Modelling
Dimensional Modelling
Prithwis Mukerjee
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
SOMASUNDARAM T
 
Data Models
Data ModelsData Models
Data Models
RituBhargava7
 
SQL(DDL & DML)
SQL(DDL & DML)SQL(DDL & DML)
SQL(DDL & DML)
Sharad Dubey
 
Introduction of ssis
Introduction of ssisIntroduction of ssis
Introduction of ssis
deepakk073
 
Introduction to SQL
Introduction to SQLIntroduction to SQL
Introduction to SQL
Ehsan Hamzei
 
Rdbms
RdbmsRdbms
Rdbms
Muhammad Adeel Rajput
 
Ppt
PptPpt
Ppt
bullsrockr666
 

Similar to What is ETL? (20)

ETL Technologies.pptx
ETL Technologies.pptxETL Technologies.pptx
ETL Technologies.pptx
Gaurav Bhatnagar
 
Building the DW - ETL
Building the DW - ETLBuilding the DW - ETL
Building the DW - ETL
ganblues
 
ETL Process
ETL ProcessETL Process
ETL Process
Rohin Rangnekar
 
ETL Tools Ankita Dubey
ETL Tools Ankita DubeyETL Tools Ankita Dubey
ETL Tools Ankita Dubey
Ankita Dubey
 
Etl with talend (data integeration)
Etl with talend (data integeration)Etl with talend (data integeration)
Etl with talend (data integeration)
pomishra
 
Chapter 4-ETL
Chapter 4-ETLChapter 4-ETL
Chapter 4-ETL
teenoooo
 
Why shift from ETL to ELT?
Why shift from ETL to ELT?Why shift from ETL to ELT?
Why shift from ETL to ELT?
HEXANIKA
 
CERN_DIS_ODI_OGG_final_oracle_golde.pptx
CERN_DIS_ODI_OGG_final_oracle_golde.pptxCERN_DIS_ODI_OGG_final_oracle_golde.pptx
CERN_DIS_ODI_OGG_final_oracle_golde.pptx
camyla81
 
oracle_soultion_oracledataintegrator_goldengate_2021
oracle_soultion_oracledataintegrator_goldengate_2021oracle_soultion_oracledataintegrator_goldengate_2021
oracle_soultion_oracledataintegrator_goldengate_2021
ssuser8ccb5a
 
Basha_ETL_Developer
Basha_ETL_DeveloperBasha_ETL_Developer
Basha_ETL_Developer
basha shaik
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
Deepali Raut
 
Data junction tool
Data junction toolData junction tool
Data junction tool
Sara shall
 
Optimizing ETL Workflows With Advanced Tools.pptx
Optimizing ETL Workflows With Advanced Tools.pptxOptimizing ETL Workflows With Advanced Tools.pptx
Optimizing ETL Workflows With Advanced Tools.pptx
Innovative Routines International
 
ETL VS ELT.pdf
ETL VS ELT.pdfETL VS ELT.pdf
ETL VS ELT.pdf
BOSupport
 
Lecture13- Extract Transform Load presentation.pptx
Lecture13- Extract Transform Load presentation.pptxLecture13- Extract Transform Load presentation.pptx
Lecture13- Extract Transform Load presentation.pptx
AyeshaAsad26
 
How Ask On Data Simplifies Data Engineering with Chat Based Interactions.docx
How Ask On Data Simplifies Data Engineering with Chat Based Interactions.docxHow Ask On Data Simplifies Data Engineering with Chat Based Interactions.docx
How Ask On Data Simplifies Data Engineering with Chat Based Interactions.docx
Varsha Nayak
 
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....
Varsha Nayak
 
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....
Varsha Nayak
 
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1).pdf
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1).pdfHow Ask On Data Simplifies Data Engineering with Chat Based Interactions (1).pdf
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1).pdf
Varsha Nayak
 
How Ask On Data Simplifies Data Engineering with Chat Based Interactions.docx
How Ask On Data Simplifies Data Engineering with Chat Based Interactions.docxHow Ask On Data Simplifies Data Engineering with Chat Based Interactions.docx
How Ask On Data Simplifies Data Engineering with Chat Based Interactions.docx
Varsha Nayak
 
Building the DW - ETL
Building the DW - ETLBuilding the DW - ETL
Building the DW - ETL
ganblues
 
ETL Tools Ankita Dubey
ETL Tools Ankita DubeyETL Tools Ankita Dubey
ETL Tools Ankita Dubey
Ankita Dubey
 
Etl with talend (data integeration)
Etl with talend (data integeration)Etl with talend (data integeration)
Etl with talend (data integeration)
pomishra
 
Chapter 4-ETL
Chapter 4-ETLChapter 4-ETL
Chapter 4-ETL
teenoooo
 
Why shift from ETL to ELT?
Why shift from ETL to ELT?Why shift from ETL to ELT?
Why shift from ETL to ELT?
HEXANIKA
 
CERN_DIS_ODI_OGG_final_oracle_golde.pptx
CERN_DIS_ODI_OGG_final_oracle_golde.pptxCERN_DIS_ODI_OGG_final_oracle_golde.pptx
CERN_DIS_ODI_OGG_final_oracle_golde.pptx
camyla81
 
oracle_soultion_oracledataintegrator_goldengate_2021
oracle_soultion_oracledataintegrator_goldengate_2021oracle_soultion_oracledataintegrator_goldengate_2021
oracle_soultion_oracledataintegrator_goldengate_2021
ssuser8ccb5a
 
Basha_ETL_Developer
Basha_ETL_DeveloperBasha_ETL_Developer
Basha_ETL_Developer
basha shaik
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
Deepali Raut
 
Data junction tool
Data junction toolData junction tool
Data junction tool
Sara shall
 
ETL VS ELT.pdf
ETL VS ELT.pdfETL VS ELT.pdf
ETL VS ELT.pdf
BOSupport
 
Lecture13- Extract Transform Load presentation.pptx
Lecture13- Extract Transform Load presentation.pptxLecture13- Extract Transform Load presentation.pptx
Lecture13- Extract Transform Load presentation.pptx
AyeshaAsad26
 
How Ask On Data Simplifies Data Engineering with Chat Based Interactions.docx
How Ask On Data Simplifies Data Engineering with Chat Based Interactions.docxHow Ask On Data Simplifies Data Engineering with Chat Based Interactions.docx
How Ask On Data Simplifies Data Engineering with Chat Based Interactions.docx
Varsha Nayak
 
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....
Varsha Nayak
 
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1)....
Varsha Nayak
 
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1).pdf
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1).pdfHow Ask On Data Simplifies Data Engineering with Chat Based Interactions (1).pdf
How Ask On Data Simplifies Data Engineering with Chat Based Interactions (1).pdf
Varsha Nayak
 
How Ask On Data Simplifies Data Engineering with Chat Based Interactions.docx
How Ask On Data Simplifies Data Engineering with Chat Based Interactions.docxHow Ask On Data Simplifies Data Engineering with Chat Based Interactions.docx
How Ask On Data Simplifies Data Engineering with Chat Based Interactions.docx
Varsha Nayak
 
Ad

More from Ismail El Gayar (7)

Neural Networks
Neural NetworksNeural Networks
Neural Networks
Ismail El Gayar
 
Why computer engineering
Why computer engineeringWhy computer engineering
Why computer engineering
Ismail El Gayar
 
Geographic Information System for Egyptian Railway System(GIS)
Geographic Information System for Egyptian Railway System(GIS)Geographic Information System for Egyptian Railway System(GIS)
Geographic Information System for Egyptian Railway System(GIS)
Ismail El Gayar
 
System science documentation
System science documentationSystem science documentation
System science documentation
Ismail El Gayar
 
Prolog & lisp
Prolog & lispProlog & lisp
Prolog & lisp
Ismail El Gayar
 
Parallel architecture &programming
Parallel architecture &programmingParallel architecture &programming
Parallel architecture &programming
Ismail El Gayar
 
Object oriented methodology & unified modeling language
Object oriented methodology & unified modeling languageObject oriented methodology & unified modeling language
Object oriented methodology & unified modeling language
Ismail El Gayar
 
Why computer engineering
Why computer engineeringWhy computer engineering
Why computer engineering
Ismail El Gayar
 
Geographic Information System for Egyptian Railway System(GIS)
Geographic Information System for Egyptian Railway System(GIS)Geographic Information System for Egyptian Railway System(GIS)
Geographic Information System for Egyptian Railway System(GIS)
Ismail El Gayar
 
System science documentation
System science documentationSystem science documentation
System science documentation
Ismail El Gayar
 
Parallel architecture &programming
Parallel architecture &programmingParallel architecture &programming
Parallel architecture &programming
Ismail El Gayar
 
Object oriented methodology & unified modeling language
Object oriented methodology & unified modeling languageObject oriented methodology & unified modeling language
Object oriented methodology & unified modeling language
Ismail El Gayar
 
Ad

Recently uploaded (20)

2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx
Samuele Fogagnolo
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul
 
Build Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For DevsBuild Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For Devs
Brian McKeiver
 
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath MaestroDev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
UiPathCommunity
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded DevelopersLinux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Toradex
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 
2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx
Samuele Fogagnolo
 
What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...What is Model Context Protocol(MCP) - The new technology for communication bw...
What is Model Context Protocol(MCP) - The new technology for communication bw...
Vishnu Singh Chundawat
 
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Linux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdfLinux Professional Institute LPIC-1 Exam.pdf
Linux Professional Institute LPIC-1 Exam.pdf
RHCSA Guru
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
Transcript: #StandardsGoals for 2025: Standards & certification roundup - Tec...
BookNet Canada
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul
 
Build Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For DevsBuild Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For Devs
Brian McKeiver
 
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath MaestroDev Dives: Automate and orchestrate your processes with UiPath Maestro
Dev Dives: Automate and orchestrate your processes with UiPath Maestro
UiPathCommunity
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)Into The Box Conference Keynote Day 1 (ITB2025)
Into The Box Conference Keynote Day 1 (ITB2025)
Ortus Solutions, Corp
 
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded DevelopersLinux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Toradex
 
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptxSpecial Meetup Edition - TDX Bengaluru Meetup #52.pptx
Special Meetup Edition - TDX Bengaluru Meetup #52.pptx
shyamraj55
 

What is ETL?

  • 1. WHAT IS ETL? “EXTRACT … TRANSFORM … LOAD” Eng. Ismail El Gayar Software Engineer
  • 2. WHY ETL?  Companies need a way to analyze their data for critical business decisions.  Transactional Database can’t answer complex business questions.  A data warehouse provide a common data repository.  ETL provide a method of moving the data from various source into a data warehouse.
  • 3. ETL CONCEPT  A Company data may be scattered in different locations and in different formats.  ETL Allows you to:  Migrate the data into a data warehouse.  Convert the various formats and types to adhere to one consistent system.  ETL is a predefined process for access and manipulate source data and loading it into a target database.
  • 4. ETL REQUIREMENTS  Any ETL Architecture must meet the following requirements:  Business Requirement  Compliance Requirement  Data Profiling  Data Security  Data Integration  Right Data at Right Time  Archiving & Uneage  Final End User Delivery Interface  Available Skills  Legacy License  Alignment with overall Enterprise Architecture
  • 5. THE ETL PROCESS Load The process of Transform writing data into the target The process of database converting data from one form to another Extract The process of reading data from a database
  • 6. EXTRACT  Gathering the data  Raw data that was written directly into the disk  Data written to flat files or relational tables from structured source systems  Data can be read multiple times, if needed.  Cleansing the data  Eliminateduplicates or fragmented data  Exclude unwanted / unneeded information
  • 7. TRANSFORM  Preparing the data to be housed in the data warehouse.  Converting the extracted data  Using rules and lookup tables  Combining data  Verification/Validity checks  Standardization
  • 8. LOAD  Storing the transformed data in the data warehouse.  Batch/Real-time processing  Can follow star schema and snowflake schema
  • 10. ADVANTAGE OF ETL TOOL  Simple, faster and cheaper development  Most ETL tools provide a metadata repository, synchronizing metadata from various sources.  Most ETL tools deliver good performance, even for very large dataset.  Most ETL tools provide impact analysis tools for any proposed schema changes.  Most ETL tools have built-in connectors for all the major RDBMS systems
  • 11. ADVANTAGE OF ETL TOOL  Most ETL tools allow reuse of the existing complex programs.  Several ETL tools offers visual Development Environment.  Most ETL tools offers built-in scheduler sequencers and documentation.  Several ETL tools offer various performance optimization options such as (parallel processing, complex load balancing etc)
  • 12. POPULAR ETL TOOLS Tools Company Infosphere Datastage IBM Informatica Informatica Corp DT/Studio Embarcadero Technologies Ab Inito Ab Inito Software Corp Oracle Warehouse Builder ORACLE Microsoft SQL Server Integration Microsoft Transformation Manager ETL Solutions