SlideShare a Scribd company logo
The process of updating the data warehouse.
ETL
Defining ETL
ETL stands for extract, transform, and load. In many
organizations, the setup of the IT department happened long time
back. So in most of the cases the way each department handles the
data is different. For example, in a retail organization you may
have different departments such as sales, marketing, logistics, etc.
Each of them is handling the customer information but way they
store that data could be quite different. The sales department
store it by name where as marketing department has it in the
number format. ETL can take all this data from different sources
and transform it into a uniform presentation, such as for storing in
a database or data warehouse.
Another challenge of the old IT infrastructure is the different
data format used by different departments. So it might
happen for HRD department somebody is using SAP where
as the sales department you have Oracle Apps. So to take a
business critical decision it is difficult for the higher
management to get data from different platforms and
consolidate them. ETL can do that job easily. It will take data
from different sources and transform it to a uniform format
and store it into DB tables. From the DB, you can generate
the required reports.
Sample ETL Tools
Teradata Warehouse Builder from Teradata
DataStage from Ascential Software
SAS System from SAS Institute
Power Mart/Power Center from Informatica
Sagent Solution from Sagent Software
Hummingbird Genio Suite from Hummingbird
Communications
ETL Testing Tools
Query Surge
Talend
SQL Server Integration Services (SSIS) 
Purpose of ETL Testing:
From the above points, clearly accuracy of the data after transformation is critical
in ETL testing because if the data is not accurate, then the business decision will be
wrong.
So following test types are commonly used in ETL testing:
No Data Losses: In this testing type, we first determine an N number of
entities in the source system. Say the total number of employee is one such
entity. If I have 2000 employees in the source system, then after ETL
transformation and data store I should still get 2000 employees in the
destination database. Only exception is when we have some business rule
applied to the transformation. An example could be in the new system the
business does not want to keep data for people who no longer works for the
organization. So in that situation the destination system will discard the record
of those employee and store number less than 2000.
 This testing is done by mainly querying the source and designation database. Depending
on the complexity of the system these SQL queries might be very complex and beyond
the skill of a tester. However, running the query and checking the result is still the
responsibility of the QA team.
Validation of Transformation rules: This is the second big validation point. In ETL
testing, you will get the requirements in terms of transformation rules. One example
could be the data format in the source was yyyy/dd/mm where as in the destination
database this is mm/dd/yy. So you need to check them by taking some sample data.
Here you apply all the good conventional test theories like boundary value analysis,
equivalent partition, etc. All the database field level and record level data integrity must
be tested here. So you need to select the test data in such a way that at the end all the
fields are touched into the destination database after transformation.
Business processes testing: Once the transformation is done these data feeds will go
to many consumer systems (sales, marketing, HRD deferment applications). You need
to ensure by testing from those systems that they can receive the data in the preformed
format. All the critical end to end business flows must be tested from the GUI.
Purpose of ETL Testing :
Data Warehousing
Data Warehousing?
A data warehouse can be considered as a storage area
where relevant data is stored irrespective of the source.
Data warehousing merges data from multiple sources into an easy
and complete form.
Virtual Data Warehousing?
A virtual data warehouse provides a collective view of the
completed data. I t can be considered as a logical data model
of the containing metadata
Active data warehousing?
An active data warehouse represents a single state of the
business. It considers the analytic perspectives of customers
and suppliers. It helps to deliver the updated data through
reports.
Data Warehousing Cont…
Snapshot in a data warehouse:
Snapshot refers to a complete visualization of data at the time of
extraction. It occupies less space and can be used to back up and
restore data quickly.
ETL process in data warehousing:
ETL stands for Extraction, transformation and loading.
Extracting data from different sources such as flat files, databases or
XML data, transforming this data depending on the application’s
needs and load this data into a data warehouse.
Difference between data mining and data warehousing:
Data mining is a method for comparing large amounts of data for the
purpose of finding patterns. It is normally used for models and forecasting.
Data warehousing is the central repository for the data of several business
systems in an enterprise. Data from various resources extracted and
organized in the data warehouse selectively for analysis and accessibility.
Data Modelling and Data Mining
Data Modeling is a technique used to define and analyze the
requirements of data that supports organization’s business
process. In simple terms, it is used for the analysis of data
objects in order to identify the relationships among these
data objects in any business.
Data Mining is a technique used to analyze datasets to derive
useful insights/information. It is mainly used in retail,
consumer goods, telecommunication and financial
organizations that have a strong consumer orientation in
order to determine the impact on sales, customer satisfaction
and profitability.
Data Warehousing Vs Business
Intelligence
Data warehousing relates to all aspects of data management
starting from the development, implementation and
operation of the data sets. It is a back up of all data relevant
to business.(  data store).
Business Intelligence is used to analyze the data from the
point of business to measure any organization’s success.
The factors like sales, profitability, marketing campaign
effectiveness, market shares and operational efficiency etc
are analyzed using Business Intelligence tools like Cognos,
Informatica etc.
OLTP Vs OLAP
OLTP = OnLine Transaction Processing.
Applications that supports and manages transactions which
involve high volumes of data are supported by OLTP
system. OLTP is based on client-server architecture and
supports transactions across networks.
OLAP = OnLine Analytical Processing.
Business data analysis and complex calculations on low
volumes of data are performed by OLAP. An insight of
data coming from various resources can be gained by a
user with the support of OLAP.
Types of ETL Testing
   1)      Constraint Testing:
In the phase of constraint testing, the test engineers identifies whether the data is mapped
from source to target or not.
The Test Engineer follows the below scenarios in ETL Testing process.
a)      NOT NULL
b)      UNIQUE
c)       Primary Key
d)      Foreign key
e)      Check
f)       Default
g)      NULL
2)      Source to Target Count Testing:
In the Source to Target data is matched or not. A Tester can check in this view whether it
is ascending order or descending order it doesn’t matter .Only count is required for Tester.
Due to lack of time a tester can follow this type of Testing.
3)      Source to Target Data Validation Testing:
In this Testing, a tester can validate the each and every point of the source to target data.
Most of the financial projects, a tester can identify the decimal factors.
Types of ETL Testing Cont..
  4)      Threshold/Data Integrated Testing:
In this Testing, the Ranges of the data, A test Engineer can usually identifies
the population calculation and share marketing and business finance analysis
(quarterly, halferly, Yearly)  
MIN MAX RANGE
4 10 6
5)      Field to Field Testing:
In the field to field testing, a test engineer can identify that how much
space is occupied in the database. The data is integrated in the table cum data
types.
  NOTE: To check the order of the columns and source column to target
column.
Types of ETL Testing Cont..
6)      Duplicate Check Testing:
In this phase of ETL Testing, a Tester can face duplicate value very frequently so,
at that time the tester follows database queries why because huge amount of
data is present in source and Target tables.
Select ENO, ENAME, SAL, COUNT (*) FROM EMP GROUP BY ENO, ENAME,
SAL HAVING COUNT (*) >1;
Note:
1)  There are no mistakes in Primary Key or no Primary Key is allotted then the
duplicates may arise.
2)  Sometimes, a developer can do mistakes while transferring the data from
source to target at that time duplicates may arise.
3)  Due to Environment Mistakes also duplicates arise (Due to improper plugins in
the tool).
7)      Error/Exception Logical Testing:
1)      Delimiter is available in Valid Tables
2)      Delimiter is not available in invalid tables(Exception Tables)
  8)      Incremental and Historical Process Testing:
In the Incremental data, the historical data is not corrupted. When the historical data is
corrupted then this is the condition where bugs raise.
Types of ETL Testing Cont..
   9)      Control Columns and Defect Values Testing:
This is introduced by IBM
   10)   Navigation Testing:
Navigation Testing is the End user point of view testing. An end user cannot
follow the friendly of the application that navigation is called as bad or poor
Navigation.
At the time of Testing, A tester can identify this type of navigation scenarios to
avoid unnecessary navigation.
11)   Initialization testing:
A combination of hardware and software installed in platform is called
the Initialization Testing
12)   Transformation Testing:
At the time of mapping from source table to target table, Transformation
is not in mapping condition, then the Test Engineer raises bugs.
Types of ETL Testing Cont..
 13)   Regression Testing:
Code modification to fix a bug or to implement a new functionality which
makes us to to find errors.
These introduced errors are called regression. Identifying for regression
effect is called regression testing.
14)   Retesting:
Re executing the failed test cases after fixing the bug.
   15)   System Integration Testing:
Integration testing: After the completion of programming process.
Developer can integrate the modules there are 3 models
a)  Top Down
b)  Bottom Up
c)  Hybrid
Test Scenarios for ETL Testing
    1. To  validate  the  data in table (emp)
    2. To validate the table structure.
    3. To validate the null values  of the table.
    4. To validate the null values of very attribute.
    5. To check the duplicate values of  the table.
    6. To check the duplicate values of each attribute of the table
    7. To check the field value or space (length of the field size)  
    8. To check the constraints (foreign ,primary key)
    9. To check the name of the employer who has not earned any commission
    10. To check the all employers who are work in dept no (Account dept,sales dept)
    11. To check the row count of each attribute.
    12. To check the row count of the table.
    13. To check the max salary from emp table.
    14. To check the min salary from emp table.
Types of ETL Bugs
1. User interface bugs/cosmetic bugs:-
 Related to GUI of application
 Navigation, spelling mistakes, font style, font size, colors, alignment.
2. BVA Related bug:-
 Minimum and maximum values
3. EP Related bug:-
 Valid and invalid type
4. Input/output bugs:-
 Valid values not accepted
 Invalid values accepted 
5. Calculation bugs:-
 Mathematical errors
 Final output is wrong 
 
Types of ETL Bugs Cont…
6. Load condition bugs:-
 Does not allows multiple users
 Does not allows customer expected load
  7. Race condition bugs:-
 System crash & hang
 System cannot run client plat forms
  8. Version control bugs:-
 No logo matching
 No version information available
 This occurs usually in regression testing
9. H/W bugs:-
 Device is not responding to the application 
  10. Source bugs:-
 Mistakes in help documents
SQL Queries for ETL Testing
1.Create a table:
CREATE TABLE customer First_Name char(50),Last_Name char(50),Address char(50), City
char(50),Country char(25),Birth_Date date);
2. Add a column to a table:
ALTER TABLE customer ADD SO_INSURANCE_PROVIDER Varchar2(35);
3. DROP a column to a table
ALTER TABLE customer DROP column SO_INSURANCE_PROVIDER Varchar2(35);
4. Add a default value to a column
ALTER TABLE customer MODIFY SO_INSURANCE_PROVIDER Varchar2(35) DEFAULT
'ABC Ins';
5. Renaming a table:
ALTER TABLE suppliers RENAME TO vendors;
6. Modifying column(s) in a table:
ALTER TABLE supplier MODIFY supplier_namevarchar2(100) not null;
7. Drop column(s) in a table:
ALTER TABLE supplier DROP COLUMN supplier_name;
8. Primary key:
CREATE TABLE supplier( supplier_id numeric(10) not null,supplier_namevarchar2(50) not
null,contact_namevarchar2(50),CONSTRAINT supplier_pk PRIMARY KEY (supplier_id,
supplier_name));
9.Add primary key:
ALTER TABLE supplier add CONSTRAINT supplier_pk PRIMARY KEY
(supplier_id);
10.Drop primary key:
ALTER TABLE supplier drop CONSTRAINT supplier_pk;
11.Disable primary key:
ALTER TABLE supplier disable CONSTRAINT supplier_pk;
12.Enable primary key:
ALTER TABLE supplier enable CONSTRAINT supplier_pk;
13.Foreign key creation:
CREATE TABLE supplier
(
supplier_id numeric(10) not null,
supplier_namevarchar2(50) not null,
contact_namevarchar2(50),
CONSTRAINT supplier_pk PRIMARY KEY (supplier_id, supplier_name)
);
CREATE TABLE products
(
Product_id numeric (10) not null,
Supplier_id numeric (10) not null,
supplier_name varchar2 (50) not null,
CONSTRAINT fk_supplier_comp
FOREIGN KEY (supplier_id, supplier_name)
REFERENCES supplier (supplier_id, supplier_name)
);
14. Alter foreign key:
ALTER TABLE products add CONSTRAINT fk_supplier FOREIGN KEY
(supplier_id) REFERENCES supplier (supplier_id);
15. Drop foreign key:
ALTER TABLE SALES_ORDER_LINE DROP FOREIGN KEY
FK_SALES_ORDER_LINE_PRODUCT
16. Check constraint:
ALTER TABLE EMPLOYEE ADD CONSTRAINT REVENUE CHECK
(SALARY + COMM > 25000)
17. Drop check constraint:
ALTER TABLE EMPLOYEE DROP CONSTRAINT REVENUE CHECK
(SALARY + COMM > 25000)
18. Drop Table:
DROP TABLE customer;
19. Truncate Statement:
Truncate table customer;
*********************************************************************End of
DDL
Statements*********************************************************************
**
****************************************************************** DML
Statements
***************************************************************************
20. Insert rows in table:
1) INSERT INTO Store_Information (store_name, Sales, Date)
VALUES ('Los Angeles', 900, 'Jan-10-1999')
2) INSERT INTO Store_Information (store_name, Sales, Date) SELECT
store_name, Sales, Date FROM Sales_Info WHERE Year
(Date) = 1998
21. Update Statement in table:
UPDATE suppliers SET name = 'HP' WHERE name = 'IBM';
UPDATE suppliers
SET supplier_name =
(SELECT customers.name
FROM customers
WHERE customers. customer_id = suppliers. supplier_id);
22. Delete Statement in table:
DELETE FROM suppliers WHERE supplier_name = 'IBM';
DELETE FROM suppliers
WHERE EXISTS
( select customers.name
from customers
wherecustomers.customer_id = suppliers.supplier_id
andcustomers.customer_name = 'IBM' );
************************** select statement
23.Select Statement in table:
1. SELECT LastName, FirstName FROM Persons;
2. SELECT * FROM Persons;
The SELECT DISTINCT Statement:
SELECT DISTINCT Company FROM Orders;
The WHERE Clause:
SELECT * FROM Persons WHERE City='Sandnes‘
Using LIKE
SELECT * FROM Persons WHERE FirstName LIKE 'O%'
Arithmetic Operation:
Operator Description
= Equal
<> Not equal
> Greater than
< Less than
>= Greater than or equal
<= Less than or equal
BETWEEN Between an inclusive range
LIKE Search for a pattern
IN If you know the exact value you
want to return for at least one of the
columns
BETWEEN ... AND:
SELECT * FROM Persons WHERE LastName BETWEEN 'Hansen' AND 'Pettersen';
IN
SELECT * FROM Persons WHERE LastName IN 'Hansen','Pettersen');
Column Name Alias
SELECT LastName AS Family, FirstName AS Name FROM Persons
AND & OR
SELECT * FROM Persons WHERE FirstName='Tove' AND
LastName='Svendson'
SELECT * FROM Persons WHERE firstname='Tove' OR
lastname='Svendson'
ORDER BY
SELECT Company, OrderNumber FROM Orders ORDER BY Company
SELECT Company, OrderNumber FROM Orders ORDER BY Company
DESC, OrderNumber ASC
Group by Clause:
SELECT Company, SUM (Amount) FROM Sales GROUP BY Company;
Having Clause:
SELECT Company, SUM (Amount) FROM Sales GROUP BY Company HAVING
SUM (Amount)>10000;
Using UNION Clause:
SELECT E_Name FROM Employees_Norway UNION SELECT E_Name FROM
Employees_USA
UNION ALL Clause:
SELECT E_Name FROM Employees_Norway
UNION ALL
SELECT E_Name FROM Employees_USA
JOINS:
Referring to Two Tables:
SELECT Employees.Name, Orders.Product FROM Employees, Orders WHERE
Employees.Employee_ID=Orders.Employee_ID
INNER JOIN:
SELECT Employees.Name, Orders.Product FROM Employees INNER JOIN Orders ON
Employees.Employee_ID=Orders.Employee_ID;
LEFT JOIN:
SELECT Employees.Name, Orders.Product FROM Employees LEFT JOIN Orders
ON Employees.Employee_ID=Orders.Employee_ID;
RIGHT JOIN:
SELECT Employees.Name, Orders.Product FROM Employees RIGHT JOIN Orders
ON Employees.Employee_ID=Orders.Employee_ID;
Subqueries:
1) Select distinct country from Northwind.dbo.Customers where country not in (select
distinct country from Northwind.dbo.Suppliers);
2) Select top 1 OrderId, convert (char (10), OrderDate, 121) Last_Paris_Order,
(Select convert (char (10), max (OrderDate), 121) from Northwind.dbo.Orders)
Last_OrderDate,datediff(dd,OrderDate, (select Max(OrderDate) from
Northwind.dbo.Orders))Day_Diff fromNorthwind.dbo.Orders whereShipCity = 'Paris'
order by OrderDatedesc;
Commit & Rollback Statements:
1) UPDATE suppliers SET name = 'HP' WHERE name = 'IBM';
Commit;
2) UPDATE suppliers SET name = 'HP' WHERE name = 'IBM';
Rollback;
SavepointStatement:
INSERT INTO DEPARTMENT VALUES ('A20', 'MARKETING', 301);
SAVEPOINT SAVEPOINT1;
INSERT INTO DEPARTMENT VALUES ('B30', 'FINANCE', 520);
SAVEPOINT SAVEPOINT2;
ETL Tool QuerySurge at a Glance
Thank You!
Ad

More Related Content

What's hot (20)

Etl testing
Etl testingEtl testing
Etl testing
Sandip Patil
 
ETL Testing Interview Questions and Answers
ETL Testing Interview Questions and AnswersETL Testing Interview Questions and Answers
ETL Testing Interview Questions and Answers
H2Kinfosys
 
ETL QA
ETL QAETL QA
ETL QA
dillip kar
 
What is ETL?
What is ETL?What is ETL?
What is ETL?
Ismail El Gayar
 
Etl testing strategies
Etl testing strategiesEtl testing strategies
Etl testing strategies
sivam_1
 
Etl - Extract Transform Load
Etl - Extract Transform LoadEtl - Extract Transform Load
Etl - Extract Transform Load
ABDUL KHALIQ
 
Dimensional Modelling
Dimensional ModellingDimensional Modelling
Dimensional Modelling
Prithwis Mukerjee
 
Etl And Data Test Guidelines For Large Applications
Etl And Data Test Guidelines For Large ApplicationsEtl And Data Test Guidelines For Large Applications
Etl And Data Test Guidelines For Large Applications
Wayne Yaddow
 
Database testing
Database testingDatabase testing
Database testing
Hrushikesh Wakhle
 
Testing data warehouse applications by Kirti Bhushan
Testing data warehouse applications by Kirti BhushanTesting data warehouse applications by Kirti Bhushan
Testing data warehouse applications by Kirti Bhushan
Kirti Bhushan
 
Etl process in data warehouse
Etl process in data warehouseEtl process in data warehouse
Etl process in data warehouse
Komal Choudhary
 
What is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data WharehouseWhat is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data Wharehouse
BugRaptors
 
Data Warehouse (ETL) testing process
Data Warehouse (ETL) testing processData Warehouse (ETL) testing process
Data Warehouse (ETL) testing process
Rakesh Hansalia
 
Informatica PowerCenter
Informatica PowerCenterInformatica PowerCenter
Informatica PowerCenter
Ramy Mahrous
 
Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...
Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...
Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...
Edureka!
 
ETL Technologies.pptx
ETL Technologies.pptxETL Technologies.pptx
ETL Technologies.pptx
Gaurav Bhatnagar
 
An Introduction To Oracle Database
An Introduction To Oracle DatabaseAn Introduction To Oracle Database
An Introduction To Oracle Database
Meysam Javadi
 
Query Optimization in SQL Server
Query Optimization in SQL ServerQuery Optimization in SQL Server
Query Optimization in SQL Server
Rajesh Gunasundaram
 
Introduction to sql
Introduction to sqlIntroduction to sql
Introduction to sql
VARSHAKUMARI49
 
Sql Tutorials
Sql TutorialsSql Tutorials
Sql Tutorials
Priyabrat Kar
 
ETL Testing Interview Questions and Answers
ETL Testing Interview Questions and AnswersETL Testing Interview Questions and Answers
ETL Testing Interview Questions and Answers
H2Kinfosys
 
Etl testing strategies
Etl testing strategiesEtl testing strategies
Etl testing strategies
sivam_1
 
Etl - Extract Transform Load
Etl - Extract Transform LoadEtl - Extract Transform Load
Etl - Extract Transform Load
ABDUL KHALIQ
 
Etl And Data Test Guidelines For Large Applications
Etl And Data Test Guidelines For Large ApplicationsEtl And Data Test Guidelines For Large Applications
Etl And Data Test Guidelines For Large Applications
Wayne Yaddow
 
Testing data warehouse applications by Kirti Bhushan
Testing data warehouse applications by Kirti BhushanTesting data warehouse applications by Kirti Bhushan
Testing data warehouse applications by Kirti Bhushan
Kirti Bhushan
 
Etl process in data warehouse
Etl process in data warehouseEtl process in data warehouse
Etl process in data warehouse
Komal Choudhary
 
What is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data WharehouseWhat is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data Wharehouse
BugRaptors
 
Data Warehouse (ETL) testing process
Data Warehouse (ETL) testing processData Warehouse (ETL) testing process
Data Warehouse (ETL) testing process
Rakesh Hansalia
 
Informatica PowerCenter
Informatica PowerCenterInformatica PowerCenter
Informatica PowerCenter
Ramy Mahrous
 
Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...
Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...
Talend Data Integration Tutorial | Talend Tutorial For Beginners | Talend Onl...
Edureka!
 
An Introduction To Oracle Database
An Introduction To Oracle DatabaseAn Introduction To Oracle Database
An Introduction To Oracle Database
Meysam Javadi
 
Query Optimization in SQL Server
Query Optimization in SQL ServerQuery Optimization in SQL Server
Query Optimization in SQL Server
Rajesh Gunasundaram
 

Viewers also liked (8)

SQL for ETL Testing
SQL for ETL TestingSQL for ETL Testing
SQL for ETL Testing
Garuda Trainings
 
Ibm informatica interview question answers
Ibm informatica interview question answersIbm informatica interview question answers
Ibm informatica interview question answers
Sweta Singh
 
Informatica interview questions and answers|Informatica Faqs 2014
Informatica interview questions and answers|Informatica Faqs 2014Informatica interview questions and answers|Informatica Faqs 2014
Informatica interview questions and answers|Informatica Faqs 2014
BigClasses.com
 
ETIS09 - Data Quality: Common Problems & Checks - Presentation
ETIS09 -  Data Quality: Common Problems & Checks - PresentationETIS09 -  Data Quality: Common Problems & Checks - Presentation
ETIS09 - Data Quality: Common Problems & Checks - Presentation
David Walker
 
Introduction to ETL process
Introduction to ETL process Introduction to ETL process
Introduction to ETL process
Omid Vahdaty
 
Accenture informatica interview question answers
Accenture informatica interview question answersAccenture informatica interview question answers
Accenture informatica interview question answers
Sweta Singh
 
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Caserta
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
RTTS
 
Ibm informatica interview question answers
Ibm informatica interview question answersIbm informatica interview question answers
Ibm informatica interview question answers
Sweta Singh
 
Informatica interview questions and answers|Informatica Faqs 2014
Informatica interview questions and answers|Informatica Faqs 2014Informatica interview questions and answers|Informatica Faqs 2014
Informatica interview questions and answers|Informatica Faqs 2014
BigClasses.com
 
ETIS09 - Data Quality: Common Problems & Checks - Presentation
ETIS09 -  Data Quality: Common Problems & Checks - PresentationETIS09 -  Data Quality: Common Problems & Checks - Presentation
ETIS09 - Data Quality: Common Problems & Checks - Presentation
David Walker
 
Introduction to ETL process
Introduction to ETL process Introduction to ETL process
Introduction to ETL process
Omid Vahdaty
 
Accenture informatica interview question answers
Accenture informatica interview question answersAccenture informatica interview question answers
Accenture informatica interview question answers
Sweta Singh
 
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Big Data Warehousing Meetup: Dimensional Modeling Still Matters!!!
Caserta
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
RTTS
 
Ad

Similar to ETL Testing Training Presentation (20)

Data warehouse-testing
Data warehouse-testingData warehouse-testing
Data warehouse-testing
raianup
 
Extract, Transform and Load.pptx
Extract, Transform and Load.pptxExtract, Transform and Load.pptx
Extract, Transform and Load.pptx
JesusaEspeleta
 
4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf
4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf
4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf
abhaybansal43
 
Data Ware House Testing
Data Ware House TestingData Ware House Testing
Data Ware House Testing
manojpmat
 
Top 20 ETL Testing Interview Questions.pdf
Top 20 ETL Testing Interview Questions.pdfTop 20 ETL Testing Interview Questions.pdf
Top 20 ETL Testing Interview Questions.pdf
AnanthReddy38
 
ETL_Methodology.pptx
ETL_Methodology.pptxETL_Methodology.pptx
ETL_Methodology.pptx
yogeshsuryawanshi47
 
Creating a Data validation and Testing Strategy
Creating a Data validation and Testing StrategyCreating a Data validation and Testing Strategy
Creating a Data validation and Testing Strategy
RTTS
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
kzayra69
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
Deepali Raut
 
Etl testing
Etl testingEtl testing
Etl testing
Krishna Prasad
 
A Comparitive Study Of ETL Tools
A Comparitive Study Of ETL ToolsA Comparitive Study Of ETL Tools
A Comparitive Study Of ETL Tools
Rhonda Cetnar
 
IRJET- Testing Improvement in Business Intelligence Area
IRJET- Testing Improvement in Business Intelligence AreaIRJET- Testing Improvement in Business Intelligence Area
IRJET- Testing Improvement in Business Intelligence Area
IRJET Journal
 
Data Management
Data ManagementData Management
Data Management
Mufaddal Nullwala
 
Data quality and bi
Data quality and biData quality and bi
Data quality and bi
jeffd00
 
What are the benefits of learning ETL Development and where to start learning...
What are the benefits of learning ETL Development and where to start learning...What are the benefits of learning ETL Development and where to start learning...
What are the benefits of learning ETL Development and where to start learning...
kzayra69
 
Data warehouse
Data warehouseData warehouse
Data warehouse
_123_
 
What are the characteristics and objectives of ETL testing_.docx
What are the characteristics and objectives of ETL testing_.docxWhat are the characteristics and objectives of ETL testing_.docx
What are the characteristics and objectives of ETL testing_.docx
Technogeeks
 
Lecture13- Extract Transform Load presentation.pptx
Lecture13- Extract Transform Load presentation.pptxLecture13- Extract Transform Load presentation.pptx
Lecture13- Extract Transform Load presentation.pptx
AyeshaAsad26
 
ETL VS ELT.pdf
ETL VS ELT.pdfETL VS ELT.pdf
ETL VS ELT.pdf
BOSupport
 
definign etl process extract transform load.ppt
definign etl process extract transform load.pptdefinign etl process extract transform load.ppt
definign etl process extract transform load.ppt
smritiibansal
 
Data warehouse-testing
Data warehouse-testingData warehouse-testing
Data warehouse-testing
raianup
 
Extract, Transform and Load.pptx
Extract, Transform and Load.pptxExtract, Transform and Load.pptx
Extract, Transform and Load.pptx
JesusaEspeleta
 
4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf
4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf
4_etl_testing_tutorial_till_chapter3-merged-compressed.pdf
abhaybansal43
 
Data Ware House Testing
Data Ware House TestingData Ware House Testing
Data Ware House Testing
manojpmat
 
Top 20 ETL Testing Interview Questions.pdf
Top 20 ETL Testing Interview Questions.pdfTop 20 ETL Testing Interview Questions.pdf
Top 20 ETL Testing Interview Questions.pdf
AnanthReddy38
 
Creating a Data validation and Testing Strategy
Creating a Data validation and Testing StrategyCreating a Data validation and Testing Strategy
Creating a Data validation and Testing Strategy
RTTS
 
What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....What are the key points to focus on before starting to learn ETL Development....
What are the key points to focus on before starting to learn ETL Development....
kzayra69
 
Datawarehousing & DSS
Datawarehousing & DSSDatawarehousing & DSS
Datawarehousing & DSS
Deepali Raut
 
A Comparitive Study Of ETL Tools
A Comparitive Study Of ETL ToolsA Comparitive Study Of ETL Tools
A Comparitive Study Of ETL Tools
Rhonda Cetnar
 
IRJET- Testing Improvement in Business Intelligence Area
IRJET- Testing Improvement in Business Intelligence AreaIRJET- Testing Improvement in Business Intelligence Area
IRJET- Testing Improvement in Business Intelligence Area
IRJET Journal
 
Data quality and bi
Data quality and biData quality and bi
Data quality and bi
jeffd00
 
What are the benefits of learning ETL Development and where to start learning...
What are the benefits of learning ETL Development and where to start learning...What are the benefits of learning ETL Development and where to start learning...
What are the benefits of learning ETL Development and where to start learning...
kzayra69
 
Data warehouse
Data warehouseData warehouse
Data warehouse
_123_
 
What are the characteristics and objectives of ETL testing_.docx
What are the characteristics and objectives of ETL testing_.docxWhat are the characteristics and objectives of ETL testing_.docx
What are the characteristics and objectives of ETL testing_.docx
Technogeeks
 
Lecture13- Extract Transform Load presentation.pptx
Lecture13- Extract Transform Load presentation.pptxLecture13- Extract Transform Load presentation.pptx
Lecture13- Extract Transform Load presentation.pptx
AyeshaAsad26
 
ETL VS ELT.pdf
ETL VS ELT.pdfETL VS ELT.pdf
ETL VS ELT.pdf
BOSupport
 
definign etl process extract transform load.ppt
definign etl process extract transform load.pptdefinign etl process extract transform load.ppt
definign etl process extract transform load.ppt
smritiibansal
 
Ad

Recently uploaded (20)

Why Orangescrum Is a Game Changer for Construction Companies in 2025
Why Orangescrum Is a Game Changer for Construction Companies in 2025Why Orangescrum Is a Game Changer for Construction Companies in 2025
Why Orangescrum Is a Game Changer for Construction Companies in 2025
Orangescrum
 
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
Egor Kaleynik
 
Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025
kashifyounis067
 
Douwan Crack 2025 new verson+ License code
Douwan Crack 2025 new verson+ License codeDouwan Crack 2025 new verson+ License code
Douwan Crack 2025 new verson+ License code
aneelaramzan63
 
Societal challenges of AI: biases, multilinguism and sustainability
Societal challenges of AI: biases, multilinguism and sustainabilitySocietal challenges of AI: biases, multilinguism and sustainability
Societal challenges of AI: biases, multilinguism and sustainability
Jordi Cabot
 
PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025
mu394968
 
Maxon CINEMA 4D 2025 Crack FREE Download LINK
Maxon CINEMA 4D 2025 Crack FREE Download LINKMaxon CINEMA 4D 2025 Crack FREE Download LINK
Maxon CINEMA 4D 2025 Crack FREE Download LINK
younisnoman75
 
Adobe Illustrator Crack FREE Download 2025 Latest Version
Adobe Illustrator Crack FREE Download 2025 Latest VersionAdobe Illustrator Crack FREE Download 2025 Latest Version
Adobe Illustrator Crack FREE Download 2025 Latest Version
kashifyounis067
 
Automation Techniques in RPA - UiPath Certificate
Automation Techniques in RPA - UiPath CertificateAutomation Techniques in RPA - UiPath Certificate
Automation Techniques in RPA - UiPath Certificate
VICTOR MAESTRE RAMIREZ
 
Get & Download Wondershare Filmora Crack Latest [2025]
Get & Download Wondershare Filmora Crack Latest [2025]Get & Download Wondershare Filmora Crack Latest [2025]
Get & Download Wondershare Filmora Crack Latest [2025]
saniaaftab72555
 
Revolutionizing Residential Wi-Fi PPT.pptx
Revolutionizing Residential Wi-Fi PPT.pptxRevolutionizing Residential Wi-Fi PPT.pptx
Revolutionizing Residential Wi-Fi PPT.pptx
nidhisingh691197
 
Top 10 Client Portal Software Solutions for 2025.docx
Top 10 Client Portal Software Solutions for 2025.docxTop 10 Client Portal Software Solutions for 2025.docx
Top 10 Client Portal Software Solutions for 2025.docx
Portli
 
EASEUS Partition Master Crack + License Code
EASEUS Partition Master Crack + License CodeEASEUS Partition Master Crack + License Code
EASEUS Partition Master Crack + License Code
aneelaramzan63
 
Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.
Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.
Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.
Dele Amefo
 
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRYLEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
NidaFarooq10
 
How to Optimize Your AWS Environment for Improved Cloud Performance
How to Optimize Your AWS Environment for Improved Cloud PerformanceHow to Optimize Your AWS Environment for Improved Cloud Performance
How to Optimize Your AWS Environment for Improved Cloud Performance
ThousandEyes
 
Not So Common Memory Leaks in Java Webinar
Not So Common Memory Leaks in Java WebinarNot So Common Memory Leaks in Java Webinar
Not So Common Memory Leaks in Java Webinar
Tier1 app
 
Adobe After Effects Crack FREE FRESH version 2025
Adobe After Effects Crack FREE FRESH version 2025Adobe After Effects Crack FREE FRESH version 2025
Adobe After Effects Crack FREE FRESH version 2025
kashifyounis067
 
Expand your AI adoption with AgentExchange
Expand your AI adoption with AgentExchangeExpand your AI adoption with AgentExchange
Expand your AI adoption with AgentExchange
Fexle Services Pvt. Ltd.
 
Download Wondershare Filmora Crack [2025] With Latest
Download Wondershare Filmora Crack [2025] With LatestDownload Wondershare Filmora Crack [2025] With Latest
Download Wondershare Filmora Crack [2025] With Latest
tahirabibi60507
 
Why Orangescrum Is a Game Changer for Construction Companies in 2025
Why Orangescrum Is a Game Changer for Construction Companies in 2025Why Orangescrum Is a Game Changer for Construction Companies in 2025
Why Orangescrum Is a Game Changer for Construction Companies in 2025
Orangescrum
 
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
How Valletta helped healthcare SaaS to transform QA and compliance to grow wi...
Egor Kaleynik
 
Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025Adobe Master Collection CC Crack Advance Version 2025
Adobe Master Collection CC Crack Advance Version 2025
kashifyounis067
 
Douwan Crack 2025 new verson+ License code
Douwan Crack 2025 new verson+ License codeDouwan Crack 2025 new verson+ License code
Douwan Crack 2025 new verson+ License code
aneelaramzan63
 
Societal challenges of AI: biases, multilinguism and sustainability
Societal challenges of AI: biases, multilinguism and sustainabilitySocietal challenges of AI: biases, multilinguism and sustainability
Societal challenges of AI: biases, multilinguism and sustainability
Jordi Cabot
 
PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025PDF Reader Pro Crack Latest Version FREE Download 2025
PDF Reader Pro Crack Latest Version FREE Download 2025
mu394968
 
Maxon CINEMA 4D 2025 Crack FREE Download LINK
Maxon CINEMA 4D 2025 Crack FREE Download LINKMaxon CINEMA 4D 2025 Crack FREE Download LINK
Maxon CINEMA 4D 2025 Crack FREE Download LINK
younisnoman75
 
Adobe Illustrator Crack FREE Download 2025 Latest Version
Adobe Illustrator Crack FREE Download 2025 Latest VersionAdobe Illustrator Crack FREE Download 2025 Latest Version
Adobe Illustrator Crack FREE Download 2025 Latest Version
kashifyounis067
 
Automation Techniques in RPA - UiPath Certificate
Automation Techniques in RPA - UiPath CertificateAutomation Techniques in RPA - UiPath Certificate
Automation Techniques in RPA - UiPath Certificate
VICTOR MAESTRE RAMIREZ
 
Get & Download Wondershare Filmora Crack Latest [2025]
Get & Download Wondershare Filmora Crack Latest [2025]Get & Download Wondershare Filmora Crack Latest [2025]
Get & Download Wondershare Filmora Crack Latest [2025]
saniaaftab72555
 
Revolutionizing Residential Wi-Fi PPT.pptx
Revolutionizing Residential Wi-Fi PPT.pptxRevolutionizing Residential Wi-Fi PPT.pptx
Revolutionizing Residential Wi-Fi PPT.pptx
nidhisingh691197
 
Top 10 Client Portal Software Solutions for 2025.docx
Top 10 Client Portal Software Solutions for 2025.docxTop 10 Client Portal Software Solutions for 2025.docx
Top 10 Client Portal Software Solutions for 2025.docx
Portli
 
EASEUS Partition Master Crack + License Code
EASEUS Partition Master Crack + License CodeEASEUS Partition Master Crack + License Code
EASEUS Partition Master Crack + License Code
aneelaramzan63
 
Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.
Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.
Salesforce Data Cloud- Hyperscale data platform, built for Salesforce.
Dele Amefo
 
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRYLEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
LEARN SEO AND INCREASE YOUR KNOWLDGE IN SOFTWARE INDUSTRY
NidaFarooq10
 
How to Optimize Your AWS Environment for Improved Cloud Performance
How to Optimize Your AWS Environment for Improved Cloud PerformanceHow to Optimize Your AWS Environment for Improved Cloud Performance
How to Optimize Your AWS Environment for Improved Cloud Performance
ThousandEyes
 
Not So Common Memory Leaks in Java Webinar
Not So Common Memory Leaks in Java WebinarNot So Common Memory Leaks in Java Webinar
Not So Common Memory Leaks in Java Webinar
Tier1 app
 
Adobe After Effects Crack FREE FRESH version 2025
Adobe After Effects Crack FREE FRESH version 2025Adobe After Effects Crack FREE FRESH version 2025
Adobe After Effects Crack FREE FRESH version 2025
kashifyounis067
 
Expand your AI adoption with AgentExchange
Expand your AI adoption with AgentExchangeExpand your AI adoption with AgentExchange
Expand your AI adoption with AgentExchange
Fexle Services Pvt. Ltd.
 
Download Wondershare Filmora Crack [2025] With Latest
Download Wondershare Filmora Crack [2025] With LatestDownload Wondershare Filmora Crack [2025] With Latest
Download Wondershare Filmora Crack [2025] With Latest
tahirabibi60507
 

ETL Testing Training Presentation

  • 1. The process of updating the data warehouse. ETL
  • 2. Defining ETL ETL stands for extract, transform, and load. In many organizations, the setup of the IT department happened long time back. So in most of the cases the way each department handles the data is different. For example, in a retail organization you may have different departments such as sales, marketing, logistics, etc. Each of them is handling the customer information but way they store that data could be quite different. The sales department store it by name where as marketing department has it in the number format. ETL can take all this data from different sources and transform it into a uniform presentation, such as for storing in a database or data warehouse.
  • 3. Another challenge of the old IT infrastructure is the different data format used by different departments. So it might happen for HRD department somebody is using SAP where as the sales department you have Oracle Apps. So to take a business critical decision it is difficult for the higher management to get data from different platforms and consolidate them. ETL can do that job easily. It will take data from different sources and transform it to a uniform format and store it into DB tables. From the DB, you can generate the required reports.
  • 4. Sample ETL Tools Teradata Warehouse Builder from Teradata DataStage from Ascential Software SAS System from SAS Institute Power Mart/Power Center from Informatica Sagent Solution from Sagent Software Hummingbird Genio Suite from Hummingbird Communications
  • 5. ETL Testing Tools Query Surge Talend SQL Server Integration Services (SSIS) 
  • 6. Purpose of ETL Testing: From the above points, clearly accuracy of the data after transformation is critical in ETL testing because if the data is not accurate, then the business decision will be wrong. So following test types are commonly used in ETL testing: No Data Losses: In this testing type, we first determine an N number of entities in the source system. Say the total number of employee is one such entity. If I have 2000 employees in the source system, then after ETL transformation and data store I should still get 2000 employees in the destination database. Only exception is when we have some business rule applied to the transformation. An example could be in the new system the business does not want to keep data for people who no longer works for the organization. So in that situation the destination system will discard the record of those employee and store number less than 2000.
  • 7.  This testing is done by mainly querying the source and designation database. Depending on the complexity of the system these SQL queries might be very complex and beyond the skill of a tester. However, running the query and checking the result is still the responsibility of the QA team. Validation of Transformation rules: This is the second big validation point. In ETL testing, you will get the requirements in terms of transformation rules. One example could be the data format in the source was yyyy/dd/mm where as in the destination database this is mm/dd/yy. So you need to check them by taking some sample data. Here you apply all the good conventional test theories like boundary value analysis, equivalent partition, etc. All the database field level and record level data integrity must be tested here. So you need to select the test data in such a way that at the end all the fields are touched into the destination database after transformation. Business processes testing: Once the transformation is done these data feeds will go to many consumer systems (sales, marketing, HRD deferment applications). You need to ensure by testing from those systems that they can receive the data in the preformed format. All the critical end to end business flows must be tested from the GUI. Purpose of ETL Testing :
  • 8. Data Warehousing Data Warehousing? A data warehouse can be considered as a storage area where relevant data is stored irrespective of the source. Data warehousing merges data from multiple sources into an easy and complete form. Virtual Data Warehousing? A virtual data warehouse provides a collective view of the completed data. I t can be considered as a logical data model of the containing metadata Active data warehousing? An active data warehouse represents a single state of the business. It considers the analytic perspectives of customers and suppliers. It helps to deliver the updated data through reports.
  • 9. Data Warehousing Cont… Snapshot in a data warehouse: Snapshot refers to a complete visualization of data at the time of extraction. It occupies less space and can be used to back up and restore data quickly. ETL process in data warehousing: ETL stands for Extraction, transformation and loading. Extracting data from different sources such as flat files, databases or XML data, transforming this data depending on the application’s needs and load this data into a data warehouse. Difference between data mining and data warehousing: Data mining is a method for comparing large amounts of data for the purpose of finding patterns. It is normally used for models and forecasting. Data warehousing is the central repository for the data of several business systems in an enterprise. Data from various resources extracted and organized in the data warehouse selectively for analysis and accessibility.
  • 10. Data Modelling and Data Mining Data Modeling is a technique used to define and analyze the requirements of data that supports organization’s business process. In simple terms, it is used for the analysis of data objects in order to identify the relationships among these data objects in any business. Data Mining is a technique used to analyze datasets to derive useful insights/information. It is mainly used in retail, consumer goods, telecommunication and financial organizations that have a strong consumer orientation in order to determine the impact on sales, customer satisfaction and profitability.
  • 11. Data Warehousing Vs Business Intelligence Data warehousing relates to all aspects of data management starting from the development, implementation and operation of the data sets. It is a back up of all data relevant to business.(  data store). Business Intelligence is used to analyze the data from the point of business to measure any organization’s success. The factors like sales, profitability, marketing campaign effectiveness, market shares and operational efficiency etc are analyzed using Business Intelligence tools like Cognos, Informatica etc.
  • 12. OLTP Vs OLAP OLTP = OnLine Transaction Processing. Applications that supports and manages transactions which involve high volumes of data are supported by OLTP system. OLTP is based on client-server architecture and supports transactions across networks. OLAP = OnLine Analytical Processing. Business data analysis and complex calculations on low volumes of data are performed by OLAP. An insight of data coming from various resources can be gained by a user with the support of OLAP.
  • 13. Types of ETL Testing    1)      Constraint Testing: In the phase of constraint testing, the test engineers identifies whether the data is mapped from source to target or not. The Test Engineer follows the below scenarios in ETL Testing process. a)      NOT NULL b)      UNIQUE c)       Primary Key d)      Foreign key e)      Check f)       Default g)      NULL 2)      Source to Target Count Testing: In the Source to Target data is matched or not. A Tester can check in this view whether it is ascending order or descending order it doesn’t matter .Only count is required for Tester. Due to lack of time a tester can follow this type of Testing. 3)      Source to Target Data Validation Testing: In this Testing, a tester can validate the each and every point of the source to target data. Most of the financial projects, a tester can identify the decimal factors.
  • 14. Types of ETL Testing Cont..   4)      Threshold/Data Integrated Testing: In this Testing, the Ranges of the data, A test Engineer can usually identifies the population calculation and share marketing and business finance analysis (quarterly, halferly, Yearly)   MIN MAX RANGE 4 10 6 5)      Field to Field Testing: In the field to field testing, a test engineer can identify that how much space is occupied in the database. The data is integrated in the table cum data types.   NOTE: To check the order of the columns and source column to target column.
  • 15. Types of ETL Testing Cont.. 6)      Duplicate Check Testing: In this phase of ETL Testing, a Tester can face duplicate value very frequently so, at that time the tester follows database queries why because huge amount of data is present in source and Target tables. Select ENO, ENAME, SAL, COUNT (*) FROM EMP GROUP BY ENO, ENAME, SAL HAVING COUNT (*) >1; Note: 1)  There are no mistakes in Primary Key or no Primary Key is allotted then the duplicates may arise. 2)  Sometimes, a developer can do mistakes while transferring the data from source to target at that time duplicates may arise. 3)  Due to Environment Mistakes also duplicates arise (Due to improper plugins in the tool). 7)      Error/Exception Logical Testing: 1)      Delimiter is available in Valid Tables 2)      Delimiter is not available in invalid tables(Exception Tables)   8)      Incremental and Historical Process Testing: In the Incremental data, the historical data is not corrupted. When the historical data is corrupted then this is the condition where bugs raise.
  • 16. Types of ETL Testing Cont..    9)      Control Columns and Defect Values Testing: This is introduced by IBM    10)   Navigation Testing: Navigation Testing is the End user point of view testing. An end user cannot follow the friendly of the application that navigation is called as bad or poor Navigation. At the time of Testing, A tester can identify this type of navigation scenarios to avoid unnecessary navigation. 11)   Initialization testing: A combination of hardware and software installed in platform is called the Initialization Testing 12)   Transformation Testing: At the time of mapping from source table to target table, Transformation is not in mapping condition, then the Test Engineer raises bugs.
  • 17. Types of ETL Testing Cont..  13)   Regression Testing: Code modification to fix a bug or to implement a new functionality which makes us to to find errors. These introduced errors are called regression. Identifying for regression effect is called regression testing. 14)   Retesting: Re executing the failed test cases after fixing the bug.    15)   System Integration Testing: Integration testing: After the completion of programming process. Developer can integrate the modules there are 3 models a)  Top Down b)  Bottom Up c)  Hybrid
  • 18. Test Scenarios for ETL Testing     1. To  validate  the  data in table (emp)     2. To validate the table structure.     3. To validate the null values  of the table.     4. To validate the null values of very attribute.     5. To check the duplicate values of  the table.     6. To check the duplicate values of each attribute of the table     7. To check the field value or space (length of the field size)       8. To check the constraints (foreign ,primary key)     9. To check the name of the employer who has not earned any commission     10. To check the all employers who are work in dept no (Account dept,sales dept)     11. To check the row count of each attribute.     12. To check the row count of the table.     13. To check the max salary from emp table.     14. To check the min salary from emp table.
  • 19. Types of ETL Bugs 1. User interface bugs/cosmetic bugs:-  Related to GUI of application  Navigation, spelling mistakes, font style, font size, colors, alignment. 2. BVA Related bug:-  Minimum and maximum values 3. EP Related bug:-  Valid and invalid type 4. Input/output bugs:-  Valid values not accepted  Invalid values accepted  5. Calculation bugs:-  Mathematical errors  Final output is wrong   
  • 20. Types of ETL Bugs Cont… 6. Load condition bugs:-  Does not allows multiple users  Does not allows customer expected load   7. Race condition bugs:-  System crash & hang  System cannot run client plat forms   8. Version control bugs:-  No logo matching  No version information available  This occurs usually in regression testing 9. H/W bugs:-  Device is not responding to the application    10. Source bugs:-  Mistakes in help documents
  • 21. SQL Queries for ETL Testing
  • 22. 1.Create a table: CREATE TABLE customer First_Name char(50),Last_Name char(50),Address char(50), City char(50),Country char(25),Birth_Date date); 2. Add a column to a table: ALTER TABLE customer ADD SO_INSURANCE_PROVIDER Varchar2(35); 3. DROP a column to a table ALTER TABLE customer DROP column SO_INSURANCE_PROVIDER Varchar2(35); 4. Add a default value to a column ALTER TABLE customer MODIFY SO_INSURANCE_PROVIDER Varchar2(35) DEFAULT 'ABC Ins'; 5. Renaming a table: ALTER TABLE suppliers RENAME TO vendors;
  • 23. 6. Modifying column(s) in a table: ALTER TABLE supplier MODIFY supplier_namevarchar2(100) not null; 7. Drop column(s) in a table: ALTER TABLE supplier DROP COLUMN supplier_name; 8. Primary key: CREATE TABLE supplier( supplier_id numeric(10) not null,supplier_namevarchar2(50) not null,contact_namevarchar2(50),CONSTRAINT supplier_pk PRIMARY KEY (supplier_id, supplier_name)); 9.Add primary key: ALTER TABLE supplier add CONSTRAINT supplier_pk PRIMARY KEY (supplier_id); 10.Drop primary key: ALTER TABLE supplier drop CONSTRAINT supplier_pk; 11.Disable primary key: ALTER TABLE supplier disable CONSTRAINT supplier_pk; 12.Enable primary key: ALTER TABLE supplier enable CONSTRAINT supplier_pk;
  • 24. 13.Foreign key creation: CREATE TABLE supplier ( supplier_id numeric(10) not null, supplier_namevarchar2(50) not null, contact_namevarchar2(50), CONSTRAINT supplier_pk PRIMARY KEY (supplier_id, supplier_name) ); CREATE TABLE products ( Product_id numeric (10) not null, Supplier_id numeric (10) not null, supplier_name varchar2 (50) not null, CONSTRAINT fk_supplier_comp FOREIGN KEY (supplier_id, supplier_name) REFERENCES supplier (supplier_id, supplier_name) );
  • 25. 14. Alter foreign key: ALTER TABLE products add CONSTRAINT fk_supplier FOREIGN KEY (supplier_id) REFERENCES supplier (supplier_id); 15. Drop foreign key: ALTER TABLE SALES_ORDER_LINE DROP FOREIGN KEY FK_SALES_ORDER_LINE_PRODUCT 16. Check constraint: ALTER TABLE EMPLOYEE ADD CONSTRAINT REVENUE CHECK (SALARY + COMM > 25000) 17. Drop check constraint: ALTER TABLE EMPLOYEE DROP CONSTRAINT REVENUE CHECK (SALARY + COMM > 25000) 18. Drop Table: DROP TABLE customer;
  • 26. 19. Truncate Statement: Truncate table customer; *********************************************************************End of DDL Statements********************************************************************* ** ****************************************************************** DML Statements *************************************************************************** 20. Insert rows in table: 1) INSERT INTO Store_Information (store_name, Sales, Date) VALUES ('Los Angeles', 900, 'Jan-10-1999') 2) INSERT INTO Store_Information (store_name, Sales, Date) SELECT store_name, Sales, Date FROM Sales_Info WHERE Year (Date) = 1998
  • 27. 21. Update Statement in table: UPDATE suppliers SET name = 'HP' WHERE name = 'IBM'; UPDATE suppliers SET supplier_name = (SELECT customers.name FROM customers WHERE customers. customer_id = suppliers. supplier_id); 22. Delete Statement in table: DELETE FROM suppliers WHERE supplier_name = 'IBM'; DELETE FROM suppliers WHERE EXISTS ( select customers.name from customers wherecustomers.customer_id = suppliers.supplier_id andcustomers.customer_name = 'IBM' ); ************************** select statement
  • 28. 23.Select Statement in table: 1. SELECT LastName, FirstName FROM Persons; 2. SELECT * FROM Persons; The SELECT DISTINCT Statement: SELECT DISTINCT Company FROM Orders; The WHERE Clause: SELECT * FROM Persons WHERE City='Sandnes‘ Using LIKE SELECT * FROM Persons WHERE FirstName LIKE 'O%'
  • 29. Arithmetic Operation: Operator Description = Equal <> Not equal > Greater than < Less than >= Greater than or equal <= Less than or equal BETWEEN Between an inclusive range LIKE Search for a pattern IN If you know the exact value you want to return for at least one of the columns
  • 30. BETWEEN ... AND: SELECT * FROM Persons WHERE LastName BETWEEN 'Hansen' AND 'Pettersen'; IN SELECT * FROM Persons WHERE LastName IN 'Hansen','Pettersen'); Column Name Alias SELECT LastName AS Family, FirstName AS Name FROM Persons AND & OR SELECT * FROM Persons WHERE FirstName='Tove' AND LastName='Svendson' SELECT * FROM Persons WHERE firstname='Tove' OR lastname='Svendson' ORDER BY SELECT Company, OrderNumber FROM Orders ORDER BY Company SELECT Company, OrderNumber FROM Orders ORDER BY Company DESC, OrderNumber ASC
  • 31. Group by Clause: SELECT Company, SUM (Amount) FROM Sales GROUP BY Company; Having Clause: SELECT Company, SUM (Amount) FROM Sales GROUP BY Company HAVING SUM (Amount)>10000; Using UNION Clause: SELECT E_Name FROM Employees_Norway UNION SELECT E_Name FROM Employees_USA UNION ALL Clause: SELECT E_Name FROM Employees_Norway UNION ALL SELECT E_Name FROM Employees_USA
  • 32. JOINS: Referring to Two Tables: SELECT Employees.Name, Orders.Product FROM Employees, Orders WHERE Employees.Employee_ID=Orders.Employee_ID INNER JOIN: SELECT Employees.Name, Orders.Product FROM Employees INNER JOIN Orders ON Employees.Employee_ID=Orders.Employee_ID; LEFT JOIN: SELECT Employees.Name, Orders.Product FROM Employees LEFT JOIN Orders ON Employees.Employee_ID=Orders.Employee_ID; RIGHT JOIN: SELECT Employees.Name, Orders.Product FROM Employees RIGHT JOIN Orders ON Employees.Employee_ID=Orders.Employee_ID;
  • 33. Subqueries: 1) Select distinct country from Northwind.dbo.Customers where country not in (select distinct country from Northwind.dbo.Suppliers); 2) Select top 1 OrderId, convert (char (10), OrderDate, 121) Last_Paris_Order, (Select convert (char (10), max (OrderDate), 121) from Northwind.dbo.Orders) Last_OrderDate,datediff(dd,OrderDate, (select Max(OrderDate) from Northwind.dbo.Orders))Day_Diff fromNorthwind.dbo.Orders whereShipCity = 'Paris' order by OrderDatedesc;
  • 34. Commit & Rollback Statements: 1) UPDATE suppliers SET name = 'HP' WHERE name = 'IBM'; Commit; 2) UPDATE suppliers SET name = 'HP' WHERE name = 'IBM'; Rollback; SavepointStatement: INSERT INTO DEPARTMENT VALUES ('A20', 'MARKETING', 301); SAVEPOINT SAVEPOINT1; INSERT INTO DEPARTMENT VALUES ('B30', 'FINANCE', 520); SAVEPOINT SAVEPOINT2;
  • 35. ETL Tool QuerySurge at a Glance

Editor's Notes

  • #5: You might go to the vendors’ web sites to find a good demo to show your students.