0% found this document useful (0 votes)

17 views

Partisions Types

The document discusses performance tuning in Informatica mappings that are running slowly. Some key points discussed include: 1. Partitioning sessions into multiple threads to process data in parallel which can improve performance. There are different types of partitioning including key range, hash, round robin, and pass through. 2. Steps for performance tuning include designing mappings with minimal transformations, filtering unwanted data in the source, and checking session statistics and logs. 3. Increasing the number of partitions allows more concurrent processing but can overload the system if too many partitions are used. Partition points control how data is distributed among partitions.

Uploaded by

malleswari Ch

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Partisions Types

Uploaded by

malleswari Ch

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 5

Requirement:How to do performance tuning in informatica if mapping is taking

long time

Long running sessions, Timeout, session failured , CPU consuption.

one thing is viewdged volume of data and another thing number of column and
complex logic.
Above sennarios we are doing performance tuning in informatica.
PARTITION: Parallel processing in the sence we have to incresing the number of
threads
inserte of one thread we are going for number of parallel threads.
1. DBlevel partition(At DB level and not in informatica),
2.Range : Ifwe have thedata rangelike 1 to 10 and 10 to 20 like thiswe have diff
range of data.
3.Round robin: Based on thread avilability.
4. Hash:used on hash alogorithm like
sorter,aggregater, lookup, rank used hash

Steps:
 As per best practices we need to design a mapping with minimum transformations
 As much as possible we need to filtered out unwanted data in source qualifier
itself
 If any mapping taking long time ,then we need to get session statistics in
monitor by
get run properties
 If source reading very less records like throughput 5,10,100 etc less records we
need
to look session log .
 Check busy percentages of reader,Writer and trasformation threads.
Partitioning Sessions
 Performance can be improved by processing data in parallel in a single session
bycreating
multiple partitions of the pipeline.
 By default session will have one partition that is pass through partition it will
create 1 reader and 1 writer thread.
 Rather than processing larger volumes of data through single reader and single
writer,
we willshare with multiple reader and multiple writer using partitions.
 Increasing the number of partitions allows the Integration Service to create
multiple
connections to sources and process partitions of source data concurrently.
 We have 4 types of partiotions at session level
 Key range
 Hash
 Pass through
 Round Robin
 If we create key range then we need to specify range values for each partion
based on
key column.
 If it is pass through then we need to specify SQ queries at session for each
partiotion.
Round robin partition is used to when we want to distributes rows of data evenly to
all
p artitions.
Hash auto-keys: The Integration Service uses a hash function to group rows of data
among
partitions. The Integration Service groups the data based on a partition key.
Informatica PowerCenter
Session Partitioning
Type of Informatica Partitions

Af ter tuning all the performance bottlenecks we can further improve the
performance by
addition partitions.

We can either go for

Dynamic partitioning (number of partition passed as parameter)
or Non-dynamic partition (number of partition are fixed while coding).
Apart from used for optimizing the session, Informatica partition become useful in
situations
where we need to load huge volume of data or when we are using Informatica source
which already has partitions defined, and using those partitions will allow to
improve
the session performance.
The partition attributes include setting the partition point, the number of
partitions,
and the partition types.

Partition Point:
There can be one or more pipelines inside a mapping.
Adding a partition point will divide this pipeline into many pipeline stages.
Informatica will create one partition by default for every pipeline stage.
As we increase the partition points it increases the number of threads.
Informatica has mainly three types of threads –Reader, Writer and Transformation
Thread.

The number of partitions can be set at any partition point.

We can define up to 64 partitions at any partition point in a pipeline.
When you increase the number of partitions, you increase the number of processing
threads,
which can improve session performance. However, if you create a large number of
partitions or
partition points in a session that processes large amounts of data, you can
overload the
system.

You cannot create partition points for the following transformations:

• Source definition
• Sequence Generator
• XMLParser
• XML target
• Unconnected transformations

The partition type controls how the Integration Service distributes data among
partitions
at partition points.
The Integration Service creates a default partition type at each partition point.

Type of partitions are :

1. Database partitioning,
2. Hash auto-keys
3. Hash user keys
4. Key range
5. Pass-through
6. Round-robin.

Database Partitioning
For Source Database Partitioning, Informatica will check the database system for
the
partition information
if any and fetches data from corresponding node in the database into the session
partitions.
When you use Target database partitioning, the Integration Service loads data into
corresponding database partition nodes.
Use database partitioning for Oracle and IBM DB2 sources and IBM DB2 targets.

Pass through
Using Pass through partition will not affect the distribution of data across
partitions
instead it will run in single pipeline.which is by default for all your sessions.
The Integration Service processes data without redistributing rows among
partitions.
Hence all rows in a single partition stay in the partition after crossing a pass-
through
partition point.

Key range
Used when we want to partition the data based on upper and lower limit.
The Integration Service will distribute the rows of data based on a port or set of
ports
that we define as the partition key. For each port, we define a range of values.
Based on the range that we define the rows are send to different partitions.
To define the upper and lower

Round robin partition is used to when we want to distributes rows of data evenly to
all
partitions.
To distributes the rows evenly amoung the partition.

Hash auto-keys: The Integration Service uses a hash function to group rows of data
among
partitions.
The Integration Service groups the data based on a partition key.

Hash user keys: The Integration Service uses a hash function to group rows of data
among
partitions. We define the number of ports to generate the partition key.

Informatica PowerCenter
Session Partitioning
Type of Informatica Partitions

After tuning all the performance bottlenecks we can further improve the performance
by addition partitions.

We can either go for

Dynamic partitioning (number of partition passed as parameter)
or Non-dynamic partition (number of partition are fixed while coding).
Apart from used for optimizing the session, Informatica partition become useful in
situations
where we need to load huge volume of data or when we are using Informatica source
which already has partitions defined, and using those partitions will allow to
improve the session performance.
The partition attributes include setting the partition point, the number of
partitions, and the partition types.

The number of partitions can be set at any partition point.

You cannot create partition points for the following transformations:

• Source definition
• Sequence Generator
• XMLParser
• XML target
• Unconnected transformations

The partition type controls how the Integration Service distributes data among
partitions at partition points.
The Integration Service creates a default partition type at each partition point.

Type of partitions are :

1. Database partitioning,
2. Hash auto-keys
3. Hash user keys
4. Key range
5. Pass-through
6. Round-robin.

Database Partitioning
For Source Database Partitioning, Informatica will check the database system for
the
partition information if any and fetches data from corresponding node in the
database
into the session partitions.
When you use Target database partitioning, the Integration Service loads data into
corresponding database partition
nodes.
Use database partitioning for Oracle and IBM DB2 sources and IBM DB2 targets.

Pass through
Using Pass through partition will not affect the distribution of data across
partitions instead
it will run in single pipeline.which is by default for all your sessions.
The Integration Service processes data without redistributing rows among
partitions.
Hence all rows in a single partition stay in the partition after crossing a pass-
through partition point.

Round robin partition is used to when we want to distributes rows of data evenly to
all
partitions

Hash auto-keys: The Integration Service uses a hash function to group rows of data
among
partitions. The Integration Service groups the data based on a partition key.

Hash user keys: The Integration Service uses a hash function to group rows of data
among
partitions. W e define the number of ports to generate the partition key.

CREATE TABLE Sales_Range

( salesman_id NUMBER(5),
salesman_nameVARCHAR2(30),
sales_amount NUMBER(10),
sales_date DATE
) PARTITION BY RANGE(sales_date)
(
PARTITION sales_jan2000 VALUES LESS
THAN(TO_DATE('02/01/2000','DD/MM/YYYY')),

CASE ( ) or DECODE ( )
Case( ) : Case is similar to decode but easier to understand while going through
coding.
Example:
SQL> SELECT Salary,
CASE Salary
WHEN 2500 THEN ‘Low’
WHEN 4000 THEN ‘High’
ELSE ‘Medium’
END CASE
FROM EMPLOYEES;

Decode( ) :
Example:
SQL> SELECT Salary,
DECODE(Salary, 2500,‘Low’,
4000,‘High’,
‘Medium’) AS GRADE
FROM EMPLOYEES;

Learn SAP Basis in 24 Hours
From Everand
Learn SAP Basis in 24 Hours
Alex Nordeen
4.5/5 (2)
You Are Asked To Write A MapReduce Program With Py...
No ratings yet
You Are Asked To Write A MapReduce Program With Py...
5 pages
Interview Questions and Answers Informatica Powercenter
No ratings yet
Interview Questions and Answers Informatica Powercenter
14 pages
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Learn HANA in 24 Hours
From Everand
Learn HANA in 24 Hours
Alex Nordeen
5/5 (1)
Sys/types.h Sys/stat.h FCNTL.H Stdio.h Stdlib.h String.h
No ratings yet
Sys/types.h Sys/stat.h FCNTL.H Stdio.h Stdlib.h String.h
12 pages
Dynamic Partitioning To Increase Parallelism in PowerCenter
No ratings yet
Dynamic Partitioning To Increase Parallelism in PowerCenter
3 pages
Pipeline Partitioning Overview Informatica
80% (5)
Pipeline Partitioning Overview Informatica
3 pages
Partitioning Attributes: 1. Partition Points
No ratings yet
Partitioning Attributes: 1. Partition Points
4 pages
Partition Types Overview
No ratings yet
Partition Types Overview
13 pages
Informatica Partitions
No ratings yet
Informatica Partitions
11 pages
Dynamic Partitioning in Informatca 8.X
No ratings yet
Dynamic Partitioning in Informatca 8.X
32 pages
Informatica Partioning
No ratings yet
Informatica Partioning
33 pages
Partitioning Oracle Sources in PowerCenter
No ratings yet
Partitioning Oracle Sources in PowerCenter
12 pages
Session and Data Partititioning
No ratings yet
Session and Data Partititioning
4 pages
Partitioning in Informatica Cloud (IICS) - ThinkETL
No ratings yet
Partitioning in Informatica Cloud (IICS) - ThinkETL
14 pages
Partitioned Tables and Indexes
100% (1)
Partitioned Tables and Indexes
24 pages
Informatica Pipeline Partitioning: WWW - Thinkittraining.in
No ratings yet
Informatica Pipeline Partitioning: WWW - Thinkittraining.in
7 pages
2 Parallel Databases
No ratings yet
2 Parallel Databases
71 pages
Parallel Databases
No ratings yet
Parallel Databases
19 pages
Partitioning PDF
No ratings yet
Partitioning PDF
5 pages
Deep dive Dynamo DB
No ratings yet
Deep dive Dynamo DB
88 pages
SAP interface programming with RFC and VBA: Edit SAP data with MS Access
From Everand
SAP interface programming with RFC and VBA: Edit SAP data with MS Access
Karl Josef Hensel
No ratings yet
Cap 4
No ratings yet
Cap 4
49 pages
Partitioning in Datastage
No ratings yet
Partitioning in Datastage
27 pages
Partitioned Tables and Indexes: Introduction To Partitioning
No ratings yet
Partitioned Tables and Indexes: Introduction To Partitioning
18 pages
Oracle 11g Partitioning
No ratings yet
Oracle 11g Partitioning
11 pages
Ab Initio - V1.6
No ratings yet
Ab Initio - V1.6
50 pages
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Oracle Performance Tuning - Oracle Partitioning - Introduction
No ratings yet
Oracle Performance Tuning - Oracle Partitioning - Introduction
57 pages
18 Partitioned Tables and Indexes: Introduction To Partitioning
No ratings yet
18 Partitioned Tables and Indexes: Introduction To Partitioning
84 pages
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
IO Parallelism
No ratings yet
IO Parallelism
4 pages
Information Technology HandBook
From Everand
Information Technology HandBook
Duong Tran
3/5 (1)
Informatica
100% (1)
Informatica
45 pages
Microsoft - SQL Scripts For Partitioned Tables and Indexes
No ratings yet
Microsoft - SQL Scripts For Partitioned Tables and Indexes
12 pages
InformaticaQ&A
100% (1)
InformaticaQ&A
18 pages
SAP HANA Database - Partitioning and Distribution of Large Tables PDF
No ratings yet
SAP HANA Database - Partitioning and Distribution of Large Tables PDF
14 pages
Informatica PowerCenter (8.6.1) Performance Tuning
No ratings yet
Informatica PowerCenter (8.6.1) Performance Tuning
37 pages
DataStage Stages 12-Dec-2013 12PM
No ratings yet
DataStage Stages 12-Dec-2013 12PM
47 pages
Microsoft - Strategies For Partitioning Relational Data Warehouses in SQL Server
No ratings yet
Microsoft - Strategies For Partitioning Relational Data Warehouses in SQL Server
27 pages
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)
ASP.NET For Beginners: The Simple Guide to Learning ASP.NET Web Programming Fast!
From Everand
ASP.NET For Beginners: The Simple Guide to Learning ASP.NET Web Programming Fast!
Tim Warren
No ratings yet
Teradata PPI
No ratings yet
Teradata PPI
14 pages
Hack into your Friends Computer
From Everand
Hack into your Friends Computer
Magelan Cyber Security
No ratings yet
Nios4 FIRST STEPS
From Everand
Nios4 FIRST STEPS
Gessica Monteforte
No ratings yet
In For Ma Tic A
No ratings yet
In For Ma Tic A
21 pages
5 Partitioning
No ratings yet
5 Partitioning
23 pages
FAQs On Informatica Final
No ratings yet
FAQs On Informatica Final
55 pages
Identifying Bottlenecks
No ratings yet
Identifying Bottlenecks
7 pages
Database Partitioning With MySQL
No ratings yet
Database Partitioning With MySQL
6 pages
Column - Partitioned Tables and Join Indexes
No ratings yet
Column - Partitioned Tables and Join Indexes
10 pages
Optimizing SQL Server 2014: Course: Database Administration Effective Period: September 2015
No ratings yet
Optimizing SQL Server 2014: Course: Database Administration Effective Period: September 2015
52 pages
Informatica Interview Questions
No ratings yet
Informatica Interview Questions
27 pages
Learn Hive in 24 Hours
From Everand
Learn Hive in 24 Hours
Alex Nordeen
No ratings yet
Oracle Partitioning
No ratings yet
Oracle Partitioning
6 pages
Database Partitioning A Review Paper
No ratings yet
Database Partitioning A Review Paper
4 pages
Introduction To DBMS
No ratings yet
Introduction To DBMS
37 pages
Oracle Partitioning For Developers
No ratings yet
Oracle Partitioning For Developers
70 pages
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
From Everand
Kafka Up and Running for Network DevOps: Set Your Network Data in Motion
Eric Chou
No ratings yet
Siebel Remote Administration 8 Blackbook
From Everand
Siebel Remote Administration 8 Blackbook
Mohammed Azizuddin Aamer
No ratings yet
Cloud Infrastructure and Data Center
From Everand
Cloud Infrastructure and Data Center
Duong Tran
No ratings yet
Sessionboss: Visual Paradigm For Uml Standard Edition (K.U.Leuven)
No ratings yet
Sessionboss: Visual Paradigm For Uml Standard Edition (K.U.Leuven)
1 page
Opportunistic Lock Examples
No ratings yet
Opportunistic Lock Examples
6 pages
CC103 - Intermediate Programming / Computer Programming 2 Weekly Instruction Navigator
No ratings yet
CC103 - Intermediate Programming / Computer Programming 2 Weekly Instruction Navigator
2 pages
Cs6801 - Multicore Architectures and Programming 2 Marks Q & A Unit Iv - Distributed Memory Programming With Mpi
No ratings yet
Cs6801 - Multicore Architectures and Programming 2 Marks Q & A Unit Iv - Distributed Memory Programming With Mpi
15 pages
COMSATS University Islamabad, Lahore Campus Sessional-1 Exam - Semester FALL 2020
No ratings yet
COMSATS University Islamabad, Lahore Campus Sessional-1 Exam - Semester FALL 2020
3 pages
OLEDrag and Drop With VB6
No ratings yet
OLEDrag and Drop With VB6
7 pages
Design and Analysis of Algorithms: Israr Ali
No ratings yet
Design and Analysis of Algorithms: Israr Ali
79 pages
Developper CASE en API
No ratings yet
Developper CASE en API
26 pages
Unit j276 02 Computational Thinking Algorithms and Programming Sample Assessment Materials
No ratings yet
Unit j276 02 Computational Thinking Algorithms and Programming Sample Assessment Materials
28 pages
SQL FAANG Questionnaire
No ratings yet
SQL FAANG Questionnaire
17 pages
Sap Abap Quick Guide
No ratings yet
Sap Abap Quick Guide
93 pages
Data Abstraction and Basic Data Structures: Improving Efficiency by Building Better Object IN
No ratings yet
Data Abstraction and Basic Data Structures: Improving Efficiency by Building Better Object IN
12 pages
JAVA
No ratings yet
JAVA
21 pages
Computer Practise
No ratings yet
Computer Practise
7 pages
Section 3 OBJECTIVES: at The End of The Session, The Student Is Expected To Be Able To
No ratings yet
Section 3 OBJECTIVES: at The End of The Session, The Student Is Expected To Be Able To
7 pages
97 Things Every Programmer Should Know
100% (4)
97 Things Every Programmer Should Know
24 pages
PYTHON Assignment
No ratings yet
PYTHON Assignment
11 pages
OWSSDM Front Office Tutorial V1.0
No ratings yet
OWSSDM Front Office Tutorial V1.0
8 pages
JS Interview Questions-1
No ratings yet
JS Interview Questions-1
25 pages
1.1 Programming Paradigm: Language? What Do We Need To Know To Program in A Language? There Are Three Crucial
No ratings yet
1.1 Programming Paradigm: Language? What Do We Need To Know To Program in A Language? There Are Three Crucial
17 pages
To Calculate Total Marks & Percentage
No ratings yet
To Calculate Total Marks & Percentage
17 pages
Date Bluej Programs ISC
100% (1)
Date Bluej Programs ISC
23 pages
DS PPT Unit - 2
No ratings yet
DS PPT Unit - 2
119 pages
Web User Interface Development
No ratings yet
Web User Interface Development
4 pages
Arrays & Strings in Java
No ratings yet
Arrays & Strings in Java
26 pages
CST 205 - OOP Using Java
100% (1)
CST 205 - OOP Using Java
25 pages
Reverse Posting Code
No ratings yet
Reverse Posting Code
7 pages
Program 6 Algorithm: Void Input
No ratings yet
Program 6 Algorithm: Void Input
15 pages

Partisions Types

Uploaded by

Partisions Types

Uploaded by

Requirement:How to do performance tuning in informatica if mapping is taking

Long running sessions, Timeout, session failured , CPU consuption.

We can either go for

The number of partitions can be set at any partition point.

You cannot create partition points for the following transformations:

Type of partitions are :

We can either go for

The number of partitions can be set at any partition point.

You cannot create partition points for the following transformations:

Type of partitions are :

CREATE TABLE Sales_Range

You might also like