0% found this document useful (0 votes)

2K views32 pages

Dynamic Partitioning in Informatca 8.X

Dynamic Partitioning in PowerCenter allows sessions to run in parallel across multiple partitions for improved performance. There are several partition types including hash, key range, round robin and pass through. Dynamic partitioning determines the number of partitions at runtime based on factors like source database partitions or grid nodes. This allows the partitions to scale automatically with increasing data volumes or resources to optimize performance without reconfiguring the session.

Uploaded by

hot_job_hunt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2K views32 pages

Dynamic Partitioning in Informatca 8.X

Uploaded by

hot_job_hunt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 32

Dynamic Partitioning

integration * intelligence * insight

AGENDA

High Availability
Grid Computing
Dynamic Partitioning

integration * intelligence * insight

Introduction

PowerCenter Domains

 PowerCenter introduces a service-oriented architecture

 PowerCenter introduces a domain, which serves as the primary unit of

administration for the PowerCenter environment.

 A domain is a collection of nodes and services in the PowerCenter

environment.

 The first time you install Informatica Services, you create a domain and add a
node to the domain.

integration * intelligence * insight

Administration Console

• The Administration Console is a browser-based utility that enables you to

view domain properties and perform basic domain administration tasks

• The Navigator displays the following types of objects:

• Domain. You can view one domain in the Administration Console

• Node. A node represents a machine in the domain.

• Grid. Create a grid to run the Integration Service on multiple nodes.

integration * intelligence * insight

Administration Console

integration * intelligence * insight

Administration Console contd..

integration * intelligence * insight

Administration Console contd..

integration * intelligence * insight

High Availability

• High availability is the PowerCenter option that eliminates a single point of

failure in the PowerCenter environment

• High availability provides the following functionality:

• Resilience.

• Failover.

• Recovery.

integration * intelligence * insight

The Partitioning Option

• The Partitioning Option increases PowerCenter’s performance through parallel data

processing .
• When the Integration Service runs the session, it can achieve higher performance by
partitioning the pipeline and performing the extract, transformation, and load for each partition
in parallel.

• Partition Types :
• Database partitioning.
• Hash auto-keys.
• Hash user keys.
• Key range.
• Pass-through .
• Round-robin.

integration * intelligence * insight

Configuring Partitioning

• Create or edit a session .

• Update partitioning information using the Partitions view on the Mapping tab
of session properties.

• Add, delete, or edit partition points on the Partitions view of session

properties .

integration * intelligence * insight

Configuring a Partition Point

• You can configure the following information when you edit or add a partition point:
• Specify the partition type at the partition point.
• Add and delete partitions.
• Enter a description for each partition.

integration * intelligence * insight

Hash user keys

• The Integration Service uses a hash function to group rows of data among partitions .
• Improves the performance of the session , the hash function usually processes
numerical data more quickly than string data.
• Specify a hash key for user hash key.
• We have created a sample mapping when we don’t configure this
mapping(m_orders_scd3) for Partitioning then the run time comes up to 37 seconds

integration * intelligence * insight

Hash user keys contd..

• using hash user key partition the run time comes up to 22 seconds to complete the
session as shown in the below figure.

integration * intelligence * insight

Key range partition

• With key range partitioning, the Integration Service distributes rows of data based on a port.
• you define a range of values.

integration * intelligence * insight

Key range partition contd..

• using key range partition the run time comes up to 33 seconds to complete the
session as shown in the below figure.

integration * intelligence * insight

Partition details

• Source/target statistics

integration * intelligence * insight

Hash auto-keys

• Use hash auto-keys partitioning at or before Rank, Sorter, Joiner,

and unsorted Aggregator transformations.
• The Integration Service distributes rows to each partition according
to group before they enter the Sorter and Aggregator
transformations .

integration * intelligence * insight

Pass-Through Partition Type

• In pass-through partitioning, the Integration Service processes data without

redistributing rows among partitions.
• Increases data throughput , without increasing number of partitions.

integration * intelligence * insight

Round-Robin Partition Type

• In round-robin partitioning, the Integration Service distributes rows of data evenly to all partitions .

• The session based on this mapping reads item information from three flat files of different sizes:
• Source file 1: 80,000 rows
• Source file 2: 5,000 rows
• Source file 3: 15,000 rows
• When the Integration Service reads the source data, the first partition begins processing 80% of the
data, the second partition processes 5% of the data, and the third partition processes 15% of the
data.
• To distribute the workload more evenly, set a partition point at the Filter transformation and set the
partition type to round-robin. The Integration Service distributes the data so that each partition
processes approximately one-third of the data.

integration * intelligence * insight

Dynamic Partitioning

• If the volume of data grows or you add more CPUs, you might need to adjust
partitioning so the session run time does not increase.

• When you use dynamic partitioning, you can configure the partition information so
the Integration Service determines the number of partitions to create at run time.

• The Integration Service scales the number of session partitions at run time based on
factors such as source database partitions or the number of nodes in a grid.

integration * intelligence * insight

Configuring Dynamic Partitioning

integration * intelligence * insight

Configuring Dynamic Partitioning contd..

• Configure dynamic partitioning using one of the following methods:

• Disabled. Do not use dynamic partitioning. Defines the number of partitions on the
Mapping tab.

• Based on number of partitions. Sets the partitions to a number that you define in
the Number of Partitions attribute. Use the $DynamicPartitionCount session
parameter, or enter a number greater than 1.

• Based on number of nodes in grid. Sets the partitions to the number of nodes in the
grid running the session. If you configure this option for sessions that do not run on a
grid, the session runs in one partition and logs a message in the session log.

• Based on source partitioning. Determines the number of partitions using database

partition information. The number of partitions is the maximum of the number of
partitions at the source.

integration * intelligence * insight

Based on number of partitions

• Edit the task , go to config object tab. Set the dynamic partition as based on number
of partitions, number of partitions 3.

integration * intelligence * insight

Based on number of partitions contd..

• Using Dynamic partition the run time comes up to 32 seconds to complete the
session as shown in the below figure.

integration * intelligence * insight

Partition details

• Source/target statistics

integration * intelligence * insight

Based on number of nodes in grid

• Edit the task , go to config object tab. Set the dynamic partition as based on number
of nodes in grid.

integration * intelligence * insight

Based on number of nodes in grid contd..

• Using Dynamic partition the run time comes up to 25 seconds to complete the
session as shown in the below figure.

integration * intelligence * insight

Based on source partitioning

• Edit the task , go to config object tab. Set the dynamic partition
as based on source partition

integration * intelligence * insight

Based on source partitioning contd..

• Using this option Dynamic partition the run time comes up to

20 seconds to complete the session as shown in the below
figure.

integration * intelligence * insight

Advantages of Dynamic Partition

 Session run time does not increase with volume of data grows or you add
more CPUs.

 Scales cost-effectively to handle large data volumes.

• Enhances developer productivity.
• Optimizes system performance in response to changing business
requirements.

• Even though any system fails , session will be completed. ( grid computing).

integration * intelligence * insight

LIMITATIONS OF DYNAMIC PARTITION

• You cannot use dynamic partitioning with XML sources

and targets.

• You cannot use dynamic partitioning with the Debugger.

integration * intelligence * insight

Thanks

integration * intelligence * insight

Pipeline Partitioning Overview Informatica
80% (5)
Pipeline Partitioning Overview Informatica
3 pages
1Z0-1093-25-DEMO
No ratings yet
1Z0-1093-25-DEMO
8 pages
IM50 Design SAP 2012 Software Guide
No ratings yet
IM50 Design SAP 2012 Software Guide
58 pages
NAVTechDays2019 - How To Run Faster in SaaS
100% (1)
NAVTechDays2019 - How To Run Faster in SaaS
84 pages
Capacity Planning PDD Final
100% (1)
Capacity Planning PDD Final
47 pages
Full Download Big Data Analytics in Cybersecurity First Edition Deng PDF DOCX
100% (1)
Full Download Big Data Analytics in Cybersecurity First Edition Deng PDF DOCX
49 pages
Partitioned Tables and Indexes
100% (1)
Partitioned Tables and Indexes
24 pages
Partitioning in Informatica Cloud (IICS) - ThinkETL
No ratings yet
Partitioning in Informatica Cloud (IICS) - ThinkETL
14 pages
Informatica PowerCenter (8.6.1) Performance Tuning
No ratings yet
Informatica PowerCenter (8.6.1) Performance Tuning
37 pages
Airbnb PDF
No ratings yet
Airbnb PDF
9 pages
My Resume Andrew K
No ratings yet
My Resume Andrew K
2 pages
Logs & Error Handling Settings
No ratings yet
Logs & Error Handling Settings
4 pages
Partitioning Attributes: 1. Partition Points
No ratings yet
Partitioning Attributes: 1. Partition Points
4 pages
Performance Tuning: SAP HANA Course
No ratings yet
Performance Tuning: SAP HANA Course
3 pages
Duwand Constant
No ratings yet
Duwand Constant
5 pages
S5 Syllabus Computer Science (Old Scheme)
100% (1)
S5 Syllabus Computer Science (Old Scheme)
6 pages
Resume (Suraj)
No ratings yet
Resume (Suraj)
4 pages
Basic PostgreSQL Tutorial
No ratings yet
Basic PostgreSQL Tutorial
7 pages
Dice Resume CV Devendra Velivelli
No ratings yet
Dice Resume CV Devendra Velivelli
7 pages
Credo 8 Features List 2020
No ratings yet
Credo 8 Features List 2020
9 pages
Teradata Geospatial Utilities User Guide - 20.00
No ratings yet
Teradata Geospatial Utilities User Guide - 20.00
33 pages
Day 49 - INTERVIEW QUESTIONS ON AWS
No ratings yet
Day 49 - INTERVIEW QUESTIONS ON AWS
12 pages
Develop and Complex Spreadsheet
No ratings yet
Develop and Complex Spreadsheet
16 pages
bda lab
No ratings yet
bda lab
4 pages
SQL Subqueries With EXISTS-NOT EXISTS Nested Inside WHERE
No ratings yet
SQL Subqueries With EXISTS-NOT EXISTS Nested Inside WHERE
60 pages
Deep dive Dynamo DB
No ratings yet
Deep dive Dynamo DB
88 pages
Performance Optimization Techniques
0% (1)
Performance Optimization Techniques
4 pages
Welcome To The World of "Cache"
No ratings yet
Welcome To The World of "Cache"
37 pages
Informatica-FAQ-How Are Automatic Memory Values Calculated in PowerCenter
No ratings yet
Informatica-FAQ-How Are Automatic Memory Values Calculated in PowerCenter
1 page
Microsoft - Strategies For Partitioning Relational Data Warehouses in SQL Server
No ratings yet
Microsoft - Strategies For Partitioning Relational Data Warehouses in SQL Server
27 pages
Optimizing Session Caches in Powercenter
No ratings yet
Optimizing Session Caches in Powercenter
10 pages
CS 465 Module Five Full Stack Guide
No ratings yet
CS 465 Module Five Full Stack Guide
18 pages
Features of SQL Server 2008
No ratings yet
Features of SQL Server 2008
7 pages
Workflow - Dynamic Partition On Relational Source
No ratings yet
Workflow - Dynamic Partition On Relational Source
1 page
Partition Types Overview
No ratings yet
Partition Types Overview
13 pages
Microsoft - SQL Server 2005
100% (1)
Microsoft - SQL Server 2005
33 pages
Sorter Transformation Properties 1. Sorter Cache Size
No ratings yet
Sorter Transformation Properties 1. Sorter Cache Size
3 pages
Checklist For Best Practices in Powercenter
No ratings yet
Checklist For Best Practices in Powercenter
7 pages
Lecture 3 - Introduction To NoSQL - Updated
No ratings yet
Lecture 3 - Introduction To NoSQL - Updated
35 pages
Setting Bom Quantities
No ratings yet
Setting Bom Quantities
1 page
Bueche Perf1 XPlore.berlin
No ratings yet
Bueche Perf1 XPlore.berlin
40 pages
InformaticaQ&A
100% (1)
InformaticaQ&A
18 pages
Gmail - Call For Papers - ICASCA-2025 - First International Conference On Advances in Smart Computing and Applications
No ratings yet
Gmail - Call For Papers - ICASCA-2025 - First International Conference On Advances in Smart Computing and Applications
3 pages
Tuning Mappings For Better Performance
No ratings yet
Tuning Mappings For Better Performance
12 pages
SG 248527
No ratings yet
SG 248527
334 pages
Partitioning With Oracle 11G: Bert Scalzo, Domain Expert, Oracle Solutions
No ratings yet
Partitioning With Oracle 11G: Bert Scalzo, Domain Expert, Oracle Solutions
45 pages
Partitioning PDF
No ratings yet
Partitioning PDF
5 pages
BIMBigDataCF-K F Ibrahim
No ratings yet
BIMBigDataCF-K F Ibrahim
8 pages
Dynamic Partitioning To Increase Parallelism in PowerCenter
No ratings yet
Dynamic Partitioning To Increase Parallelism in PowerCenter
3 pages
Informatica Partioning
No ratings yet
Informatica Partioning
33 pages
SAP HANA Top Questions-2
No ratings yet
SAP HANA Top Questions-2
3 pages
Oreillyfodooltweek 11675274112220
No ratings yet
Oreillyfodooltweek 11675274112220
45 pages
MIE1628 Big Data Analytics Lecture7
No ratings yet
MIE1628 Big Data Analytics Lecture7
77 pages
Partitioning Oracle Sources in PowerCenter
No ratings yet
Partitioning Oracle Sources in PowerCenter
12 pages
Sad - Dss - Capítulo 01
No ratings yet
Sad - Dss - Capítulo 01
45 pages
Informatica Partitions
No ratings yet
Informatica Partitions
11 pages
Informatica Interview Questions
No ratings yet
Informatica Interview Questions
27 pages
SG 247467
No ratings yet
SG 247467
270 pages
Practical List Xii Cs 2023-24
No ratings yet
Practical List Xii Cs 2023-24
76 pages
Incremental Aggregation
No ratings yet
Incremental Aggregation
2 pages
SEM II - Subjects
No ratings yet
SEM II - Subjects
6 pages
What Are The Best Mapping Development Practices and What Are The Different Mapping Design Tips For Informatica?
No ratings yet
What Are The Best Mapping Development Practices and What Are The Different Mapping Design Tips For Informatica?
29 pages
Performance Tuning
No ratings yet
Performance Tuning
40 pages
CCS341-DATA WAREHOUSING - 1805692571-Ccs341-Question-Bank
No ratings yet
CCS341-DATA WAREHOUSING - 1805692571-Ccs341-Question-Bank
10 pages
EMC Isilon Insightiq Overview
No ratings yet
EMC Isilon Insightiq Overview
5 pages
Powercenter Version 8.6 New Features and Enhancements: Command Line Programs
No ratings yet
Powercenter Version 8.6 New Features and Enhancements: Command Line Programs
4 pages
Session and Data Partititioning
No ratings yet
Session and Data Partititioning
4 pages
Performance Tuning in Informatica
No ratings yet
Performance Tuning in Informatica
26 pages
Payroll Management System
No ratings yet
Payroll Management System
23 pages
Informatica Performance Optimization Techniques
No ratings yet
Informatica Performance Optimization Techniques
21 pages
Informatica8.6 New Features
No ratings yet
Informatica8.6 New Features
5 pages
Performance Bottlenecks
No ratings yet
Performance Bottlenecks
21 pages
Performance Tuning Overview
No ratings yet
Performance Tuning Overview
34 pages
New Features Guide To Sybase Ase 15
No ratings yet
New Features Guide To Sybase Ase 15
505 pages
ReactJs Project
No ratings yet
ReactJs Project
118 pages
Database Partitioning With MySQL
No ratings yet
Database Partitioning With MySQL
6 pages
Informatica Session Properties Presentation1
No ratings yet
Informatica Session Properties Presentation1
18 pages
Powercenter 8.X New Features: Education Services
No ratings yet
Powercenter 8.X New Features: Education Services
159 pages
Informatica PowerCenter Performance Tuning Tips
No ratings yet
Informatica PowerCenter Performance Tuning Tips
8 pages
Performance Tuning
No ratings yet
Performance Tuning
28 pages
The Encrypted Web: Building Secure and Invisible Networks: Networking, #1
From Everand
The Encrypted Web: Building Secure and Invisible Networks: Networking, #1
Xettaiks
No ratings yet
Blender Pro Studio Advanced Techniques for Real-World Projects: Blender, #3
From Everand
Blender Pro Studio Advanced Techniques for Real-World Projects: Blender, #3
Steven Mcananey
No ratings yet
Digital Engineering: Complex System Design
From Everand
Digital Engineering: Complex System Design
S Mathioudakis
No ratings yet
What's New in .NET 8? A Complete Guide to the Latest Features
From Everand
What's New in .NET 8? A Complete Guide to the Latest Features
Nitika
No ratings yet
Design and Build Modern Datacentres, A to Z practical guide
From Everand
Design and Build Modern Datacentres, A to Z practical guide
Engineer Said AL Hosni
3/5 (2)
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-4: AZ 104 EXAM STUDY GUIDE
From Everand
MICROSOFT AZURE ADMINISTRATOR EXAM PREP(AZ-104) Part-4: AZ 104 EXAM STUDY GUIDE
Devi Prasad
No ratings yet
Siebel Remote Administration 8 Blackbook
From Everand
Siebel Remote Administration 8 Blackbook
Mohammed Azizuddin Aamer
No ratings yet
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
From Everand
Preliminary Specifications: Programmed Data Processor Model Three (PDP-3) October, 1960
Digital Equipment Corporation
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Dynamic Partitioning in Informatca 8.X

Uploaded by

Dynamic Partitioning in Informatca 8.X

Uploaded by

Dynamic Partitioning

integration * intelligence * insight

integration * intelligence * insight

 PowerCenter introduces a service-oriented architecture

 PowerCenter introduces a domain, which serves as the primary unit of

 A domain is a collection of nodes and services in the PowerCenter

integration * intelligence * insight

• The Administration Console is a browser-based utility that enables you to

• The Navigator displays the following types of objects:

• Domain. You can view one domain in the Administration Console

• Node. A node represents a machine in the domain.

• Grid. Create a grid to run the Integration Service on multiple nodes.

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

• High availability is the PowerCenter option that eliminates a single point of

• High availability provides the following functionality:

integration * intelligence * insight

• The Partitioning Option increases PowerCenter’s performance through parallel data

integration * intelligence * insight

• Create or edit a session .

• Add, delete, or edit partition points on the Partitions view of session

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

• Use hash auto-keys partitioning at or before Rank, Sorter, Joiner,

integration * intelligence * insight

• In pass-through partitioning, the Integration Service processes data without

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

• Configure dynamic partitioning using one of the following methods:

• Based on source partitioning. Determines the number of partitions using database

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

integration * intelligence * insight

• Using this option Dynamic partition the run time comes up to

integration * intelligence * insight

 Scales cost-effectively to handle large data volumes.

integration * intelligence * insight

• You cannot use dynamic partitioning with XML sources

• You cannot use dynamic partitioning with the Debugger.

integration * intelligence * insight

integration * intelligence * insight

You might also like