Power Center 8 6 Update UG 4 Nov 2008
Power Center 8 6 Update UG 4 Nov 2008
6
Data Quality 8.6
Update Session
Identity Resolution
4 november 2008
1
• Informatica
- upgrades/updates vs. support
• PowerCenter 8.6
- new architecture
- new features
* CacheCalc
* Parameters
* GRID/High Availability
* Data Masking
2
• Informatica
- upgrades/updates vs. support
• PowerCenter 8.6
- new architecture
- new features
* CacheCalc
* Parameters
* GRID/High Availability
* Data Masking
3
PowerCenter 6 and 7 Architecture
Sources Targets
Client Tools
Designer Rep. Manager
Standards, Standards,
Messaging, Messaging,
Web Services WF Manager WF Monitor Web Services
Relational/Flat Relational/Flat
Files
Files
Data Servers (pmserver)
PowerCenter
Connects
Mainframe/
PowerExchange Mainframe/
Midrange Midrange
4
PowerCenter 8 Architecture
Client Tools
Repository Manager Workflow Manger
Core Services
PowerCenter
Connects Repository Service
Domain/Gateway Services Log Service
• Administration & Authorization
• Configuration
PowerExchange • Domain
• Licensing
5
Domain Overview Dashboard
6
7
• Informatica
- upgrades/updates vs. support
• PowerCenter 8.6
- new architecture
- new features
* CacheCalc
* Parameters
* GRID/High Availability
* Data Masking
8
9
• Informatica
- upgrades/updates vs. support
• PowerCenter 8.6
- new architecture
- new features
* CacheCalc
* Parameters
* GRID/High Availability
* Data Masking
10
Additional levels of specification
(in parameter files)
high low
GLOBAL
PRECEDENCE
SERVICE
SCOPE
SERVICE PROCESS
WORKFLOW
SESSION
low high
11
Example parameter file
Global level
Service level
Service process
level
Workflow level
takes precedence
over higher order levels
such as Global / Service
/ Service Process.
12
Expanded areas of parametrization
• Pre-defined server variables can be overridden in parameter file now
• Use of workflow variables in any task in the workflow as a parameter
• Owner name of relational source
• Table name prefix of relational target
• E-mail address in e-mail tasks and pre/post session e-mails
• File name parameterization
Workflow and session log file names
FTP remote file name
Persistent cache file name
• Use of workflow/session/mapping variables and parameters in
E-mail subject/ body
Pre/post SQL for source & target
Environment SQL
Transaction SQL
Pre/post session commands
Call text of stored procedures
13
• Informatica
- upgrades/updates vs. support
• PowerCenter 8.6
- new architecture
- new features
* CacheCalc
* Parameters
* GRID/High Availability
* Data Masking
14
PowerCenter 8 Architecture
Client Tools
Repository Manager Workflow Manger
Core Services
PowerCenter
Connects Repository Service
Domain/Gateway Services Log Service
• Administration & Authorization
• Configuration
PowerExchange • Domain
• Licensing
15
Domain Overview Dashboard
16
High Availability in PowerCenter V8
• Failover
• Restart for data integration, repository and other services
• Primary and backup servers
• Recovery
• Workflow and sessions will be recovered on running servers on
the grid during server failure
• Checkpoint recovery
• Repository recovery
• Resilience
• PowerCenter jobs will sustain transient failure
• Network errors
• DB connection failures
17
Simple High Availability/Failover Scenario
Integration
Service Primary Backup
Repository
Service Backup Primary
18
Simple High/Failover Availability Scenario
Node A Node B
Integration
Service Unavailable Backup
Repository
Service Unavailable Primary
19
• Informatica
- upgrades/updates vs. support
• PowerCenter 8.6
- new architecture
- new features
* CacheCalc
* Parameters
* GRID/High Availability
* Data Masking
20
Protecting Sensitive Data
Restrict Access, Mask Private Data
• Support
Glen Carter
21
What is Data Masking?
22
Masking Production Data for Test Environment
Packaged Packaged
Applications Applications
PowerCenter +
Relational and Relational and
Flat Files Data Masking Option Flat Files
23
CUSTOMERS_PROD CUSTOMERS_TEST
PowerCenter +
Data Masking Option
CUSTOMER_ACCOUNTS CUSTOMER_ACCOUNTS
_PROD _TEST
24
25
• Informatica
- upgrades/updates vs. support
• PowerCenter 8.6
- new architecture
- new features
* CacheCalc
* Parameters
* GRID/High Availability
* Data Masking
26
Reference Table Manager (RTM) Overview
27
Typical Use Cases
• Data Migrations
• Define default values for an application (e.g. the default value to
load into a field when there is no corresponding data from the
source system)
• Define valid values for an application (e.g. the list of acceptable
geography codes for an ERP application)
• Cross-reference data values from one application to another
(e.g. how the accounting codes from a legacy financial system
map to the codes in the new finance application)
28
RTM empowers the business SME to create, edit and
manage reference tables directly
29
RTM maintains an audit trail on changes to the
reference data
30
RTM is integrated with PowerCenter Admin Console
31
32
• Informatica
- upgrades/updates vs. support
• PowerCenter 8.6
- new architecture
- new features
* CacheCalc
* Parameters
* GRID/High Availability
* Data Masking
33
Mapping Analyst for Excel
Current Approach
DI Developer
1. Translate mapping specifications into
PowerCenter Mappings
34
Mapping Analyst for Excel
Mapping Specification: Issues with Current Practice
Each analyst has his
own Excel format
35
Mapping Analyst for Excel
The Informatica Solution
DI Developer
1. Augment & Tune Generated
PowerCenter Mappings
2. Re-use mappings for other projects
36
Mapping Analyst for Excel
Keep Specifications Updated
• Define business terms
• Specify associated sources, target • Jumpstart mappings from specification
• Define transformation rules • Modify and optimize mappings
Analyst • Standardize Excel format Developer • Synchronize mappings w/ spec.
37
Mapping Specification Lifecycle
38
39