SlideShare a Scribd company logo
Data Stage
Course Description
Pre-training Test
Introduction about Data Stage
• IBM Information Sever architecture
• Datastage within the IBM Information Sever architecture
• Difference between Server Jobs and Parallel Jobs
• Difference between Pipeline Parallelism and Partition Parallelism
• Partition techniques (Round Robin, Random, Hash, Entire, Same,
Modules, Range, DB2, Auto)
• Configuration file
• Difference between SMP/MPD Architecture
• Data stage components (Server components /Client components)
• Package installer
Datastage Administrator
• Creating project, Editing project and Deleting project
• Permissions to user
• .APT config file
• Environment variable creation, permission
Datastage Director
• Introduction to Data stage Director
• Job Status View
• View logs
• Scheduling
• Batches Creation
Designer
• Introduction about Designer
• Repository
• Palatte
• Type of Links
• File Stages
• Sequential file
• Dataset file
• File set
• Lookup file set
• Difference between Sequential file/Dataset/File set
• Database stages
o Dynamic RDBMS
o Oracle Enterprise
o ODBC Enterprise
o Stored Procedure
Processing stage
• Change capture (Caption)
• Compare stage
• Difference Stage Aggregate Stage
• Transformer Stage
• Surrogate Generator Stage
• Join Generator Stage
• Merge Generator Stage
• Lookup Generator Stage
• Difference between Join/Lookup/Merge
• Difference between Join/Lookup
• Remove Duplicates
• Switch
• Pivot
• Modify
• Funnel
Debugging Stage
• Head
• Tail
• Pea
• Row Generator
• Column Generator
• Sample
• Job Parameters
Manager
• Introduction about Datastage Manager
• Importing the Job
• Exporting the Job
• Importing Table Definition
• Importing Flat File Definition
• Routines
Containers
• Difference between local container and shared container
• Local Container
• Shared Container
Job sequencer
• Arrange job activities in sequencer
• Triggers in Sequencer
• Restability
• Recoverability
• Notification activity
• Terminator Activity
• Wait for file activity
• Start look activity
• Execute command activity
• Nested Condition activity
• Routine activity
• Exception handing activity
• User variable activity
• End loop activity
• Adding Checkpoints
Information Analyzer
• IBM Websphere Information Analyzer overview
• Data profiling process
• Column
• Analysis
• Primary key analysis
• Foreign key analysis
• Cross-domain analysis
• Baseline analysis
• Analysis result publication
• Deleting analysis results
• Reports for information analysisC
• Column analysis summary statistics reports
• Baseline analysis reports
• Cross-domain analysis reports
• Primary key reports
• Foreign key analysis reports
Web sphere Quality Stage
• About Date Quality
• Datastage Quality stages
• Investigate stage Standardize stage Match Frequency stage
• Unduplicate Match stage
• Reference Match stage
• Survive stage
IBM Information Server Administration Guide
• IBM Websphere Datastage administration
• Opening the IBM Information Sever Web console
• Setting up a project in the console
• Customizing the project dashboard
• Setting up security
• Creating users in the console
• Assigning security roles to users and groups
• Managing licenses
• Managing active sessions
• Managing logs
• Managing schedules
• Backing up and restoring IBM Information Server
Conclusion
• Real Time Scenario
• Documents
Ad

More Related Content

Similar to Datastage details (20)

Datastage Online Training @ Adithya Elearning
Datastage Online Training @ Adithya ElearningDatastage Online Training @ Adithya Elearning
Datastage Online Training @ Adithya Elearning
shanmukha rao dondapati
 
Informatica9.0
Informatica9.0Informatica9.0
Informatica9.0
Amit Sharma
 
Informatica course curriculum
Informatica course curriculumInformatica course curriculum
Informatica course curriculum
Amit Sharma
 
SQL Server - High availability
SQL Server - High availabilitySQL Server - High availability
SQL Server - High availability
Peter Gfader
 
Lombardi intro full
Lombardi intro  full Lombardi intro  full
Lombardi intro full
Guang Ying Yuan
 
Configuration Management
Configuration ManagementConfiguration Management
Configuration Management
elliando dias
 
Camunda BPM 7.2: Performance and Scalability (English)
Camunda BPM 7.2: Performance and Scalability (English)Camunda BPM 7.2: Performance and Scalability (English)
Camunda BPM 7.2: Performance and Scalability (English)
camunda services GmbH
 
5 multi-instance management
5   multi-instance management 5   multi-instance management
5 multi-instance management
sqlserver.co.il
 
Java Online Training
Java Online TrainingJava Online Training
Java Online Training
PRO IT Online Training
 
Low Hanging Fruits In J EE Performance
Low Hanging Fruits In J EE PerformanceLow Hanging Fruits In J EE Performance
Low Hanging Fruits In J EE Performance
Alois Reitbauer
 
ibm websphere admin training | websphere admin course | ibm websphere adminis...
ibm websphere admin training | websphere admin course | ibm websphere adminis...ibm websphere admin training | websphere admin course | ibm websphere adminis...
ibm websphere admin training | websphere admin course | ibm websphere adminis...
Nancy Thomas
 
Msdn Workflow Services And Windows Server App Fabric
Msdn Workflow Services And Windows Server App FabricMsdn Workflow Services And Windows Server App Fabric
Msdn Workflow Services And Windows Server App Fabric
Juan Pablo
 
Real-time Code Sharing Service for one-to-many coding classes
Real-time Code Sharing Service for one-to-many coding classesReal-time Code Sharing Service for one-to-many coding classes
Real-time Code Sharing Service for one-to-many coding classes
a2tt
 
Informatica mdm online training
Informatica mdm online trainingInformatica mdm online training
Informatica mdm online training
Glory IT Technologies Pvt. Ltd.
 
Stucorner dot-net-training-syllabus
Stucorner dot-net-training-syllabusStucorner dot-net-training-syllabus
Stucorner dot-net-training-syllabus
STUCORNER technology
 
DATASTAGE ONLINE TRAINING
DATASTAGE ONLINE TRAININGDATASTAGE ONLINE TRAINING
DATASTAGE ONLINE TRAINING
TRAINING ICON
 
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Bhupesh Bansal
 
Hadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedInHadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedIn
Hadoop User Group
 
Practical SQL query monitoring and optimization
Practical SQL query monitoring and optimizationPractical SQL query monitoring and optimization
Practical SQL query monitoring and optimization
Ivo Andreev
 
DATASTAGE ONLINE TRAINING
DATASTAGE ONLINE TRAININGDATASTAGE ONLINE TRAINING
DATASTAGE ONLINE TRAINING
Smartittrainings
 
Datastage Online Training @ Adithya Elearning
Datastage Online Training @ Adithya ElearningDatastage Online Training @ Adithya Elearning
Datastage Online Training @ Adithya Elearning
shanmukha rao dondapati
 
Informatica course curriculum
Informatica course curriculumInformatica course curriculum
Informatica course curriculum
Amit Sharma
 
SQL Server - High availability
SQL Server - High availabilitySQL Server - High availability
SQL Server - High availability
Peter Gfader
 
Configuration Management
Configuration ManagementConfiguration Management
Configuration Management
elliando dias
 
Camunda BPM 7.2: Performance and Scalability (English)
Camunda BPM 7.2: Performance and Scalability (English)Camunda BPM 7.2: Performance and Scalability (English)
Camunda BPM 7.2: Performance and Scalability (English)
camunda services GmbH
 
5 multi-instance management
5   multi-instance management 5   multi-instance management
5 multi-instance management
sqlserver.co.il
 
Low Hanging Fruits In J EE Performance
Low Hanging Fruits In J EE PerformanceLow Hanging Fruits In J EE Performance
Low Hanging Fruits In J EE Performance
Alois Reitbauer
 
ibm websphere admin training | websphere admin course | ibm websphere adminis...
ibm websphere admin training | websphere admin course | ibm websphere adminis...ibm websphere admin training | websphere admin course | ibm websphere adminis...
ibm websphere admin training | websphere admin course | ibm websphere adminis...
Nancy Thomas
 
Msdn Workflow Services And Windows Server App Fabric
Msdn Workflow Services And Windows Server App FabricMsdn Workflow Services And Windows Server App Fabric
Msdn Workflow Services And Windows Server App Fabric
Juan Pablo
 
Real-time Code Sharing Service for one-to-many coding classes
Real-time Code Sharing Service for one-to-many coding classesReal-time Code Sharing Service for one-to-many coding classes
Real-time Code Sharing Service for one-to-many coding classes
a2tt
 
Stucorner dot-net-training-syllabus
Stucorner dot-net-training-syllabusStucorner dot-net-training-syllabus
Stucorner dot-net-training-syllabus
STUCORNER technology
 
DATASTAGE ONLINE TRAINING
DATASTAGE ONLINE TRAININGDATASTAGE ONLINE TRAINING
DATASTAGE ONLINE TRAINING
TRAINING ICON
 
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Voldemort & Hadoop @ Linkedin, Hadoop User Group Jan 2010
Bhupesh Bansal
 
Hadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedInHadoop and Voldemort @ LinkedIn
Hadoop and Voldemort @ LinkedIn
Hadoop User Group
 
Practical SQL query monitoring and optimization
Practical SQL query monitoring and optimizationPractical SQL query monitoring and optimization
Practical SQL query monitoring and optimization
Ivo Andreev
 

More from Krishna Prasad (7)

Unix4
Unix4Unix4
Unix4
Krishna Prasad
 
Unix3
Unix3Unix3
Unix3
Krishna Prasad
 
Unix2
Unix2Unix2
Unix2
Krishna Prasad
 
Unix1
Unix1Unix1
Unix1
Krishna Prasad
 
Dw concepts
Dw conceptsDw concepts
Dw concepts
Krishna Prasad
 
Etl testing
Etl testingEtl testing
Etl testing
Krishna Prasad
 
Success quotes from nh
Success quotes from nhSuccess quotes from nh
Success quotes from nh
Krishna Prasad
 
Ad

Recently uploaded (20)

Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Build Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For DevsBuild Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For Devs
Brian McKeiver
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Cybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure ADCybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure AD
VICTOR MAESTRE RAMIREZ
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
BookNet Canada
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc
 
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...
Aqusag Technologies
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
Quantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur MorganQuantum Computing Quick Research Guide by Arthur Morgan
Quantum Computing Quick Research Guide by Arthur Morgan
Arthur Morgan
 
Build Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For DevsBuild Your Own Copilot & Agents For Devs
Build Your Own Copilot & Agents For Devs
Brian McKeiver
 
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-UmgebungenHCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
HCL Nomad Web – Best Practices und Verwaltung von Multiuser-Umgebungen
panagenda
 
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveDesigning Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
ScyllaDB
 
Semantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AISemantic Cultivators : The Critical Future Role to Enable AI
Semantic Cultivators : The Critical Future Role to Enable AI
artmondano
 
Drupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy ConsumptionDrupalcamp Finland – Measuring Front-end Energy Consumption
Drupalcamp Finland – Measuring Front-end Energy Consumption
Exove
 
AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
Generative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in BusinessGenerative Artificial Intelligence (GenAI) in Business
Generative Artificial Intelligence (GenAI) in Business
Dr. Tathagat Varma
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
Cybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure ADCybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure AD
VICTOR MAESTRE RAMIREZ
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
#StandardsGoals for 2025: Standards & certification roundup - Tech Forum 2025
BookNet Canada
 
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...
Alan Dix
 
Cyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of securityCyber Awareness overview for 2025 month of security
Cyber Awareness overview for 2025 month of security
riccardosl1
 
Big Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur MorganBig Data Analytics Quick Research Guide by Arthur Morgan
Big Data Analytics Quick Research Guide by Arthur Morgan
Arthur Morgan
 
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...
TrustArc
 
Ad

Datastage details

  • 1. Data Stage Course Description Pre-training Test Introduction about Data Stage • IBM Information Sever architecture • Datastage within the IBM Information Sever architecture • Difference between Server Jobs and Parallel Jobs • Difference between Pipeline Parallelism and Partition Parallelism • Partition techniques (Round Robin, Random, Hash, Entire, Same, Modules, Range, DB2, Auto) • Configuration file • Difference between SMP/MPD Architecture • Data stage components (Server components /Client components) • Package installer Datastage Administrator • Creating project, Editing project and Deleting project • Permissions to user • .APT config file • Environment variable creation, permission Datastage Director • Introduction to Data stage Director • Job Status View • View logs • Scheduling • Batches Creation Designer • Introduction about Designer • Repository • Palatte • Type of Links • File Stages • Sequential file • Dataset file • File set • Lookup file set • Difference between Sequential file/Dataset/File set • Database stages o Dynamic RDBMS o Oracle Enterprise o ODBC Enterprise o Stored Procedure Processing stage • Change capture (Caption) • Compare stage • Difference Stage Aggregate Stage • Transformer Stage • Surrogate Generator Stage • Join Generator Stage • Merge Generator Stage • Lookup Generator Stage • Difference between Join/Lookup/Merge • Difference between Join/Lookup
  • 2. • Remove Duplicates • Switch • Pivot • Modify • Funnel Debugging Stage • Head • Tail • Pea • Row Generator • Column Generator • Sample • Job Parameters Manager • Introduction about Datastage Manager • Importing the Job • Exporting the Job • Importing Table Definition • Importing Flat File Definition • Routines Containers • Difference between local container and shared container • Local Container • Shared Container Job sequencer • Arrange job activities in sequencer • Triggers in Sequencer • Restability • Recoverability • Notification activity • Terminator Activity • Wait for file activity • Start look activity • Execute command activity • Nested Condition activity • Routine activity • Exception handing activity • User variable activity • End loop activity • Adding Checkpoints Information Analyzer • IBM Websphere Information Analyzer overview • Data profiling process • Column • Analysis • Primary key analysis • Foreign key analysis • Cross-domain analysis • Baseline analysis • Analysis result publication • Deleting analysis results • Reports for information analysisC • Column analysis summary statistics reports • Baseline analysis reports
  • 3. • Cross-domain analysis reports • Primary key reports • Foreign key analysis reports Web sphere Quality Stage • About Date Quality • Datastage Quality stages • Investigate stage Standardize stage Match Frequency stage • Unduplicate Match stage • Reference Match stage • Survive stage IBM Information Server Administration Guide • IBM Websphere Datastage administration • Opening the IBM Information Sever Web console • Setting up a project in the console • Customizing the project dashboard • Setting up security • Creating users in the console • Assigning security roles to users and groups • Managing licenses • Managing active sessions • Managing logs • Managing schedules • Backing up and restoring IBM Information Server Conclusion • Real Time Scenario • Documents