0% found this document useful (0 votes)

58 views

IBM QualityStage V11.5.x Standardizing Dat v2

The document discusses the standardization process in IBM QualityStage. It can identify new fields based on underlying data, such as setting flags. The standardize stage creates additional columns that can be used for blocking and matching. Phonetic codes like NYSIIS are used in matching. Classification overrides can modify rule sets and take precedence over classification tables. Rule sets can contain lookup tables that are called from pattern action files. The standardization process involves parsing free-form fields, assigning tokens to fields, and creating addressable output.

Uploaded by

Antonio Blanco

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views

IBM QualityStage V11.5.x Standardizing Dat v2

Uploaded by

Antonio Blanco

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

IBM QualityStage V11.5.

x Standardizing
Data

1. The Standardize Stage identifies

new fields based on underlying data.
Which of the following are examples of
this process?

Select one or more:

A. Setting a name-type flag to differentiate between an individual address and an organization

address

B. Creating phonetic representations of data that can be used in the matching process

C. Creating transformation rules for matching and for creating the load file

D. Setting an address-type flag

The Standardize stage has only one output link. This link can send the raw input
and the standardized output to any other stage. The Standardize stage creates
multiple columns that you can send along with the input columns to the output
link. Any columns from the original input can be written to the output along with
additional data created by the Standardize stage based on the input data (such
as a SOUNDEX phonetic or NYSIIS codes). The Match stage and other stages
can use the output from the Standardize stage—you can use any of the
additional data for blocking and matching columns in Match stages (more on this
in 1.8, “Match stage” on page 82).

2. Which of the following are phonetic

codes used in Matching?
Select one or more:

A. NYSIIS

B. Marlboro

C. USSUBURBS

D. EU

E. Soundex

3. Which of the following are TRUE

statements about Classification
Overrides?

Select one or more:

A. Word Investigation Word Frequency reports are useful in implementing Classification Overrides

B. User Overrides cannot override or add token values

C. Classification overrides do not take precedence over a classification table

D. Classification overrides are available in domain pre-processor rule sets and in domain-specific
rule sets

Classifications
The Classification Table is used in the standardization process to identify and classify key words
such as titles, street name, street type, and directions. The Classification Table includes the name
of the rule set and the classification legend. Click CLS in Figure 1-33 to view its contents. The
partial contents of the Classification Table for the USPREP rule set is shown in Example 1-1 on
page 33. As already mentioned, you can gain valuable insight by browsing the Classification tables
to determine the classification codes, as well as the different literals supported such as ZQMIXAZQ
and ZQNAMEZQ.

Selecting override object types to modify rule sets In the Rules Management window as shown in
Figure 1-33 on page 62, Overrides provides editing windows to customize rule sets for your
business requirements.
IBM WebSphere QualityStage provides five methods of rule set overrides as follows:
Classification
You can modify the classification table of any rule set using the Designer client. Figure 1-34 on
page 65 and Figure 1-35 on page 66 show the classification table override for the domain-specific
USADDR rule set.
In the Input Token field, type the word (AVEDUE) for which you want to override the classification
as it appears in the input file. In the Standard Form field, type the standardized spelling (AVE) of
the token.8 From the Classification menu, select the one-character tag (T- Street Types) that
indicates the class of the token word. In the Comparison Threshold field, type a value (850)9 that
defines the degree of uncertainty to tolerate in the spelling of the token word. Click Add in Figure
1-34 to add the override to the pane at the bottom of the window as shown in Figure 1-35 on page
66.
After you create the override (and provision it), the next time you run the rule set, the word
tokens are classified with the designations you specified and appear with the appropriate standard
form.

With the input pattern override, you can specify token overrides that are based on the input
pattern. The input pattern overrides take precedence over the pattern-action file. Input pattern
overrides are specified for the entire input pattern.

4. Where can you launch the SRD?

Select one or more:
A. From the Director client
B. From any web browser, with the exception of Firefox
C. From the Information Server Launch Pad
D. From the QualityStage Rules Management dialog in the Designer Client
5. Which statement below is a valid
Pattern Action File statement?

Select one:

A. COPY_A [3] (StreetType}

B. COPY_S [2] {StreetName)

C. COPY [1] (HouseNumber)

D. COPY [1] {HouseNumber}

^|D|+|T
COPY [1] {HouseNumber}
COPY [2] {StreetPrefixDirectional}
COPY [3] {StreetName}
COPY [4] {StreetSuffixType}
EXIT

6. Which of the statement below is

TRUE about the Comparison
Threshold?

Select one:

A. It cannot be used in the Classification Table

B. The second pass through the classification table looks for a fuzzy match based on the threshold
level

C. It is always required

D. The second pass through the classification table looks for an exact match
7. Which of the following statement
about Overrides is TRUE?

Select one:

A. Overrides cannot be tested with Rules Analyzer

B. Overrides are used to customize rule sets by applying changes to the Pattern Action File

C. Overrides are used to correct problems found during standardization

D. Administrator status is always required to create Overrides

8. Which of the following are methods

used to standardize international data?

Select one or more:

A. Use a country pre-processor with a domain pre-processor and domain-specific rules

B. Use a default country code designated by ZC…default value…ZC

C. Use a Multinational Standardize or Address Verification Interface

D. Use a four-byte ISO country code

You can also apply rule sets for international stages such as Worldwide Address Verification and
Enhancement System (WAVES) and Multinational Standardize Stage (MNS). With all of these
stages, you can use rules management (that is modify existing rules and add new rules).

9. Which statements below are TRUE

about Lookup Tables?
Select one or more:

A. QualityStage does not use Lookup Tables

B. Rule sets are being phased out in recent versions of QualityStage

C. Rule sets can contain Lookup Tables

D. They are called from the Pattern Action File

Lookup Tables
Click Reference Tables in the Rules Management window in Figure 1-33 on
page 62 to view information about the rule set.

10. Which statements are TRUE about

Rule set revision?

Select one or more:

A. Unpublished changes can be used in the Standardize stage

B. Changes are saved in the SRD database

C. It is a way to save and revert changes to rule sets

D. You can roll back changes by resetting a revisión

11. When using Rule Sets, which of the

following is an optional file?

Select one:

A. Lookup table

B. Dictionary file

C. Pattern action file

D. Classification table

12. Which statements are TRUE about

Text Overrides?

Select one or more:

A. Input Text Overrides apply to the original text string

B. Text overrides must not include character sets with UTF-8 encoding

C. Unhandled Text Overrides only apply to short strings (less than 20 characters)

D. Text Overrides can use partial string matching

E. Text Overrides are used for special cases and specific handling of a string of text

13. Which of the following are TRUE

about the Standardization
Transformation process?
Select one or more:

A. It may execute a Dictionary File script

B. It may use a comparison threshold for classifying like words

C. It may involve parsing free-form fields

D. It may involve bucketing data tokens

E. It involves decomposing free-form fields into single-component fields and assigning data to its
appropriate metadata field

The Standardize stage processes the data with the following outcome:
_ Creates fixed-column, addressable data
_ Facilitates effective matching
_ Enables output formatting
The Standardize stage parses free-form and fixed-format columns into
single-domain columns to create a consistent representation of the input data.
_ Free-form columns contain alphanumeric information of any length as long as
it is less than or equal to the maximum column length defined for that column.
_ Fixed-format columns contain only one specific type of information, such as
only numeric, character, or alphanumeric, and have a specific format

ISTQB Advanced Level Technical Test Analyst- Exam Insights: Q&A with Explanations
From Everand
ISTQB Advanced Level Technical Test Analyst- Exam Insights: Q&A with Explanations
SUJAN
No ratings yet
Standardize Your Data Using InfoSphere QualityStage
100% (1)
Standardize Your Data Using InfoSphere QualityStage
22 pages
Practice Questions for Tableau Desktop Specialist Certification Case Based
From Everand
Practice Questions for Tableau Desktop Specialist Certification Case Based
Exam OG
5/5 (1)
Jump into JMP Scripting, Second Edition
From Everand
Jump into JMP Scripting, Second Edition
Wendy Murphrey
No ratings yet
EnterpriseOne Interview Questions
From Everand
EnterpriseOne Interview Questions
equitypress
No ratings yet
Examen 2090-424 DS
100% (1)
Examen 2090-424 DS
22 pages
Oracle Application Express 3.2: The Essentials and More
From Everand
Oracle Application Express 3.2: The Essentials and More
Arie Geller
No ratings yet
Getting Started with SAS Programming: Using SAS Studio in the Cloud
From Everand
Getting Started with SAS Programming: Using SAS Studio in the Cloud
Ron Cody
No ratings yet
Unlock Hidden Menu in Phoenix BIOS Setup Menu Tutorial
100% (1)
Unlock Hidden Menu in Phoenix BIOS Setup Menu Tutorial
10 pages
Investigate:: Build and Run A Word Investigate For The Area Fields
No ratings yet
Investigate:: Build and Run A Word Investigate For The Area Fields
2 pages
Quality Stage User Guide
No ratings yet
Quality Stage User Guide
233 pages
DQ Standardization
No ratings yet
DQ Standardization
24 pages
QS User Tutorial
No ratings yet
QS User Tutorial
333 pages
User'S Guide: Ibm Infosphere Qualitystage
No ratings yet
User'S Guide: Ibm Infosphere Qualitystage
331 pages
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
From Everand
AP Computer Science Principles: Student-Crafted Practice Tests For Excellence
Sama Alshatali
No ratings yet
Tutorial: Enhancing A Product Rule Set in The Standardization Rules Designer
No ratings yet
Tutorial: Enhancing A Product Rule Set in The Standardization Rules Designer
56 pages
Quality Stage
No ratings yet
Quality Stage
3 pages
Certificación 11.3
100% (1)
Certificación 11.3
16 pages
Quality Stage Guide
No ratings yet
Quality Stage Guide
45 pages
Ibm Websphere Qualitystage: 8 Release 1
No ratings yet
Ibm Websphere Qualitystage: 8 Release 1
64 pages
Quality Stage Pattern-Action Reference
No ratings yet
Quality Stage Pattern-Action Reference
62 pages
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Salesforce ADM-201 Exam Preparation
From Everand
Salesforce ADM-201 Exam Preparation
Georgio Daccache
No ratings yet
CSA Recap Part4
No ratings yet
CSA Recap Part4
10 pages
Redis Certified Developer - Exam Practice Tests
From Everand
Redis Certified Developer - Exam Practice Tests
Cristian Scutaru
No ratings yet
Urexam: $GVVGT 5gtxkeg Kijgt 3Wcnkv (
No ratings yet
Urexam: $GVVGT 5gtxkeg Kijgt 3Wcnkv (
4 pages
Practice Questions for UiPath Certified RPA Associate Case Based
From Everand
Practice Questions for UiPath Certified RPA Associate Case Based
Exam OG
No ratings yet
AutoCAD Electrical 2022 for Electrical Control Designers, 13th Edition
From Everand
AutoCAD Electrical 2022 for Electrical Control Designers, 13th Edition
Prof. Sham Tickoo
No ratings yet
Apache Cassandra Developer Associate - Exam Practice Tests
From Everand
Apache Cassandra Developer Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
From Everand
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
Georgio Daccache
No ratings yet
Dump
No ratings yet
Dump
7 pages
Mastering C: A Comprehensive Guide to Proficiency in The C Programming Language
From Everand
Mastering C: A Comprehensive Guide to Proficiency in The C Programming Language
Kameron Hussain
No ratings yet
Data Quality Challenges
No ratings yet
Data Quality Challenges
9 pages
MCQ, BSP 1
No ratings yet
MCQ, BSP 1
12 pages
Datastage 1
No ratings yet
Datastage 1
15 pages
SG 247546
No ratings yet
SG 247546
968 pages
Quality Stage Wipro
No ratings yet
Quality Stage Wipro
240 pages
Cassandra Query Language by Examples - Puzzles with Answers
From Everand
Cassandra Query Language by Examples - Puzzles with Answers
Cristian Scutaru
No ratings yet
PROC REPORT by Example: Techniques for Building Professional Reports Using SAS: Techniques for Building Professional Reports Using SAS
From Everand
PROC REPORT by Example: Techniques for Building Professional Reports Using SAS: Techniques for Building Professional Reports Using SAS
Lisa Fine
No ratings yet
SQL Server Functions and tutorials 50 examples
From Everand
SQL Server Functions and tutorials 50 examples
Nino Paiotta
1/5 (1)
Question Paper Preview: Question Id: 85483 (Correct + 1.0, Wrong - 0.33)
No ratings yet
Question Paper Preview: Question Id: 85483 (Correct + 1.0, Wrong - 0.33)
26 pages
QS320 - LabExercises - v7 - 20040810
No ratings yet
QS320 - LabExercises - v7 - 20040810
95 pages
IBM Dumps
No ratings yet
IBM Dumps
31 pages
Couchbase Certified Java Developer - Exam Practice Tests
From Everand
Couchbase Certified Java Developer - Exam Practice Tests
Cristian Scutaru
No ratings yet
Mastering phpMyAdmin 3.3.x for Effective MySQL Management
From Everand
Mastering phpMyAdmin 3.3.x for Effective MySQL Management
Marc Delisle
No ratings yet
Integrated Rules
No ratings yet
Integrated Rules
160 pages
Best Practices in DataStage
No ratings yet
Best Practices in DataStage
7 pages
Best Practices in DataStage
No ratings yet
Best Practices in DataStage
7 pages
Access 2010 Bible
From Everand
Access 2010 Bible
Michael R. Groh
5/5 (1)
PEGACPSA88V1 Exam - Free Actual Q&As, Page 1 _ ExamTopics 29-june-2024
No ratings yet
PEGACPSA88V1 Exam - Free Actual Q&As, Page 1 _ ExamTopics 29-june-2024
37 pages
A Certification Questions
100% (2)
A Certification Questions
67 pages
DQ 1040 RuleSpecificationGuide en
No ratings yet
DQ 1040 RuleSpecificationGuide en
59 pages
AutoCAD Electrical 2025: A Tutorial Approach, 6th Edition
From Everand
AutoCAD Electrical 2025: A Tutorial Approach, 6th Edition
Prof. Sham Tickoo
No ratings yet
PEGABAIMPORTAR
No ratings yet
PEGABAIMPORTAR
89 pages
info mcqs
No ratings yet
info mcqs
12 pages
100 Puzzles to Learn Data Warehousing
From Everand
100 Puzzles to Learn Data Warehousing
Cristian Scutaru
No ratings yet
Quality Stage Essentials
No ratings yet
Quality Stage Essentials
2 pages
IBM Cognos 8 Planning
From Everand
IBM Cognos 8 Planning
Jason Edwards
No ratings yet
Test Dump PDF
100% (1)
Test Dump PDF
21 pages
Test Dump
No ratings yet
Test Dump
21 pages
Document of Software
No ratings yet
Document of Software
34 pages
Python NOTES Unit-4
No ratings yet
Python NOTES Unit-4
163 pages
4.1 Project 4. Advance Calculator PDF
No ratings yet
4.1 Project 4. Advance Calculator PDF
7 pages
Java Input From Keyboard
No ratings yet
Java Input From Keyboard
11 pages
LibO Primitives V0ad Without Classes
No ratings yet
LibO Primitives V0ad Without Classes
174 pages
Python Lab Manual - Updated
No ratings yet
Python Lab Manual - Updated
76 pages
Chapter12 Spring2024
No ratings yet
Chapter12 Spring2024
53 pages
Computer Science Program File: Prepared By: NAMAN GANDH Class: XII-B Roll No
No ratings yet
Computer Science Program File: Prepared By: NAMAN GANDH Class: XII-B Roll No
23 pages
Python Cookbook 3rd Edition David Beazley instant download
100% (1)
Python Cookbook 3rd Edition David Beazley instant download
51 pages
KMP Algorithm
No ratings yet
KMP Algorithm
26 pages
C Programming Practical's (Basic Level Training Programs)
No ratings yet
C Programming Practical's (Basic Level Training Programs)
6 pages
W3resource Concepts
No ratings yet
W3resource Concepts
98 pages
Codefa
No ratings yet
Codefa
19 pages
Niversity: Abdul Majid Niazai
No ratings yet
Niversity: Abdul Majid Niazai
8 pages
Non Regular Language
No ratings yet
Non Regular Language
25 pages
Strings Methods Slides Java Aplus
100% (1)
Strings Methods Slides Java Aplus
42 pages
Coal Lab 3 Reference Materail
No ratings yet
Coal Lab 3 Reference Materail
4 pages
CSC108 Final 2012W PDF
No ratings yet
CSC108 Final 2012W PDF
20 pages
The Power of CLV-Managing Customer Lifetime Value at IBM
No ratings yet
The Power of CLV-Managing Customer Lifetime Value at IBM
7 pages
Some Important Programs On Strings (In C)
80% (10)
Some Important Programs On Strings (In C)
21 pages
Prompt Engineering
No ratings yet
Prompt Engineering
26 pages
Inputting A String of Data From Keyboard (Int 21h Option 0ah)
No ratings yet
Inputting A String of Data From Keyboard (Int 21h Option 0ah)
7 pages
Module 2 Python
No ratings yet
Module 2 Python
35 pages
Python Tutorial For Beginners - Learn Python Programming Language Basics - Intellipaat
No ratings yet
Python Tutorial For Beginners - Learn Python Programming Language Basics - Intellipaat
39 pages
Longest Common Substring Problem: Example
No ratings yet
Longest Common Substring Problem: Example
5 pages
Directives and Syscall
No ratings yet
Directives and Syscall
11 pages
Content Generation in Dungeon Run
No ratings yet
Content Generation in Dungeon Run
17 pages
Solutions To Set 17 - (Telegram @myhackersworld2)
No ratings yet
Solutions To Set 17 - (Telegram @myhackersworld2)
18 pages
OCAJP 8 - Quick Revision Notes
No ratings yet
OCAJP 8 - Quick Revision Notes
37 pages
Differentiate Between Data Type and Data Structures
No ratings yet
Differentiate Between Data Type and Data Structures
11 pages

IBM QualityStage V11.5.x Standardizing Dat v2

Uploaded by

IBM QualityStage V11.5.x Standardizing Dat v2

Uploaded by

IBM QualityStage V11.5.

1. The Standardize Stage identifies

Select one or more:

A. Setting a name-type flag to differentiate between an individual address and an organization

D. Setting an address-type flag

2. Which of the following are phonetic

3. Which of the following are TRUE

Select one or more:

B. User Overrides cannot override or add token values

C. Classification overrides do not take precedence over a classification table

4. Where can you launch the SRD?

A. COPY_A [3] (StreetType}

B. COPY_S [2] {StreetName)

C. COPY [1] (HouseNumber)

D. COPY [1] {HouseNumber}

6. Which of the statement below is

A. It cannot be used in the Classification Table

A. Overrides cannot be tested with Rules Analyzer

C. Overrides are used to correct problems found during standardization

D. Administrator status is always required to create Overrides

8. Which of the following are methods

Select one or more:

A. Use a country pre-processor with a domain pre-processor and domain-specific rules

B. Use a default country code designated by ZC…default value…ZC

C. Use a Multinational Standardize or Address Verification Interface

D. Use a four-byte ISO country code

9. Which statements below are TRUE

A. QualityStage does not use Lookup Tables

B. Rule sets are being phased out in recent versions of QualityStage

C. Rule sets can contain Lookup Tables

D. They are called from the Pattern Action File

10. Which statements are TRUE about

Select one or more:

A. Unpublished changes can be used in the Standardize stage

B. Changes are saved in the SRD database

C. It is a way to save and revert changes to rule sets

D. You can roll back changes by resetting a revisión

11. When using Rule Sets, which of the

C. Pattern action file

12. Which statements are TRUE about

Select one or more:

A. Input Text Overrides apply to the original text string

D. Text Overrides can use partial string matching

13. Which of the following are TRUE

A. It may execute a Dictionary File script

B. It may use a comparison threshold for classifying like words

C. It may involve parsing free-form fields

D. It may involve bucketing data tokens

You might also like