CS614MCQs_Spring13_solvedbyDrTariqhanif1
CS614MCQs_Spring13_solvedbyDrTariqhanif1
SPRING-2013
OLAP 30page
OLTP
Data Cleansing
ETL
2. The confusion created by data redundancy makes it difficult for companies to
Unpredictable pg 62
Predictable
Conventional
Unsurprising
04. OLAP is a (n) ___________ of application.
Classification pg74
Amalgamation
Unification
Blending
5. DOLAP model facilitates ___________ computing paradigm.
Mobile pg 78
Permanent
Rigid
Strict
6. ______ is the lowest level of detail or the atomic level of data stored in the warehouse.
Cube
Grain pg 111
Virtual Cube
Page 1
MIDTERM-
SPRING-2013
Aggregate
7. Extract, Transform, Load (ETL) process consist of steps which are ______________.
Independent and Interrelated 131
Independent or Interrelated
Dependent and Interrelated
Dependent or Interrelated
8. In _________ system, the contents change with time.
OLTP pg 20
DSS
ATM
OLAP
9. ________ is an application of intelligence and experience.
Skill
Power
Wisdom pg 11
Knowledge
10. 3NF removes even more data redundancy than 2NF but it is at the cost of
Only One-to-One
Only Many-to-Many
Only One-to-Many
Both One-to-One and Many-to-Many pg 52
12. Transactional fact tables do not have records for events that do not occur. These are
called
Page 2
MIDTERM-
SPRING-2013
Backup pg 131
Cube
Load
Schema
19. "Change Data Capture" is one of the challenging technical issues in _____________
Extraction
Loading
Cleansing pg 168
Join
CS614 Data Warehousing
1. Taken jointly, the extract programs or naturally evolving systems formed a spider web,
also known as
Page 3
MIDTERM-
SPRING-2013
Date pg 66
Most redundant column
Fact
Dimensions
4. OLAP is a (n) ___________ of application.
Classification pg 74
Amalgamation
Unification
Blending
5. ER is a _______ design technique that seeks to remove the redundancy in data.
Logical pg 98
Physical
Data Dependent
Transaction Dependent
6. ______ is the lowest level of detail or the atomic level of data stored in the warehouse.
Cube
Grain pg 111
Virtual Cube
Aggregate
7. It is called a ______ violation, if we have null values for attributes where NOT NULL
constraint exists.
Load
Transform
Constraint pg 161
Extraction
8. In the Information Age, the _________ learning organization is at a distinct disadvantage.
This term means "impaired functioning."
Functional
Dysfunctional pg 181
Purposeful
Serviceable
9. In _________ system, the contents change with time.
OLTP pg 20
DSS
ATM
Page 4
MIDTERM-
SPRING-2013
OLAP
10. It is observed that every year the amount of data recorded in an organization
Doubles pg 15
Triples
Quartiles
Remains same as previous year
11. Normalization is the process of efficiently organizing data in a database by ________ a
relational table into smaller tables by projection.
Composing
Joining / Merging
Combining
Decomposing pg 41
12. 3NF removes even more data redundancy than 2NF but it is at the cost of
1. Block Insert pg
2. Block Slamming
Page 5
MIDTERM-
SPRING-2013
3. Bulk Insert
4. Bulk Slamming
Which of the following option is true?
Option 1 & 3
Aggregates pg 111
Facts
Dimensions
Primary Keys
20. Single value attributes during recording of a transaction are __________
Dimensions pg 115
Facts
Aggregates
Constraints
Page 6
MIDTERM-
SPRING-2013
Selection Anomalies
Update Anomalies 43
SQL Anomalies
Data Warehouse Anomalies
5. 3NF removes even more data redundancy than 2NF but it is at the cost of
Multi-level aggregates 74
Record level access
Data level access
Row level access
9. The cube clause which is a part of SQL: 1999 is
Redundancy pg 98
Normalization
Anomalies
11. Non recording facts have a disadvantage that it has
Page 7
MIDTERM-
SPRING-2013
Loading 139
Transformation
Quality
Indexing
13. Syntactically Dirty Data class of anomalies includes which of the following:
1. Lexical Errors
2. Integrity Constraints Violation
3. Business Rule Contradiction
4. Irregularities
5. Duplication
None of these
Operational
Page 8
MIDTERM-
SPRING-2013
Internal
External pg 21
18. Source systems for extraction are typically OLTP systems. Extraction is a very complex
task due to reasons:
1. Very complex and poorly documented source system.
2. Data has to extracted not once but many times
3. People extracting data have limited expertise
Which of the following option represents correct reason?
1 & 2 only pg 132
1 & 3 only
2 & 3 only
All 1, 2 and 3
19. ______________ is about taking/collecting data from different heterogeneous sources.
Data Warehouse pg 21
Data Mart
Data Mining
20. In ROLAP access to information is provided via relational database using _________
standard SQL.
ANSI pg 78
Microsoft
Oracle
SAP
CS614 Data Warehousing
1. A typical example of the crisis in credibility in the naturally evolving architecture is the
decision of CEO based on politics and personalities on receiving two different reports for
the same query. We say CEO is
Very Subjective and Non-Scientific pg 14
Very Objective and Non-Scientific
Very Subjective and Scientific
Very Objective and Scientific
2. Development of data warehouse is hard because data sources are
Page 9
MIDTERM-
SPRING-2013
Selection Anomalies
Update Anomalies pg 43
SQL Anomalies
Data Warehouse Anomalies
5. Normalization is the process of efficiently organizing data in a database by decomposing
/ splitting a relational table into ______ tables by projection.
Smaller pg 41
Larger
Combined
Joined
6. One major goal of horizontal splitting is
Splitting rows for exploiting parallelism pg 54
Splitting columns for exploiting parallelism
Splitting schema for exploiting parallelism
7. The most common use of range partitioning in data warehouse is on
Date pg 66
Most redundant column
Fact
Dimensions
8. ER Model can be simplified in -------- ways
One
Two pg 103
Three
Four
9. ______ is the lowest level of detail or the atomic level of data stored in the warehouse.
Cube
Grain pg 111
Virtual Cube
Aggregate
10. A company has implemented data warehouse for analytical purpose. Quantity sold is
stored as a fact. This quantity sold is
Page 10
MIDTERM-
SPRING-2013
None of these
13. Rearranging the grouping of source data, delivering it to the destination database, and
ensuring the quality of data are crucial to the process of loading the data warehouse. Data
____________ is vitally important to the overall health of a warehouse project.
1. Cleansing
2. Cleaning
3. Scrubbing
Which of the following options is true?
Option 1 only pg 158
Option 2 only
Option 1 & 2 only
Option 1, 2 & 3
14. Syntactically Dirty Data class of anomalies includes which of the following:
6. Lexical Errors
7. Integrity Constraints Violation
8. Business Rule Contradiction
9. Irregularities
10. Duplication
Option 1 and 4 pg 160
Option 2 and 3
Option 2, 3, and 5
Option 1, 4, and 5
15. It is called a ______ violation, if we have null values for attributes where NOT NULL
constraint exists.
Load
Transform
Constraint
Extraction
16. As consumers, human beings judge the quality of things during their life-time.
I Consciously
II Subconsciously
III Unconsciously
I An Abstraction
II A Representation
Page 11
MIDTERM-
SPRING-2013
None of I & II
18. __________queries deal with number of variables spanning across number of tables (i.e.
join operations) and looking at lots of historical data.
OLTP
DBMS
DSS pg 21
None of these
19. Collapsing tables can be done on the ___________ relationships
Many-to-Many
Both One-to-One and Many-to-Many pg 52
None of these
One-to-One
20. In data warehouse, a query results in retrieval of hundreds of records from very large
table. The ratio of number of records retrieved to total number of record present is high
and selectivity is
Low
High pg 22
Average
Can not be calculated
CS614 Data Warehousing
OLTP
Data Cleansing
ETL
Unpredictable 62
Predictable
Conventional
Unsurprising
Page 12
MIDTERM-
SPRING-2013
Classification pg 74
Amalgamation
Unification
Blending
Mobile pg 97
Permanent
Rigid
Strict
6. ______ is the lowest level of detail or the atomic level of data stored in the warehouse.
Cube
Grain pg 111
Virtual Cube
Aggregate
7. Extract, Transform, Load (ETL) process consist of steps which are ______________.
Dependent or Interrelated
OLTP pg 20
DSS
ATM
OLAP
Skill
Power
Wisdom
Knowledge pg 11
Page 13
MIDTERM-
SPRING-2013
10. 3NF removes even more data redundancy than 2NF but it is at the cost of
Complexity
Number of tables
Relations
Only Many-to-Many
Only One-to-Many
Both One-to-One and Many-to-Many pg 52
12. Transactional fact tables do not have records for events that do not occur. These are
called
Fact-less Facts
Null Facts
Empty Facts
13. Semantically "Dirty Data" class of anomalies includes which of the following:
I) Lexical Errors
II) Integrity Constraints Violation
IV) Irregularities
V) Duplication
Page 14
MIDTERM-
SPRING-2013
Any Direction pg 19
Two Direction
Partitions
Microsoft
Oracle
SAP
17. A company has implemented data warehouse for analytical purpose. Quantity sold is
stored as a fact. This quantity sold is
Associative Fact
Non-Associative Fact
18. Typically a data mart is much smaller to data warehouse and it is pretty easy to take its
______ as compare to data warehouse.
Backup
Cube pg 131
Load
Schema
19. "Change Data Capture" is one of the challenging technical issues in _____________
Data Loading
Data Transformation
Data Cleansing
Page 15
MIDTERM-
SPRING-2013
20. Within the data warehousing domain, data ________ is applied especially when several
databases are merged.
Extraction
Loading
Cleansing pg 168
Join
CS614 Data Warehousing
1. Taken jointly, the extract programs or naturally evolving systems formed a spider web,
also known as
Distributed Systems Architecture
2. Suppose the amount of data recorded in an organization is doubled every year. This
increase is
Linear
Quadratic
Logarithmic
Exponential 15
3. The most common use of range partitioning in data warehouse is on
Date pg 66
Fact
Dimensions
Classification pg 74
Amalgamation
Unification
Blending
Physical
Data Dependent
Transaction Dependent
6. ______ is the lowest level of detail or the atomic level of data stored in the warehouse.
Cube
Grain pg 111
Virtual Cube
Aggregate
7. It is called a ______ violation, if we have null values for attributes where NOT NULL
constraint exists.
Load
Transform
Constraint pg 161
Extraction
Functional
Dysfunctional pg 181
Purposeful
Serviceable
DSS
ATM
OLAP
10. It is observed that every year the amount of data recorded in an organization
Doubles pg 15
Triples
Quartiles
Page 17
MIDTERM-
SPRING-2013
Composing
Joining / Merging
Combining
Decomposing pg 41
12. 3NF removes even more data redundancy than 2NF but it is at the cost of
Number of tables
Relations
Redundant data is a performance liability at both query time and update time.
15. Source systems for extraction are typically OLTP systems. Extraction is a very complex
task due to reasons:
Page 18
MIDTERM-
SPRING-2013
2 & 3 only
All 1, 2 and 3
16. When tables are populated for the first time, it is a full data refresh. This may be called
as:
1. Block Insert
2. Block Slamming
3. Bulk Insert
4. Bulk Slamming
Which of the following option is true?
Option 1 & 3
Option 1 & 2
17. The TQM philosophy of management is __________. All members of a total quality
management organization strive to systematically manage the improvement of the
organization through the ongoing participation of all employees in problem solving efforts
across functional and hierarchical boundaries.
Customer-Oriented pg 182
Employee-Oriented
Employer-Oriented
Organization-Oriented
18. Identify the correct option. One Petabyte (PB) equals to ____
Aggregates pg 111
Facts
Dimensions
Primary Keys
Page 19
MIDTERM-
SPRING-2013
Dimensions pg 115
Facts
Aggregates
Constraints
1. Suppose the amount of data recorded in an organization is doubled every year. This
increase is
Linear
Quadratic
Logarithmic
Exponential pg 15
2. _________ is one class of decision support environment.
OLAP pg 30
OLTP
Data Cleansing
ETL
3. De-Normalization normally speeds up
Data Retrieval pg 51
Data Modification
Development Cycle
Data Replication
4. In horizontal splitting, we split a relation into multiple tables on the basis of
Common Column Values
Common Row Values
Different Index Values
Value resulted by ad-hoc query
5. The most common use of range partitioning in data warehouse is on
Date pg 66
Most redundant column
Fact
Dimensions
6. OLAP is a (n) ___________ of application.
Blending
Characterization pg 74
Amalgamation
Unification
7. One of the OLAP characteristics is Multi-dimensional, which is ________ for OLAP.
Essential 76
Optional
Discretionary
Not Obligatory
Page 20
MIDTERM-
SPRING-2013
I An Abstraction pg 180
II A Representation
Which of the following option is true?
I Only
II Only
Both I & II
None of I & II
11. _______ is an application of information and data.
Skill
Knowledge pg 11
Intelligence
Power
12. In data warehouse, a query results in retrieval of hundreds of records from very large
table. The ratio of number of records retrieved to total number of records present is high
and selectivity is:
Low
High 22
Average
Non computable
13. "The environment is smart enough to develop or compute higher level aggregates
using lower level or more detailed aggregates". Which of the following approach is
described by the above statement?
Aggregate awareness pg 87
Cube partitioning
Indexing
MOLAP cube aggregation
14. The goal of star schema design is to simplify ________
Page 21
MIDTERM-
SPRING-2013
Aggregates pg 111
Facts
Dimensions
Primary Keys
18. Single value attributes during recording of a transaction are __________
Dimensions pg 115
Facts
Aggregates
Constraints
19. In full extraction, data is extracted completely from the source system. Therefore there
is no need to keep track of changes to the ________
Extraction
Loading
Cleansing pg 168
Join
Page 22