CS614 Merged
CS614 Merged
90 bytes……..confirm
One to many…..confirm
Star Schema…..confirm
Transitive dependency….confirm
Date…..confirm
Dimension…..confirm
Reduce or decrease……confirm
Parallelism…..confirm
11. In _________ index, the ith bit is set to “I” if the ith of the base table
has the value for the indexed column
Bitmap indexes……confirm
Speed-up…..confirm
Scale-up….confirm
FROM…..confirm
Independent…..confirm
16. As the number of processors increase the speedup should also increase.
Thus we should have linear speedup. Which of the following is NOT one
of the barriers to achieve this linear.
No interface…..confirm
17. The online high performance transaction process was evolved in _____.
1975….confirm
Total view….confirm
Interactive system which enable decision makers to use databases and solve
un-structured
Sales amount
HOLISTIC……confirm
Associative……confirm
24. _________ is a logical design technique that seeks to present the data in
a standard, instinctive structure that supports high-performance and
ease of understanding.
DM…..confirm
ER modeling…..confirm
Aggregation…..confirm
Annual salary…..confirm
Conversation or enrichment…confirm
0304-1659294 1
AL-JUNAID TECH INSTIUTE
• Flat files are not required
• Limited query interface is required for data extraction page 152
• No incremental on-line I/O required for log tape
• Extraction of changed data occurs immediately
8. The most common range partitioning is on
• Color
• Date page 66
• PhoneNo
• Name
9. A relation is said to be in first normal form(1NF), if it does not contain ________
• Single value column
• Multi-valued column page 43
• Derived column
• Composite column
10. In a fully normalized database, too many ____________are required
• Values
• Joins page 49
• Queries
• Conditions
0304-1659294 2
AL-JUNAID TECH INSTIUTE
• Wisdom page 11
• Education
• Power
• Information
15. 1In the data transformation, ---------- is the rearrangement and simplification of individual
• Aggregation
• Enrichment page 136
• Splitting joining
• Conversion
16. Grain of a fact table means :
• The meaning of one fact table row page 109
• The meaning of one dimensional table row
• Summary of aggregates in all fact tables
• Summary of aggregates in all dimension tables
17. Normalization ----------------- :
• Reduces redundancy page 41
• Increases redundancy
• Reduces joins
• Reduces tables
0304-1659294 3
AL-JUNAID TECH INSTIUTE
• ROLAP
21. Data Warehouse provides the best support for analysis while OLAP carries out the --------
-----
task:
• Mandatory
• Whole
• Analysis page 69
• Prediction
22. ----------------- involves splitting a table by columns so that a group of columns is placed
into the new table and the remaining columns are placed in another new table:
• Vertical splitting page 56
• Horizontal splitting
• Adding redundant column
• None of the given option
23. OLAP implementations are highly/completely ------------------ :
• Normalized
• Demoralized page 69
• Predictive
• Additive
24. If each cell of Relation R contains a single value ( no repeating values) then it is
confirmed that :
• Relation R is in 1st Normal Form page 43
• Relation R is in 2nd Normal Form
• Relation R is in 3rd Normal Form
• Relation R is in 3rd Normal Form but not in 2nd Normal Form
25. Which kind of relationships is captured by Fact less fact table:
• Many- to- Many page 121
• One-to-many
• One-to-one
• None of the given option
26. Which of the following is not an example of dimension:
• Product
• Date
• Region
• Sales volume page 78
27. Which people criticize Dimensional Modeling (DM) as being a data mart oriented
approach?
0304-1659294 4
AL-JUNAID TECH INSTIUTE
• Those that consider ER models as Data marts
• Those that consider Business processes as Data marts page 110
• Those that consider Data marts as Data warehouse
• Those that consider dimensional model
• Those that consider dimensional modeling as de-normalization approach
0304-1659294 5
AL-JUNAID TECH INSTIUTE
• Expression indexing
• None of the given option
34. Taken jointly, the extract programs or naturally evolving systems formed a spider
web, also known as
• Distributed Systems Architecture
• Legacy System Architecture page 14
• Online System Architecture
• Intranet System Architecture
35. The data has to be checked , cleaned and transformed into a --------------- format to
allow easy and fast access
• Unified page 20
• Predicated
• Qualified
• Proactive
36. Suppose in a system A, the values of “PhoneNo” attribute were stored in “countrycode-
phone-extension” format, however after transformation into data warehouse the
separate columns were used for “countrycode”,”phone” and “extension”. The above
scenario is an example of :
• One-to-one scalar transformation
• One-to-many element transformation page 144+conceptual
• Many-to-one element transformation
• Many-to-many element transformation
37. In decision support system ease of use in achieved by:
• Normalization
• Denormalization page no 49
• Drill up
• Drill down
38. Which of the following is one of the methods to simplify an ER model?
• Normalization
• Denormalization page no 103
• HOLAP
• Hybrid schema
39. In ETL process data transformation includes ----------------
• Data cleansing page 129
• Data aggregation
• Behavior checking
0304-1659294 6
AL-JUNAID TECH INSTIUTE
• Pattern recognition
40. Non-uniform use of abbreviations, units, and values refers to:
• Syntactically dirty data page 160
• Semantically dirty data
• Coverage anomaly
• Extraction issue
41. Suppose the size of the attribute “Computerized National Card (CNIC) no. is changed
in NADRA database. This transformation refers to:
• Format revision page 153
• Field splitting
• Field decoding
• Calculation of derived value
42. The divide and conquer cube partitioning approach helps alleviate the ------------
limitations of MOLAP implementation:
• Flexibility
• Maintainability
• Security
• Scalability page 85
43. identify the TRUE statement:
• DM is inherently dimensional in nature
• DM comprises of a single central fact table
• DM comprises of a set of dimensional tables
• All of the given option Page 103
44. ------------- can be used when some columns are rarely accessed rather than other
columns or when the table has wide rows or header or both:
• Horizontal splitting
• Pre-joining
• Vertical splitting page 56
• Derived attributes
45. Which of the following is an example of derived attributes?
• Age page 61
• Size
• Color
• Length
46. The online high performance transaction processing was evolved in --------------:
• 1980
• 1975 page 12
• 1977
0304-1659294 7
AL-JUNAID TECH INSTIUTE
• 1965
47. Cube is a logical entity containing values of a certain aggregation level at an
intersection of a combination of -------------------- :
• Facts
• Dimension page 88
• Summary tables
• Primary and foreign key
48. Which of the following is TRUE regarding Entity relationship modeling?
• It does not really model business, but models the micro relationships
among data elements.
• ER modeling does not have “business rules,” it has “data rules
• ER modeling helps retrieval of individual records having certain critical
identifiers.
• All of the given option page 102
49. ------Facilitates a mobile computing paradiagramn:
• HOLAP
• DOLAP page78
• ROLAP
• MOLAP
50. The main reason(s )for the increase in cube size may be:
• Increase in the number of dimensions
• Increase in the cardinality of the dimensions
• Increase in the amount of detail data
• All of the given options page 87
51. Suppose the amount of data recorded in an organization is doubled in year. This increase
in ------
• Linear
• Quadratic
• Exponential page 15
• Logarithmic
52. The data in the data warehouse is ----------- :
• Volatile
• Non-volatile page 69
• Static
• Non-structured
0304-1659294 8
AL-JUNAID TECH INSTIUTE
53. --------------- models the macro relationships among data elements with an overall
deterministic strategy:
• Dimensional model page102
• Entity relationship model
• Object oriented model
• Structured model
54. ----------------- technique requires a separate column to specify the time and date when
the last modification was occurred:
• Checkmarks
• Timestamps page 150
• Just-in-Time
• Real Time extraction
55. Which of the de-normalization technique squeezes master table into detail?
• Pre-joining page 58
• Horizontal splitting
• Vertical splitting
• Adding redundant column
56. De-normalization can help:
• Minimize joins
• Minimize foreign keys
• Resolve aggregates
• All of the given options page 51
57. The domain of the “gender” field in some database may be (‘F’,’M’) or as (“Female”,
“Male”) or even as (1, 0). This is:
• Primary key problem
• Non primary key problem page 163
• Normalization problem
• All of the given option
58. Increasing level of normalization --- number of
------------- tables:
• Increases page 51
• Decreases
• Does not effect
• None of the given option
59. Which of the following is not a Data Quality Validation Technique:
• Referential integrity
0304-1659294 9
AL-JUNAID TECH INSTIUTE
• Using Data Quality Rules
• Data Histograming
• Indexes page 189
60. This technique can be used when column from one table is frequently accessed in a
large scale join in conjunction with a column from another table:
• Horizontal splitting
• Pre-joining
• Adding redundant column page 58
• Derived attributes
61. Data cleansing requires involvement of domain expert because:
• Domain expert has deep knowledge of data aggregation
• Change Data captures requires involvement of domain expert
• Domain knowledge is required to correct anomalies page 158
• Domain expert has deep knowledge of data summarization
62. Relational databases allow you to navigate the data in ------------- that is appropriate
using the primary , foreign key structure with in the data model:
• Only One Direction
• Any Direction page 19
• Two Direction
• None of these
63. History is excellent predicator of the ------------:
• Past
• Present
• Future page 15
• History
64. De- normalization is the process of selectively transforming normalized relations
into un-normalized physical record specifications, with the aim to:
0304-1659294 10
AL-JUNAID TECH INSTIUTE
66. Suppose in system A, the possible values of “Gender” attribute were “Male”& “Female”,
however in data warehouse ,the values stored were “M” for male and “F” for female.
This above scenario is an example of :
• One-to-one scalar transformation page 144
• One-to-many element transformation
• Many-to-one element transformation
• Many-to-many element transformation
67. Enrichment is one of the basic tasks in data ---------------- :
• Extraction
• Transformation page 138
• Loading
• Summarization
68. Which of the following is not a technique of De-normalization?
• Pre-joining
• Splitting tables
• Adding redundant columns
• ER modeling page 52
69. Which of the following is an example of Additive Facts?
• Sales Amount page 119
• Average
• Discount
• Ratios
70. Robotic libraries are needed for -------------------------:
• Cubes
• Data marts
• Data warehouse page 131
• Aggregates
71. Normally ROLAP is implemented using ----------------
• Star schema page 87
• Hybrid schema
• Pre-defined aggregate
• All of the given options
72. The relation R will be in 2nd Normal Form if
• It is in 1NF and each cell contains single value
• It is in 1NF and each non key attribute is dependent upon entire primary key
page 44
• It is in 1NF and non key attribute is dependent upon a single column of
composite primary key
0304-1659294 11
AL-JUNAID TECH INSTIUTE
• It is in 1NF and Primary key is composite
73. In --
----- ested loop join of quadratic time complexity does not hurt the
---- performance
0304-1659294 12
AL-JUNAID TECH INSTIUTE
• Ease-of-use
• All of the given options page 62
79. If actual data structure does not conform to documented formats then it is called:
• Syntactically dirty data page 160
• Semantically dirty data
• Coverage anomaly
• Extraction issue
80. “Header size is reduced, allowing more rows per back , thus reducing I/O” .The above
statement is TRUE with respect to:
• Vertical splitting page 56
• Horizontal splitting
• Adding redundant column
• None of the given options
---------------- Breaks a table into multiple tables based upon common column values
page
• Horizontal splitting 54
• Vertical splitting
• Adding redundant column
• None of the given option
81. Which of the following is NOT an example of derived attribute?
• Age
• CGPA
• Area of rectangle
• Height (Conceptual)
83. If a table is expected to have six columns but some or all of the records do not have six
columns then it is example of:
• Syntactically dirty data page 160
• Semantically dirty data
• Coverage anomaly
• Extraction issue
0304-1659294 13
AL-JUNAID TECH INSTIUTE
84. MDX by Microsoft is an example of ------------------------:
• HOLAP
• DOLAP
• ROLAP
• None of the given options page 79
85. The growth of master files and magnetic tapes exploded around the mid- ---------------
• 1950s
• 1960s page 12
• 1970s
• 1980s
86. If one or more records in a relational table do not satisfy one or more integrity constraint
, then the data:
• Is syntactically dirty
• Is semantically dirty page 160
• Has Coverage anomaly
• Has extraction issue
87. OLAP is:
• Analytical processing page 69
• Transaction processing
• Additive processing
• Active processing
88. One of the possible issues faced by web scrapping is that:
• Web pages may contain junk data page 141
• Web pages do not contain multiple facts
• Web pages do not contain multiple dimensions
• Web pages does not support transformation
0304-1659294 14
AL-JUNAID TECH INSTIUTE
• Maintaining historic data
91. In case of multiple sources for the same data element , we need to prioritize the source
systems per element based, the process is called:
• Ranking page 143
• Prioritization
• Element selection
• Measurement selection
92. One feature of Change Data Capture (CDC) is that:
• It pre-calculates changed aggregates
• It loads the transformed data in real time
• It only processes the data has been changed
• It can automate the transformation of extracted data page 149
93. In ------------------ SQL generation in vastly simplified for front-end tools when the
data is highly structure:
• MOLAP
• Star Schema page 107
• Hybrid schema
• Object oriented schema
94. Dirty data means:
• Data cannot be aggregated
• Data contains non-additive facts
• Data does not fulfill dimensional modeling rules
• Data does not conform to proper domain definitions page 158
95. In Context of Change Data Capture (CDC) sometimes a ------------- object can be used
to store recently modified data:
• Buffer table
• Change table page 149
• Checkmark table
• Change control table
96. “Sometimes during data collection complete entities are missed”. This statement is an
example of :
• Missing tuple page 161
• Missing attribute
• Missing aggregates
0304-1659294 15
AL-JUNAID TECH INSTIUTE
• Semantically dirty data
97. Table collapsing technique is applied in case of:
• One-by-one relation or many-to –many relation page 52
• One-to-many relation
• Many-to-many relation
• None of the given option
98. Which of the following is an example of dimension?
• Product
• Region
• Date
• All of the given option page 78
99. Data warehouse stores -------------------:
• Operational data
• Historical data page 24
• Meta data
• Log files data
100. The business process covered by ER diagrams:
• Do not co-exist in time and space page 109
• Co-exist in time and space
• Do not physically exist in real time context
• None of the given options
101. The main goal of normalization is to eliminate -----------:
• Data redundancy page 41
• Data sharing
• Data security
• Data consistency
102. Serious ---- involves decomposing and resembling the data:
• Data cleansing page 168
• Data transformation
• Data loading
• Data extraction
103. In the data warehouse environment the data is ------------
• Subject- oriented page 69
• Time- oriented
• Both subject and time oriented
0304-1659294 16
AL-JUNAID TECH INSTIUTE
• Neither time-oriented nor subject- oriented
104. For large record spaces and large number of records , the run time of the
clustering algorithms:
• Prohibitive page 164
• Static
• Exponential
• Numerical
105. 1------------- can result in costly errors, such as , False frequency distributions and
incorrect aggregates due to double counting:
• Data duplication page 165
• Data reduction
• Data anomaly
• Data transformation
106. The degree to which values are present in the attributes that require them is
known as -
---------------------:
• Completeness page 185
• Uniqueness
• Accessibility
• Consistency
107. Time complexity of Key Creation process in basic Sorted Neighborhood (BSN)
Method is
----------------------:
• O(n log n)
• O(log n)
• O(n) page 171
• O(2n)
108. Which of the following is an example of slowly changing dimensions?
• Inheritance page 124
• Aggregation
• Association
• Asset disposal
109. The ------------ operator proves useful in more complex metrices
applicable to the dimensions and accessibility:
• Max page 188
• Min
• Max and Min
• None of the given
0304-1659294 17
AL-JUNAID TECH INSTIUTE
110. In OLAP , the typical write operation is ------------- :
• Bulk insertion page 75
• Single insertion
• Sequential insertion
• No insertion
111. 1The issue(s) of “ Adding redundant column” includes(s):
• Increase in table size
• Maintenance
• Loss of information
• All of the given option page 65
113. The hardware (CPU) utilization in data warehouse environment is full or ----------
-:
• Fixed
• Partial
• Not at all page 24
• Slow
114. 1Time variant is a characteristics of data warehouse which means:
• Data loaded in data warehouse will be time stamped page 20
• Data can be loaded in data warehouse anytime
• Data can be loaded in data warehouse only at a particular time
• Data cannot be loaded in data warehouse with respect to time
0304-1659294 18
AL-JUNAID TECH INSTIUTE
• Emp_ID
• Emp_Name
• Emp_skills page 43(conceptual)
• Emp_Designation
117. The application of data and information leads to -------------
• Intelligence
• Experience
• Knowledge page 11
• Power
118. 1--------------- segregate data into separate partitions so that queries do not need to
examine all data in a table when WHERE clause filters specify only a subset of the
partitions.
• Pre-joining technique
• Collapsing table technique
• Horizontal splitting technique page 56
• Vertical splitting technique
119. -------------should not be present in a relation, so that it would be in second
normal form (2NF).
• Partial dependency page 44 (conceptual)
• Full functional dependency
• Multivalued dependency
• Transitive dependency
120. 1Records referring to the same entity are represented in different formulas in
the different data sets or are represented erroneously. Thus duplicate records will
appear in the merged database. This problem is known as------------.
• Merge/purge problem page 168
• Duplication problem
• Redundant duplication problem
• Redundant problem
121. The data perspective in OLTP system is operational, while that in data warehouse
is:
• Fully normalized
• Fully de-normalized
• Fully summarized
• Historical and detailed page 30
0304-1659294 19
AL-JUNAID TECH INSTIUTE
122. 1Simple scalar transformation is a--------------mapping from one set of values
to another set of values using straightforward rules.
• One-to-one page 144
• One-to-many
• Many-to-many
• Many-to-one
123. ---------------can be created in operational systems to keep tracks of
recently updated records.
• Triggers page 150
• Timestamps
• Partitioning
• ELT
124. 1Development of data warehouse is hard because data sources are usually--------
• Structured and homogeneous
• Unstructured and heterogeneous page 31
• Structured and heterogeneous
• Unstructured and homogeneous
125. In a decision support environment, the decision maker is interested in -------------.
• Only limited organizational data
• Big picture of organizational data page 21
• Only sale related data
• Only customer related data
126. Information can answer question like “what”, “who” and “when”
while knowledge can answer question like----------------.
• Why
• Where
• Which
• How page 11
127. OLTP implementations are fully-------------.
• Normalized page 69
• Denormalized
• Predictive
• Additive
128. Which logical data extraction has significant performance impacts on the data
warehouse server?
• Incremental Extraction page 133
• Online Extraction
0304-1659294 20
AL-JUNAID TECH INSTIUTE
• Offline Extraction
• Legacy Vs OLTP
129. 1Consider the following Student table and identify the column which causes
that the table is not in first normal form(1NF).
Student(Std_ID, Std_Name ,Std_CGPA ,Std_Hobbies)
• Std_ID
• Std_Name
• Std_CGPA
• Std_Hobbies page 43(Conceptual)
130. Analytical processing uses ---------------
• Multi-level aggregates page 74
• Record level aggregates
• Table level aggregates
• All of the given options
131. Which is not a class of anomalies in following?
• Dirty anomalies page 160
• Syntactically dirty data
• Semantically dirty data
• Coverage anomalies
0304-1659294 21
AL-JUNAID TECH INSTIUTE
• Flat tables
• Data marts
135. 1In OLTP environment the selectivity is ------------ and ---------- in data
warehouse environment.
• High, Low page 22
• Low, High
• High, Fixed
• Fixed, Low
136. Which is not a/an characteristics of data quality?
• Reliability page 186
• Uniqueness
• Accessibility
• Consistency
137. 1If a product meets formally defined “requirement specifications”, yet fails to be
a quality product from the customer’s perspective , this means the requirements were -----
------.
• Defective page 180
• Unclear
• Unrefined
• Undefined
0304-1659294 22
AL-JUNAID TECH INSTIUTE
• Reduces tables
141. 1The end user of data ware house are---------------.
• Programmers
• Database developers
• Data entry operator
• Business executives page 18 + 19
0304-1659294 23
AL-JUNAID TECH INSTIUTE
• Volatile page 69
148. Which is not a/an step of data cleansing procedure?
• Aggregation page 168
• Elementizing
• Standardizing
• Verifying
149. 1Instance matching between different sources is then achieved by a standard ------
-------
on identifying attribute(s), if you are very, very, very lucky.
• Equi-join page 169
• Inner join
• Outer join
• Fuller join
150. Ad-hoc access of data warehouse means:
• That have predefined database access pattern
• That does not have predefined database access pattern page 18
• That could be accessed by any user
• That could not be accessed by any user
151. In OLTP environment, the size of tables is relatively-------------------
• Large
• Fixed
• Moderate
• Small page 22
152. 1------------- is a/an measure of how current or up to date the data is:
• Timeliness page 185
• Completeness
• Accessibility
• Consistency
153. 1The process of converting entity relationship model in to dimensional model of -
---------
steps:
• Two
• Three
• Four page 109
0304-1659294 24
AL-JUNAID TECH INSTIUTE
• Five
154. A ------------- Is defined by a group of records that have similar characteristics
(“behavior”) for p% of the fields in the data set, where p is a user- defined
value(usually above 90).
• Pattern page 164
• Cluster
• Entity
• Attribute
155. ------------------ is known as state of being only one of its kind or being without an
equal or
parallel.
• Completeness
• Uniqueness page 185
• Accessibility
• Consistency
156. Which of the following is not an example of fact?
• Account no page 74
• Sales quantity
• Per unit sales amount
• Sales amount
157. 1------------------is the degree to which data is accurately reflects the real world
object that the data represents.
• Intrinsic data quality page 181
• Realistic data quality
• Strong data quality
• Weak data quality
158. 1Which one among the following data warehouse stores data containing long
period?
• Telecommunication data warehouse
• Financial data warehouse
• Human resource data warehouse
• Insurance data warehouse page 36
0304-1659294 25
AL-JUNAID TECH INSTIUTE
______ dimension is simply a structure that provides a convenient place to
store the ______ attributes.
• Junk
• Time
• Parallel
• None of these
160. 1Data Warehouse is about taking / collecting data from different ________
sources.
• Harmonized
• Identical
• Homogeneous NOT CONFIRM
• Heterogeneous
161. 1Taken jointly, the extract programs or naturally evolving systems formed a
spider web, also known as
• Distributed Systems Architecture
• Legacy Systems Architecture
• Online Systems Architecture
• Intranet Systems Architecture
162. It is observed that every year the amount of data recorded in an organization
• Doubles
• Triples
• Quartiles
163. The users of data warehouse are knowledge workers in other words they
are _________ in the organization.
• DWH Analyst
• Decision maker
• Database Administrator
• Manager
164. Node of a B-Tree is stored in memory block and traversing a B-Tree involves
______ page faults.
• O (n lg n)
• O (log n) { O(log n) it’s the real answer}
• O (n)
• O (n2)
165. In _________ system, the contents change with time.
• OLTP
• ATM
• DSS
• OLAP
166. The growth of master files and magnetic tapes exploded around the mid-
_______.
• 1950s.
• 1960s.
0304-1659294 26
AL-JUNAID TECH INSTIUTE
• 1970s.
• 1980s.
167. Relational databases allow you to navigate the data in ____________ that is
appropriate using the primary, foreign key structure within the data model
• Only One Direction
• Any Direction
• Two Direction
• None of these
168. 1Naturally Evolving architecture occurred when an organization had a _______
approach to handling the whole process of hardware and software architecture.
• Relaxed
• Good
• Not Relaxed
• None
169. ________ gives total view of an organization
• OLAP
• OLTP
• Data Warehouse
• Database
170. Suppose the amount of data recorded in an organization is doubled every year.
This increase is
__________ .
• Linear
• Quadratic
• Exponential
• Logarithmic
171. ________is the application and experience to get common goals.
Wisdom
Power
Information
Education
172. Normally ROLAP is implemented using___________.
Star Schema
Pre-defined aggregates
Hybrid Schema
All of the given options
173. Identify data warehouse query from the following.
Number of students studying CS614 course
Factors which can effect students’ performance
List of students belong to Lahore city
Total number of students that have 3.5 CGPA
0304-1659294 27
AL-JUNAID TECH INSTIUTE
174. In _________ SQL generation is vastly simplified for front-end tools when
the data is highly structured.
Hybrid Schema
Star Schema
MOLAP
Object oriented schema
175. __________ models the Marco relationships among data elements with an
overall deterministic strategy.
Dimensional model
Structured model
Object oriented model
Entity relationship model
176. The _________ task is typically performed after most other transformation
and cleaning steps have taken place.
Data extraction
Data duplicate elimination
Data transformation
Data loading
177. __________ is known as state of being only one of its kind or being without
an equal or parallel.
Completeness
Consistency
Accessibility
Uniqueness
178. Relational modeling techniques are used to develop__________ systems.
OLTP
MOLAP
OLAP
ROLAP
179. Which of the following is not CUBE operation?
ANSI SQL
Drill Down
Pivoting
Roll UP
180. The hybrid OLAP(HOLAP) solution is a mix of ___________ architecture
that supports queries against summary and transaction in an integrated.
ROLAP
0304-1659294 28
AL-JUNAID TECH INSTIUTE
OLTP
MOLAP and ROLAP
MOLAP
181. The response time of OLTP queries are in milliseconds to seconds. While
that of data warehouse queries are in:
Days of weeks
Hours to days
Milliseconds to microseconds
Minutes to hours
182. __________ allows download of “cube” structures to a desktop platform
without the need for shared relational or cube server.
ROLAP
HOLAP
DOLAP
MOLAP
183. In case of multiple sources for the same data element. We need to prioritize
the source systems per element bases, the process is called:
Ranking
Measurement selection
Element selection
Prioritization
184. OLAP is:
Implementation technique
Framework
None of the given options
Physical database design
185. Which of the following is not a task of Data Transformation?
Summarization
Enrichment
Conversion
Full data refresh
186. Simple scalar transformation is a ____________ mapping from one set of
values to another set of values using straightforward rules.
Many-to-one
Many-to-many
One-to-one
One-to-many
0304-1659294 29
AL-JUNAID TECH INSTIUTE
187. Which of the following is not a data quality validation technique?
Indexes
Using data quality rules
Referential integrity
Data Histograming
188. __________ are created out from the data warehouse to service different
needs of departments in the organization.
Databases
Data marts
MIS
Tables
189. An OLTP system is always good at___________.
Maintaining historical data
Keeping static data
Tracking past data
Evolving data
190. _________ incorporates the concept of product quality, process control,
quality improvement.
Intrinsic data quality management
Realistic Data Quality Management
Total Quality Management
Strong Data Quality Management
191. “Sometimes during data collection complete entities are missed”. This
statement is an example of:
Missing aggregate
Missing tuple
Semantically dirty data
Missing attribute
192. The last step of Software Development Life Cycle (SDLC) is implementation.
While that of data warehouse is:
Integration
Understanding requirements
Testing
Analysis
193. In which class of aggregates Max function can be places?
Algebraic
Associative
0304-1659294 30
AL-JUNAID TECH INSTIUTE
Holistic
Distributive
194. Telecommunication data warehouse is dominated by the__________ volume
of data generated at the call level.
Incomplete
Semi-incomplete
Sheer
Partial
195. If a table is expected to have six columns but some or all of the records do
not have six columns then it is example of:
Semantically dirty data
Syntactically dirty data
Extraction issue
Coverage anomaly
196. In the data warehouse environment the date is__________.
Both subject and time oriented
Neither time-oriented nor subject-oriented
Subject-oriented
time-oriented
197. dirty data means that:
Data control be aggregated
Data contain non-additive facts
Data does not fulfill dimensional modeling rules
Data does not conform to proper domain definitions
198. One of the main reason for the failure of DWH deployments is________.
Data anomaly
Data integrity
Data quality
Data duplication
199. The data in the data warehouse is__________.
Non-structured
Volatile
Static
Non-volatile
200. Which of the following is NOT an example of a typical grain?
Daily aggregates
Monthly aggregates
0304-1659294 31
AL-JUNAID TECH INSTIUTE
Individual transaction
Normalized attribute
201. Data warehouse stores ___
Operational data
Historical data
Meta data
Log files data
202. The ___ dimension represents data correctness
Free-of-error
Completeness
Consistency
Correctness
203. Which of the following is not a Data Quality Validation Technique?
Referential Integrity
Using Data Quality Rules
Data Histograming
Indexes
204. Which of the following is an example of Non-Additive Facts?
Quantity sold
Total Sale in Rs.
Discount Percentage
Count of orders in a store
205. Which is the most complex type of transformation in the following?
Many-to-many element transformation
One-to-one scalar transformation
One-to-many element transformation
All of the given
206. Serious ___ involves decomposing and reassembling the data
Data cleansing
Data transformation
Data loading
Data Extraction
207. ___ is the degree of utility and value the data has to support the enterprise
processes that enable accomplishing enterprise objectives
Intrinsic Data Quality
Realistic Data Quality
Strong Data Quality
Weak Data Quality
208. In a decision support system ease of use is achieved by:
Normalization
Denormalization
Drill up
0304-1659294 32
AL-JUNAID TECH INSTIUTE
Drill down
209. Assume a company with a multi-million row customer table i.e. n rows.
Checking for Referential Integrity (RI), using a smart technique with some kind of tree
data structure would require ___ time
O(log n)
O(n)
O(1)
None
210. Which of the following is NOT an example of a typical grain?
Individual Transactions
Daily aggregates
Monthly aggregates
Normalized attributes
211. Most DWH implementations today do not use ___ enforced by the database, but
as TQM methods improve overall data quality and database optimizers
Consistency Integrity
Referential Integrity
Attribute domain
Using Data Quality Rules
212. Suppose in system A, the possible values of “Gender” attribute were “Male” &
“Female”, however in data warehouse, the values stored were “M” for male and “F” for
female. The above scenario is an example of:
One-to-one scalar transformation
One-to-many element transformation
Many-to-one element transformation
Many-to-many element transformation
213. Development of data warehouse is hard because data sources are usually ___
Structured and homogeneous
Unstructured and heterogeneous
Structured and heterogeneous
Unstructured and homogeneous
214. A/an ___ is a collection of random transactional codes, flags and/or text attributes
that are unrelated to any particular dimension
Junk dimension
Slowly changing dimension
Multi-valued dimension
Simple dimensions
215. ROLAP provides access to information via a relational database using
ANSI standard SQL
Proprietary file format
Comma Separated Values
All of the given
0304-1659294 33
AL-JUNAID TECH INSTIUTE
216. The typical availability of OLTP system is 24/7, while that of data warehouse is
___
6/12
7/12
1/24
Twice a week
217. In ___ nested-loop join of quadratic time complexity does not hurt the
performance
Typical OLTP environments
Data warehouse
DSS
OLAP
218. If actual data structure does not conform to documented formats then it is called:
Syntactically dirty data
Semantically dirty data
Coverage anomaly
Extraction issue
219. Which of the following is not a CUBE operation?
ANSI SQL
Roll UP
Drill down
pivoting
220. The data has to be checked, cleansed and transformed into a ___ format to allow
easy and fast access
unified
predicted
qualified
proactive
221. Which is not a/an characteristics of data quality?
Reliability
Uniqueness
Accessibility
Consistency
222. The extent to which data is in appropriate languages, symbols, and units, and the
definitions are clear is known as ___
Interpretability
Uniqueness
Accessibility
Consistency
223. In case of multiple sources for the same data element, we need to prioritize the
source systems per element bases, the process is called
Ranking
Prioritization
0304-1659294 34
AL-JUNAID TECH INSTIUTE
Element Selection
Measurement Selection
224. In OLTP environments, the size of tables is relatively ___
Large
Fixed
Moderate
Small
225. Change Data Capture (CDC) can be challenging task because
Aggregates don’t change in real time
Transformation of extracted data is difficult
Identifying the recently modified data may be difficult
Source systems may not support extraction of changed aggregates
226. ___ is the extent to which data is regarded as true and credible
Believability
Completeness
Accessibility
Consistency
227. The relation R will be in 2nd Normal Form if
It is in 1NF and each cell contains single value
It is in 1NF and each non key attribute is dependent upon entire primary key
It is in 1NF and each non key attribute is dependent upon a single column of composite
primary key
It is in 1NF and Primary key is composite
228. ___ is the degree to which data accurately reflects the real-world object that the
data represents
Intrinsic Data Quality
Realistic Data Quality
Strong Data Quality
Weak Data Quality
229. Web scrapping is a process of applying ___ techniques to the web
Screen scrapping
Data scrapping
Text scrapping
Meta scrapping
230. In which class of aggregates AVERAGE function can be placed?
Algebraic
Distributive
Associative
Holistic
231. Which of the following is not an “Orr’s Law of Data Quality”?
“Data that is not used cannot be correct!”
“Data quality is a function of its use, not its collection”
“Data will be no better than its most stringent use!”
0304-1659294 35
AL-JUNAID TECH INSTIUTE
“Data duplication can be harmful for the organization!”
232. The ___ operator proves useful in more complex metrics applicable to the
dimensions of timeliness and accessibility
Max
Min
Min or Max
None
233. Which is not a/an Data Quality Validation Technique?
Consistency Integrity
Referential Integrity
Attribute Domain
Using Data Quality Rules
234. Assume a company with a multi-million row customer table i.e. n rows.
Checking for Referential Integrity (RI) using a naïve approach would take ___ time.
O(n)
O(1)
O(log n)
None
235. ___ breaks a table into multiple tables based upon common column values
Horizontal splitting
Vertical splitting
Both
None of these
236. Companies collect and record their own operational data, but at the same time
they also use reference data obtained from ___ sources such as codes, prices etc.
Operational
None
Internal
External
237. Ad-hoc access means to run such queries which are known already
True
False
238. Relational databases allow you to navigate the data in ____ that is appropriate
using the primary, foreign key structure within the data model.
Only One direction
Any direction
Two direction
None
239. DSS queries do not involve a primary key
True
False
0304-1659294 36
AL-JUNAID TECH INSTIUTE
240. The need to synchronize data upon update is called
Data Manipulation
Data Replication
Data Coherency
Data imitation
241. Taken jointly, the extract programs or naturally evolving systems formed a spider
web, also known as
Distributed Systems Architecture
Legacy Systems Architecture
Online Systems Architecture
Intranet Systems Architecture
242. Node of a B-Tree is stored in memory block and traversing a B-tree involves ___
page faults
O(n)
O(n^2)
O(n log n)
O(log n)
243. Which statement is true for De-Normalization?
Redundant data is a performance liability at query time, but is a performance benefit at
update time
Redundant data is a performance liability at both query time and update time
Redundant data is a performance benefit at both query time and update time
Redundant data is a performance benefit at query time, but is a performance liability at
update time
244. De-normalization normally speeds up
Data Retrieval
Data modification
Development cycle
Data replication
245. In horizontal splitting, we split a relation into multiple tables on the basis of
Common column values
Common row values
Different index values
Value resulted by ad-hoc query
246. For good decision making, data should be integrated across the organization to
cross the LoB (Line of Business). This is to give the total view of organization from:
Owner’s perspective
Customer’s perspective
Decision Maker’s perspective
Employee’s Perspective
247. A data warehouse may include
Legacy systems
Only internal data sources
0304-1659294 37
AL-JUNAID TECH INSTIUTE
Privacy restrictions
Small data mart
248. Multidimensional databases typically use proprietary ___ format to store pre-
summarized cube structures
File
Application
Aggregate
Database
249. All data is ___ of something real
I An abstraction
II A representation
250. Which of the following option is true?
I only
II only
Both I and II
None
0304-1659294 38
CS614- Data Warehousing
Solved MCQ(S) Oct
From Midterm Papers (1 TO 22 Lectures) 26,2017
BY Arslan
V-U For Updated Files Visit Our Site : Www.pakeducation4u.com Updated.
1/N PG # 22
N/2
lg(N)
OLAP PG # 30
OLTP
Data Cleansing
ETL
Decision makers
Knowledge workers
Both Knowledge workers and Decision makers PG # 18
The users of data warehouse are knowledge workers in other words they are decision makers in the organization.
4. The users of data warehouse are knowledge workers in other words they are _________ in the organization.
Decision maker PG # 18
Manager
Database Administrator
DWH Analyst
Unpredictable PG # 62
Predictable
Conventional
Unsurprising
Multi-level aggregates PG # 74
7. Multidimensional databases typically use proprietary __________ format to store pre- summarized cube
structures.
File PG # 79
Application
Aggregate
Database
8. Normally Selectivity of query in OLTP system is
High PG # 30
Low
Not measured
High
Low PG # 30
Not measured
Data Retrieval PG # 51
Data Modification
Development Cycle
Data Replication
Data Retrieval
Data Modification PG # 51
Development Cycle
Data Replication
De-normalization usually speeds up data retrieval, but it can slow the data modification processes.
12. Geography is a good example of
One-dimensional Hierarchy
Multidimensional Hierarchy PG # 52
Non-Dimensional
Linear Hierarchy
Round-Robin PG # 66
De-normalization
Normalization
OLAP is Analytical Processing instead of Transaction Processing. It is also NOT a physical database design or
implementation technique, but a framework.
15. ER is a logical design technique that seeks to remove the _____________ in data.
Redundancy PG # 98
Normalization
Anomalies
16. ER is a _______ design technique that seeks to remove the redundancy in data.
Logical PG # 98
Physical
Data Dependent
Transaction Dependent
17. Pre-computed _______ can solve performance problems
Aggregates PG # 111
Facts
Dimensions
18. Extract, Transform, Load (ETL) process consist of steps which are _________________.
19. In full extraction, data is extracted completely from the source system. Therefore there is no need to keep track
of changes to the ________
20. Data Extraction from source systems is a difficult task because source systems are ___________
Homogeneous
Heterogeneous PG # 140
Identical
Standardized
Coverage Anomalies
0 and 1 PG # 169
0 and 10
0 and 100
0 and 99
23. As consumers, human beings judge the quality of things during their life-time.
I Consciously
II Subconsciously
III Unconsciously
I Only
II Only
III Only
24. The growth of master files and magnetic tapes exploded around the mid- _______.
1950s.
1960s. PG # 12
1970s.
1980s.
25. Relational databases allow you to navigate the data in ____________ that is appropriate using the primary,
foreign key structure within the data model.
26. In ROLAP access to information is provided via relational database using _________ standard SQL.
ANSI PG # 78
Microsoft
Oracle
SAP
O (n)
O (w)
O (w n) PG # 171
O (w log n)
29. The telecommunications data warehouse is dominated by the sheer volume of data generated at the call level
______ area.
Subject PG # 35
Object
Aggregate
Detail
30. “Decision making is an iterative process; which must involve the users” is a classic statement of
OLAP PG # 69
ATM
OLTP
Data Mining
31. Cube is a logical entity containing values of a certain fact at a certain aggregation level at _____________ of a
combination of dimensions.
An Intersection PG # 88
A Union
A Subtraction
A Subset
32. After implementing Change Data Capture, the advantage we have is that, data is able to be integrated and
transformed __________
Finally data is able to be integrated and transformed "in-flight”. Once the update/transaction data has been pulled from
the log tape, the DWH is free to re-sequence, reformat, convert, merge, summarize, etc.
I Only PG # 180
II Only
Both I & II
None of I & II
34. _____________ queries deal with number of variables spanning across number of tables (i.e. join operations)
OLTP
DBMS
DSS PG # 21
ETL
35. In data warehouse, a query results in retrieval of hundreds of records from very large table. the ratio of number
of records retrieved to total number of records present is high and selectivity is:
Low PG # 22
High
Average
Non computable
36. At small scale e.g. hundreds of rows, difference between performance of complexity algorithms “O(n lg n)”
2
and “O(n ) is not _____________.
Noticeable
Small
Symmetric
Asymmetric PG # 32
Only One-to-One
Only Many-to-Many
Only One-to-Many
38. HOLAP provides a combination of relational database access and cube data structures. The goal is to get the
best of both MOLAP and ROLAP i.e.
Horizontal
Vertical PG # 56
Cross
Diagonal
40. “Change Data Capture” is one of the challenging technical issues in __________
Data merging is part of data transformation where multiple values are summarized into single summarized value.
42. Which of the following is NOT an example of a “Dimension”?
Date
Region
Product
Units sold PG # 74
44. Which of the following is not one of the characteristics of a Data warehouse?
Ad-hoc access
Complete repository
Historical data
Volatile PG # 18
A complete repository of historical corporate data extracted from transaction systems that is available for ad-hoc
access by knowledge workers
45. To handle dimensions that require the aggregation of multiple data quality indicators, which of the following
Complex Ratio
Aggregate Average
Weighted Ratio
Physical Extraction
Semantic Extraction
The two logical data extraction types are full and incremental extraction techniques.
47. “The environment is smart enough to develop or compute higher level aggregates using lower level or more
detailed aggregates”. Which of the following approach is described by the above statement?
Aggregate awareness PG # 87
Cube partitioning
Indexing
Linear
Quadratic
Logarithmic
Exponential PG # 15
One
Two PG # 103
Three
Four
There are actually two ways of “simplifying” the ER model i.e. (i) De -normalization and (ii) Dimensional Modeling.
51. Fact-less fact table is a fact table without numeric fact columns. It is used to capture relationship between
__________
Dimensions PG # 121
Attributes
Tables
Facts
52. A cube is not a data warehouse, it is a _______________
53. If we remove the modification anomalies the table comes in ________ form
1NF
2NF
3NF PG # 47
4NF
1NF
2NF PG # 45
3NF
BCNF
55. In MOLAP physically build cubes for direct access, support is not available for ______ SQL.
ANSI PG # 78
Microsoft
Oracle
SAP
MOLAP physically builds “cubes” for direct access - usually in the proprietary file format of a multi-dimensional
database (MDD) or a user defined data structure. Therefore ANSI SQL is not supported.
56. ______ is the lowest level of detail or the atomic level of data stored in the warehouse.
Aggregate
Cube
Grain PG # 111
Virtual Cube
Update PG # 43
Select
Both Update & Select
None of these
59. Node of a B-Tree is stored in memory block and traversing a B-Tree involves ______ page faults.
O (n)
O (n2)
O (n lg n)
O (log n) PG # 22
60. One major goal of horizontal splitting is
61. Fact-less fact table is a fact table without numeric fact columns. It is used to capture relationship between
__________
Dimensions PG # 121
Attributes
Tables
Facts
62. ________ gives total view of an organization
OLTP
Data warehouse PG # 16
OLAP
Data base
63. Experience showed that for a single pass magnetic tape that scanned 100% of the records, only _________ of
the records, sometimes even were actually required.
5% PG # 12
30%
50%
80%
OLTP PG # 20
DSS
ATM
OLAP
65. It is observed that every year the amount of data recorded in an organization
Doubles PG # 15
Triples
Quartiles
Remains same as previous year
66. Normalized design is likely to perform much faster than de-normalized design for queries that probe
Round-Robin PG # 66
De-normalization
Normalization
Pre-joining
Splitting Tables
Adding Redundant Column
ER Modeling PG # 52
mid-1970s
mid-1980s
mid-1960s PG # 12
mid-1950s
70. If each cell of Relation R contains a single value (no repeating values) then it is confirmed that
73. ____________ involves splitting a table by columns so that a group of columns is placed into the new table and
the remaining columns are placed in another new table.
Vertical splitting PG # 56
Horizontal splitting
Adding redundant column
None of the given options
74. In ____________ nested-loop join of quadratic time complexity does not hurt the performance
Note: Give me a feedback and your Suggestion also If you find any mistake in mcqz plz
inform me Viva Contact us Page on our Site. And tell me your answer with references.
For More Solved Papers By Arslan Visit Our Website :
Www.pakeducation4u.com