0% found this document useful (0 votes)
47 views

CS614 Merged

cs614 quiz 2

Uploaded by

1jaanali676
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
47 views

CS614 Merged

cs614 quiz 2

Uploaded by

1jaanali676
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 70

ORANGE MONKEY Page 1

CS614 ( Midterm Mcqs)

1. In pre-joining de-normalization technique, the header


size of Master table is 40 bytes and Detail table is 50
bytes. If the size of reference column is 8 bytes, then
the header size of de-normalized table will be _________.

90 bytes……..confirm

2. Pre-joining de-normalization technique is generally


used when there is __________ relationship between two
or more tables.

One to many…..confirm

3. ROLAP tools will query the relational database using


SQL generated to conform to a framework using the
facts and dimensions paradigm using the.

Star Schema…..confirm

ORANGE MONKEY Page 2


4. _________Should not be present in a relation, so that it
would be in third normal form(3NF)

Transitive dependency….confirm

5. The most common use of range partitioning is on ______.

Date…..confirm

6. In dimension modeling _________ can be more than


simple hierarchies i.e item, subcategory,etc

Dimension…..confirm

7. Which type of dependency is represented by the


following functional dependencies?
Book_Name -> Author_Name. Author_Age
Author_Name -> Author_Age

Full functional dependencies


8. Which one among the following is not an advantage of
horizontal splitting?

Increase I/O Overhead…..confirm

ORANGE MONKEY Page 3


9. One of the fundamental purpose of de-normalization is to _______ a
number of physical tables which ultimately reduce the number of joins
to answer a query.

Reduce or decrease……confirm

10. One goal of horizontal splitting is spreading rows of a table for


exploiting.

Parallelism…..confirm

11. In _________ index, the ith bit is set to “I” if the ith of the base table
has the value for the indexed column

Bitmap indexes……confirm

ORANGE MONKEY Page 4


12. More resources means proportionally less time for given amount of
data. This statement refers to.

Speed-up…..confirm

13. If resources increased in proportion to increase in data size time is


constant. This statement refers to

Scale-up….confirm

14. A join is identified by multiple tables in the ______ clause.

FROM…..confirm

ORANGE MONKEY Page 5


15. In context of nested-loop join actual number of matching rows
returned as a result of the join would be _______ of the order of tables.

Independent…..confirm

16. As the number of processors increase the speedup should also increase.
Thus we should have linear speedup. Which of the following is NOT one
of the barriers to achieve this linear.

No interface…..confirm

17. The online high performance transaction process was evolved in _____.

1975….confirm

ORANGE MONKEY Page 6


18. For good decision making, one should be able to integrate data across
the organization to give the _______ of organization.

Total view….confirm

19. Decision support system (DSS) is/an _____________.

Interactive system which enable decision makers to use databases and solve
un-structured

20. Which of the following is not an example of dimension?

Sales amount

21. In which class of aggregates median function can be placed?

HOLISTIC……confirm

22. In extract, load, Transformation (ELT) process, you don’t need to


purchase extra devices to achieve parallelism because:

You already have parallel data warehouse servers……confirm

ORANGE MONKEY Page 7


23. Which is not a class of aggregation functions?

Associative……confirm

24. _________ is a logical design technique that seeks to present the data in
a standard, instinctive structure that supports high-performance and
ease of understanding.

DM…..confirm

25. The _______ technique is a discipline used to highlight the microscopic


relationships among data elements or entities.

ER modeling…..confirm

ORANGE MONKEY Page 8


26. Which is not a basic task of data transformation?

Aggregation…..confirm

27. Which of the following is not an example of derived attributes

Annual salary…..confirm

28. In data transformation, __________ is the rearrangement and


simplification of individual fields to make them more useful for the data
warehouse environment.

Conversation or enrichment…confirm

ORANGE MONKEY Page 9


29. There are often multiple ways to represent the same piece of
information. FAST National Universty Fast Nu and not univ of
computers. This problem is known as __________.

Non primary key problem….confirm

30. In Extract,Load,Transform (ELT) process data transformation _____.

Takes place on a data warehouse server….confirm

31. In Extract Transform load(ETL) in which data transformation


____________

Takes place on a separate transformation server…..confirm

ORANGE MONKEY Page 10


AL-JUNAID TECH INSTIUTE
1. Onlline Extraction is a kind of---------------------------------data extraction.
o Logical
o Dimensional

o Physical page 132


o Multi valued
2. The ---------------- saw the advent of disk storage, or DASD( direct Access Storage
Device) :
• 1960s
• 1970s page 13
• 1950s
• 1990s

3. In context of data warehouse, normally it becomes difficult to extract data from


different sources because these sources are normally.
• Heterogeneous page 140
• Homogeneous
• Centralized
• Baseline

4. 5Which of the following is not a task of Data Transformation?


• Conversion
• Summarization
• Enrichment
• Full Data Refresh page 135

5. . Which of the following is not an Orr’s Law of Data Quality”?


• “Data that is not used cannot be corrected!”
• “Data quality is a function of its use, not its collection!”
• “Data will be no better than its most stringent use!”
• “Data duplication can be harmful for the organization! ” page 181
6. Flat files are one of the prevalent structures used in ------------------- data extraction:
• Online
• Offline page 134
• Incremental
• Full
7. Which of the following is NOT one of the advantages of changed data capture (CDC)
technique?

0304-1659294 1
AL-JUNAID TECH INSTIUTE
• Flat files are not required
• Limited query interface is required for data extraction page 152
• No incremental on-line I/O required for log tape
• Extraction of changed data occurs immediately
8. The most common range partitioning is on
• Color
• Date page 66
• PhoneNo
• Name
9. A relation is said to be in first normal form(1NF), if it does not contain ________
• Single value column
• Multi-valued column page 43
• Derived column
• Composite column
10. In a fully normalized database, too many ____________are required
• Values
• Joins page 49
• Queries
• Conditions

11. In the data warehouse, data is collection from -------------------- sources:


• Homogeneous
• Heterogeneous page 21
• External
• Internal
12. De-normalization is more like a “controlled crash” with the aim to ------------
without loss of information:
• Check
• Balance
• Decrease
• Enhance page 49
13. 1----------------- is making all efforts to increase effectiveness and efficiency in
meeting and accepted customer expectation:
• Quality assurance
• Quality improvement page 183
• Quality maintenance
• Quality Establishment
14. ------------- is the application of intelligence and experience to get common goals.

0304-1659294 2
AL-JUNAID TECH INSTIUTE
• Wisdom page 11
• Education
• Power
• Information
15. 1In the data transformation, ---------- is the rearrangement and simplification of individual
• Aggregation
• Enrichment page 136
• Splitting joining
• Conversion
16. Grain of a fact table means :
• The meaning of one fact table row page 109
• The meaning of one dimensional table row
• Summary of aggregates in all fact tables
• Summary of aggregates in all dimension tables
17. Normalization ----------------- :
• Reduces redundancy page 41
• Increases redundancy
• Reduces joins
• Reduces tables

18. 1Which of the following is not an example of a typical grain :


• Individual transaction
• Daily aggregates
• Monthly aggregates
• Normalized attributes page 111

19. Multi-dimensional databases(MDDs) typically use -------------------- formats to


store pre-summarized cube structures:
• SQL
• Proprietary file page 79
• Object oriented
• Non-proprietary file
20. ------------ provides a combination of “relational databases access” and “cube” data
structures within a single framework:
• HOLAP page 78
• DOLAP
• MOLAP

0304-1659294 3
AL-JUNAID TECH INSTIUTE
• ROLAP
21. Data Warehouse provides the best support for analysis while OLAP carries out the --------
-----
task:
• Mandatory
• Whole
• Analysis page 69
• Prediction
22. ----------------- involves splitting a table by columns so that a group of columns is placed
into the new table and the remaining columns are placed in another new table:
• Vertical splitting page 56
• Horizontal splitting
• Adding redundant column
• None of the given option
23. OLAP implementations are highly/completely ------------------ :
• Normalized
• Demoralized page 69
• Predictive
• Additive
24. If each cell of Relation R contains a single value ( no repeating values) then it is
confirmed that :
• Relation R is in 1st Normal Form page 43
• Relation R is in 2nd Normal Form
• Relation R is in 3rd Normal Form
• Relation R is in 3rd Normal Form but not in 2nd Normal Form
25. Which kind of relationships is captured by Fact less fact table:
• Many- to- Many page 121
• One-to-many
• One-to-one
• None of the given option
26. Which of the following is not an example of dimension:
• Product
• Date
• Region
• Sales volume page 78

27. Which people criticize Dimensional Modeling (DM) as being a data mart oriented
approach?

0304-1659294 4
AL-JUNAID TECH INSTIUTE
• Those that consider ER models as Data marts
• Those that consider Business processes as Data marts page 110
• Those that consider Data marts as Data warehouse
• Those that consider dimensional model
• Those that consider dimensional modeling as de-normalization approach

28. In a fully normalized form:


• To many joins are required page 49
• Relationships lose their significance
• No joins are required
• Data integrity becomes an issue
29. Which of the following is an example of Non-Additive Facts:
• Quality sold
• Total sale in Rs.
• Discount in percentage page 119
• Count of orders in a store
30. Which of the following is not a CUBE operation?
• ANSI SQL page 81
• Roll Up
• Drill Down
• Pivoting
31. -------------------- allows download of “cube” structures to a desktop platform without
the need for shared or cube server:
• MPLAP
• ROLAP
• DOLAP page 78
• HOLAP

32. ROLAP provides access to information via a relational database using:


• ANSI standard SQL page 78
• Proprietary file format
• Comma Separated Values
• All of the given option
33. -------------------- is usually deployed when expression can be used to group data together
in such a way that access can be targeted to a small set of partitions:
• Expression elimination
• Expression partitioning page 67

0304-1659294 5
AL-JUNAID TECH INSTIUTE
• Expression indexing
• None of the given option
34. Taken jointly, the extract programs or naturally evolving systems formed a spider
web, also known as
• Distributed Systems Architecture
• Legacy System Architecture page 14
• Online System Architecture
• Intranet System Architecture

35. The data has to be checked , cleaned and transformed into a --------------- format to
allow easy and fast access
• Unified page 20
• Predicated
• Qualified
• Proactive
36. Suppose in a system A, the values of “PhoneNo” attribute were stored in “countrycode-
phone-extension” format, however after transformation into data warehouse the
separate columns were used for “countrycode”,”phone” and “extension”. The above
scenario is an example of :
• One-to-one scalar transformation
• One-to-many element transformation page 144+conceptual
• Many-to-one element transformation
• Many-to-many element transformation
37. In decision support system ease of use in achieved by:
• Normalization
• Denormalization page no 49
• Drill up
• Drill down
38. Which of the following is one of the methods to simplify an ER model?
• Normalization
• Denormalization page no 103
• HOLAP
• Hybrid schema
39. In ETL process data transformation includes ----------------
• Data cleansing page 129
• Data aggregation
• Behavior checking

0304-1659294 6
AL-JUNAID TECH INSTIUTE
• Pattern recognition
40. Non-uniform use of abbreviations, units, and values refers to:
• Syntactically dirty data page 160
• Semantically dirty data
• Coverage anomaly
• Extraction issue
41. Suppose the size of the attribute “Computerized National Card (CNIC) no. is changed
in NADRA database. This transformation refers to:
• Format revision page 153
• Field splitting
• Field decoding
• Calculation of derived value
42. The divide and conquer cube partitioning approach helps alleviate the ------------
limitations of MOLAP implementation:
• Flexibility
• Maintainability
• Security
• Scalability page 85
43. identify the TRUE statement:
• DM is inherently dimensional in nature
• DM comprises of a single central fact table
• DM comprises of a set of dimensional tables
• All of the given option Page 103
44. ------------- can be used when some columns are rarely accessed rather than other
columns or when the table has wide rows or header or both:
• Horizontal splitting
• Pre-joining
• Vertical splitting page 56
• Derived attributes
45. Which of the following is an example of derived attributes?
• Age page 61
• Size
• Color
• Length
46. The online high performance transaction processing was evolved in --------------:
• 1980
• 1975 page 12
• 1977

0304-1659294 7
AL-JUNAID TECH INSTIUTE
• 1965
47. Cube is a logical entity containing values of a certain aggregation level at an
intersection of a combination of -------------------- :
• Facts
• Dimension page 88
• Summary tables
• Primary and foreign key
48. Which of the following is TRUE regarding Entity relationship modeling?
• It does not really model business, but models the micro relationships
among data elements.
• ER modeling does not have “business rules,” it has “data rules
• ER modeling helps retrieval of individual records having certain critical
identifiers.
• All of the given option page 102
49. ------Facilitates a mobile computing paradiagramn:

• HOLAP
• DOLAP page78
• ROLAP
• MOLAP

50. The main reason(s )for the increase in cube size may be:
• Increase in the number of dimensions
• Increase in the cardinality of the dimensions
• Increase in the amount of detail data
• All of the given options page 87
51. Suppose the amount of data recorded in an organization is doubled in year. This increase
in ------
• Linear
• Quadratic
• Exponential page 15
• Logarithmic
52. The data in the data warehouse is ----------- :
• Volatile
• Non-volatile page 69
• Static
• Non-structured

0304-1659294 8
AL-JUNAID TECH INSTIUTE
53. --------------- models the macro relationships among data elements with an overall
deterministic strategy:
• Dimensional model page102
• Entity relationship model
• Object oriented model
• Structured model
54. ----------------- technique requires a separate column to specify the time and date when
the last modification was occurred:
• Checkmarks
• Timestamps page 150
• Just-in-Time
• Real Time extraction
55. Which of the de-normalization technique squeezes master table into detail?
• Pre-joining page 58
• Horizontal splitting
• Vertical splitting
• Adding redundant column
56. De-normalization can help:
• Minimize joins
• Minimize foreign keys
• Resolve aggregates
• All of the given options page 51

57. The domain of the “gender” field in some database may be (‘F’,’M’) or as (“Female”,
“Male”) or even as (1, 0). This is:
• Primary key problem
• Non primary key problem page 163
• Normalization problem
• All of the given option
58. Increasing level of normalization --- number of
------------- tables:

• Increases page 51
• Decreases
• Does not effect
• None of the given option
59. Which of the following is not a Data Quality Validation Technique:
• Referential integrity

0304-1659294 9
AL-JUNAID TECH INSTIUTE
• Using Data Quality Rules
• Data Histograming
• Indexes page 189
60. This technique can be used when column from one table is frequently accessed in a
large scale join in conjunction with a column from another table:
• Horizontal splitting
• Pre-joining
• Adding redundant column page 58
• Derived attributes
61. Data cleansing requires involvement of domain expert because:
• Domain expert has deep knowledge of data aggregation
• Change Data captures requires involvement of domain expert
• Domain knowledge is required to correct anomalies page 158
• Domain expert has deep knowledge of data summarization
62. Relational databases allow you to navigate the data in ------------- that is appropriate
using the primary , foreign key structure with in the data model:
• Only One Direction
• Any Direction page 19
• Two Direction
• None of these
63. History is excellent predicator of the ------------:
• Past
• Present
• Future page 15
• History
64. De- normalization is the process of selectively transforming normalized relations
into un-normalized physical record specifications, with the aim to:

• Well structure the data


• Well model the data
• Reduce query processing time page 50
• None of the given option
65. ----------------- gives total view of an organization:
• OLAP
• OLTP
• Data Warehouse page 16
• Database

0304-1659294 10
AL-JUNAID TECH INSTIUTE
66. Suppose in system A, the possible values of “Gender” attribute were “Male”& “Female”,
however in data warehouse ,the values stored were “M” for male and “F” for female.
This above scenario is an example of :
• One-to-one scalar transformation page 144
• One-to-many element transformation
• Many-to-one element transformation
• Many-to-many element transformation
67. Enrichment is one of the basic tasks in data ---------------- :
• Extraction
• Transformation page 138
• Loading
• Summarization
68. Which of the following is not a technique of De-normalization?
• Pre-joining
• Splitting tables
• Adding redundant columns
• ER modeling page 52
69. Which of the following is an example of Additive Facts?
• Sales Amount page 119
• Average
• Discount
• Ratios
70. Robotic libraries are needed for -------------------------:
• Cubes
• Data marts
• Data warehouse page 131
• Aggregates
71. Normally ROLAP is implemented using ----------------
• Star schema page 87
• Hybrid schema
• Pre-defined aggregate
• All of the given options
72. The relation R will be in 2nd Normal Form if
• It is in 1NF and each cell contains single value
• It is in 1NF and each non key attribute is dependent upon entire primary key
page 44
• It is in 1NF and non key attribute is dependent upon a single column of
composite primary key

0304-1659294 11
AL-JUNAID TECH INSTIUTE
• It is in 1NF and Primary key is composite
73. In --
----- ested loop join of quadratic time complexity does not hurt the
---- performance

Typical OLTP page


• environments 22
• Data warehouse
• DSS
• OLAP
74. In Extract, Load, Transform(ELT) process, data transformation ---------------:
• Takes place on the data warehouse server page 147
• Takes place on a separate transformation server
• Depends on the nature of the source database
• Does not take place
75. Node of a B-Tree is stored in memory block and traversing a B-Tree involves ----------
----- page faults:
• O(n log n)
• O(log n) page 22
• O(n)
• O(n2)
76. As dimensions get less detailed (e.g. , year vs. day) cubes get --------------------
• Smaller page 84
• Larger
• Partitioned
• Merged
77. Which of the following is not a technique of “ Changed Data Capture” in currently used
Modren
Source System?
• Timestamps
• Partitioning
• Triggers
• Dimensional Modeling page 150

78. The trade-offs of de-normalization is/are:


• Storage
• Performance

0304-1659294 12
AL-JUNAID TECH INSTIUTE
• Ease-of-use
• All of the given options page 62
79. If actual data structure does not conform to documented formats then it is called:
• Syntactically dirty data page 160
• Semantically dirty data
• Coverage anomaly
• Extraction issue
80. “Header size is reduced, allowing more rows per back , thus reducing I/O” .The above
statement is TRUE with respect to:
• Vertical splitting page 56
• Horizontal splitting
• Adding redundant column
• None of the given options
---------------- Breaks a table into multiple tables based upon common column values

page
• Horizontal splitting 54
• Vertical splitting
• Adding redundant column
• None of the given option
81. Which of the following is NOT an example of derived attribute?
• Age
• CGPA
• Area of rectangle
• Height (Conceptual)

82. Which of the following is NOT an example of derived attribute?


• Age
• CGPA
• Annual Salary

83. If a table is expected to have six columns but some or all of the records do not have six
columns then it is example of:
• Syntactically dirty data page 160
• Semantically dirty data
• Coverage anomaly
• Extraction issue

0304-1659294 13
AL-JUNAID TECH INSTIUTE
84. MDX by Microsoft is an example of ------------------------:
• HOLAP
• DOLAP
• ROLAP
• None of the given options page 79

85. The growth of master files and magnetic tapes exploded around the mid- ---------------
• 1950s
• 1960s page 12
• 1970s
• 1980s
86. If one or more records in a relational table do not satisfy one or more integrity constraint
, then the data:
• Is syntactically dirty
• Is semantically dirty page 160
• Has Coverage anomaly
• Has extraction issue
87. OLAP is:
• Analytical processing page 69
• Transaction processing
• Additive processing
• Active processing
88. One of the possible issues faced by web scrapping is that:
• Web pages may contain junk data page 141
• Web pages do not contain multiple facts
• Web pages do not contain multiple dimensions
• Web pages does not support transformation

89. Which of the following is\are example of dimension:


• Product page 79
• Region
• Data
• None of the given
90. An OLTP system is always good at ------------------------:
• Evolving data page 122
• Keeping static data
• Tracking past data

0304-1659294 14
AL-JUNAID TECH INSTIUTE
• Maintaining historic data

91. In case of multiple sources for the same data element , we need to prioritize the source
systems per element based, the process is called:
• Ranking page 143
• Prioritization
• Element selection
• Measurement selection
92. One feature of Change Data Capture (CDC) is that:
• It pre-calculates changed aggregates
• It loads the transformed data in real time
• It only processes the data has been changed
• It can automate the transformation of extracted data page 149
93. In ------------------ SQL generation in vastly simplified for front-end tools when the
data is highly structure:
• MOLAP
• Star Schema page 107
• Hybrid schema
• Object oriented schema
94. Dirty data means:
• Data cannot be aggregated
• Data contains non-additive facts
• Data does not fulfill dimensional modeling rules
• Data does not conform to proper domain definitions page 158
95. In Context of Change Data Capture (CDC) sometimes a ------------- object can be used
to store recently modified data:
• Buffer table
• Change table page 149
• Checkmark table
• Change control table
96. “Sometimes during data collection complete entities are missed”. This statement is an
example of :
• Missing tuple page 161
• Missing attribute
• Missing aggregates

0304-1659294 15
AL-JUNAID TECH INSTIUTE
• Semantically dirty data
97. Table collapsing technique is applied in case of:
• One-by-one relation or many-to –many relation page 52
• One-to-many relation
• Many-to-many relation
• None of the given option
98. Which of the following is an example of dimension?
• Product
• Region
• Date
• All of the given option page 78
99. Data warehouse stores -------------------:
• Operational data
• Historical data page 24
• Meta data
• Log files data
100. The business process covered by ER diagrams:
• Do not co-exist in time and space page 109
• Co-exist in time and space
• Do not physically exist in real time context
• None of the given options
101. The main goal of normalization is to eliminate -----------:
• Data redundancy page 41
• Data sharing
• Data security
• Data consistency
102. Serious ---- involves decomposing and resembling the data:
• Data cleansing page 168
• Data transformation
• Data loading
• Data extraction
103. In the data warehouse environment the data is ------------
• Subject- oriented page 69
• Time- oriented
• Both subject and time oriented

0304-1659294 16
AL-JUNAID TECH INSTIUTE
• Neither time-oriented nor subject- oriented
104. For large record spaces and large number of records , the run time of the
clustering algorithms:
• Prohibitive page 164
• Static
• Exponential
• Numerical
105. 1------------- can result in costly errors, such as , False frequency distributions and
incorrect aggregates due to double counting:
• Data duplication page 165
• Data reduction
• Data anomaly
• Data transformation

106. The degree to which values are present in the attributes that require them is
known as -
---------------------:
• Completeness page 185
• Uniqueness
• Accessibility
• Consistency
107. Time complexity of Key Creation process in basic Sorted Neighborhood (BSN)
Method is
----------------------:
• O(n log n)
• O(log n)
• O(n) page 171
• O(2n)
108. Which of the following is an example of slowly changing dimensions?
• Inheritance page 124
• Aggregation
• Association
• Asset disposal
109. The ------------ operator proves useful in more complex metrices
applicable to the dimensions and accessibility:
• Max page 188
• Min
• Max and Min
• None of the given

0304-1659294 17
AL-JUNAID TECH INSTIUTE
110. In OLAP , the typical write operation is ------------- :
• Bulk insertion page 75
• Single insertion
• Sequential insertion
• No insertion
111. 1The issue(s) of “ Adding redundant column” includes(s):
• Increase in table size
• Maintenance
• Loss of information
• All of the given option page 65

112. -------------- is applicable in Profitability analysis:


• OLTP
• Data warehouse page 36,37
• Information System(IS)
• Management Information System(MIS)

113. The hardware (CPU) utilization in data warehouse environment is full or ----------
-:
• Fixed
• Partial
• Not at all page 24
• Slow
114. 1Time variant is a characteristics of data warehouse which means:
• Data loaded in data warehouse will be time stamped page 20
• Data can be loaded in data warehouse anytime
• Data can be loaded in data warehouse only at a particular time
• Data cannot be loaded in data warehouse with respect to time

115. 1In which class of aggregates AVERAGE function can be placed:


• Algebraic page 120
• Distributed
• Associative
• Holistic
116. Considered the following Employee table and identify the column which causes
that the table is not in first normal form(1NF): (Emp_ID, Emp_Name ,Emp_skills,
Emp_Designation)

0304-1659294 18
AL-JUNAID TECH INSTIUTE
• Emp_ID
• Emp_Name
• Emp_skills page 43(conceptual)
• Emp_Designation
117. The application of data and information leads to -------------
• Intelligence
• Experience
• Knowledge page 11
• Power
118. 1--------------- segregate data into separate partitions so that queries do not need to
examine all data in a table when WHERE clause filters specify only a subset of the
partitions.
• Pre-joining technique
• Collapsing table technique
• Horizontal splitting technique page 56
• Vertical splitting technique
119. -------------should not be present in a relation, so that it would be in second
normal form (2NF).
• Partial dependency page 44 (conceptual)
• Full functional dependency
• Multivalued dependency
• Transitive dependency
120. 1Records referring to the same entity are represented in different formulas in
the different data sets or are represented erroneously. Thus duplicate records will
appear in the merged database. This problem is known as------------.
• Merge/purge problem page 168
• Duplication problem
• Redundant duplication problem
• Redundant problem
121. The data perspective in OLTP system is operational, while that in data warehouse
is:
• Fully normalized
• Fully de-normalized
• Fully summarized
• Historical and detailed page 30

0304-1659294 19
AL-JUNAID TECH INSTIUTE
122. 1Simple scalar transformation is a--------------mapping from one set of values
to another set of values using straightforward rules.
• One-to-one page 144
• One-to-many
• Many-to-many
• Many-to-one
123. ---------------can be created in operational systems to keep tracks of
recently updated records.
• Triggers page 150
• Timestamps
• Partitioning
• ELT
124. 1Development of data warehouse is hard because data sources are usually--------
• Structured and homogeneous
• Unstructured and heterogeneous page 31
• Structured and heterogeneous
• Unstructured and homogeneous
125. In a decision support environment, the decision maker is interested in -------------.
• Only limited organizational data
• Big picture of organizational data page 21
• Only sale related data
• Only customer related data
126. Information can answer question like “what”, “who” and “when”
while knowledge can answer question like----------------.
• Why
• Where
• Which
• How page 11
127. OLTP implementations are fully-------------.
• Normalized page 69
• Denormalized
• Predictive
• Additive
128. Which logical data extraction has significant performance impacts on the data
warehouse server?
• Incremental Extraction page 133
• Online Extraction

0304-1659294 20
AL-JUNAID TECH INSTIUTE
• Offline Extraction
• Legacy Vs OLTP
129. 1Consider the following Student table and identify the column which causes
that the table is not in first normal form(1NF).
Student(Std_ID, Std_Name ,Std_CGPA ,Std_Hobbies)
• Std_ID
• Std_Name
• Std_CGPA
• Std_Hobbies page 43(Conceptual)
130. Analytical processing uses ---------------
• Multi-level aggregates page 74
• Record level aggregates
• Table level aggregates
• All of the given options
131. Which is not a class of anomalies in following?
• Dirty anomalies page 160
• Syntactically dirty data
• Semantically dirty data
• Coverage anomalies

132. 1------------- is a system of activities that assures conformance of


product to pre-established requirements.
• Quality assurance page 183
• Quality improvement
• Quality Maintenance
• Quality Establishment
133. Two interesting examples of quality dimensions that can make use of min
operator are ------
• Believability and appropriate amount of data page 188
• Believability and consistency
• Believability and Redundancy
• Reliability and appropriate amount of data
134. -------------- in database or data warehouse has no actual value; it only has
potential
value.
• Data page 181
• Entity

0304-1659294 21
AL-JUNAID TECH INSTIUTE
• Flat tables
• Data marts
135. 1In OLTP environment the selectivity is ------------ and ---------- in data
warehouse environment.
• High, Low page 22
• Low, High
• High, Fixed
• Fixed, Low
136. Which is not a/an characteristics of data quality?
• Reliability page 186
• Uniqueness
• Accessibility
• Consistency
137. 1If a product meets formally defined “requirement specifications”, yet fails to be
a quality product from the customer’s perspective , this means the requirements were -----
------.
• Defective page 180
• Unclear
• Unrefined
• Undefined

138. The relation R will be in 3rd Normal Form if:


• It is in 2NF each cell contains single value
• It is in 2NF and every non-key column is non-key transitively dependent
upon its primary
key. Page 46
• It is in 1NF and each non key attribute is dependent upon a single column of
composite primary key.
• It is in 2NF and each non key attribute is dependent upon other non-key attribute.
139. Decision support system queries deal with number of columns ------------
• Having numeric values
• In a single table
• In a single view
• Spanning across multiple tables page 21

140. Normalization is used to reduce:


• Reduces redundancy page 41
• Increases redundancy
• Reduces joins

0304-1659294 22
AL-JUNAID TECH INSTIUTE
• Reduces tables
141. 1The end user of data ware house are---------------.
• Programmers
• Database developers
• Data entry operator
• Business executives page 18 + 19

142. Which one are the characteristics of data warehouse queries?


• Use primary key
• High selectivity
• Use multiple tables page 30
• Very low performance
143. 1Referential Integrity (RI) using a naive approach would take ------------------
time.
• O(n) page 160
• O(1)
• O(log n)
• None of the given
144. Web scrapping is a process of applying ------------- techniques to the web
• Screen scrapping page 146
• Data scrapping
• Text scrapping
• Meta scrapping
145. Which is not an issue of ROLAP in the following?
• Standard hierarchy of dimensions page 92
• Non-standard conventions
• Maintenance
• Aggregation
146. One of the fundamental purpose of de-normalization is to --------------------- a
number of physical tables which ultimately reduce the number of joins to answer a
query.
• Delete
• Reduce page 50
• Increase
• Decrease
147. ----------- is not the characteristic of data warehouse.
• Time variant
• Subject-oriented
• Integrated

0304-1659294 23
AL-JUNAID TECH INSTIUTE
• Volatile page 69
148. Which is not a/an step of data cleansing procedure?
• Aggregation page 168
• Elementizing
• Standardizing
• Verifying

149. 1Instance matching between different sources is then achieved by a standard ------
-------
on identifying attribute(s), if you are very, very, very lucky.
• Equi-join page 169
• Inner join
• Outer join
• Fuller join
150. Ad-hoc access of data warehouse means:
• That have predefined database access pattern
• That does not have predefined database access pattern page 18
• That could be accessed by any user
• That could not be accessed by any user
151. In OLTP environment, the size of tables is relatively-------------------
• Large
• Fixed
• Moderate
• Small page 22

152. 1------------- is a/an measure of how current or up to date the data is:
• Timeliness page 185
• Completeness
• Accessibility
• Consistency
153. 1The process of converting entity relationship model in to dimensional model of -
---------
steps:
• Two
• Three
• Four page 109

0304-1659294 24
AL-JUNAID TECH INSTIUTE
• Five
154. A ------------- Is defined by a group of records that have similar characteristics
(“behavior”) for p% of the fields in the data set, where p is a user- defined
value(usually above 90).
• Pattern page 164
• Cluster
• Entity
• Attribute
155. ------------------ is known as state of being only one of its kind or being without an
equal or
parallel.
• Completeness
• Uniqueness page 185
• Accessibility
• Consistency
156. Which of the following is not an example of fact?
• Account no page 74
• Sales quantity
• Per unit sales amount
• Sales amount
157. 1------------------is the degree to which data is accurately reflects the real world
object that the data represents.
• Intrinsic data quality page 181
• Realistic data quality
• Strong data quality
• Weak data quality

158. 1Which one among the following data warehouse stores data containing long
period?
• Telecommunication data warehouse
• Financial data warehouse
• Human resource data warehouse
• Insurance data warehouse page 36

159. A ________ dimension is a collection of random transactional codes,


flags and/text attributes that are unrelated to any particular dimension. The

0304-1659294 25
AL-JUNAID TECH INSTIUTE
______ dimension is simply a structure that provides a convenient place to
store the ______ attributes.
• Junk
• Time
• Parallel
• None of these
160. 1Data Warehouse is about taking / collecting data from different ________
sources.
• Harmonized
• Identical
• Homogeneous NOT CONFIRM
• Heterogeneous
161. 1Taken jointly, the extract programs or naturally evolving systems formed a
spider web, also known as
• Distributed Systems Architecture
• Legacy Systems Architecture
• Online Systems Architecture
• Intranet Systems Architecture
162. It is observed that every year the amount of data recorded in an organization
• Doubles
• Triples
• Quartiles
163. The users of data warehouse are knowledge workers in other words they
are _________ in the organization.
• DWH Analyst
• Decision maker
• Database Administrator
• Manager
164. Node of a B-Tree is stored in memory block and traversing a B-Tree involves
______ page faults.
• O (n lg n)
• O (log n) { O(log n) it’s the real answer}
• O (n)
• O (n2)
165. In _________ system, the contents change with time.
• OLTP
• ATM
• DSS
• OLAP
166. The growth of master files and magnetic tapes exploded around the mid-
_______.
• 1950s.
• 1960s.

0304-1659294 26
AL-JUNAID TECH INSTIUTE
• 1970s.
• 1980s.
167. Relational databases allow you to navigate the data in ____________ that is
appropriate using the primary, foreign key structure within the data model
• Only One Direction
• Any Direction
• Two Direction
• None of these
168. 1Naturally Evolving architecture occurred when an organization had a _______
approach to handling the whole process of hardware and software architecture.
• Relaxed
• Good
• Not Relaxed
• None
169. ________ gives total view of an organization
• OLAP
• OLTP
• Data Warehouse
• Database
170. Suppose the amount of data recorded in an organization is doubled every year.
This increase is
__________ .
• Linear
• Quadratic
• Exponential
• Logarithmic
171. ________is the application and experience to get common goals.
 Wisdom
 Power
 Information
 Education
172. Normally ROLAP is implemented using___________.
 Star Schema
 Pre-defined aggregates
 Hybrid Schema
 All of the given options
173. Identify data warehouse query from the following.
 Number of students studying CS614 course
 Factors which can effect students’ performance
 List of students belong to Lahore city
 Total number of students that have 3.5 CGPA

0304-1659294 27
AL-JUNAID TECH INSTIUTE
174. In _________ SQL generation is vastly simplified for front-end tools when
the data is highly structured.
 Hybrid Schema
 Star Schema
 MOLAP
 Object oriented schema
175. __________ models the Marco relationships among data elements with an
overall deterministic strategy.
 Dimensional model
 Structured model
 Object oriented model
 Entity relationship model
176. The _________ task is typically performed after most other transformation
and cleaning steps have taken place.
 Data extraction
 Data duplicate elimination
 Data transformation
 Data loading
177. __________ is known as state of being only one of its kind or being without
an equal or parallel.
 Completeness
 Consistency
 Accessibility
 Uniqueness
178. Relational modeling techniques are used to develop__________ systems.
 OLTP
 MOLAP
 OLAP
 ROLAP
179. Which of the following is not CUBE operation?
 ANSI SQL
 Drill Down
 Pivoting
 Roll UP
180. The hybrid OLAP(HOLAP) solution is a mix of ___________ architecture
that supports queries against summary and transaction in an integrated.
 ROLAP

0304-1659294 28
AL-JUNAID TECH INSTIUTE
 OLTP
 MOLAP and ROLAP
 MOLAP
181. The response time of OLTP queries are in milliseconds to seconds. While
that of data warehouse queries are in:
 Days of weeks
 Hours to days
 Milliseconds to microseconds
 Minutes to hours
182. __________ allows download of “cube” structures to a desktop platform
without the need for shared relational or cube server.
 ROLAP
 HOLAP
 DOLAP
 MOLAP
183. In case of multiple sources for the same data element. We need to prioritize
the source systems per element bases, the process is called:
 Ranking
 Measurement selection
 Element selection
 Prioritization
184. OLAP is:
 Implementation technique
 Framework
 None of the given options
 Physical database design
185. Which of the following is not a task of Data Transformation?
 Summarization
 Enrichment
 Conversion
 Full data refresh
186. Simple scalar transformation is a ____________ mapping from one set of
values to another set of values using straightforward rules.
 Many-to-one
 Many-to-many
 One-to-one
 One-to-many

0304-1659294 29
AL-JUNAID TECH INSTIUTE
187. Which of the following is not a data quality validation technique?
 Indexes
 Using data quality rules
 Referential integrity
 Data Histograming
188. __________ are created out from the data warehouse to service different
needs of departments in the organization.
 Databases
 Data marts
 MIS
 Tables
189. An OLTP system is always good at___________.
 Maintaining historical data
 Keeping static data
 Tracking past data
 Evolving data
190. _________ incorporates the concept of product quality, process control,
quality improvement.
 Intrinsic data quality management
 Realistic Data Quality Management
 Total Quality Management
 Strong Data Quality Management
191. “Sometimes during data collection complete entities are missed”. This
statement is an example of:
 Missing aggregate
 Missing tuple
 Semantically dirty data
 Missing attribute
192. The last step of Software Development Life Cycle (SDLC) is implementation.
While that of data warehouse is:
 Integration
 Understanding requirements
 Testing
 Analysis
193. In which class of aggregates Max function can be places?
 Algebraic
 Associative

0304-1659294 30
AL-JUNAID TECH INSTIUTE
 Holistic
 Distributive
194. Telecommunication data warehouse is dominated by the__________ volume
of data generated at the call level.
 Incomplete
 Semi-incomplete
 Sheer
 Partial
195. If a table is expected to have six columns but some or all of the records do
not have six columns then it is example of:
 Semantically dirty data
 Syntactically dirty data
 Extraction issue
 Coverage anomaly
196. In the data warehouse environment the date is__________.
 Both subject and time oriented
 Neither time-oriented nor subject-oriented
 Subject-oriented
 time-oriented
197. dirty data means that:
 Data control be aggregated
 Data contain non-additive facts
 Data does not fulfill dimensional modeling rules
 Data does not conform to proper domain definitions
198. One of the main reason for the failure of DWH deployments is________.
 Data anomaly
 Data integrity
 Data quality
 Data duplication
199. The data in the data warehouse is__________.
 Non-structured
 Volatile
 Static
 Non-volatile
200. Which of the following is NOT an example of a typical grain?
 Daily aggregates
 Monthly aggregates

0304-1659294 31
AL-JUNAID TECH INSTIUTE
 Individual transaction
 Normalized attribute
201. Data warehouse stores ___
 Operational data
 Historical data
 Meta data
 Log files data
202. The ___ dimension represents data correctness
 Free-of-error
 Completeness
 Consistency
 Correctness
203. Which of the following is not a Data Quality Validation Technique?
 Referential Integrity
 Using Data Quality Rules
 Data Histograming
 Indexes
204. Which of the following is an example of Non-Additive Facts?
 Quantity sold
 Total Sale in Rs.
 Discount Percentage
 Count of orders in a store
205. Which is the most complex type of transformation in the following?
 Many-to-many element transformation
 One-to-one scalar transformation
 One-to-many element transformation
 All of the given
206. Serious ___ involves decomposing and reassembling the data
 Data cleansing
 Data transformation
 Data loading
 Data Extraction
207. ___ is the degree of utility and value the data has to support the enterprise
processes that enable accomplishing enterprise objectives
 Intrinsic Data Quality
 Realistic Data Quality
 Strong Data Quality
 Weak Data Quality
208. In a decision support system ease of use is achieved by:
 Normalization
 Denormalization
 Drill up

0304-1659294 32
AL-JUNAID TECH INSTIUTE
 Drill down
209. Assume a company with a multi-million row customer table i.e. n rows.
Checking for Referential Integrity (RI), using a smart technique with some kind of tree
data structure would require ___ time
 O(log n)
 O(n)
 O(1)
 None
210. Which of the following is NOT an example of a typical grain?
 Individual Transactions
 Daily aggregates
 Monthly aggregates
 Normalized attributes
211. Most DWH implementations today do not use ___ enforced by the database, but
as TQM methods improve overall data quality and database optimizers
 Consistency Integrity
 Referential Integrity
 Attribute domain
 Using Data Quality Rules
212. Suppose in system A, the possible values of “Gender” attribute were “Male” &
“Female”, however in data warehouse, the values stored were “M” for male and “F” for
female. The above scenario is an example of:
 One-to-one scalar transformation
 One-to-many element transformation
 Many-to-one element transformation
 Many-to-many element transformation
213. Development of data warehouse is hard because data sources are usually ___
 Structured and homogeneous
 Unstructured and heterogeneous
 Structured and heterogeneous
 Unstructured and homogeneous
214. A/an ___ is a collection of random transactional codes, flags and/or text attributes
that are unrelated to any particular dimension
 Junk dimension
 Slowly changing dimension
 Multi-valued dimension
 Simple dimensions
215. ROLAP provides access to information via a relational database using
 ANSI standard SQL
 Proprietary file format
 Comma Separated Values
 All of the given

0304-1659294 33
AL-JUNAID TECH INSTIUTE
216. The typical availability of OLTP system is 24/7, while that of data warehouse is
___
 6/12
 7/12
 1/24
 Twice a week
217. In ___ nested-loop join of quadratic time complexity does not hurt the
performance
 Typical OLTP environments
 Data warehouse
 DSS
 OLAP
218. If actual data structure does not conform to documented formats then it is called:
 Syntactically dirty data
 Semantically dirty data
 Coverage anomaly
 Extraction issue
219. Which of the following is not a CUBE operation?
 ANSI SQL
 Roll UP
 Drill down
 pivoting
220. The data has to be checked, cleansed and transformed into a ___ format to allow
easy and fast access
 unified
 predicted
 qualified
 proactive
221. Which is not a/an characteristics of data quality?
 Reliability
 Uniqueness
 Accessibility
 Consistency
222. The extent to which data is in appropriate languages, symbols, and units, and the
definitions are clear is known as ___
 Interpretability
 Uniqueness
 Accessibility
 Consistency
223. In case of multiple sources for the same data element, we need to prioritize the
source systems per element bases, the process is called
 Ranking
 Prioritization

0304-1659294 34
AL-JUNAID TECH INSTIUTE
 Element Selection
 Measurement Selection
224. In OLTP environments, the size of tables is relatively ___
 Large
 Fixed
 Moderate
 Small
225. Change Data Capture (CDC) can be challenging task because
 Aggregates don’t change in real time
 Transformation of extracted data is difficult
 Identifying the recently modified data may be difficult
 Source systems may not support extraction of changed aggregates
226. ___ is the extent to which data is regarded as true and credible
 Believability
 Completeness
 Accessibility
 Consistency
227. The relation R will be in 2nd Normal Form if
 It is in 1NF and each cell contains single value
 It is in 1NF and each non key attribute is dependent upon entire primary key
 It is in 1NF and each non key attribute is dependent upon a single column of composite
primary key
 It is in 1NF and Primary key is composite
228. ___ is the degree to which data accurately reflects the real-world object that the
data represents
 Intrinsic Data Quality
 Realistic Data Quality
 Strong Data Quality
 Weak Data Quality
229. Web scrapping is a process of applying ___ techniques to the web
 Screen scrapping
 Data scrapping
 Text scrapping
 Meta scrapping
230. In which class of aggregates AVERAGE function can be placed?
 Algebraic
 Distributive
 Associative
 Holistic
231. Which of the following is not an “Orr’s Law of Data Quality”?
 “Data that is not used cannot be correct!”
 “Data quality is a function of its use, not its collection”
 “Data will be no better than its most stringent use!”

0304-1659294 35
AL-JUNAID TECH INSTIUTE
 “Data duplication can be harmful for the organization!”
232. The ___ operator proves useful in more complex metrics applicable to the
dimensions of timeliness and accessibility
 Max
 Min
 Min or Max
 None
233. Which is not a/an Data Quality Validation Technique?
 Consistency Integrity
 Referential Integrity
 Attribute Domain
 Using Data Quality Rules
234. Assume a company with a multi-million row customer table i.e. n rows.
Checking for Referential Integrity (RI) using a naïve approach would take ___ time.
 O(n)
 O(1)
 O(log n)
 None
235. ___ breaks a table into multiple tables based upon common column values

 Horizontal splitting
 Vertical splitting
 Both
 None of these
236. Companies collect and record their own operational data, but at the same time
they also use reference data obtained from ___ sources such as codes, prices etc.
 Operational
 None
 Internal
 External
237. Ad-hoc access means to run such queries which are known already
 True
 False
238. Relational databases allow you to navigate the data in ____ that is appropriate
using the primary, foreign key structure within the data model.
 Only One direction
 Any direction
 Two direction
 None
239. DSS queries do not involve a primary key
 True
 False

0304-1659294 36
AL-JUNAID TECH INSTIUTE
240. The need to synchronize data upon update is called
 Data Manipulation
 Data Replication
 Data Coherency
 Data imitation
241. Taken jointly, the extract programs or naturally evolving systems formed a spider
web, also known as
 Distributed Systems Architecture
 Legacy Systems Architecture
 Online Systems Architecture
 Intranet Systems Architecture
242. Node of a B-Tree is stored in memory block and traversing a B-tree involves ___
page faults
 O(n)
 O(n^2)
 O(n log n)
 O(log n)
243. Which statement is true for De-Normalization?
 Redundant data is a performance liability at query time, but is a performance benefit at
update time
 Redundant data is a performance liability at both query time and update time
 Redundant data is a performance benefit at both query time and update time
 Redundant data is a performance benefit at query time, but is a performance liability at
update time
244. De-normalization normally speeds up
 Data Retrieval
 Data modification
 Development cycle
 Data replication
245. In horizontal splitting, we split a relation into multiple tables on the basis of
 Common column values
 Common row values
 Different index values
 Value resulted by ad-hoc query
246. For good decision making, data should be integrated across the organization to
cross the LoB (Line of Business). This is to give the total view of organization from:
 Owner’s perspective
 Customer’s perspective
 Decision Maker’s perspective
 Employee’s Perspective
247. A data warehouse may include
 Legacy systems
 Only internal data sources

0304-1659294 37
AL-JUNAID TECH INSTIUTE
 Privacy restrictions
 Small data mart
248. Multidimensional databases typically use proprietary ___ format to store pre-
summarized cube structures
 File
 Application
 Aggregate
 Database
249. All data is ___ of something real
 I An abstraction
 II A representation
250. Which of the following option is true?
 I only
 II only
 Both I and II
 None

0304-1659294 38
CS614- Data Warehousing
Solved MCQ(S) Oct
From Midterm Papers (1 TO 22 Lectures) 26,2017

BY Arslan
V-U For Updated Files Visit Our Site : Www.pakeducation4u.com Updated.

Midterm Papers Solved MCQ(S) (1 TO 22 Lectures)


1. If there are N records in a table, then the selectivity of primary key column is

 1/N PG # 22
 N/2
 lg(N)

2. _________ is one class of decision support environment.

 OLAP PG # 30
 OLTP
 Data Cleansing
 ETL

3. The users of data warehouse are ________

 Decision makers
 Knowledge workers
 Both Knowledge workers and Decision makers PG # 18

The users of data warehouse are knowledge workers in other words they are decision makers in the organization.
4. The users of data warehouse are knowledge workers in other words they are _________ in the organization.

 Decision maker PG # 18

 Manager

 Database Administrator

 DWH Analyst

5. Effects of de-normalization on database performance are

 Unpredictable PG # 62

 Predictable

 Conventional

 Unsurprising

6. OLAP is used for analytical process. For analytical processing we need

 Multi-level aggregates PG # 74

 Record level access

 Data level access

 Row level access

7. Multidimensional databases typically use proprietary __________ format to store pre- summarized cube

structures.

 File PG # 79

 Application

 Aggregate

 Database
8. Normally Selectivity of query in OLTP system is

 High PG # 30
 Low
 Not measured

9. Normally Selectivity of query in data warehouse is

 High
 Low PG # 30
 Not measured

10. De-Normalization normally speeds up

 Data Retrieval PG # 51
 Data Modification
 Development Cycle
 Data Replication

11. De-Normalization normally slows down

 Data Retrieval
 Data Modification PG # 51
 Development Cycle
 Data Replication

De-normalization usually speeds up data retrieval, but it can slow the data modification processes.
12. Geography is a good example of

 One-dimensional Hierarchy
 Multidimensional Hierarchy PG # 52
 Non-Dimensional
 Linear Hierarchy

13. Partition elimination is not possible with

 Round-Robin PG # 66
 De-normalization
 Normalization

14. OLAP is Analytical processing instead of Transaction processing, and OLAP is a

 Physical database design


 Implementation technique
 Framework PG # 69

OLAP is Analytical Processing instead of Transaction Processing. It is also NOT a physical database design or
implementation technique, but a framework.

15. ER is a logical design technique that seeks to remove the _____________ in data.

 Redundancy PG # 98
 Normalization
 Anomalies

16. ER is a _______ design technique that seeks to remove the redundancy in data.

 Logical PG # 98
 Physical
 Data Dependent
 Transaction Dependent
17. Pre-computed _______ can solve performance problems

 Aggregates PG # 111
 Facts
 Dimensions

18. Extract, Transform, Load (ETL) process consist of steps which are _________________.

 Independent and interrelated PG # 131


 Independent or interrelated
 Dependent and interrelated
 Dependent or interrelated

19. In full extraction, data is extracted completely from the source system. Therefore there is no need to keep track
of changes to the ________

 Data Source PG # 133


 DWH
 Data Mart

20. Data Extraction from source systems is a difficult task because source systems are ___________

 Homogeneous

 Heterogeneous PG # 140

 Identical

 Standardized

21. Lexical errors fall in which type of class of anomalies

 Syntactically Dirty Data PG # 160

 Semantically Dirty Data

 Coverage Anomalies

 Missing Values Anomalies


22. The degree of similarity between two records, often measured by a numerical value between _______, usually

depends On application characteristics.

 0 and 1 PG # 169

 0 and 10

 0 and 100

 0 and 99

23. As consumers, human beings judge the quality of things during their life-time.

I Consciously

II Subconsciously

III Unconsciously

Which of the following statement is true?

 I Only

 II Only

 III Only

 I & II Only PG # 179

24. The growth of master files and magnetic tapes exploded around the mid- _______.

 1950s.
 1960s. PG # 12
 1970s.
 1980s.
25. Relational databases allow you to navigate the data in ____________ that is appropriate using the primary,
foreign key structure within the data model.

 Only One Direction


 Any Direction PG # 19
 Two Direction
 None of these

26. In ROLAP access to information is provided via relational database using _________ standard SQL.

 ANSI PG # 78
 Microsoft
 Oracle
 SAP

27. Which of the following statement is true? 1 GB is

 230 or 109 bytes PG # 15


30 6
 2 or 10 bytes
32 9
 2 or 10 bytes
 232 or 108 bytes
28. If w is the window size and n is the size of data set, then the complexity of merging phase in BSN method
is___________

 O (n)
 O (w)
 O (w n) PG # 171
 O (w log n)

29. The telecommunications data warehouse is dominated by the sheer volume of data generated at the call level
______ area.

 Subject PG # 35
 Object
 Aggregate
 Detail

30. “Decision making is an iterative process; which must involve the users” is a classic statement of

 OLAP PG # 69

 ATM

 OLTP

 Data Mining

31. Cube is a logical entity containing values of a certain fact at a certain aggregation level at _____________ of a
combination of dimensions.

 An Intersection PG # 88
 A Union
 A Subtraction
 A Subset
32. After implementing Change Data Capture, the advantage we have is that, data is able to be integrated and
transformed __________

 In-flight Click Here For Reference Detail PG # 152


 Off-flight
 Stored Data
 Over-flight

Finally data is able to be integrated and transformed "in-flight”. Once the update/transaction data has been pulled from
the log tape, the DWH is free to re-sequence, reformat, convert, merge, summarize, etc.

33. All data is ______________ of something real.


I An Abstraction
II A Representation
Which of the following option is true?

 I Only PG # 180
 II Only
 Both I & II
 None of I & II

34. _____________ queries deal with number of variables spanning across number of tables (i.e. join operations)

and looking at lots of historical data.

 OLTP

 DBMS

 DSS PG # 21

 ETL

35. In data warehouse, a query results in retrieval of hundreds of records from very large table. the ratio of number

of records retrieved to total number of records present is high and selectivity is:

 Low PG # 22

 High

 Average

 Non computable
36. At small scale e.g. hundreds of rows, difference between performance of complexity algorithms “O(n lg n)”
2
and “O(n ) is not _____________.

 Noticeable

 Small

 Symmetric

 Asymmetric PG # 32

37. Collapsing tables can be done on the ___________ relationship(s)

 Only One-to-One

 Only Many-to-Many

 Only One-to-Many

 Both One-to-One and Many-to-Many PG # 52

38. HOLAP provides a combination of relational database access and cube data structures. The goal is to get the
best of both MOLAP and ROLAP i.e.

 Scalability and high performance PG # 78

 Backward compatibility to RDBMS

 Forward compatibility to new DBMS

 Usability of new DBMS


39. Primary key is repeated in ___________ splitting.

 Horizontal
 Vertical PG # 56
 Cross
 Diagonal

40. “Change Data Capture” is one of the challenging technical issues in __________

 Data Extraction PG # 150


 Data Loading
 Data Transformation
 Data Cleansing

41. Merging information is one of the major types of ________________

 Transformation PG # 152, 153


 Extraction
 Loading
 Updation

Data merging is part of data transformation where multiple values are summarized into single summarized value.
42. Which of the following is NOT an example of a “Dimension”?

 Date

 Region

 Product

 Units sold PG # 74

FACTS: Quantitative values (numbers) or “measures.”


e.g., units sold, sales $, Co, Kg etc.
DIMENSIONS: Descriptive categories.
e.g., time, geography, product etc.
DIM often organized in hierarchies representing levels of detail in the data
(e.g., week, month, quarter, year, decade etc.).

43. Multi-value dependencies are removed in _______________

 1st Normal form

 2nd Normal form

 3rd Normal form


th
 4 Normal form PG # 48

44. Which of the following is not one of the characteristics of a Data warehouse?

 Ad-hoc access

 Complete repository

 Historical data

 Volatile PG # 18

A complete repository of historical corporate data extracted from transaction systems that is available for ad-hoc
access by knowledge workers
45. To handle dimensions that require the aggregation of multiple data quality indicators, which of the following

operations can be applied:

 Minimum or Maximum PG # 188

 Complex Ratio

 Aggregate Average

 Weighted Ratio

46. Full and Incremental extraction techniques are types of ___________

 Logical Extraction PG # 133

 Physical Extraction

 Both Logical Extraction and Physical Extraction

 Semantic Extraction

The two logical data extraction types are full and incremental extraction techniques.

47. “The environment is smart enough to develop or compute higher level aggregates using lower level or more

detailed aggregates”. Which of the following approach is described by the above statement?

 Aggregate awareness PG # 87

 Cube partitioning

 Indexing

 MOLAP cube aggregation


48. Suppose the amount of data recorded in an organization is doubled every year. This increase is

 Linear
 Quadratic
 Logarithmic
 Exponential PG # 15

49. ER Model can be simplified in -------- ways

 One
 Two PG # 103
 Three
 Four

There are actually two ways of “simplifying” the ER model i.e. (i) De -normalization and (ii) Dimensional Modeling.

50. Non recording facts have a disadvantage that it has

 Lack of Information PG # 120


 Redundant Information
 Repeated Information
 Normalized Information

51. Fact-less fact table is a fact table without numeric fact columns. It is used to capture relationship between
__________

 Dimensions PG # 121
 Attributes
 Tables
 Facts
52. A cube is not a data warehouse, it is a _______________

 Data Mart PG # 131


 Data Extraction
 Data Loading
 Query Manager

53. If we remove the modification anomalies the table comes in ________ form

 1NF
 2NF
 3NF PG # 47
 4NF

54. Modification anomalies occur in _________ form

 1NF
 2NF PG # 45
 3NF
 BCNF

55. In MOLAP physically build cubes for direct access, support is not available for ______ SQL.

 ANSI PG # 78
 Microsoft
 Oracle
 SAP

MOLAP physically builds “cubes” for direct access - usually in the proprietary file format of a multi-dimensional
database (MDD) or a user defined data structure. Therefore ANSI SQL is not supported.
56. ______ is the lowest level of detail or the atomic level of data stored in the warehouse.

 Aggregate
 Cube
 Grain PG # 111
 Virtual Cube

57. Redundancy causes _________ anomalies

 Update PG # 43
 Select
 Both Update & Select
 None of these

58. Which of the following statement is true? 1 PB is

 252 or 1013 bytes


50 15
 2 or 10 bytes PG # 15
50 10
 2 or 10 bytes
48 12
 2 or 10 bytes

59. Node of a B-Tree is stored in memory block and traversing a B-Tree involves ______ page faults.

 O (n)
 O (n2)
 O (n lg n)
 O (log n) PG # 22
60. One major goal of horizontal splitting is

 Splitting rows for exploiting parallelism


 Splitting columns for exploiting parallelism
 Splitting schema for exploiting parallelism
 Spreading rows for exploiting parallelism. PG # 46

61. Fact-less fact table is a fact table without numeric fact columns. It is used to capture relationship between
__________

 Dimensions PG # 121
 Attributes
 Tables
 Facts
62. ________ gives total view of an organization

 OLTP
 Data warehouse PG # 16
 OLAP
 Data base

63. Experience showed that for a single pass magnetic tape that scanned 100% of the records, only _________ of
the records, sometimes even were actually required.

 5% PG # 12
 30%
 50%
 80%

64. In _________ system, the contents change with time.

 OLTP PG # 20
 DSS
 ATM
 OLAP

65. It is observed that every year the amount of data recorded in an organization

 Doubles PG # 15
 Triples
 Quartiles
 Remains same as previous year
66. Normalized design is likely to perform much faster than de-normalized design for queries that probe

 Master table only PG # 64


 Details tables only
 Both master and detail tables

67. Partition elimination is not possible with

 Round-Robin PG # 66
 De-normalization
 Normalization

68. Which of the following is not a technique of 'De-Normalization'?

 Pre-joining
 Splitting Tables
 Adding Redundant Column
 ER Modeling PG # 52

69. Growth of master files and magnetic tapes exploded around

 mid-1970s
 mid-1980s
 mid-1960s PG # 12
 mid-1950s

70. If each cell of Relation R contains a single value (no repeating values) then it is confirmed that

 Relation R is in 1st Normal Form PG # 43


 Relation P is in 2nd Normal Form
 Relation R is in 3rd Normal Form
 Relation R is in 3rd Normal Form but not in 2nd Normal Form
71. Table collapsing technique is applied in case of:

 One-to-one relation or many-to-many relation PG # 52


 One-to-many relation
 Many -to-one relation
 None of the given options

72. ROLAP provides access to information via a relational database using

 ANSI standard SQL PG # 78


 Proprietary file format
 Comma Separated Values
 All of the given options

73. ____________ involves splitting a table by columns so that a group of columns is placed into the new table and
the remaining columns are placed in another new table.

 Vertical splitting PG # 56
 Horizontal splitting
 Adding redundant column
 None of the given options

74. In ____________ nested-loop join of quadratic time complexity does not hurt the performance

 Typical OLTP environments PG # 22


 Data warehouse
 DSS
 0LAP
75. Taken jointly, the extract programs or naturally evolving systems formed a spider web, also known as

 Distributed Systems Architecture


 Legacy Systems Architecture PG # 14
 Online Systems Architecture
 Intranet Systems Architecture

76. The relation R will be in 2nd Normal Form if

 It is in 1NF and each cell contains single value.


 It is in 1NF and each non key attribute is dependent upon entire primary key. PG # 44
 It is in 1NF and each non key attribute is dependent upon a single column of composite primary key.
 It is in 1NF and Primary key is composite.

Every non-key column is fully dependent on the PK.

Note: Give me a feedback and your Suggestion also If you find any mistake in mcqz plz
inform me Viva Contact us Page on our Site. And tell me your answer with references.
For More Solved Papers By Arslan Visit Our Website :
Www.pakeducation4u.com

You might also like