(WWW - Entrance-Exam - Net) - ICFAI University MBA Data Warehousing and Data Mining (MB3G1IT) Sample Paper 2
(WWW - Entrance-Exam - Net) - ICFAI University MBA Data Warehousing and Data Mining (MB3G1IT) Sample Paper 2
5. NEXT inc., a reputed BPO company is maintaining the database of the customers office & home
phone
numbers. But it is having a problem of wasting space when the customers are using either of them. So
which of
the following processes can help the company in eliminating the redundant data?
(a) Structuring
(b) Randomizing
(c) Analyzing
(d) Normalizing
(e) Actualizing.
6. An accounts database has a table with invoices and each invoice is associated with a particular
supplier. Supplier
details (such as address and name) are kept in a separate table; each supplier is given a 'supplier
number' to
identify them. Each invoice record has an attribute containing the supplier number for that invoice.
Identify the
1
primary key, foreign keys in the tables.
(a) Supplier number in the supplier table is foreign key and InvoiceNumber in the Invoices table is the
primary key
(b) Supplier number in the supplier table is primary key and InvoiceNumber in the Invoices table is the
foreign key
(c) Supplier number in the supplier table is primary key, InvoiceNumber in the Invoices table is the
primary
key and Supplier number in Invoices table is the foreign key
(d) Supplier number in the supplier table is foreign key and InvoiceNumber, Supplier number in
Invoices
table are primary keys
(e) Supplier name in the supplier table is the primary key, InvoiceNumber in the Invoices table is the
primary key and Supplier number in Invoices table is the foreign key.
7. The ADSM backup software package was produced by
(a) HP
(b) Sequent
(c) IBM
(d) Epoch
(e) Legato.
8. A perceptron consists of a simple three-layered network, with output units called
(a) Photo-receptors
(b) Associators
(c) Responders
(d) Acceptors
(e) Rejectors.
9. In an organization, the relation between projects and employees is
(d) 4
(e) 5.
22. Which of the following type of knowledge is the information that can be easily retrieved from
databases using
query tools?
(a) Shallow knowledge
(b) Multi-dimensional knowledge
3
(b) Multi-dimensional knowledge
(c) Hidden knowledge
(d) Deep knowledge
(e) Tacit knowledge.
23. A petabyte equals to
(a) 1024 terabytes
(b) 1024 gigabytes
(c) 1024 megabytes
(d) 1024 kilobytes
(e) 1024 bytes.
24. The non-trivial extraction of implicit, previously unknown and potentially useful knowledge from
data is known
as
(a) Data selection
(b) Data mirroring
(c) Data cleaning
(d) Knowledge discovery in databases
(e) Data design.
25. The theory in which information content of the message is related to the probability that a certain
message will
occur is
(a) Shannons communication theory
(b) Kolmogorov complexity theory
(c) Rissanen theory
(d) Freuds theory
(e) Kohonen theory.
26. An approach to a problem that is not guaranteed to work but performs well in most cases is
(a) Heuristics
(b) Enumeration
(c) Falsification
(d) Naive prediction
(e) Enrichment.
27. Which of the following is not a stage in Knowledge Discovery Process?
(a) Data selection
(b) Cleaning
(c) Enrichment
(d) Reporting
(e) Data encapsulation.
28. Which of the following statements is/are true about Online Analytical Processing (OLAP)?
I. OLAP tools do not learn.
II. OLAP creates no new knowledge.
III. OLAP is more powerful than Data mining.
IV. OLAP cannot search for new solutions.
(a) Only (I) above
(b) Only (III) above
(c) Both (I) and (II) above
(d) Both (II) and (III) above
(e) (I), (II) and (IV) above.
29. In Freuds theory of psychodynamics, the human brain was described as a
(a) Decision tree
(b) Neural network
(c) Learning
(d) Knowledge
(e) Visualization technique.
30. An individual learns how to carry out a certain task by making a transition from a situation in
which the task
cannot be carried out to a situation in which the same task can be carried out under the same
circumstances. The
4
given definition is referred to as
(a) Learning
(b) Knowledge
(c) Machine learning
(d) Learning algorithm
(e) Meta learning.
Section B : Caselets (50 Marks)
This section consists of questions with serial number 1 6.
Answer all questions.
Marks are indicated against each question.
Detailed explanations should form part of your answer.
Do not spend more than 110 - 120 minutes on Section B.
Caselet 1
Read the caselet carefully and answer the following questions:
1. Why KPC insurance company need tools to manage Data warehouse? Explain. ( 5 marks)
2. If you are the manager of the KPC insurance company, what are the issues you will
consider before buying the ETL tool? ( 6 marks)
KPC insurance is implementing new data warehousing and reporting technology
during a crisis.
Many software companies may experience problems as a result of a fire, natural
calamity or massive power outage. If software companies and all of the resources it
has for day-to-day operations are no longer available, it would wreak havoc.
Core mind, Worlds one of the largest department store retailer with more than 500
stores, generated a tremendous amount of data spread across a number of
operational systems. Core mind is implementing a Teradata Warehouse with the
Teradata Database. One sudden day it was affected by the cyclone that hit Kolkata,
but it was able to restore facilities within 60 hours because it had a well-defined
Disaster Recovery Plan (DRP) and procedures.
Core mind is putting together a disaster recovery plan to ensure that its large global
customers continue to get round-the-clock support, even if the subcontinent goes to
war. It wants to set up disaster recovery sites in Singapore and Canada. The plan is
to move employees to these sites and resume operations in the advent of an
emergency.
END OF
CASELET
2
Caselet 3
Read the caselet carefully and answer the following questions:
5. Assume that you are a Tester at Allina, explain how you will test Backup Recovery. ( 12 marks)
6. What other factors (Other than mentioned in the caselet) Allina might have been
considered for successful implementation of data warehousing solution? Explain. ( 5 marks)
Allina Health System Implements Data Warehouse
Minneapolis-based Allina Health System is a non-profit healthcare system serving
one million people living in Minnesota. They vertically integrated healthcare
system includes 13,0000 physicians and 22,000 employees who own and manage
19 hospitals, 57 clinics and seven nursing homes networked across the country.
Allina provides people with a life time of healthcare options and full continuum of
care-from prevention and wellness services such as health screening and
immunizations to high-quality and technologically advanced inpatient and
outpatient services.
6
Allina is pressured from all sides to reduce costs, holding the premiums while
providing its patients with best treatment possible. It must also integrate
information systems and business practices of multiple organizations it has acquired
through various mergers in the recent past. To meet these challenges, management
has decided to develop and market a data warehouse strategy across the country.
The goal is to enable Allina to pull information together and integrate it in ways
never done before; for example extracting cost information from hospital and the
health plans, comparing best practices in treatments and matching cost of level of
service.
Meeting this was a data warehouse challenge. The data modeling effort had to
ensure that each data mart of data warehouse could be tied together logically and
physically. The key elements of the success of the project first a multitier database
was developed so that so that both summary and detailed information is available.