0% found this document useful (0 votes)
32 views

MIT-Data & Information

The document discusses various types of data that are categorized in an enterprise data environment. It describes transaction data, master data, reference data, operational data stores (ODS), metadata, and unstructured data. It provides examples of each type of data. The document also discusses the information value chain and how data moves from transaction systems to data warehouses and data marts. Finally, it covers topics like the value of information, the difference between data, information, and knowledge, and dimensions of data quality.

Uploaded by

Aindrila Nandy
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
32 views

MIT-Data & Information

The document discusses various types of data that are categorized in an enterprise data environment. It describes transaction data, master data, reference data, operational data stores (ODS), metadata, and unstructured data. It provides examples of each type of data. The document also discusses the information value chain and how data moves from transaction systems to data warehouses and data marts. Finally, it covers topics like the value of information, the difference between data, information, and knowledge, and dimensions of data quality.

Uploaded by

Aindrila Nandy
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 46

Managing Innovation and Technology

A holistic model of an Enterprise's Information Infrastructure

Data categorization in an Enterprise Data

Transaction Data

Transactional data
describe an internal
or external event
that takes place as
an organization
conducts its
business.

Master Data
Master data
describes the
people, Product,
Location, Account
those are Assets,
slow changing and
involved in an
organizations
business

Reference Data

ODS

Reference data are


sets of values that
are referenced to by
systems,
applications, data
stores and processes

ODS is designed to
integrate disparate
data from multiple
sources so that
business
operations,
analysis and
reporting can be
carried out.

Metadata
Metadata literally means data about data. Metadata describe, or characterize other data and
make it easier to retrieve, interpret, or use information.

Unstructured Data
3

Data categorization in an Enterprise Data


Transaction Data

Master Data

Reference Data

ODS Data

Examples include
sales orders,
invoices, purchase
orders, shipping
documents,
passport
applications, credit
card payments, and
insurance claims.

Examples include
people (e.g.,
customers, employees,
vendors, suppliers),
places (e.g., locations,
sales territories,
offices), and things
(e.g., accounts,
products, assets,
document sets).

Examples include lists


of valid values, code
lists, status codes, state
abbreviations,
flags, product types,
gender, Post Code, STD
Code, Unit of Measure,
Customer Category,
Material Type

This is the place


where most of the
data used in current
operation is housed
before it's
transferred to the
data warehouse for
longer term storage
or archiving.

Metadata
Technical metadata are metadata used to describe technology and data structures. Examples of technical metadata are field
names, length, type, lineage, and database table layouts.
Business metadata describe the nontechnical aspects of data and their usage. Examples are field definitions, report names,
headings in reports and on Web pages, application screen names.
Audit trail metadata are a specific type of metadata, typically stored in a record and protected from alteration, that capture
how, when, and by whom the data were created, accessed, updated, or deleted. Audit trail metadata are used for security,
compliance, or forensic purposes. Examples include timestamp, creator, create date, and update date.

Unstructured Data
4

The Information Value Chain


Master
Data
System

Transactional
Systems

Operational
Data Stores
(ODS)

Data
Warehouse
(DWH)

Data Marts
(DM)

Master

Master
Transaction

Master
Transaction

Master
Transaction
Analytic

Master
Transaction
Analytic

History

Yes

No

Limited

Yes

Yes

Integration

Yes

No

Limited

Yes

Yes

Real Time

Close to Real
time

Sometimes Daily
Weekly
Monthly

Sometimes
Daily, Weekly,
Monthly

Fully Integrated
Application
Neutral

Analytical,
Derived and
Summarized,
Application
specific

Limited to
Derived Data

Limited to
Derived Data

Data Class

Data
Currency

Real Time

Data Scope

Fully
Integrated
Application
Neutral

Local
Application
Specific

Integrated
(Limited to a
few
applications)

Yes

Yes

No

Data
Creation

Data & Information

Value of Information

Information Value

Value of information is directly linked to how it helps


decision makers achieve their organizations goals.

Operational
Applications

Transaction Query &


Data
Reporting
(OLTP)

OLAP

Descriptive
Modeling

Predictive
Modeling

Optimization

Information Sophistication
OLAP: Online analytical processing

Data, Information, and Knowledge


Data: raw facts
Information:
Is data in context. Without
context, data is meaningless; we
create meaningful information
by interpreting the context
around data. The resulting
information then guides our
decisions.

Knowledge:
is understanding, awareness and
recognition of a situation and
familiarity with its complexity. It
may also include assumptions
and theories about causes.

Knowledge
+ Patterns & Trends
+ Relationships
+ Assumptions

Information
+ Definition
+ Format
+ Timeframe
+ Relevance

Data

Important Events

General ledger
Budgeting
Human Resource Management and Payroll
Customer Relationship Management
Forecasting, Materials Management, Production Planning
Supply Chain Management
Order Processing, Purchase and Inventory Management
Logistics, Distribution, Fulfillment
Training

The Characteristics of Valuable Information

10

The Characteristics of Valuable Information (continued)

11

How do we define Data Quality


Measurable Dimensions for Data Quality

Dimension

Description

Accuracy

Does the data accurately represent reality or a verifiable source?

Integrity

Do broken links exist between data that should be related?

Consistency

Is there a single representation of data?

Completeness

Is any key information missing?

Uniqueness

Is the data value unique? i.e. no duplicate values or records

Accessibility

Is the data easily accessible, understandable, and used consistently?

Precision

Is data stored with the precision required by the business?

Timeliness

Is the information update frequency adequate to meet the business


requirements?

Data Quality Management involves more than just addressing historical data quality issues through
data profiling and re-engineering. It involves preventing these issues from occurring in the first place.

12

How do we define Data Quality


Intangible Dimensions
Dimension

Description

Relevance

Every piece of information stored is important in order to get a true business


representation

Usability

The stored information is usable by the organization with ease

Usefulness

The stored information is applicable for organization

Believability

The level to which the data is regarded as true and credible

Unambiguous

Each piece of data has a unique meaning, and can be easily comprehended

Objectivity

Data is objective, unbiased and impartial i.e., it does not depend on the
judgment, interpretation, or evaluation of people

Data Quality Management involves more than just addressing historical data quality issues through
data profiling and re-engineering. It involves preventing these issues from occurring in the first place.

13

Information Architecture Components

Web Services (SOA)

Data Movement

Data Exchange

Data Distribution

Data Governance

Enterprise Data Strategy

A framework providing a structured description of an enterprises information assets and the


relationship of those assets to business processes, business management, and IT systems.

Shared Data Repository


Data Model

Business Performance Management


.

Reports
Enterprise Data Warehouse /
Data Marts
Transaction Data Repository

Data Quality Management

Data Staging Area

Semantic Metadata Repository


Enterprise Data Inventory

14

Traditional Organizational Structure

A hierarchical structure
Major department heads report to a president or top-level manager
Managerial pyramid shows the hierarchy of decision making and authority

15

The Role of Information Systems in Business Today


There is a growing interdependence between a firms information systems
and its business capabilities. Changes in strategy, rules, and business
processes increasingly require changes in hardware, software, databases, and
telecommunications. Often, what the organization would like to do depends
on what its systems will permit it to do.

16

Organizing Data in a Traditional File Environment


The use of a traditional approach to file processing encourages each
functional area in a corporation to develop specialized applications and files.
Each application requires a unique data file that is likely to be a subset of the
master file. These subsets of the master file lead to data redundancy and
inconsistency, processing inflexibility, and wasted storage resources.

17

Types of Business Information Systems


Systems from a functional perspective
Sales and marketing systems
Manufacturing and production systems
Finance and accounting systems
Human resources systems
Systems from a constituency perspective
Transaction processing systems
Management information systems and decision-support systems

Executive support systems


Relationship of systems to one another
18

Information, Decisions and Management


Strategic
group of executives develop overall organizational goals, strategies, policies, and objectives as
part of a strategic planning process
Tactical
managers and business professionals in self-directed teams develop short- and medium-range
plans, schedules and budgets and specify the policies, procedures and business objectives for
their subunits
Operational
managers or members of self-directed teams develop short-range plans such as weekly
production schedules

19

Strategic Management

The People
Board of Directors
Chief Executive Officer
President

Decisions
Develop Overall Goals
Long-term Planning
Determine Direction
Political
Economic
Competitive

20

Tactical Management

People
o Business Unit Managers
o Vice-President to MiddleManager

Decisions
o
o
o
o
o
o

short-medium range planning


schedules
budgets
policies
procedures
resource allocation

21

Operational Management

People
o Middle-Managers to
o Supervisors
o Self-directed teams

Decisions
o
o
o
o
o
o

short-range planning
production schedules
day-to-day decisions
use of resources
enforce polices
follow procedures

22

Types of Business Information Systems


The various types of systems in the organization have interdependencies. TPS
are major producers of information that is required by many other systems in
the firm, which, in turn, produce information for other systems. These
different types of systems are loosely coupled in most business firms, but
increasingly firms are using new technologies to integrate information that
resides in many different systems.

23

Management Information Systems (MIS)


Definition:
An information system that produces information products that
support many of the day-to-day decision-making needs of
managers and business professionals.

MIS provides information about the performance of an


organization
Think of entire company (the firm) as a system.
An MIS provides management with feedback

24

Typical MIS Reporting


Periodic Scheduled Reports
Example: Monthly Financial Statements
Exception Reports
Example: List of items out of stock
These reports contain information but they might not directly help you
determine the best decision to make.

Demand Reports and Responses


Available whenever a manager needs them, updated in real-time.
Push Reporting
Information is pushed to a managers computer
Example: Report is pushed every time a supplier is late with a shipment
MIS Reporting is all about giving managers feedback and doesnt necessarily
help directly with decision making.

25

Decision Support Systems (DSS)


Definition:
Computer-based information systems that provide interactive
information support to managers and business professionals
during the decision-making process using the following to make
semi structured business decisions

Analytical models
Specialized databases
A decision makers own insights and judgments
An interactive, computer-based modeling process

26

MIS vs. DSS: Big Differences

In a DSS, a manager generates the report through an


interactive interface
Flexible & Adaptable reports
DSS Reporting is produced through analytical modeling, not
just computing an average, or plotting a graph.
Business Models are programmed into a DSS

27

Interrelationships Among Systems

28

DSS - Perceived benefits

o
o
o
o
o
o

Decision Quality
Improved Communication
Cost Reduction
Increased Productivity
Time Savings
Improved Customer And Employee Satisfaction

29

TPS, MIS, DSS & EIS:

TPS

DSS
Operational
Management
Decisions

MIS

Tactical
Management
Decisions

Strategic
Management
Decisions

EIS

30

Information Characteristics

Decision Structure

Unstructured

Semi-structured

Strategic
Management

Ad Hoc
Unscheduled
Summarized
Infrequent
Forward Looking
External
Wide Scope

Tactical Management

Structured
Operational Management

Pre-specified
Scheduled
Detailed
Frequent
Historical
Internal
Narrow Focus

31

Enterprise Data Principles

32

1 - Manage data as a valuable Enterprise Asset


Definition

Data needs to be managed as an Enterprise Asset, rather than as an asset of any one group or
department
Rationale
Data is a valuable asset providing reference and operational information in support of making
business decisions, transactions between departments, divisions, as well as with trading
partners and customers.
Effective management and operation of the organization and successful provision of goods
and services depend on accurate, timely and secure data.
Data usually originates in one business process and/or department, but it may be used by
other business processes and departments, as well as beyond the organizational boundary
into the extended organization, including trading partners.
Lack of a common, Enterprise wide approach to Data results in separate islands of data,
created in different departments and managed according to varying and disparate policies and
principles.
33

2 - Maximize data sharing


Definition

Data collected and stored must be readily available to all (i.e. shared with) authorized users,
whenever and wherever needed

Rationale
Historically, data has been seen to belong within specific application and within organizational
boundaries. This created an artificial notion of our data - data in fact transcends the
applications and the operational, and must be treated as an Enterprise Asset rather than
belonging to a specific application or user group.
Creating mechanisms which enable easy and consistent sharing of data between applications
and user groups will reduce application integration complexity.

34

3 Ensure single Reference Source of Data


Definition

There will be a single, clearly identified, data storage location for each managed Enterprise
Data element.

Rationale
A single data source is critical to achieving comprehensive data integrity.

A single source reduces confusion, complexity, and therefore data management costs.
A single data source enables a more effective data ownership practice

35

4 - Replicate data
Definition

The organization will provide replicated data to enhance access performance and/or
specialized analysis needs. However every data item must have one and only one source of
reference, which is singularly used as the source for all replications. The replicated data should
not be persisted beyond the original usage time-window.

Rationale
Business value, access performance demands, or specialized analysis needs may require data
replications.
System response performance impacts system usability and therefore complex reporting and
data analyses should be performed on replicated data and not on the source data.
As the organization opens up its data and information repositories to a broader audience via
new channels (e.g., the Internet) the organization must ensure its original source data is
protected from accidental corruption.

36

5 - Validate data at the source


Definition

Data must be validated at the source (i.e., at data entry or collection), prior to being recorded
in the reference source repository

Rationale
Validation Rules prevent entry and propagation of invalid data, which in turn can lead to
adverse business decisions, or erroneous business transactions.
Validating data at the source is critical to improve and maintain data integrity.

37

6 - Establish Data Ownership


Definition

Each data item must have a clearly defined Owner who is accountable and responsible for the
quality of the data, including its definition.

Rationale
Clearly defined Ownership provides clarity in data definition, reducing unnecessary
complexity.
Data Ownership ensures responsibility and accountability for the maintenance of data integrity
and protection of strategic Enterprise Assets.
Clear data ownership ensures accountability for the quality, accuracy and timeliness of data for
the source.

Data ownership identifies the single point of authority for the specification of the quality
requirements for each data element.

38

7 - Automate data entry


Definition

Automated data entry techniques and processes will be used to minimize data entry errors
where economically and technically viable.

Rationale
Manual data entry is inherently prone to error.

Retail industry innovations such as bar coding, Radio Frequency Identification (RFID), Point Of
Sale (POS) scanning facilitates automated data entry.
Data accuracy is critical to support decision making in the retail industry.
Costs of manual data entry increase as the costs of human resources increase.

39

8 - Design data storage facilities for scalability


Definition

Data storage will be designed to facilitate incremental capacity increases in response to


increased data storage demand

Rationale
Changing business strategies, business processes, service delivery and data access channels
are predicted to lead to significant increases in data storage capacity

Early identification of changing data storage requirements facilitates capacity meeting demand
in a planned and cost effective manner

40

9 - Encrypt data before transmission over unsecured infra.

Definition

Data classified as sensitive must be encrypted, before it is transmitted over unsecured


infrastructure in accordance with the Data and Information security policy

Rationale
Breaches of data security increase exposure to potential financial loss.

Secure transmission of sensitive personal and corporate information is consistent with


legislative, regulatory and Corporate policies

41

10 - Govern access to data


Definition
Access to Data must be a subject to Data Management and security policies. Only authorized users
and entities shall be granted access to the Data

Rationale
Data must be managed as a critical strategic asset and access to data must be managed according to
the Corporate Data Management Policies, as defined by the Information Architecture.
Regular breaches to the Data Management Policies, regarding data access, result in unsecured
sharing of data or excessive control of access to data. Both of these extremes are undesirable.
The organization serves a variety of user groups, with differing requirements. Accordingly, the
Corporate Data Management Policies, must be flexible enough to provide access to corporate
information whilst also securing it.
Sensitive and/or classified data often needs to be protected and access to it restricted, according to
the sensitivity.

Different data elements have different levels of sensitivity and therefore access is allowed at
different levels.

42

11 - Track data errors back to the source


Definition
Data quality will be traced to the source of the error and permanent solutions will be sought to
prevent reoccurrence

Rationale
This is necessary to minimize sources of data error
Error-tracking-back-to-the-source approach is critical for a successful data integrity program.
The approach will assist to minimize the cost of the data maintenance over time.

43

12 - Manage all data sourced externally


Definition
All data must be managed across the end-to-end value chain, including data sourced externally

Rationale
Integration must be accomplished across the complete data value chain which includes all trading
partners.
Externally sourced, competitive information has value to the organization and therefore should also
be managed

44

13 - Enter the data only once


Definition
A data element instance will be entered only once in a computer system

Rationale
Repeated data entry is costly.
Event driven, single data creation enables optimal reuse of data and information.
Repeated data entry causes data integrity problems

45

14 - Manage the data life cycle


Definition
Data Owners should manage data items and data subject areas for their complete life cycle (i.e. from
creation through to disposal), according to the value of data items and data subject area to the
organization. Suitable processes should be implemented to manage each data item throughout its
life cycle.

Rationale
The life cycle of data item/subject area varies in time and therefore the organization must be
cognizant of the varying data item/subject area life cycles and the current life cycle stage each data
item/subject area is in.

46

You might also like