Chapter 3 - Data Modelling Concepts
Chapter 3 - Data Modelling Concepts
Chapter 3
Objectives:
Understand the concept of data modeling and why data models are important
Know the basic building blocks of data modeling
Understand the concept of business rules and how does it affect to the design of a
database
Know the different types of data modeling used in database design
Know the levels of data abstraction
A data model pertains to the organization data elements and standardizes how the data
elements relate to one another. Data models describe structured data for storage in data
management systems.
A very important aspect about a data model is that when it is implementation – ready, it should
contain at least the following components:
A description of the data structure that will store the end – user data.
A set of enforceable rules to guarantee the integrity of the data.
A data manipulation methodology that supports the real – world data transformations.
Data models can facilitate interaction among the designer, the applications programmer, and
the end user. This means that a well – developed data model can even foster improved
understanding of the organization for which the database design is developed.
A comprehensive and optimized data model helps create a simplified, logical database that
eliminates redundancy, reduces storage requirements, and enables efficient retrieval. It also
equips all systems with a ‘single source of truth’ – which is essential for effective operations and
provable compliance with regulations and regulatory requirements.
Entity. It represents a particular type of object in the real world. Because an entity
represents a particular type of object, entities are “distinguishable”—that is, each entity
occurrence is unique and distinct. (e.g. STUDENT )
To properly identify entities, attributes, relationships, and constraints in the database system
you are developing, the first step would be always identifying clearly its business rules.
Examples:
But why business rules are essential to database design? The following reasons may
explain the said query:
The following data models basically represent the “schools of thought” as to what a database
is, and what it should do:
Network Model. Unlike the hierarchical model, the network model allows a record to
have more than one parent. It was created based from the hierarchical model due to
the following reasons:
While the network database model is generally not used today, the definitions of
standard database concepts that emerged with the network model are still used by
modern data models. And these are:
Each row in a relation is called a tuple. Each column represents an attribute. The
relational model also describes a precise set of data manipulation constructs based
on advanced mathematical concepts.
The rise to dominance of the relational data model is due in part to its powerful and
flexible query language such as the Structured Query Language or SQL. Structured
Query Language (SQL) allows the user to specify what must be done without
specifying how it must be done.
Object – Oriented Model. Object-oriented data model (OODM) is the basis for the
object-oriented database management system (OODBMS). OODM is said to be a
semantic data model in which it describes the meaning of its instances.
Object described by its factual content like the relational model’s entity. It includes
information about relationships between facts within object, and relationships with
other objects unlike the relational model’s entity. Object becomes a basic building
block for autonomous structures.
In the early 1970s, the American National Standards Institute (ANSI) Standards Planning
and Requirements Committee (SPARC) defined a framework for data modeling based on degrees
of data abstraction. The ANSI/SPARC architecture (as it is often referred to) defines three levels
of data abstraction: external, conceptual, and internal.
External Model. The external model is the end users’ view of the data environment.
The term end users refer to people who use the application programs to manipulate
the data and generate information.
Each business unit is subject to specific constraints and requirements, and each
one uses a data subset of the overall data in the organization. Therefore, end users
working within those business units view their data subsets as separate from or
external to other units within the organization. Because data are being modeled, ER
diagrams will be used to represent the external views.
Conceptual Model. The conceptual model represents a global view of the entire
database as viewed by the entire organization. That is, the conceptual model
integrates all external views such as entities, relationships, constraints, and processes
into a single global view of the data in the enterprise. Having identified the external
views, a conceptual model is used, graphically represented by an entity – relationship
diagram (ERD), to integrate all external views into a single view.
Also known as a conceptual schema, it is the basis for the identification and high-
level description of the main data objects, avoiding any database model –specific
details.
The most widely used conceptual model is the ER model. Remember that the ER
model is illustrated with the help of the ERD, which is, in effect, the basic database
blueprint. The ERD is used to graphically represent the conceptual schema.
Internal Model. Once a specific DBMS has been selected, the internal model maps the
conceptual model to the DBMS. The internal model (also known as the logical
model) is the representation of the database as “seen” by the DBMS. In other words,
the internal model requires the designer to match the conceptual model’s
characteristics and constraints to those of the selected implementation model.
Physical Model. The physical model operates at the lowest level of abstraction,
describing the way data are saved on storage media such as disks or tapes. The
physical model requires the definition of both the physical storage devices and the
(physical) access methods required to reach the data within those storage devices,
making it both software and hardware dependent.
The storage structures used are dependent on the software such as the DBMS
and the operating system and on the type of storage devices that the computer can
handle. The precision required in the physical model’s definition demands that
database designers who work at this level have a detailed knowledge of the hardware
and software used to implement the database design.