0% found this document useful (0 votes)
70 views

4a - Database Systems

Here are some examples of data mining that were discussed in the document: - Market basket analysis to identify new product bundles - Finding root causes of quality or manufacturing problems - Preventing customer attrition and acquiring new customers - Cross-selling to existing customers - Profiling customers with more accuracy The document explained that data mining involves analyzing large amounts of data stored in data warehouses to reveal hidden patterns and trends. It uses techniques like pattern recognition, statistics, and mathematics to extract strategic business insights from vast amounts of business data. Some common uses are those listed above like improving customer profiles, increasing sales, and solving operational issues.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
70 views

4a - Database Systems

Here are some examples of data mining that were discussed in the document: - Market basket analysis to identify new product bundles - Finding root causes of quality or manufacturing problems - Preventing customer attrition and acquiring new customers - Cross-selling to existing customers - Profiling customers with more accuracy The document explained that data mining involves analyzing large amounts of data stored in data warehouses to reveal hidden patterns and trends. It uses techniques like pattern recognition, statistics, and mathematics to extract strategic business insights from vast amounts of business data. Some common uses are those listed above like improving customer profiles, increasing sales, and solving operational issues.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 35

Problems in Traditional File System

- Data Redundancy
- Data Inconsistency
- Data Isolation
- Data Security
- Data Integrity
- Application Dependency
Fundamental Data Concepts

- Character: most basic logical data element


- Field: data item
- Record: collection of attributes that describe a
single instance of an entity
- File (Table): group of related records
- Database: integrated collection of logically related
data elements
Fundamental Data Concepts
Database Structures

Database management system (DBMS)


packages are designed to use a specific data
structure to provide end users with quick, easy
access to information stored in databases.
Five fundamental database structures are the
hierarchical, network, relational, object-oriented,
and multidimensional models.
Hierarchical
Structure
Hierarchical Structure

Early mainframe DBMS packages used the


hierarchical structure, in which the relationships
between records form a hierarchy or treelike
structure. In the traditional hierarchical model, all
records are dependent and arranged in multilevel
structures, consisting of one root record and any
number of subordinate levels. Thus, all of the
relationships among records are one-to-many
because each data element is related to only one
element above it
Network
Structure
Network Structure

The network structure can represent more complex


logical relationships and is still used by some
mainframe DBMS packages. It allows
many-to-many relationships among records; that is,
the network model can access a data element by
following one of several paths because any data
element or record can be related to any number of
other data elements.
Relational Structure
Relational Structure

The relational model is the most widely used of the


three database structures. It is used by most
microcomputer DBMS packages, as well as by
most midrange and mainframe systems. In the
relational model, all data elements within the
database are viewed as being stored in the form of
simple two-dimensional tables, sometimes referred
to as relations..
Relational Structure

The tables in a relational database are flat files that


have rows and columns. Each row represents a single
record in the file, and each column represents a field.
The major difference between a flat file and a database
is that a flat file can only have data attributes specified
for one file. In contrast, a database can specify data
attributes for multiple files simultaneously and can
relate the various data elements in one file to those in
one or more other files
Multidimensional
Structure
Multidimensional Structure

It is a variation of the relational model


that uses multidimensional structures
to organize data and express the
relationships between data. You can
visualize multidimensional structures as
cubes of data and cubes.
Object-Oriented
Structure
Object-Oriented Structure

The object-oriented model is considered one of the key


technologies of a new generation of multimedia
Web-based applications. An object consists of data
values describing the attributes of an entity, plus the
operations that can be performed upon the data. This
encapsulation capability allows the object-oriented
model to handle complex types of data (graphics,
pictures, voice, and text) more easily than other
database structures.
Object-Oriented Structure

The object-oriented model also supports


inheritance; that is, new objects can be
automatically created by replicating some or
all of the characteristics of one or more
parent objects.
Database Development

Database development involves data


planning and database design activities.
Data models that support business
processes are used to develop databases
that meet the information needs of users.
Database Development

The development of databases can be easily accomplished


using microcomputer database management packages for
small end-user applications.
However, the development of large corporate databases
requires a top-down data planning effort that may involve
developing enterprise and entity relationship models, subject
area databases, and data models that reflect the logical data
elements and relationships needed to support the operation and
management of the basic business processes of the
organization.
Process of Database Development

Put the figure of database development here.


Types of databases

- Operational Databases
- Distributed Databases
- External Databases
- Hypermedia Databases
Operational Databases

Operational databases store detailed data needed


to support the business processes and operations
of a company. They are also called subject area
databases (SADB), transaction databases, and
production databases. Examples are a customer
database, human resource database, inventory
database, and other databases containing data
generated by business operations.
Distributed Databases

Many organizations replicate and distribute copies


or parts of databases to network servers at a
variety of sites. These distributed databases can
reside on network servers on the Internet, on
corporate intranets or extranets, or on other
company networks.
Distributed Databases

Replication and distribution of databases improve


database performance at end-user worksites.
Ensuring that the data in an organization’s
distributed databases are consistently and
concurrently updated is a major challenge of
distributed database management.
Advantages of Distributed Databases

- By having databases distributed in multiple


locations, the negative impact of an
catastrophic can be minimized.
- Often, a large database system may be
distributed into smaller databases based on
some logical relationship between the data and
the location. Thus it requires less space in
storage at a single data storage facility.
Disadvantages of Distributed Databases

- Data Accuracy
- Data Replication or Data Duplication is used to
get around issue of data accuracy, both of
which are resource expensive.
External Databases

Whenever you use a search engine like


Google or Yahoo to look up something on
the Internet, you are using an external
database.
Hypermedia Databases

Hypermedia databases on the World Wide


Web and on corporate intranets and
extranets store hyperlinked multimedia
pages on a Web site. Web server software
can manage such databases for quick
access and maintenance of the Web
database.
What is data warehouse?

A data warehouse stores data that have been extracted


from the various operational, external, and other
databases of an organization.
It is a central source of the data that have been cleaned,
transformed, and cataloged so that they can be used by
managers and other business professionals for data
mining, online analytical processing, and other forms of
business analysis, market research, and decision support.
What is data warehouse?

Data warehouses may be subdivided into data marts, which hold


subsets of data from the warehouse that focus on specific
aspects of a company, such as a department or a business
process.
One important characteristic about the data in a data warehouse
is that, unlike a typical database in which changes can occur
constantly, data in a data warehouse are which means that once
the data are gathered up, formatted for storage, and stored in
the data warehouse, they will never change
What is data mining?

The development of databases can be easily accomplished


using microcomputer database management packages for
small end-user applications.
However, the development of large corporate databases
requires a top-down data planning effort that may involve
developing enterprise and entity relationship models, subject
area databases, and data models that reflect the logical data
elements and relationships needed to support the operation and
management of the basic business processes of the
organization.
What is data mining?

Data mining is a major use of data warehouse


databases and the static data they contain. In data
mining, the data in a data warehouse are analyzed to
reveal hidden patterns and trends in historical business
activity. This analysis can be used to help managers
make decisions about strategic changes in business
operations to gain competitive advantages in the
marketplace.
What is data mining?

Data mining can discover new correlations, patterns,


and trends in vast amounts of business data (frequently
several terabytes of data) stored in data warehouses.
Data mining software uses advanced pattern
recognition algorithms, as well as a variety of
mathematical and statistical techniques, to sift through
mountains of data to extract previously unknown
strategic business information.
Data Mining Process
Some uses of Data Mining

- Perform market-basket analysis to identify new


product bundles.
- Find root causes of quality or manufacturing
problems.
- Prevent customer attrition and acquire new customers.
- Cross-sell to existing customers.
- Profile customers with more accuracy.
What are the
examples of data
mining that we
have discussed
so far?

You might also like