Dw Midterms Notes
Dw Midterms Notes
Oracle
- the industry-leading database. If offers a wide range of
choice of data warehouse solutions for both on-
premises and in the cloud
Amazon Redshift
- a simple and cost-effective tool to analyze all types of
data using standard SQL and existing BI tools
INTRODUCTION TO
DATABASE
RGRAFIA
DATA AND ITS MANAGEMENT
• Basics of Data
• Database Systems
• Database Architecture
• Data Management
INTRODUCTION TO DATABASE: ITS CONCEPT
• When data is stored in Database Systems, it can be stored in any format. Data can be
presented in either a structured or unstructured format. The complex combination of
structured and unstructured data sets is known as Big Data.
• Due to the 3V’s (Volume, Velocity, Variety) of Big Data, traditional technologies and
methods can’t be used to analyze them.
• Database Systems have been developed to address the issues of Big Data
WHAT IS DATABASE SYSTEMS OR DBMS?
• Generally, Databases have a table to store data, they use Structured Query
Language (SQL) to access the data from these tables.
• Databases and Database Systems play a vital role in processing hard, fast
and diverse datasets. Without a Database Management System, businesses
won’t receive valuable insights and deep analytics.
• In the Database environment, data is accessed, modified, controlled, and
then presented into a well-organized form, allowing the business
corporations to execute multiple data-processing operations.
WHAT IS DATABASE SYSTEMS OR DBMS?
• The data is usually organized in the form of rows and columns to minimize
the workload pressure and achieve accurate results instantly.
• Different types of data that can be stored, processed, or retrieved in
Database Management System include numerical, time series, textual and
binary data.
LANGUAGES SUPPORTED BY DATABASE SYSTEMS
• Database Systems comprise of specific languages that are used by operators,
programmers and end-users to interact with Database queries and updates.
• There are generally 4 types of Database Languages:
• Data Definition Language (DDL)
• Data Control Language (DCL)
• Data Manipulation Language (DML)
• Transaction Control Language (TCL)
• Data Definition Language (DDL)
• It is also called Data Description Language and is used to describe data
structures, create and modify data. SQL commands and statements like
Create, Alter, Drop, Truncate, Rename, and Comment are used to form
the pattern of the Database.
• Data Control Language (DCL)
• DCL commands include Revoke and Grant used to retrieve previously
stored and saved data. The syntax of DCL commands is similar to
programming languages. These statements play an essential role to
describe the ‘‘Rights & Permissions’’ across the Database system.
• Data Manipulation Language (DML)
• DML commands include Select, Insert, Update, Delete, Merge and Call.
These are used to access and manipulate data in the Database. These
statements are commonly meant for handling user requests.
• Transactional Control Language (TCL)
• TCL is used to handle all the transactions within Database Systems. TCL
commands include Commit, Rollback and SavePoint.
DATABASE SYSTEMS LANGUAGE EXAMPLES
• SQL: SQL unifies data definition, data manipulation, and querying in a single language. It
was one of the earliest commercial languages for the relational paradigm, albeit it differs
in some ways from Codd’s description (for example, rows and columns in a table can be
sorted).
• OQL: It is an object model language standard (developed by the Object Data Management
Group). It inspired the design of various subsequent query languages, such as JDOQL and
EJB QL.
• XQuery: XQuery is a standard XML query language that is supported by XML database
systems like MarkLogic and eXist, relational databases with XML capabilities like Oracle
and Db2, and in-memory XML processors like Saxon.
TYPES OF DATABASE SYSTEMS
• Do a logical or conceptual design of the database. This is where ER model plays a role. It is
the most used graphical representation of the conceptual design of a database.
• Physical Database Design (Like indexing) and external design (like views)
WHY USE ER DIAGRAMS IN DBMS?
ER Model is used to model the logical view of the system from a data
perspective which consists of these symbols:
• Rectangles: Rectangles represent Entities in the ER Model.
• Ellipses: Ellipses represent Attributes in the ER Model.
• Diamond: Diamonds represent Relationships among Entities.
• Lines: Lines represent attributes to entities and entity sets with other
relationship types.
• Double Ellipse: Double Ellipses represent Multi-Valued Attributes.
• Double Rectangle: Double Rectangle represents a Weak Entity.
SYMBOLS USED IN ER MODEL
COMPONENTS OF ER DIAGRAM
• ER Model consists of Entities, Attributes, and Relationships among Entities in a
Database System.
WHAT IS ENTITY?
PRIVATE CLOUD !
! PRODUCTS !
! SOLUTIONS !
! ABOUT US !
! BLOG
We have collected some of the most popular database management systems (DBMS) nowadays. Let’s start by defining what a database
management system is.
Table of contents
2.1 MySQL
2.2 MariaDB
2.5 PostgreSQL
2.6 MongoDB
2.7 Redis
2.9 Elasticsearch
2.10 SQLite
MySQL
MySQL is a free, open source relational database management system (RDBMS). It was initially owned by MySQL AB, before being acquired by
Sun Microsystems (part of Oracle Corporation since 2010). MySQL was originally developed by Ulf Michael Widenius, Swedes David Axmark and
Allan Larsson, founders of MySQL AB.
Many database-driven web applications, such as WordPress, Joomla and phpBB, as well as many popular websites like MediaWiki, Twitter and
Facebook, use MySQL.
MariaDB
MariaDB is a community-developed, free and open source relational database management system. It is a fork of MySQL. MariaDB was originally
developed by Ulf Michael Widenius, Swedes David Axmark and Allan Larsson, founders of MySQL AB and the MariaDB Foundation. Ulf Michael
Widenius is the current lead developer and CTO of MariaDB.
MariaDB is also included in numerous Linux distributions, such as CentOS, Debian and RHEL. Besides, it is used by many organizations such as
Wikipedia, Google or Tumblr.
Microsoft SQL Server is a commercial relational database management system. It is available in multiple editions, divided into three main
categories: mainstream, specialized and discontinued editions.
Developer: Microsoft.
Oracle DBMS
Oracle DBMS is a commercial, multi-model database management system. It is also known as Oracle Database or just Oracle. It is commonly used
for running: online transaction processing (OLTP) and data warehousing (DW).
PostgreSQL
PostgreSQL is a free, open source relational database management system (RDBMS). It was initially developed as a successor of the Ingres
database, developed at the University of California, Berkeley.
MongoDB
MongoDB is an open source, NoSQL, document-oriented database management system. MongoDB Inc. offers an integrated suite of cloud
database services, as well as commercial support. This document-oriented database software is commonly used for high-volume data storage.
Redis
Redis, short for “Remote Dictionary Server”, is an open source, NoSQL, key-value database management system.
Developer: Redis.
IBM DB2
IBM DB2 is a database management product developed by IBM, formerly known as DB2 for Linux, UNIX and Windows.
Developer: IBM.
Elasticsearch
Elasticsearch is a distributed, RESTful search and analytics engine. It is based on the Lucene library. Elasticsearch is the successor to a previous
search engine called Compass, also designed by Shay Banon.
Elasticsearch license: dual-licensed Elastic license and Server Side Public License.
SQLite
SQLite is a public domain database engine that belongs to the embedded, relational database management systems family. It has bindings to
many programming languages.
Bash, C, C++,
MariaDB RDBMS Linux, MacOS and Windows GNU GPL v2
and Perl
Microsoft
RDBMS Linux and Windows Proprietary C and C++
SQL Server
Assembly
Oracle Multi-model database AIX, BS2000, HP-UX, Linux, MacOS
Proprietary language, C and
DBMS management system and Windows
C++
Assembly, C,
IBM DB2 RDBMS Linux, Unix-like and Windows Proprietary
C++ and Java
1. Oracle
2. MySQL
3. Microsoft SQL Server
4. PostgreSQL
5. MongoDB
6. Redis
7. IBM DB2
8. Elasticsearch
9. SQLite
10. Microsoft Access
Managed services
System administration and IT outsourcing adapted to the needs of
each project.
DISCOVER MORE
Related articles
Protecting and Controlling The Linux Kernel surpasses 40 Million Take Your WordPress Blog to the Next
Information: It’s More Than Just lines of code: A historic nilestone in Level: How to Handle 100,000 Daily
Backups Open-Source software Visitors Without Breaking a Sweat
Products About us
Do you need help boosting your IT?
Solutions Case studies
Data centers Blog
Contact us