0% found this document useful (0 votes)

4 views

DBMS NOTES

Unit 1 covers the fundamentals of databases and DBMS, including definitions, user roles, characteristics, and architecture. It also introduces data models, SQL commands, and the Entity-Relationship approach for database design. Unit 2 focuses on the relational model, constraints, relational algebra, and calculus, emphasizing the importance of data integrity and query languages.

Uploaded by

itisarpitkumar13082005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

DBMS NOTES

Uploaded by

itisarpitkumar13082005

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Unit 1: Database & DBMS Fundamentals

1. Basic Concepts: Database & Database Users

● What is a Database? A database is an organized collection of data, typically stored and accessed
electronically from a computer system. Think of it like a super-organized digital filing cabinet where data is
stored in tables, and you can easily retrieve or update it.
○ Example: A university database might store student info (name, ID, grades), course details, and
faculty records.
● Database Users:
○ End Users: People who interact with the database via applications (e.g., students checking grades
online).
○ Database Administrators (DBAs): Manage the database, ensuring security, backups, and
performance.
○ Application Developers: Write programs that interact with the database.
○ Database Designers: Create the structure (schema) of the database.
● Key Point for Exam: Know the roles of different users and how they interact with the database system.

2. Characteristics of Database Systems

Database systems have unique features that make them powerful:

● Data Integrity: Ensures data is accurate and consistent (e.g., no duplicate student IDs).
● Data Independence: Changes to the database structure (e.g., adding a new table) don’t affect the
applications using it.
● Concurrency Control: Allows multiple users to access the database simultaneously without conflicts.
● Data Security: Restricts unauthorized access (e.g., only faculty can update grades).
● Query Processing: Efficiently retrieves data using languages like SQL.
● Backup and Recovery: Protects data from failures (e.g., restoring data after a crash).
● Exam Tip: Be ready to list 4–5 characteristics with a brief explanation or example for each.

3. Concepts and Architecture

● DBMS Architecture:
○ Three-Schema Architecture:
1. External Schema (View Level): How users see the data (e.g., a student sees only their
grades).
2. Conceptual Schema (Logical Level): The overall structure of the database (e.g., tables for
students, courses).
3. Internal Schema (Physical Level): How data is stored on disk (e.g., file formats, indexes).
○ Purpose: Separates user views from physical storage for flexibility and security.
● Data Independence:
○ Logical Data Independence: Changes to the conceptual schema (e.g., adding a new table) don’t
affect external views.
○ Physical Data Independence: Changes to storage (e.g., switching to a new hard drive) don’t affect
the conceptual schema.
● Exam Tip: You might get a question asking you to explain the three-schema architecture with a diagram or
example. Practice sketching it!
4. Data Models, Schemas & Instances

● Data Models: Define how data is structured and related.

○ Relational Model: Data stored in tables (e.g., a "Students" table with columns for ID, Name, GPA).
○ Hierarchical Model: Data organized in a tree-like structure (e.g., parent-child relationships).
○ Network Model: Data organized in a graph structure with complex relationships.
○ Object-Oriented Model: Combines data and behavior (like objects in programming).
● Schema: The structure or blueprint of the database (e.g., the "Students" table has columns ID, Name, GPA).
● Instance: The actual data in the database at a specific time (e.g., the "Students" table contains a row for
John, ID: 101, GPA: 3.5).
● Exam Tip: Be able to differentiate between schema (structure) and instance (data). Example: Schema is like
a blank form; instance is the filled-in form.

5. DBMS Architecture & Data Independence (Revisited)

● We covered this under Concepts and Architecture, but it’s worth noting:
○ DBMS architecture ensures modularity and abstraction.
○ Data independence is critical for maintaining flexibility in large systems.
○ Example: If a university adds a new column to the "Students" table (e.g., "Email"), the application
students use to check grades doesn’t break (logical independence).

6. Database Languages & Interfaces

● Database Languages:
○ DDL (Data Definition Language): Defines the database structure (e.g., CREATE TABLE Students).
○ DML (Data Manipulation Language): Manipulates data (e.g., INSERT INTO Students VALUES (101,
'John', 3.5)).
○ DCL (Data Control Language): Defines access permissions (e.g., GRANT SELECT ON Students TO
faculty).
○ TCL (Transaction Control Language): Manages transactions (e.g., COMMIT, ROLLBACK).
● Interfaces:
○ Query Interfaces: Tools like SQL command-line or GUI (e.g., MySQL Workbench).
○ Application Interfaces: APIs like JDBC/ODBC for developers.
○ User Interfaces: Web or mobile apps for end users.
● Exam Tip: Know the difference between DDL, DML, DCL, and TCL with examples of commands.

7. Data Modelling Using the Entity-Relationship (ER) Approach

● ER Model: A visual way to design databases using entities, attributes, and relationships.
○ Entities: Objects like Student, Course.
○ Attributes: Properties like Student ID, Course Name.
○ Relationships: Connections like "Student enrolls in Course."
○ Example ER Diagram:
■ Entity: Student (Attributes: ID, Name, GPA).
■ Entity: Course (Attributes: Code, Title).
■ Relationship: Enrolls (Student takes Course).
● Key Symbols:
○ Rectangle: Entity.
○ Oval: Attribute.
○ Diamond: Relationship.
● Exam Tip: Practice drawing an ER diagram for a simple scenario (e.g., library system with Books, Borrowers,
and Loans).

8. Enhanced ER Concepts

● Specialization/Generalization:
○ Generalization: Combining similar entities into a general entity (e.g., Student and Faculty generalized
into Person).
○ Specialization: Breaking an entity into specific subtypes (e.g., Person specialized into Undergraduate
and Graduate).
○ Represented by an ISA relationship in ER diagrams.
● Aggregation:
○ Treating a relationship as an entity to simplify complex relationships.
○ Example: If Students work on Projects under Advisors, the "Works On" relationship can be
aggregated into an entity to capture additional attributes.
● Mapping ER Model to Relational Model:
○ Entities become tables (e.g., Student entity → Students table).
○ Attributes become columns.
○ Relationships:
■ One-to-One: Merge tables or use a foreign key.
■ One-to-Many: Add a foreign key in the "many" side (e.g., Student table has a CourseID
column).
■ Many-to-Many: Create a junction table (e.g., Student_Course table with StudentID and
CourseID).
● Exam Tip: Be able to convert a simple ER diagram into relational tables. Practice with 1:1, 1:N, and N:N
relationships.

9. SQL – DDL, DCL, DML, Views, and Indexes

● SQL Basics:
○ DDL (Data Definition Language):
■ CREATE TABLE Students (ID INT, Name VARCHAR(50), GPA FLOAT);
■ ALTER TABLE Students ADD Email VARCHAR(100);
■ DROP TABLE Students;
○ DML (Data Manipulation Language):
■ INSERT INTO Students VALUES (101, 'John', 3.5);
■ UPDATE Students SET GPA = 3.7 WHERE ID = 101;
■ DELETE FROM Students WHERE ID = 101;
■ SELECT * FROM Students WHERE GPA > 3.0;
○ DCL (Data Control Language):
■ GRANT SELECT ON Students TO faculty;
■ REVOKE INSERT ON Students FROM public;
● Views:
○ Virtual tables based on a query.
○ Example: CREATE VIEW HighGPA AS SELECT Name, GPA FROM Students WHERE GPA > 3.5;
○ Use: Simplifies queries, enhances security.
● Indexes:
○ Improve query performance by creating a lookup structure.
○ Example: CREATE INDEX idx_student_id ON Students(ID);
○ Trade-off: Faster searches, but slower inserts/updates.
● Exam Tip: Memorize at least one example for each SQL command type (CREATE, INSERT, GRANT, etc.).

10. Defining Constraints

Constraints ensure data integrity:

● Primary Key: Uniquely identifies each row (e.g., ID in Students table).

○ Example: CREATE TABLE Students (ID INT PRIMARY KEY, Name VARCHAR(50));
● Foreign Key: Links tables (e.g., CourseID in Students references Courses table).
○ Example: CREATE TABLE Enrollments (StudentID INT, CourseID INT, FOREIGN KEY (CourseID)
REFERENCES Courses(CourseID));
● Unique: Ensures no duplicates (e.g., email addresses).
○ Example: Email VARCHAR(100) UNIQUE
● Not Null: Ensures a column must have a value.
○ Example: Name VARCHAR(50) NOT NULL
● Check: Enforces a condition (e.g., GPA between 0 and 4.0).
○ Example: GPA FLOAT CHECK (GPA >= 0 AND GPA <= 4.0)
● IN Operator: Used in queries to check if a value is in a list.
○ Example: SELECT * FROM Students WHERE Grade IN ('A', 'B');
● Exam Tip: Practice writing CREATE TABLE statements with multiple constraints.

Study Strategies for Your Exam

1. Practice ER Diagrams: Draw ER diagrams for scenarios like a hospital, library, or e-commerce system.
Convert them to relational tables.
2. SQL Queries: Write SQL commands for DDL, DML, DCL, and constraints. Use a tool like MySQL or SQLite
to test them.
3. Flashcards: Create flashcards for terms like schema, instance, data independence, and SQL commands.
4. Past Papers: If you have access to previous exams, practice those questions to get a feel for the format.
5. Focus on Weak Areas: If ER modeling or SQL constraints feel tricky, let me know, and we can dive deeper
with examples or problems.

Sample Practice Question

Question: Design an ER diagram for a library system with Books, Members, and Loans. Convert it to a relational
model and write SQL to create the tables with appropriate constraints.

Answer:

● ER Diagram:
○ Entity: Book (Attributes: ISBN, Title, Author)
○ Entity: Member (Attributes: MemberID, Name, Email)
○ Entity: Loan (Attributes: LoanID, LoanDate, ReturnDate)
○ Relationship: Borrows (Member borrows Book)
● Relational Model:
○ Books (ISBN, Title, Author)
○ Members (MemberID, Name, Email)
○ Loans (LoanID, MemberID, ISBN, LoanDate, ReturnDate)
■ Foreign Keys: MemberID references Members(MemberID), ISBN references Books(ISBN)
● SQL:

sql
Copy
CREATE TABLE Books (
ISBN VARCHAR(13) PRIMARY KEY,
Title VARCHAR(100) NOT NULL,
Author VARCHAR(50)
);

CREATE TABLE Members (

MemberID INT PRIMARY KEY,
Name VARCHAR(50) NOT NULL,
Email VARCHAR(100) UNIQUE
);

CREATE TABLE Loans (

LoanID INT PRIMARY KEY,
MemberID INT,
ISBN VARCHAR(13),
LoanDate DATE NOT NULL,
ReturnDate DATE,
FOREIGN KEY (MemberID) REFERENCES Members(MemberID),
FOREIGN KEY (ISBN) REFERENCES Books(ISBN)
);
Unit 2: Relational Model, Algebra, Calculus, and Advanced SQL

1. Relational Model Concepts

The Relational Model is the foundation of most modern DBMSs, organizing data into tables (relations) with rows
(tuples) and columns (attributes).

● Key Concepts:
○ Relation: A table (e.g., Students with columns ID, Name, GPA).
○ Tuple: A row in the table (e.g., (101, 'John', 3.5)).
○ Attribute: A column in the table (e.g., GPA).
○ Domain: The set of allowed values for an attribute (e.g., GPA is a float between 0 and 4.0).
○ Primary Key: Uniquely identifies each tuple (e.g., ID).
○ Foreign Key: Links one table to another (e.g., CourseID in Enrollments references Courses).

Example:

Students Table:
ID | Name | GPA
----+-------+-----
101 | John | 3.5

● 102 | Alice | 3.8

● Exam Tip: Be ready to define terms like tuple, attribute, and domain, and explain how they fit into the
relational model.

2. Relational Model Constraints

Constraints ensure data integrity in the relational model:

● Domain Constraints: Attributes must follow their defined domain (e.g., GPA must be a float).
● Key Constraints: Every table must have a unique primary key (e.g., ID in Students).
● Entity Integrity: Primary key values cannot be null.
● Referential Integrity: Foreign key values must match a primary key in the referenced table or be null (e.g.,
CourseID in Enrollments must exist in Courses).
● Check Constraints: Custom rules (e.g., GPA >= 0 AND GPA <= 4.0).

Example:

CREATE TABLE Enrollments (
StudentID INT,
CourseID INT,
PRIMARY KEY (StudentID, CourseID),
FOREIGN KEY (StudentID) REFERENCES Students(ID),
FOREIGN KEY (CourseID) REFERENCES Courses(CourseID)

);
● Exam Tip: Practice writing CREATE TABLE statements with constraints and identifying violations (e.g.,
inserting a null primary key).

3. Relational Algebra

Relational Algebra is a theoretical query language used to manipulate and retrieve data from relational databases.
It uses operators to build queries.

● Basic Operators:
○ Select (σ): Filters rows based on a condition.
■ Example: σ_{GPA > 3.5}(Students) returns students with GPA > 3.5.
○ Project (π): Selects specific columns.
■ Example: π_{Name, GPA}(Students) returns only the Name and GPA columns.
○ Union (∪): Combines rows from two relations (removes duplicates).
■ Example: Students ∪ Faculty combines their rows if compatible.
○ Difference (−): Returns rows in one relation but not another.
■ Example: Students − Graduates returns non-graduate students.
○ Cartesian Product (×): Combines all rows from two relations.
■ Example: Students × Courses pairs every student with every course.
○ Join (⨝): Combines rows based on a condition.
■ Example: Students ⨝_{Students.ID = Enrollments.StudentID} Enrollments.
● Additional Operators:
○ Natural Join: Joins tables on common attributes.
○ Division: Finds values in one relation that match all values in another.
○ Rename (ρ): Renames a relation or attribute for clarity.
● Example: To find names of students with GPA > 3.5 enrolled in a course:

π_{Name}(σ_{GPA > 3.5}(Students) ⨝_{Students.ID = Enrollments.StudentID} Enrollments)

● Exam Tip: Practice writing relational algebra expressions for queries like “Find students enrolled in a specific
course.” Know the symbols and their purposes.

4. Relational Calculus

Relational Calculus is a non-procedural query language that describes what data to retrieve, not how to retrieve it.
It’s more declarative than relational algebra.

● Types:
○ Tuple Relational Calculus (TRC):
■ Uses tuples and conditions to define queries.
■ Example: { t | t ∈ Students ∧ t.GPA > 3.5 } returns tuples t from Students where GPA > 3.5.
○ Domain Relational Calculus (DRC):
■ Uses attribute domains instead of tuples.
■ Example: { <Name, GPA> | <ID, Name, GPA> ∈ Students ∧ GPA > 3.5 }.
● Key Difference:
○ Relational Algebra: Procedural (step-by-step operations).
○ Relational Calculus: Declarative (describes the result).
● Exam Tip: Understand the difference between algebra and calculus. TRC questions are more common, so
practice writing TRC queries for simple conditions.
5. SQL – Functions (Aggregate Functions)

Aggregate Functions perform calculations on a set of values and return a single value.

● Common Aggregate Functions:

○ COUNT(*): Counts rows.
■ Example: SELECT COUNT(*) FROM Students; (total students).
○ SUM(column): Sums values.
■ Example: SELECT SUM(GPA) FROM Students;.
○ AVG(column): Computes average.
■ Example: SELECT AVG(GPA) FROM Students;.
○ MIN(column): Finds minimum.
■ Example: SELECT MIN(GPA) FROM Students;.
○ MAX(column): Finds maximum.
■ Example: SELECT MAX(GPA) FROM Students;.

Example:

SELECT AVG(GPA), MAX(GPA)
FROM Students
WHERE Major = 'CS';

● Exam Tip: Combine aggregate functions with GROUP BY (covered later) for questions like “Find average
GPA per department.”

6. Built-in Functions

SQL provides built-in functions for numeric, date, and string operations.

● Numeric Functions:
○ ROUND(number, decimals): Rounds a number.
■ Example: SELECT ROUND(GPA, 1) FROM Students; (e.g., 3.56 → 3.6).
○ ABS(number): Returns absolute value.
○ CEIL(number), FLOOR(number): Rounds up or down.
● Date Functions:
○ CURRENT_DATE: Returns today’s date.
■ Example: SELECT CURRENT_DATE;.
○ DATEDIFF(date1, date2): Days between two dates.
■ Example: SELECT DATEDIFF(ReturnDate, LoanDate) FROM Loans;.
○ EXTRACT(unit FROM date): Extracts part of a date (e.g., year).
■ Example: SELECT EXTRACT(YEAR FROM LoanDate) FROM Loans;.
● String Functions:
○ CONCAT(str1, str2): Concatenates strings.
■ Example: SELECT CONCAT(FirstName, ' ', LastName) AS FullName FROM Students;.
○ UPPER(str), LOWER(str): Changes case.
■ Example: SELECT UPPER(Name) FROM Students;.
○ LENGTH(str): Returns string length.
■ Example: SELECT LENGTH(Name) FROM Students;.
● Exam Tip: Memorize 2–3 examples per function type. Be ready to use them in SELECT queries.

7. Set Operations

SQL supports set operations to combine query results:

● UNION: Combines rows from two queries, removing duplicates.

○ Example: SELECT Name FROM Students UNION SELECT Name FROM Faculty;.
● UNION ALL: Like UNION but keeps duplicates.
● INTERSECT: Returns rows common to both queries.
○ Example: SELECT Name FROM Students INTERSECT SELECT Name FROM Faculty;.
● EXCEPT (or MINUS): Returns rows in the first query but not the second.
○ Example: SELECT Name FROM Students EXCEPT SELECT Name FROM Graduates;.
● Exam Tip: Ensure the queries have the same number of columns and compatible data types. Practice writing
UNION and INTERSECT queries.

8. Subqueries and Correlated Subqueries

● Subqueries: A query nested inside another query.

Example: Find students with above-average GPA:

SELECT Name, GPA
FROM Students
WHERE GPA > (SELECT AVG(GPA) FROM Students);

● Correlated Subqueries: A subquery that references the outer query’s table.

Example: Find students enrolled in at least one course:

SELECT Name
FROM Students S
WHERE EXISTS (
SELECT * FROM Enrollments E
WHERE E.StudentID = S.ID
);

● Exam Tip: Practice identifying correlated vs. non-correlated subqueries. Correlated subqueries are slower
but powerful for complex conditions.

9. GROUP BY, HAVING, ORDER BY

● GROUP BY: Groups rows with the same values into summary rows (used with aggregate functions).
Example: Find average GPA by major:

SELECT Major, AVG(GPA)
FROM Students

○ GROUP BY Major;

● HAVING: Filters groups based on a condition (like WHERE for GROUP BY).

Example: Find majors with average GPA > 3.5:

SELECT Major, AVG(GPA)
FROM Students
GROUP BY Major
HAVING AVG(GPA) > 3.5;

● ORDER BY: Sorts results (ASC for ascending, DESC for descending).

Example: Sort students by GPA:

SELECT Name, GPA
FROM Students
ORDER BY GPA DESC;

● Exam Tip: Remember: WHERE filters rows before grouping, HAVING filters groups after grouping.

10. Joins and Their Types

Joins combine rows from two or more tables based on a condition.

● Types of Joins:
○ INNER JOIN: Returns rows where the condition matches in both tables.
■ Example: SELECT S.Name, E.CourseID FROM Students S INNER JOIN Enrollments E ON
S.ID = E.StudentID;.
○ LEFT OUTER JOIN: Returns all rows from the left table, with matching rows from the right (nulls if no
match).

Example: List all students, including those not enrolled:

SELECT S.Name, E.CourseID
FROM Students S LEFT JOIN Enrollments E ON S.ID = E.StudentID;

○ RIGHT OUTER JOIN: Returns all rows from the right table, with matching rows from the left.
○ FULL OUTER JOIN: Returns all rows from both tables, with nulls where there’s no match.
○ NATURAL JOIN: Joins on common column names (less common).
● Exam Tip: Practice writing INNER and LEFT JOIN queries. Be able to explain the difference with a Venn
diagram or example.
11. EXISTS, ANY, ALL

● EXISTS: Checks if a subquery returns any rows.

Example: Find students enrolled in any course:

SELECT Name
FROM Students S
WHERE EXISTS (
SELECT * FROM Enrollments E WHERE E.StudentID = S.ID
);

● ANY: Compares a value to any value in a subquery.

Example: Find students with GPA greater than at least one CS major:

SELECT Name
FROM Students
WHERE GPA > ANY (SELECT GPA FROM Students WHERE Major = 'CS');

● ALL: Compares a value to all values in a subquery.

Example: Find students with GPA greater than all CS majors:

SELECT Name
FROM Students
WHERE GPA > ALL (SELECT GPA FROM Students WHERE Major = 'CS');

● Exam Tip: Practice rewriting ANY and ALL queries using MIN or MAX for clarity.

12. Views and Their Types

● Views: Virtual tables based on a query, used for simplicity or security.

Example: Create a view for high-GPA students:

CREATE VIEW HighGPAStudents AS
SELECT Name, GPA
FROM Students
WHERE GPA > 3.5;

● Types of Views:
○ Simple Views: Based on a single table, updatable.
○ Complex Views: Involve joins, aggregates, or subqueries, usually not updatable.
○ Materialized Views: Physically store data for performance (not always updatable).
● Exam Tip: Know how to create and query a view. Be ready to explain when a view is updatable.

13. Transaction Control Commands

● Transactions: A sequence of operations treated as a single unit.

● Commands:
○ COMMIT: Saves all changes in the transaction.
■ Example: COMMIT;.
○ ROLLBACK: Undoes all changes in the transaction.
■ Example: ROLLBACK;.
○ SAVEPOINT: Sets a point to roll back to.

Example:

SAVEPOINT save1;
INSERT INTO Students VALUES (103, 'Bob', 3.2);
ROLLBACK TO save1;

Example:

BEGIN TRANSACTION;
INSERT INTO Students VALUES (104, 'Eve', 3.9);
SAVEPOINT save1;
UPDATE Students SET GPA = 4.0 WHERE ID = 104;
ROLLBACK TO save1; -- Undoes the UPDATE
COMMIT; -- Saves the INSERT

● Exam Tip: Understand the flow of a transaction and how SAVEPOINT works.

Sample Practice Question

Question: Write a SQL query to find the names of students who have a GPA greater than the average GPA of all
Computer Science majors. Also, express this query in relational algebra.

Answer:

SQL:

SELECT Name
FROM Students
WHERE GPA > (SELECT AVG(GPA) FROM Students WHERE Major = 'CS');

Relational Algebra:

π_{Name}(σ_{GPA > avg_gpa}(Students))
where avg_gpa = π_{AVG(GPA)}(σ_{Major = 'CS'}(Students))
Unit 3:Relational Database Design

Functional Dependencies & Normalization

Functional Dependencies (FDs)

A functional dependency is a constraint where one set of attributes (A) uniquely determines another set (B), written
as A → B. For every value of A, there’s exactly one corresponding value of B. FDs are the foundation of
normalization and help identify keys and dependencies in a relation.

● Types of FDs:
○ Trivial FD: B is a subset of A (e.g., {StudentID, Name} → Name). Always true.
○ Non-trivial FD: B is not a subset of A (e.g., StudentID → Name).
○ Partial Dependency: A non-prime attribute depends on part of a candidate key.
○ Transitive Dependency: A non-prime attribute depends on another non-prime attribute via a third
attribute.
● Closure of Attributes: To find all attributes determined by a set A (denoted A⁺), use Armstrong’s Axioms:
○ Reflexivity: If B ⊆ A, then A → B.
○ Augmentation: If A → B, then A ∪ C → B ∪ C.
○ Transitivity: If A → B and B → C, then A → C.
○ Derived rules: Union (A → B, A → C implies A → BC), Decomposition (A → BC implies A → B, A →
C).

Example: Given a relation R(A, B, C, D) with FDs {A → B, B → C}, compute {A}⁺:

1. Start with {A}⁺ = {A}.

2. Apply A → B: Add B, so {A}⁺ = {A, B}.
3. Apply B → C: Add C, so {A}⁺ = {A, B, C}.
4. No more FDs apply. Final {A}⁺ = {A, B, C}.

Exam Tip: Practice computing closures to identify candidate keys (attributes whose closure includes all attributes in
the relation).

Normalization

Normalization decomposes a relation into smaller relations to eliminate redundancy and anomalies (insertion,
deletion, update) while preserving data and dependencies. Each normal form builds on the previous one.

1NF (First Normal Form)

● Requirement: All attributes are atomic (no multi-valued or composite attributes).

Example: A table {StudentID, Name, Courses} where Courses contains {“DBMS, OS”} violates 1NF. Fix by splitting
into rows:

Before: | StudentID | Name | Courses|
| 101 | Alice | DBMS,OS|
After: | StudentID | Name | Course |
| 101 | Alice | DBMS |
| 101 | Alice | OS |
2NF (Second Normal Form)

● Requirement: In 1NF, and no partial dependency (non-prime attributes depend on the entire candidate key,
not part of it).
● Example: Table {StudentID, CourseID, Instructor, Dept} with FDs {StudentID, CourseID → Instructor,
CourseID → Dept}. Candidate key: {StudentID, CourseID}. CourseID → Dept is a partial dependency (Dept
depends only on CourseID). Decompose:
○ R1: {CourseID, Dept}
○ R2: {StudentID, CourseID, Instructor}

3NF (Third Normal Form)

● Requirement: In 2NF, and no transitive dependency (non-prime attributes don’t depend on other non-prime
attributes).
● Example: Table {StudentID, Dept, DeptHead} with FDs {StudentID → Dept, Dept → DeptHead}. Dept →
DeptHead is transitive. Decompose:
○ R1: {Dept, DeptHead}
○ R2: {StudentID, Dept}

BCNF (Boyce-Codd Normal Form)

● Requirement: For every FD A → B, A is a superkey (stricter than 3NF).

● Example: Table {Student, Course, Instructor} with FDs {Student, Course → Instructor, Instructor → Course}.
Candidate key: {Student, Course}. But Instructor → Course violates BCNF (Instructor isn’t a superkey).
Decompose:
○ R1: {Instructor, Course}
○ R2: {Student, Instructor}

Lossless Join Decomposition

Ensures the join of decomposed tables reconstructs the original relation. A decomposition is lossless if at least one
decomposed table contains a key of the original relation or satisfies the chase algorithm.

Example: For R(A, B, C) with FD {A → B}, decompose into R1(A, B) and R2(A, C). Since R1 ∩ R2 = {A} and A → B,
the join on A preserves all tuples.

Dependency Preserving Decomposition

Ensures all FDs are enforceable in the decomposed tables. Check if the closure of FDs in the decomposed tables
covers the original FDs.

Example: For R(A, B, C) with FDs {A → B, B → C}, decompose into R1(A, B) and R2(B, C). FDs are preserved
because {A → B} is in R1, {B → C} is in R2, and together they cover all FDs.

Practice Problem: Normalize the table R(A, B, C, D) with FDs {A → B, B → C, C → D} to BCNF, ensuring lossless
join and dependency preservation.

● Step 1: Check 1NF (assume atomic attributes).

● Step 2: Candidate key: {A} (since {A}⁺ = {A, B, C, D}).
● Step 3: Check BCNF: B → C violates BCNF (B isn’t a superkey). Decompose:
○ R1(B, C)
○ R2(A, B, D)
● Step 4: Check R2 for BCNF: A → B, D is okay (A is a superkey in R2). C → D no longer applies.
● Step 5: Verify:
○ Lossless: R1 ∩ R2 = {B}, and B → C ensures lossless join.
○ Dependency Preserving: {A → B, B → C} preserved; C → D implied via transitivity.

Exam Tip: Practice normalizing step-by-step and checking properties. Expect questions like “Normalize this table” or
“Is this decomposition lossless?”

2. Normal Forms Based on Multivalued & Join Dependencies

4NF (Fourth Normal Form)

● Requirement: In BCNF, and no non-trivial multi-valued dependencies (MVDs). An MVD A ↠ B holds if for a
value of A, the set of B values is independent of other attributes.

Example: Table {Student, Course, Hobby} with data:

○ R1: {Student, Course}

○ R2: {Student, Hobby}

5NF (Fifth Normal Form)

● Requirement: No non-trivial join dependencies (JDs). A JD exists if a relation can be decomposed into
smaller relations and joined back losslessly. 5NF is rare and ensures no anomalies from complex joins.
● Example: A table {Supplier, Part, Project} where a supplier supplies a part to a project only if they supply
both independently. Decompose into projections to eliminate JDs.

DKNF (Domain-Key Normal Form)

● Requirement: Every constraint is a result of domain constraints or key constraints. Theoretical and hard to
achieve but ensures no anomalies.
● Example: A table with complex constraints (e.g., “salary must be positive unless employee is intern”) may
violate DKNF unless simplified.

Practice Problem: For {Student, Course, Book} with MVD Student ↠ Course, decompose to 4NF.

● Decompose into {Student, Course} and {Student, Book}.

Exam Tip: Focus on 4NF with MVDs. 5NF/DKNF are less common but understand their purpose.

3. Properties of Transactions
A transaction is a sequence of operations (e.g., read, write) treated as a single unit. Transactions ensure database
reliability via ACID properties:

● Atomicity: All operations complete, or none do. Example: A bank transfer debits one account and credits
another; if it fails, both are rolled back.
● Consistency: Database remains in a valid state (e.g., foreign key constraints hold).
● Isolation: Transactions don’t interfere (partial changes aren’t visible).
● Durability: Committed changes persist, even after a crash.

Transaction States

● Active: Executing operations.

● Partially Committed: Operations done, awaiting commit.
● Committed: Successfully completed.
● Failed: Cannot proceed (e.g., error or deadlock).
● Aborted: Rolled back, changes undone.

Example: A transaction transferring $100:

1. Active: Reads balance, updates accounts.

2. Partially Committed: Updates written to buffer.
3. Committed: Changes saved to disk.
4. If it fails (e.g., insufficient funds), it’s aborted.

Practice Problem: Draw the transaction state diagram and explain how a failed transaction is handled.

Exam Tip: Memorize ACID and states. Be ready to explain with real-world examples (e.g., online shopping).

4. Transaction Schedules & Serializability

A schedule is a sequence of operations from multiple transactions. Schedules must ensure correctness, equivalent
to a serial schedule (one transaction at a time).

● Serial Schedule: No interleaving (e.g., T1 then T2).

● Serializable Schedule: Produces the same result as a serial schedule.
○ Conflict Serializability: Operations conflict if they access the same data and at least one is a write
(e.g., R(A)-W(A), W(A)-W(B)). Use a precedence graph:
■ Nodes: Transactions.
■ Edges: Conflicts (e.g., T1 → T2 if T1’s operation precedes and conflicts with T2’s).
■ Acyclic graph = conflict serializable.
○ View Serializability: Stricter, rarely tested.

Example: Schedule: T1: R(A), T2: W(A), T2: W(B), T1: W(B).

● Conflicts: T1’s R(A) → T2’s W(A), T1’s W(B) → T2’s W(B).

● Precedence graph: T1 → T2 (no cycle, serializable).

Practice Problem: Is T1: R(A), T2: W(A), T1: W(B), T2: R(B) conflict serializable?

● Conflicts: T1 → T2 (R(A)-W(A)), T2 → T1 (W(B)-R(B)).

● Graph: Cycle (T1 ↔ T2). Not conflict serializable.
Exam Tip: Practice precedence graphs and identifying conflicts.

5. Concurrency Control Techniques

Concurrency control ensures serializability in interleaved schedules.

Locking Techniques

● Shared Lock (S): Allows reading, blocks writing.

● Exclusive Lock (X): Allows reading/writing, blocks others.
● Two-Phase Locking (2PL):
○ Growing Phase: Acquire locks.
○ Shrinking Phase: Release locks (no new locks after releasing).
○ Ensures conflict serializability but may cause deadlocks.

Example: T1: S(A), R(A), X(B), W(B); T2: X(A), W(A). T1 and T2 follow 2PL, ensuring serializability.

Timestamp Ordering

● Assigns each transaction a unique timestamp.

● Operations processed in timestamp order:
○ Read: Allowed if no later write occurred.
○ Write: Allowed if no later read/write occurred.
● Avoids deadlocks but may abort transactions.

Example: T1 (TS=10): R(A); T2 (TS=20): W(A). T1 reads A if no write with TS > 10 occurred.

Granularity

Locking can apply to:

● Database, table, page, or tuple.

● Finer granularity (e.g., tuple) increases concurrency but overhead.

Recoverable Schedules

● T2 can commit only after T1 commits if T2 reads T1’s uncommitted data.

● Cascadeless Schedules: Avoid dirty reads (T2 doesn’t read T1’s uncommitted data).

Practice Problem: In 2PL, show how T1: R(A), W(B) and T2: W(A), R(B) avoid conflicts.

Exam Tip: Compare 2PL vs. timestamp ordering. Practice identifying recoverable schedules.

6. Deadlock Detection and Recovery

● Deadlock: Transactions wait for each other’s locks in a cycle.
● Detection: Use a wait-for graph (nodes: transactions, edges: T1 waits for T2). A cycle indicates deadlock.
● Recovery:
○ Select a victim (e.g., youngest transaction).
○ Rollback and restart.

Example: T1: X(A), wants B; T2: X(B), wants A. Wait-for graph: T1 → T2, T2 → T1 (cycle). Abort T2.

Practice Problem: Draw a wait-for graph for T1: X(A), T2: X(B), T1: wants B, T2: wants A.

Exam Tip: Practice detecting and resolving deadlocks.

7. Recovery Techniques
Recovery ensures consistency after failures.

● Log-Based Recovery:
○ Logs store before/after images.
○ Undo: Rollback uncommitted changes.
○ Redo: Reapply committed changes.
● Checkpoints: Periodic snapshots to reduce recovery time.
● Backup: Full/incremental backups for catastrophic failures.
● Recovery from Catastrophic Failures: Restore backup, apply logs.

Example: Log: <T1, A, 100, 200>. If T1 fails, undo by restoring A=100. If T1 commits, redo A=200.

Practice Problem: Given a log, show undo/redo steps after a crash.

Exam Tip: Understand log-based recovery and checkpoints.

8. Database Programming
● Control Structures: PL-SQL supports:
○ IF-THEN-ELSE: IF salary < 50000 THEN bonus := 1000; END IF;
○ Loops: FOR i IN 1..10 LOOP ... END LOOP;
● Exception Handling:
○ BEGIN ... EXCEPTION WHEN NO_DATA_FOUND THEN ... END;

Stored Procedures:

CREATE PROCEDURE UpdateSalary(empID IN NUMBER, amount IN NUMBER) AS
BEGIN
UPDATE Employees SET salary = salary + amount WHERE ID = empID;
COMMIT;
EXCEPTION
WHEN OTHERS THEN
ROLLBACK;
END;
Triggers:

CREATE TRIGGER LogSalaryChange
AFTER UPDATE OF salary ON Employees
FOR EACH ROW
BEGIN
INSERT INTO SalaryLog(empID, oldSalary, newSalary)
VALUES (:OLD.ID, :OLD.salary, :NEW.salary);
END;

Practice Problem: Write a trigger to log deletions from a table.

Exam Tip: Practice writing PL-SQL code and handling exceptions.

Unit 4: File Structures and Indexing

1. File Structures and Indexing: Secondary Storage Devices

Secondary Storage Devices

Secondary storage devices (e.g., hard disk drives (HDDs), solid-state drives (SSDs)) store database data
persistently. Unlike main memory (RAM), secondary storage is non-volatile but slower, so efficient organization and
access methods are critical.

● Key Characteristics:
○ Blocks: Data is stored in fixed-size blocks (e.g., 4KB or 8KB). A block is the unit of transfer between
disk and memory.
○ Access Time: Includes seek time (moving disk head to track), rotational latency (waiting for sector
to rotate under head), and transfer time (reading/writing data). SSDs are faster due to no mechanical
parts.
○ I/O Cost: Measured in block accesses, as these dominate query performance.
● Example: A database table with 1 million records, each 200 bytes, stored on a disk with 4KB blocks. Each
block holds 4096 / 200 ≈ 20 records. To read 100 records, you need 100 / 20 = 5 block accesses (assuming
records are contiguous).

Exam Tip: Understand block-based storage and how to calculate block accesses for queries. Be ready to explain
why SSDs are faster than HDDs.

2. Operations on Files
Files store database records, and operations include insertion, deletion, modification, and retrieval. The efficiency
of these operations depends on the file organization.

● Common Operations:
○ Insert: Add a new record. May require shifting records or appending.
○ Delete: Remove a record. May mark as deleted (logical deletion) or reorganize file (physical deletion).
○ Modify: Update a record’s fields.
○ Retrieve: Fetch records by key (exact match) or range (e.g., all records with age > 30).
● Challenges:
○ Insertion/deletion in sorted files is costly due to shifting records.
○ Retrieval is slow without indexes.

Example: In a file with records {ID: 1, Name: Alice}, {ID: 2, Name: Bob}, inserting {ID: 3, Name: Charlie} is simple
(append), but deleting {ID: 1} may require marking or reorganizing.

Exam Tip: Know the cost of operations in different file organizations (heap, sorted, hashed). Expect questions like
“How many block accesses to insert a record?”

3. File Organizations
Heap Files

● Structure: Records stored in no particular order (appended as inserted).

● Pros: Fast insertion (append to end).
● Cons: Slow retrieval (requires linear scan for searches) and deletion (must search for record).
● Use Case: Temporary files or when order doesn’t matter.
● Example: A log file where records are appended as events occur. To find a record with ID = 5, scan all blocks
(O(n) block accesses).

Sorted Files

● Structure: Records sorted by a key (e.g., ID).

● Pros: Efficient for range queries (e.g., ID BETWEEN 10 AND 20) and binary search (O(log n) for exact
match).
● Cons: Insertion/deletion is costly (requires shifting records, O(n) in worst case).
● Use Case: Static data with frequent range queries.
● Example: A table sorted by EmployeeID. To find EmployeeID = 100, use binary search on disk blocks.

Hashed Files

● Structure: Records stored based on a hash function applied to a key (e.g., hash(ID) = block number).
● Pros: Fast exact-match retrieval (O(1) block accesses on average).
● Cons: Poor for range queries (hash scatters records). Collisions require resolution (e.g., chaining or open
addressing).
● Use Case: High-speed lookups (e.g., primary key access).
● Example: A table with StudentID hashed to blocks. hash(101) = block 3 directly retrieves the record.

Practice Problem: Compare the number of block accesses to retrieve a record with ID = 50 in a heap file vs. a
hashed file (assume 1000 records, 10 records per block).

● Heap File: Linear scan, up to 1000 / 10 = 100 block accesses.

● Hashed File: 1 block access (hash directs to correct block, assuming no collisions).

Exam Tip: Be ready to compare file organizations for specific operations (e.g., “Which is best for range queries?”).
Practice calculating I/O costs.

4. Indexing
Indexes are data structures that speed up retrieval by providing quick access to records based on key values. They
store key-pointer pairs, where the pointer references a disk block or record.

Single-Level Indexes

● Primary Index: Built on the primary key of a sorted file. Each entry maps a key to a block.
○ Structure: Sparse (one entry per block, not per record).
○ Example: A sorted file with 1000 records, 10 records per block (100 blocks). The primary index has
100 entries, each pointing to a block’s first key.
○ Search: Binary search on index (O(log m), where m is number of blocks), then 1 block access.
● Clustering Index: Built on a non-key attribute that orders the file (e.g., DepartmentID if records are sorted by
department).
● Secondary Index: Built on a non-ordering attribute (e.g., Name in a file sorted by ID). Dense (one entry per
record).
○ Example: A secondary index on Name maps each name to a record’s disk address. Requires more
storage but supports queries on non-key attributes.

Multi-Level Indexes

● Structure: A hierarchy of indexes (index on an index) to reduce search time for large indexes.
● Example: A primary index with 10,000 entries is too large to fit in memory. Create a second-level index on
the first-level index, reducing search to O(log log n).
● Use Case: Large databases where single-level indexes are too big.

B-Tree and B+ Tree Indexes

● B-Tree:
○ A balanced, multi-way search tree where each node holds multiple keys and pointers.
○ Properties:
■ All leaves at the same level.
■ A node with k keys has k+1 pointers.
■ Minimum and maximum keys per node (e.g., order m means max m-1 keys).
○ Operations:
■ Search: O(log n) disk accesses.
■ Insert/Delete: Split or merge nodes to maintain balance.
○ Example: A B-tree of order 3 (max 2 keys per node). To find key 50, traverse root to leaf, accessing
2–3 disk blocks.
● B+ Tree:
○ A variant of B-tree optimized for databases.
○ Differences:
■ Only leaf nodes store data pointers; internal nodes store keys for navigation.
■ Leaf nodes linked sequentially for efficient range queries.
○ Advantages:
■ Better for range queries (sequential leaf access).
■ Less storage for internal nodes (no data pointers).
○ Example: A B+ tree with keys {10, 20, 30} in leaves, linked for range query 10 ≤ key ≤ 25.

Practice Problem: In a B+ tree of order 4 (max 3 keys per node), insert keys {10, 20, 30, 15, 25}. Show the tree
structure.

● Step 1: Insert 10: Root = [10].

● Step 2: Insert 20: Root = [10, 20].
● Step 3: Insert 30: Root = [10, 20, 30].
● Step 4: Insert 15: Root splits (order exceeded). New root = [20], leaves = [10, 15] → [20, 30].
● Step 5: Insert 25: Update leaf to [20, 25, 30].

Exam Tip: Understand the structure of B+ trees and how they support range queries. Practice inserting keys and
calculating search costs (e.g., “How many disk accesses to find a key?”).

5. Concepts of Object-Oriented Database Management Systems

(OODBMS)
An OODBMS combines object-oriented programming (OOP) principles with database capabilities, storing objects
directly rather than mapping them to relational tables.

● Key Features:
○ Objects: Data and methods (behavior) stored together (e.g., a Student object with attributes name, ID
and method calculateGPA()).
○ Classes and Inheritance: Objects belong to classes, which can inherit properties (e.g., GradStudent
inherits from Student).
○ Encapsulation: Data and methods bundled, with access control (e.g., private attributes).
○ Polymorphism: Methods can behave differently based on object type.
○ Complex Data Types: Support for nested objects, arrays, and references (e.g., a Course object with
a list of Student objects).
○ Persistence: Objects stored permanently, with query capabilities.

Example: In an OODBMS, a Student class:

Class Student {

String name;

Int ID;

List<Course> enrolledCourses;

Float calculateGPA() { ... }

}
Query: “Find all students with GPA > 3.5” directly accesses the calculateGPA method.

● Advantages:
○ Natural modeling for complex data (e.g., multimedia, CAD).
○ Faster for object-based applications (no impedance mismatch with OOP languages).
● Disadvantages:
○ Complex querying compared to SQL.
○ Less mature than relational DBMS.

Practice Problem: Design an OODBMS schema for a library system with Book, Author, and Borrower classes,
including inheritance (e.g., TextBook inherits from Book).

Exam Tip: Compare OODBMS vs. RDBMS (e.g., “Why use OODBMS for a CAD system?”). Be ready to define OOP
concepts in a database context.

6. Concepts of Distributed Database Management Systems (DDBMS)

A DDBMS manages a database distributed across multiple sites (nodes), connected via a network, to provide
transparent access to data.

● Key Features:
○ Data Distribution: Data split across sites via:
■ Fragmentation: Divide tables (e.g., horizontal: rows split by condition; vertical: columns split).
■ Replication: Copies of data at multiple sites for availability.
○ Transparency: Users see a single logical database (hides distribution details).
■ Location Transparency: No need to know where data is stored.
■ Replication Transparency: No need to know about copies.
○ Distributed Query Processing: Queries optimized across sites, minimizing data transfer.
○ Distributed Transactions: Ensure ACID properties across sites using protocols like Two-Phase
Commit (2PC):
■ Prepare Phase: All sites agree to commit.
■ Commit Phase: All sites commit or abort.
● Example: A bank with branches in New York and London. Customer data is fragmented:
○ New York: Customers with ID < 1000.
○ London: Customers with ID ≥ 1000.
○ A query for a customer’s balance is routed to the appropriate site.
● Advantages:
○ Localized access reduces latency.
○ Scalability and fault tolerance via replication.
● Disadvantages:
○ Complex query optimization and transaction management.
○ Network latency and reliability issues.
● Key Challenges:
○ Concurrency Control: Use distributed locking or timestamp ordering.
○ Deadlock Detection: Global wait-for graphs across sites.
○ Recovery: Ensure consistency after site failures (e.g., via logs and 2PC).

Practice Problem: For a DDBMS with two sites, explain how a query “SELECT * FROM Customers WHERE
balance > 1000” is processed if data is horizontally fragmented by region.

Exam Tip: Understand fragmentation, replication, and 2PC. Be ready to explain transparency types or compare
DDBMS vs. centralized DBMS.

Data Structures and Algorithm Analysis in C++, Third Edition
From Everand
Data Structures and Algorithm Analysis in C++, Third Edition
Clifford A. Shaffer
4.5/5 (5)
MS SQL Server Interview Questions
100% (4)
MS SQL Server Interview Questions
17 pages
DBMS madam impt list
No ratings yet
DBMS madam impt list
38 pages
Dbms q-a
No ratings yet
Dbms q-a
4 pages
UNIT1-5
No ratings yet
UNIT1-5
17 pages
DBMS NOTES
No ratings yet
DBMS NOTES
17 pages
Database Engineering Detailed Answers With Examples
No ratings yet
Database Engineering Detailed Answers With Examples
4 pages
Database Engineering Detailed Answers Fixed
No ratings yet
Database Engineering Detailed Answers Fixed
4 pages
DBMS Assignment Answers
No ratings yet
DBMS Assignment Answers
14 pages
UG - B.Sc. - Computer Science - 130 52 - BSc-Computer Science - Semester V - RDBMS - 8693
100% (1)
UG - B.Sc. - Computer Science - 130 52 - BSc-Computer Science - Semester V - RDBMS - 8693
364 pages
Notes 1
No ratings yet
Notes 1
8 pages
DBMS
No ratings yet
DBMS
53 pages
Database Management Systems - Course Outline
No ratings yet
Database Management Systems - Course Outline
5 pages
DBMS
No ratings yet
DBMS
25 pages
3410
No ratings yet
3410
5 pages
Dbms Intro
No ratings yet
Dbms Intro
34 pages
DBMS for BCA
No ratings yet
DBMS for BCA
33 pages
Dbms Question Bank Full Solution
No ratings yet
Dbms Question Bank Full Solution
41 pages
dbms ppt
No ratings yet
dbms ppt
28 pages
DBMS_Combined_Notes
No ratings yet
DBMS_Combined_Notes
5 pages
Midsem Exam Dbe
No ratings yet
Midsem Exam Dbe
29 pages
DBMS_Notes_Unit1_Unit2
No ratings yet
DBMS_Notes_Unit1_Unit2
4 pages
Fundamentals of database outline
No ratings yet
Fundamentals of database outline
8 pages
Database Management System (DBMS) : Convenient Efficient
No ratings yet
Database Management System (DBMS) : Convenient Efficient
11 pages
IT2306-Database Systems
No ratings yet
IT2306-Database Systems
5 pages
DBMS Unit1
No ratings yet
DBMS Unit1
30 pages
Btcse 501 Introduction To Database System
No ratings yet
Btcse 501 Introduction To Database System
12 pages
COM 228 (Ques. & Ans)
No ratings yet
COM 228 (Ques. & Ans)
9 pages
Unit 1 PART A
No ratings yet
Unit 1 PART A
59 pages
Fundamentals of Databasec Ourseoutline
No ratings yet
Fundamentals of Databasec Ourseoutline
4 pages
4 Database Management Basics
No ratings yet
4 Database Management Basics
49 pages
Chapter 1k
No ratings yet
Chapter 1k
34 pages
soft_DeepAI
No ratings yet
soft_DeepAI
23 pages
DBMS notes
No ratings yet
DBMS notes
15 pages
0 Introduction PDF
No ratings yet
0 Introduction PDF
35 pages
Database Split Schedule 16.2.20
No ratings yet
Database Split Schedule 16.2.20
8 pages
DBMS NOTES v23
No ratings yet
DBMS NOTES v23
99 pages
DBMS
No ratings yet
DBMS
11 pages
DBMS Hints Final
No ratings yet
DBMS Hints Final
59 pages
EEE207 Database Concepts Lecture 1 v2
No ratings yet
EEE207 Database Concepts Lecture 1 v2
26 pages
Databases Note
No ratings yet
Databases Note
6 pages
Lecture 1 Database Management System
No ratings yet
Lecture 1 Database Management System
24 pages
Database Engineering Complete Answers Fixed
No ratings yet
Database Engineering Complete Answers Fixed
4 pages
DBMS
No ratings yet
DBMS
24 pages
DBMS-1
No ratings yet
DBMS-1
15 pages
Dbmsconcepts Intro
No ratings yet
Dbmsconcepts Intro
50 pages
Introduction to Rdbms
No ratings yet
Introduction to Rdbms
5 pages
dbms pyq
No ratings yet
dbms pyq
14 pages
Ism Lab File
No ratings yet
Ism Lab File
52 pages
Chapter 1 DBMS DJSCE
No ratings yet
Chapter 1 DBMS DJSCE
27 pages
Course Out Lin
No ratings yet
Course Out Lin
6 pages
x0hwSXFvlScNokRLZfwJsKRqudtgq7uOQX00EWAj
No ratings yet
x0hwSXFvlScNokRLZfwJsKRqudtgq7uOQX00EWAj
4 pages
Data Base Management Systems Notes
No ratings yet
Data Base Management Systems Notes
93 pages
DBMS Notes
No ratings yet
DBMS Notes
14 pages
نظري-1
No ratings yet
نظري-1
10 pages
DBMS (1)
No ratings yet
DBMS (1)
16 pages
cb3401-unit-2
No ratings yet
cb3401-unit-2
24 pages
R19 DBMS Material
No ratings yet
R19 DBMS Material
207 pages
SQL Interview Success From Beginner To Pro
From Everand
SQL Interview Success From Beginner To Pro
Shana
No ratings yet
Data Structures in C / C ++: Exercises and Solved Problems
From Everand
Data Structures in C / C ++: Exercises and Solved Problems
Fulbia Torres
No ratings yet
SQL Query Basics
From Everand
SQL Query Basics
Isabella Ramirez
No ratings yet
The Internals of PostgreSQL - Chapter 1 Database Cluster, Databases, and Tables
No ratings yet
The Internals of PostgreSQL - Chapter 1 Database Cluster, Databases, and Tables
10 pages
File Structures: Data Representation in Memory
No ratings yet
File Structures: Data Representation in Memory
107 pages
Various Data Structure
No ratings yet
Various Data Structure
56 pages
B Trees
No ratings yet
B Trees
51 pages
Designing Data-Intensive Apps - Ch 3
No ratings yet
Designing Data-Intensive Apps - Ch 3
7 pages
DBMS Module 3.3 PDF
No ratings yet
DBMS Module 3.3 PDF
16 pages
6 Syllabus ISE
No ratings yet
6 Syllabus ISE
22 pages
Infix and Postfix Expressions
No ratings yet
Infix and Postfix Expressions
32 pages
Oracle Database 12c Available For Download
No ratings yet
Oracle Database 12c Available For Download
27 pages
Fs Mini Project: Employee Management System
No ratings yet
Fs Mini Project: Employee Management System
21 pages
MCS 207
No ratings yet
MCS 207
10 pages
II Year - II Semester L T P C 4 0 0 3
No ratings yet
II Year - II Semester L T P C 4 0 0 3
3 pages
Modern B Tree Techniques 1
No ratings yet
Modern B Tree Techniques 1
88 pages
Dbms Unit III Notes
No ratings yet
Dbms Unit III Notes
27 pages
Data Structure & Algorithms - CSC 221: Instructor: Dr. Muhammad Asfand-E-Yar
No ratings yet
Data Structure & Algorithms - CSC 221: Instructor: Dr. Muhammad Asfand-E-Yar
73 pages
Binary Trees-Unit 4
No ratings yet
Binary Trees-Unit 4
33 pages
Implementation Techniques - Unit 4
No ratings yet
Implementation Techniques - Unit 4
29 pages
Oracle Vs Sybase PDF
No ratings yet
Oracle Vs Sybase PDF
31 pages
Qestion Bank
No ratings yet
Qestion Bank
260 pages
Chapter 5-Record Storage and Primary File Organization
100% (1)
Chapter 5-Record Storage and Primary File Organization
64 pages
B Tree
No ratings yet
B Tree
16 pages
ADBMS Chapter No. 6
No ratings yet
ADBMS Chapter No. 6
24 pages
Data Structures
No ratings yet
Data Structures
49 pages
3 Solutions Clrs 18
No ratings yet
3 Solutions Clrs 18
4 pages
Data Structures and Algorithm: Avl Tree
No ratings yet
Data Structures and Algorithm: Avl Tree
42 pages
Data Structures and Algorithms
No ratings yet
Data Structures and Algorithms
41 pages
BCA (R) Session 2012-13 - 15 - 16 (1) - 23 - 7 - 18
No ratings yet
BCA (R) Session 2012-13 - 15 - 16 (1) - 23 - 7 - 18
27 pages
21._June_2020 MCS-021 IGNOUAssignmentGuru.com
No ratings yet
21._June_2020 MCS-021 IGNOUAssignmentGuru.com
4 pages

DBMS NOTES

Uploaded by

DBMS NOTES

Uploaded by

Unit 1: Database & DBMS Fundamentals

1. Basic Concepts: Database & Database Users

2. Characteristics of Database Systems

Database systems have unique features that make them powerful:

3. Concepts and Architecture

●​ Data Models: Define how data is structured and related.

5. DBMS Architecture & Data Independence (Revisited)

6. Database Languages & Interfaces

7. Data Modelling Using the Entity-Relationship (ER) Approach

9. SQL – DDL, DCL, DML, Views, and Indexes

10. Defining Constraints

Constraints ensure data integrity:

●​ Primary Key: Uniquely identifies each row (e.g., ID in Students table).

Study Strategies for Your Exam

Sample Practice Question

CREATE TABLE Members (

CREATE TABLE Loans (

1. Relational Model Concepts

●​ 102 | Alice | 3.8​

2. Relational Model Constraints

Constraints ensure data integrity in the relational model:

●​ Common Aggregate Functions:

SQL supports set operations to combine query results:

●​ UNION: Combines rows from two queries, removing duplicates.

8. Subqueries and Correlated Subqueries

●​ Subqueries: A query nested inside another query.

Example: Find students with above-average GPA:​

●​ Correlated Subqueries: A subquery that references the outer query’s table.

Example: Find students enrolled in at least one course:​

9. GROUP BY, HAVING, ORDER BY

Example: Find majors with average GPA > 3.5:​

Example: Sort students by GPA:​

10. Joins and Their Types

Joins combine rows from two or more tables based on a condition.

Example: List all students, including those not enrolled:​

●​ EXISTS: Checks if a subquery returns any rows.

Example: Find students enrolled in any course:​

●​ ANY: Compares a value to any value in a subquery.

●​ ALL: Compares a value to all values in a subquery.

Example: Find students with GPA greater than all CS majors:​

12. Views and Their Types

●​ Views: Virtual tables based on a query, used for simplicity or security.

Example: Create a view for high-GPA students:​

13. Transaction Control Commands

●​ Transactions: A sequence of operations treated as a single unit.

Sample Practice Question

Functional Dependencies & Normalization

Example: Given a relation R(A, B, C, D) with FDs {A → B, B → C}, compute {A}⁺:

1.​ Start with {A}⁺ = {A}.

1NF (First Normal Form)

●​ Requirement: All attributes are atomic (no multi-valued or composite attributes).

3NF (Third Normal Form)

BCNF (Boyce-Codd Normal Form)

●​ Requirement: For every FD A → B, A is a superkey (stricter than 3NF).

Lossless Join Decomposition

Dependency Preserving Decomposition

●​ Step 1: Check 1NF (assume atomic attributes).

2. Normal Forms Based on Multivalued & Join Dependencies

Example: Table {Student, Course, Hobby} with data:​

○​ R1: {Student, Course}

5NF (Fifth Normal Form)

DKNF (Domain-Key Normal Form)

●​ Decompose into {Student, Course} and {Student, Book}.

●​ Active: Executing operations.

Example: A transaction transferring $100:

1.​ Active: Reads balance, updates accounts.

4. Transaction Schedules & Serializability

●​ Serial Schedule: No interleaving (e.g., T1 then T2).

●​ Conflicts: T1’s R(A) → T2’s W(A), T1’s W(B) → T2’s W(B).

●​ Conflicts: T1 → T2 (R(A)-W(A)), T2 → T1 (W(B)-R(B)).

5. Concurrency Control Techniques

●​ Shared Lock (S): Allows reading, blocks writing.

●​ Assigns each transaction a unique timestamp.

Locking can apply to:

●​ Database, table, page, or tuple.

●​ T2 can commit only after T1 commits if T2 reads T1’s uncommitted data.

● Data Models: Define how data is structured and related.

● Primary Key: Uniquely identifies each row (e.g., ID in Students table).

● 102 | Alice | 3.8

● Common Aggregate Functions:

● UNION: Combines rows from two queries, removing duplicates.

● Subqueries: A query nested inside another query.

Example: Find students with above-average GPA:

● Correlated Subqueries: A subquery that references the outer query’s table.

Example: Find students enrolled in at least one course:

Example: Find majors with average GPA > 3.5:

Example: Sort students by GPA:

Example: List all students, including those not enrolled:

● EXISTS: Checks if a subquery returns any rows.

Example: Find students enrolled in any course:

● ANY: Compares a value to any value in a subquery.

● ALL: Compares a value to all values in a subquery.

Example: Find students with GPA greater than all CS majors:

● Views: Virtual tables based on a query, used for simplicity or security.

Example: Create a view for high-GPA students:

● Transactions: A sequence of operations treated as a single unit.

1. Start with {A}⁺ = {A}.

● Requirement: All attributes are atomic (no multi-valued or composite attributes).

● Requirement: For every FD A → B, A is a superkey (stricter than 3NF).

● Step 1: Check 1NF (assume atomic attributes).

Example: Table {Student, Course, Hobby} with data:

○ R1: {Student, Course}

● Decompose into {Student, Course} and {Student, Book}.

● Active: Executing operations.

1. Active: Reads balance, updates accounts.

● Serial Schedule: No interleaving (e.g., T1 then T2).

● Conflicts: T1’s R(A) → T2’s W(A), T1’s W(B) → T2’s W(B).

● Conflicts: T1 → T2 (R(A)-W(A)), T2 → T1 (W(B)-R(B)).

● Shared Lock (S): Allows reading, blocks writing.

● Assigns each transaction a unique timestamp.

● Database, table, page, or tuple.

● T2 can commit only after T1 commits if T2 reads T1’s uncommitted data.

● Structure: Records stored in no particular order (appended as inserted).

● Structure: Records sorted by a key (e.g., ID).

● Heap File: Linear scan, up to 1000 / 10 = 100 block accesses.

● Step 1: Insert 10: Root = [10].

Example: In an OODBMS, a Student class: