0% found this document useful (0 votes)

34 views

Previous Year Solved Question Paper

Uploaded by

yodhavu253

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views

Previous Year Solved Question Paper

Uploaded by

yodhavu253

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

FOURTH SEMESTER B.

TECH DEGREE EXAMINATION, MAY 2019

DATABASE MANAGEMENT SYSTEMS

PART-A
1. Illustrate with an example, the difference between the conceptual data models
and the physical data models.

CONCEPTUAL DATA MODELS

● High-level or conceptual data models provide concepts that are close to the way
many users perceive data.
● Conceptual data models use concepts such as entities, attributes, and relationships.
● An entity represents a real-world object or concept, such as an employee or a project
that is described in the database.
● An attribute represents some property of interest that further describes an entity, such
as the employee's name or salary.
● A relationship among two or more entities represents an association among two or
more entities, for example, a works-on relationship between an employee and a
project
● Eg: ER model

PHYSICAL DATA MODELS

● Physical data models describe how data is stored as files in the computer by
representing information such as record formats, record orderings, and access paths.
● An access path is a structure that makes the search for particular database records
efficient.
● Eg: internal storage of student record

2. How is weak entity type different from a strong entity type? Give an example.

Strong Entity Type:

A strong entity type, also known as a regular entity type, is an entity that has
its own attributes and can be uniquely identified independently of any other entity. It
has a primary key that uniquely identifies each instance of the entity type.

For example, let's consider an "Employee" entity in a company database. The

"Employee" entity has attributes like "EmployeeID," "Name," and "Position." Each
employee can be uniquely identified using their "EmployeeID." The "Employee"
entity is a strong entity type because it can exist independently and has its own unique
identifier.

Weak Entity Type:

A weak entity type, on the other hand, is an entity that cannot be uniquely
identified without the existence of a related entity. It depends on a strong entity type,
known as its owner entity, for identification. The weak entity type doesn't have its own
unique identifier; instead, it uses a combination of its attributes and the primary key of

1
its owner entity to form a unique identifier.
Let's consider an example of a "Bank Account" entity in a banking system.
The "Bank Account" entity has attributes like "AccountNumber," "AccountType," and
"Balance." However, a bank account cannot be uniquely identified on its own. It
requires the existence of an owner entity, such as "Customer," to establish uniqueness.

In this example, the "Bank Account" entity is a weak entity type, and the
"Customer" entity is its owner entity. The combination of the "AccountNumber"
attribute and the "CustomerID" (primary key of the "Customer" entity) forms a unique
identifier for each bank account.

In summary, the key difference between a weak entity type and a strong entity
type is that a strong entity type can be uniquely identified independently, while a
weak entity type relies on an owner entity for identification. The weak entity type's
unique identifier includes attributes from both itself and its owner entity.

3. What is entity integrity constraint? Why is it important?

Entity integrity constraint is a rule or condition that ensures the uniqueness and
non-nullness of the primary key attribute in a database table. It guarantees that each
instance or row of a table has a unique and non-null value for its primary key attribute.
In other words, it ensures that there are no duplicate or missing primary key values
within a table.

The entity integrity constraint is important as it ensures uniqueness,

non-nullness, and accuracy of primary key values in a database table. It maintains data
integrity, consistency, and supports various database operations, ultimately
contributing to the overall reliability and efficiency of the database system.

4. Using the following ER diagram, create a relation database. Give your assumptions.

2
PART-B
5. a) With the help of an example, compare DML and DDL.

DML (Data Manipulation Language):

DML statements are used to manipulate or operate on the data stored in the
database. They primarily deal with retrieving, inserting, updating, and deleting data.
Some common DML statements include SELECT, INSERT, UPDATE, and
DELETE.

Example DML Statement:

Let's consider a simple table called "Students" with columns like "StudentID,"
"Name," and "Age." Here's an example of a DML statement:
SELECT * FROM Students WHERE Age > 20;

DDL (Data Definition Language):

DDL statements are used to define the structure or schema of the database. They
allow you to create, modify, or delete database objects such as tables, views, indexes,
and constraints. DDL statements are used to define the data organization and the
relationships between different entities in the database.
Example DDL Statement:
Continuing with our example, let's consider creating a new table called
"Courses" with columns like "CourseID" and "CourseName." Here's an example of a
DDL statement:
CREATE TABLE Courses ( CourseID INT PRIMARY KEY,CourseName
VARCHAR(50));

b) What are logical data independence and physical data independence? What is the
difference between them? Which of these harder to realize? Why?

Logical Data Independence:

Logical data independence refers to the ability to modify the logical or
conceptual schema of a database system without affecting the applications or
programs that use the database. It allows changes to the organization, structure, and
relationships of data without requiring changes to the application's code or queries.

Physical Data Independence:

Physical data independence, on the other hand, refers to the ability to modify the
physical schema or storage structures of a database system without affecting the
logical schema or the applications using the database. It allows changes to the storage
devices, file organization, indexing techniques, or other physical implementation
details without requiring changes to the logical schema or application programs.

6. Design an ER diagram to represent the following scenario:A company has many employees
working on a project. An employee can be part of one or more projects. Each employee
works on a project for certain amount of time.Assume suitable attributes for entities and
3
relations. Mark the primary key(s) and the cardinality ratio of the relations

we can design an ER diagram with the following entities and relationships:

Entities:

1. Employee
2. Project

Relationships:

1. Works On: Represents the association between an employee and a project.

2. Duration: Represents the time duration an employee works on a project.

Sample Diagram

Attributes:

1. Employee:
o Employee ID (Primary Key)
o Employee Name
o Employee Role
o Other employee attributes as needed
2. Project:
o Project ID (Primary Key)
o Project Name
o Project Description
o Other project attributes as needed
3. Works On:
o Employee ID (Foreign Key referencing Employee)

4
o Project ID (Foreign Key referencing Project)
o Start Date
o End Date (if applicable)
4. Duration:
o Employee ID (Foreign Key referencing Employee)
o Project ID (Foreign Key referencing Project)
o Duration (in days)

Cardinality Ratios:

 An employee can be part of one or more projects (Many-to-Many). The cardinality ratio between
Employee and Works On will be (0,N) on the Employee side and (0,N) on the Works On side.
 A project can have one or more employees working on it (Many-to-Many). The cardinality ratio
between Project and Works On will be (0,N) on the Project side and (0,N) on the Works On side.
 Each employee works on a project for a certain amount of time (One-to-One). The cardinality ratio
between Works On and Duration will be (0,1) on both sides.

7. Consider the following relations for a database that keeps track of business trips of
salespersons in a sales office:SALESPERSON(Ssn, Name, StartYear, DeptNo)
TRIP(Ssn, FromCity, ToCity, DepartureDate, ReturnDate, TripId)

EXPENSE(TripId, AccountNo, Amount)

a) A trip can be charged to one or more accounts. Specify the foreign keys for
this schema, stating any assumptions you make.

In the SALESPERSON relation:

DeptNo is likely a foreign key referencing the primary key of another
table that represents the departments in the sales office. However, since the
specific department table is not mentioned in the provided schema, we cannot
specify the exact foreign key.
In the TRIP relation
Ssn is a foreign key referencing the primary key of the SALESPERSON
relation, specifically the Ssn attribute.
In the EXPENSE relation:
TripId is a foreign key referencing the primary key of the TRIP relation,
specifically the TripId attribute.
AccountNo is likely a foreign key referencing the primary key of
another table that represents the accounts related to expenses. However, since
the specific account table is not mentioned in the provided schema, we cannot
specify the exact foreign key.

*We assume that SALESPERSON relation's Ssn attribute is the primary key.
*We assume that TRIP relation's TripId attribute is the primary key.
*We assume that EXPENSE relation's combination of TripId and AccountNo is
the primary key.

b) Write relation algebra expression to get the details of salespersons who have
travelled between Mumbai and Delhi and the travel expense is greater that Rs.

5
50000.
σ(FromCity = 'Mumbai' ∧ ToCity = 'Delhi' ∧ Amount > 50000) (SALESPERSON
⨝ TRIP ⨝ EXPENSE)

c) Write relation algebra expression to get the details of salesperson who had
incurred the greatest travel expenses among all travels made.

MAX(σ(SALESPERSON.Ssn = TRIP.Ssn ∧
TRIP.TripId = EXPENSE.TripId)(EXPENSE ⨝ TRIP
⨝ SALESPERSON))

PART-C
8. With the help of an example, illustrate the use of SQL TRIGGER.

Triggers are stored programs, which are automatically executed or

fired when some events occur.

Benefits of Triggers
*Generating some derived column values automatically
*Enforcing referential integrity
*Event logging and storing information on table access
*Auditing
*Synchronous replication of tables
*Imposing security authorizations
*Preventing invalid transactions

Example :
CREATE [OR REPLACE ] TRIGGER trigger_name
{BEFORE | AFTER }
{INSERT [OR] | UPDATE [OR] | DELETE}
[OF column_name]
ON table_name
[FOR EACH
ROW]
WHEN
(condition)
BEGIN
………
……
END;

9. List the basic data types available for defining attributes in SQL?

INTEGER: Used for whole numbers (e.g., 1, 2, 3).

FLOAT or REAL: Used for floating-point numbers with decimal precision (e.g.,
3.14, 2.718).
DOUBLE or DOUBLE PRECISION: Used for double-precision floating-point numbers
6
with higher precision than FLOAT.
VARCHAR or VARCHAR2: Used for variable-length character strings (e.g.,
'OpenAI','Database').
BOOLEAN: Used for storing boolean values, typically representing true or false.
DATE: Used for storing date values (e.g., '2023-05-23').

10. Consider a relation R={A,B,C,D,E,F} and a set of functional

dependencies F={A→BC,C→BD,BF→E,F→D}. Find the closure of A. Is
A a candidate key? Justify.

To find the closure of attribute A, we need to determine all the attributes that
can be functionally determined by A through the given set of functional dependencies
F.
Starting with A, let's calculate the closure step by step:
1. A → BC (Given)
The closure now includes A, B, and C.
2. C → BD (Given)
Since A determines C, and C determines B and D, we can add B and D to the closure.
The closure now includes A, B, C, and D.

3. BF → E (Given)
Since A determines B, and B in combination with F determines E, we can
add E to the closure.
The closure now includes A, B, C, D, and E.
4. F → D (Given)
Since A determines F, and F determines D, we can add D to the closure.
The closure now includes A, B, C, D, E, and F.

At this point, the closure includes all attributes of the relation R={A, B, C, D, E, F}.
Therefore, the closure of A is {A, B, C, D, E, F}.
To determine if A is a candidate key, we need to check if it is a superkey and if it is
minimal.
A superkey is a set of attributes that can uniquely identify each tuple in a
relation. Since the closure of A includes all attributes of the relation R, A is a
superkey.
To check if A is minimal, we can check if removing any attribute from A would
still be able to uniquely identify each tuple. In this case, removing any attribute from A
would result in the closure not including all attributes of R, and therefore it would not
be able to uniquely identify each tuple. Therefore, A is minimal.
Therefore, A is a candidate key for the relation R.

11. What are fully functional dependencies and partial functional dependencies? Give an
example to distinguish between these?

In database normalization, functional dependencies describe the relationships between

attributes in a relation. There are two types of functional dependencies: fully functional
dependencies and partial functional dependencies.
7
1. Fully Functional Dependencies:
A fully functional dependency occurs when an attribute is functionally determined by
the entire primary key, not just a part of it. In other words, removing any attribute from
the primary key would result in the loss of the functional dependency.

For example, let's consider a relation called Employees with attributes (EmployeeID,
FirstName, LastName, Address). Here, the primary key is EmployeeID. If we have the
functional dependency EmployeeID → FirstName, it means that for each unique
EmployeeID, there is a unique FirstName associated with it. This is a fully functional
dependency because the entire primary key (EmployeeID) determines the FirstName
attribute.

2. Partial Functional Dependencies:

A partial functional dependency occurs when an attribute is functionally determined
by a part of the primary key, rather than the entire key. In other words, removing one
or more attributes from the primary key would still maintain the functional
dependency.

Let's continue with the Employees example. Suppose we have the following
functional dependency: (EmployeeID, Address) → FirstName. This means that for a
given combination of EmployeeID and Address, there is a unique FirstName
associated with it. Here, the functional dependency is partial because only a part of the
primary key (EmployeeID) is required to determine the FirstName attribute, and the
additional attribute (Address) also plays a role.

PART-D
12. a) Consider the following table MARKS. Why is the table not in 1NF? Reconstruct

the table so that it is in 1NF.

The given table "MARKS" is not in the first normal form (1NF) because it
violates the rule that each cell in a table should contain a single atomic value. The
8
table has repeating groups of attributes (Marks, Subject Code, and Subject Name) for
each student, leading to redundancy and difficulty in interpreting the data.
To reconstruct the table into 1NF, we need to separate the repeating groups into
separate tables and establish appropriate relationships between them. Here's the
modified table structure:

Table 1: STUDENTS
- Roll No. (Primary Key)
- Name
Table 2: SUBJECTS
- Subject Code (Primary Key)
- Subject Name
Table 3:
MARKS
- Roll No. (Foreign Key referencing STUDENTS.Roll No.)
- Subject Code (Foreign Key referencing SUBJECTS.Subject Code)
- Marks

The reconstructed tables eliminate the repeating groups and follow 1NF guidelines by
ensuring that each table contains atomic values in each cell. The STUDENTS table
stores information about the students, the SUBJECTS table contains subject
information, and the MARKS table stores the marks obtained by each student in each
subject.

By separating the data into multiple tables and establishing appropriate relationships,
we achieve a normalized structure that adheres to the 1NF requirements.

b) When does a relational scheme is said to be in 3NF? How is BCNF different from
3NF?

A relational schema is said to be in the third normal form (3NF) when it satisfies the
following conditions:
1. It is in the second normal form (2NF).
2. There are no transitive dependencies between non-key attributes.
To understand the difference between 3NF and Boyce-Codd Normal Form
(BCNF), let's first define BCNF. A relational schema is said to be in BCNF if, for
every non-trivial functional dependency X → Y (where X is a superkey), X is a
candidate key. Now, the key differences between 3NF and BCNF are as follows:

1. Dependency Consideration:
- In 3NF, there should be no transitive dependencies between non-key
attributes, meaning that the attributes should not depend on each other through
other attributes.
- In BCNF, all functional dependencies must be determined by candidate keys.
This means that every non-trivial functional dependency should have a superkey on
the left-hand side (LHS).

9
2. Preservation of Dependencies:
- 3NF allows non-key attributes to have transitive dependencies on other
non-key attributes.
- BCNF, on the other hand, eliminates all non-trivial functional dependencies
that are not determined by candidate keys. This ensures that all functional
dependencies are directly determined by the superkey and there are no redundant
dependencies.

In essence, BCNF is a stronger form of normalization than 3NF. While 3NF

focuses on eliminating transitive dependencies, BCNF takes it a step further by
ensuring that all non-trivial functional dependencies are determined by candidate keys.
By adhering to BCNF, a relation eliminates all potential redundancy and minimizes
data integrity issues caused by functional dependencies.

13. a) List aggregate functions of SQL.

i.) COUNT()-Function Returns the number of rows that matches a specified criteria.
Syntax- SELECT COUNT (column_name) FROM table_name WHERE condition;

ii.) SUM()-Function returns the total sum of a numeric column.

Syntax-SELECT SUM (column_name) FROM table_name WHERE condition;

iii.) AVG()-Function returns the average value of a numeric column.

Syntax-SELECT AVG (column_name) FROM table_name WHERE condition;

iv.) MIN()-function returns the smallest value of the selected column.

Syntax-SELECT MIN (column_name) FROM table_name WHERE condition;

v.) MAX()-Function returns the largest value of the selected column.

Syntax-SELECT MAX (column_name) FROM table_name WHERE condition;

b) Given a relation R(A,B,C). Find the minimal cover of the set of functional
dependencies given;F= {A→BC, B→C, A→B, AB→C}

To find the minimal cover of the set of functional dependencies, we need to simplify
and eliminate any redundant or extraneous dependencies. Here's the step-by-step
process:

1. Eliminate Redundant Dependencies:

- Remove redundant dependencies where the right-hand side (RHS) is a subset of
another dependency's RHS.
- A→B is redundant because A→BC already implies A→B.
The updated set of functional dependencies: F = {A→BC, B→C, AB→C}.

10
2. Eliminate Extraneous Dependencies:
- Check each dependency and eliminate any extraneous attributes from the
left-hand side (LHS).
- For each dependency, we'll check if any attribute on the LHS can be
removed while preserving the closure.
Analyzing F = {A→BC, B→C, AB→C}:

Dependency 1: A→BC
- We can check if any attribute on the LHS can be removed.
- Removing A from the LHS gives us BC.
- Check the closure of BC using the remaining dependencies.

Closure:
- BC+ = BC (already included)
- AB+ = ABC (using AB→C)
- Since BC+ and AB+ both include the attribute C, we can remove A from the
LHS.
- Updated dependency: B→C.
Updated set of functional dependencies: F = {B→C,
AB→C}. The minimal cover of the set of functional
dependencies is: F = {B→C, AB→C}.
We have eliminated redundant dependencies and eliminated extraneous attributes from
the LHS, resulting in the minimal cover.

14. Consider the relation R = {A, B, C, D, E, F, G, H} and the setof functional

dependencies F = {A→DE, B→F, AB→C, C→GH, G→H}. What is the key for
R? Decompose R into 2NF and then 3NF relations.

To determine the key for relation R, we need to identify the attributes that are
functionally dependent on other attributes but not functionally determined by any
subset of attributes. This will give us the candidate key for R.

Given the functional dependencies:

F = {A→DE, B→F, AB→C, C→GH, G→H}

We can use these dependencies to determine the key:

1. Start with attribute A since it appears on the left-hand side of a functional

dependency: A→DE.
2. A→DE implies that both D and E are functionally dependent on A.
3. Check if any other attributes are functionally dependent on A. In this case, no other
attributes are dependent on A.
4. Therefore, the key for R is {A}.

To decompose R into 2NF and 3NF relations, we follow these steps:

2NF Decomposition:
11
1. Identify functional dependencies that violate the 2NF requirement, which is to
remove partial dependencies.
2. A partial dependency occurs when a non-key attribute is functionally dependent on
only part of the key.

In this case, we have the following dependencies that violate 2NF:

- B→F (B is a non-key attribute)
- AB→C (C is a non-key attribute)

To decompose R into 2NF relations, we create two new relations:

R1 (B, F)
R2 (A, B, C, D, E, G, H)

3NF Decomposition:
1. Identify functional dependencies that violate the 3NF requirement, which is to
remove transitive dependencies.
2. A transitive dependency occurs when a non-key attribute is functionally dependent
on another non-key attribute.

In this case, we have the following dependency that violates 3NF:

- C→GH (H is a non-key attribute and functionally dependent on C, which is not a
key attribute)

To decompose R2 into 3NF relations, we create two new relations:

R2.1 (A, B, C)
R2.2 (C, G, H, D, E)

The resulting 2NF and 3NF relations are:

R1 (B, F)
R2.1 (A, B,
C)
R2.2 (C, G, H, D, E)

These relations are in 2NF and 3NF, respectively, following the decomposition
process.

PART-E
15. a) Suppose that we have an ordered file with 400,000 records stored on a disk with
block size 4,096 bytes. File records are of fixed size and are unspanned,with record
length 200 bytes. How many blocks are needed for the file? Approximately, how
many block accesses are required for a binary search in this file? On an average,
how many block accesses are required for a linear search, if the file is nonordered?

To determine the number of blocks needed for the file, we need to divide the total file
size by the block size.

12
Total file size = Number of records × Record length
Total file size = 400,000 records × 200 bytes = 80,000,000 bytes
Number of blocks = Total file size / Block size
Number of blocks = 80,000,000 bytes / 4,096 bytes = 19,531.25 blocks

Since the block size is fixed and records are unspanned, we can't have partial blocks.
Therefore, we need 19,532 blocks for the file.

For a binary search in this file, we can estimate the number of block accesses by
considering that the binary search algorithm reduces the search space by half with
each iteration. In the worst case, we need to continue dividing the search space until
we find the desired record.

The number of block accesses required for a binary search can be calculated as
log2(N), where N is the number of blocks.

Number of block accesses for binary search = log2(19,532) ≈ 14.25 block accesses

On average, for a linear search in a non-ordered file, we need to scan through half of
the file, assuming the desired record is equally likely to be anywhere in the file.
The number of block accesses required for a linear search can be estimated as N/2,
where N is the number of blocks.

Number of block accesses for linear search = 19,532 / 2 = 9,766 block accesses

Keep in mind that these calculations provide estimates based on the assumptions
mentioned. The actual performance may vary depending on factors such as disk
seek time, caching, and file organization.

b) give an example to illustrate that indexing can improve the

search time.

Example:
Suppose we have an index on the "ID" attribute of the file. The index allows
us to quickly locate the block address where a specific ID is stored.
If we want to search for a record with ID = 500, instead of performing a linear
search through the file, we can use the index to directly find the block address
associated with ID = 500. This significantly reduces the number of block accesses
required for the search.
Let's say the index lookup for ID = 500 gives us the block address as Block 7.
With the index, we only need to access Block 7 to retrieve the desired record,
resulting in just one block access.
In this example, the use of indexing reduces the search time from potentially
thousands of block accesses in a linear search to just one block access. This illustrates
the significant improvement in search efficiency that indexing provides.

16. a) Explain the structure of an internal node and a leaf node in a B+-tree.
13
In a B+-tree, both internal nodes and leaf nodes play essential roles in
organizing and accessing data efficiently. Let's discuss the structure of each node type:

1. Internal Node:
- An internal node in a B+-tree contains an ordered list of key-value pairs and child
pointers.
- The key-value pairs act as separators for the ranges of keys represented by the
child nodes.
- Each key in the internal node represents the largest key value in its left child and
the smallest key value in its right child.
- The child pointers point to the corresponding child nodes.
- The number of key-value pairs in an internal node is typically less than or equal to
the maximum degree of the B+-tree.
- Internal nodes facilitate efficient navigation through the tree structure by guiding the
search path towards the appropriate leaf node.

2. Leaf Node:
- The leaf nodes in a B+-tree store the actual data records or key-value pairs of the
indexed data.
- Each leaf node contains an ordered list of key-value pairs.
- The key-value pairs in the leaf nodes are sorted based on the keys.
- The leaf nodes are connected in a linked list structure, allowing sequential access
to the records.
- Each leaf node also has a pointer to the next leaf node in the linked list.
- The leaf nodes store the complete data records or key-value pairs and are
responsible for storing and retrieving the actual data.

The structure of a B+-tree is optimized for efficient range searches and sequential
access. The internal nodes provide a hierarchy and guide the search path, while the
leaf nodes store the actual data and support efficient range queries and ordered
traversal. This separation of internal and leaf nodes allows for a balanced tree
structure with improved performance characteristics.

b) Illustrate with an example how searching for a record with search key field value is
done using a B+-Tree.

Let's consider an example to illustrate how searching for a record with a search key
field value is done using a B+-tree.

Suppose we have a B+-tree that stores student records, where each record has a search
key field value representing the student ID. The B+-tree has the following structure:

13
Each internal node and leaf node in the B+-tree is shown above, with the key values
represented in square brackets. The arrows represent the child pointers.

Now, let's assume we want to search for a student record with the student ID 10.

1. Starting at the root node, we compare the search key value (10) with the key
values in the internal node [7, 14, 23, 35].
2. Since 10 is less than 14, we follow the left child pointer and move to the next level.
3. Now, we compare the search key value (10) with the key values in the internal
node [9, 12].
4. Again, 10 is less than 12, so we follow the left child pointer.
5. We reach the leaf node [9, 10], which contains the search key value (10).
6. We have found the desired record with the search key field value 10.

In this example, the B+-tree allowed us to efficiently search for the record with the
search key field value 10. We started at the root node and made comparisons to
determine the appropriate child node to follow. By narrowing down the search path
based on the key values in the internal nodes, we reached the leaf node that contained
the desired record.

The structure and organization of the B+-tree, along with its search algorithm, enable
efficient searching and retrieval of records based on their search key field values.

17. Why Concurrency Control Is Needed? What are the different types of problems we
may encounter when two transactions run concurrently? Illustrate each problem with
suitable examples.

Concurrency control is needed in database systems to ensure that transactions

can run concurrently without interfering with each other and producing inconsistent or
incorrect results. It aims to maintain the integrity and consistency of the data while
allowing for efficient and simultaneous execution of multiple transactions. Without
proper concurrency control, several problems can arise when two transactions run
concurrently:

1. Lost Update Problem:

The lost update problem occurs when the updates made by one transaction are
overwritten or lost due to the concurrent execution of another transaction. This can
lead to data inconsistency.

15
Example:
Suppose there are two transactions, T1 and T2, both updating the same account
balance concurrently:
T1: Read balance = $500
Deduct $100
Write balance = $400
T2: Read balance = $500
Deduct $50
Write balance = $450

If T1 and T2 execute concurrently without proper concurrency control, one of the

updates may be lost. For example, if T2's update executes last, the final balance will be
$450, and T1's update will be lost.

2. Dirty Read Problem:

The dirty read problem occurs when one transaction reads data that has been
modified by another transaction but not yet committed. If the transaction making the
modifications is rolled back, the data read by the first transaction becomes invalid or
"dirty."

Example:
Consider two transactions, T1 and T2, where T1 updates a customer's phone
number, and T2 reads the updated phone number:
T1: Update customer's phone number to 1234567890 (not yet committed)
T2: Reads the customer's phone number (dirty read)

If T1 rolls back before committing the update, T2 will have read an invalid or
incorrect phone number, leading to data inconsistency.

3. Inconsistent Analysis Problem:

The inconsistent analysis problem occurs when a transaction reads different
versions of the same data during its execution. This can result in incorrect analysis of
the data and inconsistent results.

Example:
Suppose there are two transactions, T1 and T2, where T1 reads a customer's
account balance twice during its execution:
T1: Read account balance (at time T1)
Perform some calculations
Read account balance again (at time T2)

If T2 modifies the account balance in between the two read operations of T1, T1 will
analyze inconsistent versions of the account balance, leading to incorrect calculations
or decisions.

These problems highlight the importance of concurrency control in

16
maintaining data integrity and consistency in a multi-user database system. Various
concurrency control mechanisms, such as locking, timestamps, and serializability
techniques, are employed to prevent these problems and ensure correct and reliable
execution of concurrent transactions.

18. a) What are the desirable properties of transactions? Explain.

The desirable properties of transactions, often referred to as ACID properties,
are fundamental principles that ensure the reliability and consistency of database
transactions. ACID stands for Atomicity, Consistency, Isolation, and Durability. Let's
explain each property:

1. Atomicity:
Atomicity ensures that a transaction is treated as a single, indivisible unit of
work. It guarantees that either all the operations within a transaction are successfully
completed and permanently saved to the database, or none of them are performed at
all. If any part of the transaction fails, all changes made by the transaction are rolled
back, and the database remains unchanged.

2. Consistency:
Consistency ensures that a transaction brings the database from one consistent
state to another consistent state. The database must satisfy a set of predefined integrity
constraints and rules before and after the execution of a transaction. In other words, a
transaction should preserve the consistency of the data and not violate any integrity
constraints.

3. Isolation:
Isolation ensures that concurrent execution of multiple transactions produces
the same results as if the transactions were executed sequentially, one after another.
Each transaction must operate independently of other concurrently executing
transactions. Isolation prevents interference, such as dirty reads, non-repeatable reads,
and phantom reads, between concurrent transactions.

4. Durability:
Durability guarantees that once a transaction is committed and changes are
saved to the database, they persist even in the event of system failures, such as power
outages or crashes. The changes made by a committed transaction are considered
permanent and should not be lost or undone.

These properties collectively ensure the reliability, consistency, and durability

of database transactions. ACID properties provide a framework for designing and
implementing robust transaction processing systems. They help maintain data
integrity, prevent data corruption, and ensure the accuracy and reliability of database
operations, even in the presence of failures or concurrent access.

b) “If every transaction in a schedule follows the two-phase locking protocol, the
schedule is guaranteed to be serializable”, justify the statement.
17
The statement "If every transaction in a schedule follows the two-phase
locking protocol, the schedule is guaranteed to be serializable" is true. The
two-phase locking protocol is a concurrency control mechanism that ensures
serializability of transactions in a schedule. Let's justify this statement:
The two-phase locking (2PL) protocol consists of two phases: the growing phase and
the shrinking phase.

1. Growing Phase:
- During the growing phase, a transaction acquires locks on the resources it needs
before performing any updates or reads.
- Once a lock is acquired on a resource (e.g., a database record or a table), it cannot be
released until the transaction completes.
- This phase ensures that the transaction does not release any locks until it has
acquired all the required locks, preventing other transactions from accessing or
modifying the locked resources.

2. Shrinking Phase:
- During the shrinking phase, a transaction releases all the locks it holds after
completing its updates and reads.
- Once a lock is released, it cannot be reacquired by the transaction.
- Releasing the locks allows other transactions to acquire them and proceed with
their operations.

Now, let's see how the two-phase locking protocol guarantees serializability:

1. Conflict Serializability:
- The two-phase locking protocol prevents conflicts between transactions by ensuring
that no two transactions acquire conflicting locks simultaneously.
- Conflicts arise when two transactions try to access or modify the same resource
concurrently, leading to potential data inconsistencies.
- By following the two-phase locking protocol, transactions acquire locks in a way
that avoids conflicts and enforces a serial order of accessing resources.
- This ensures conflict serializability, where the final result of concurrent execution is
equivalent to some serial order of executing the transactions.

2. Locking and Unlocking Order:

- The two-phase locking protocol enforces a strict order for acquiring and releasing
locks.
- All transactions in a schedule follow the same protocol, acquiring locks in the same
order and releasing them only at the end.
- This strict ordering of lock acquisition and release ensures that the schedule is free
from anomalies like dirty reads, non-repeatable reads, and phantom reads, which
could violate serializability.

Therefore, if every transaction in a schedule follows the two-phase locking

18
protocol, it guarantees conflict serializability by preventing conflicts and enforcing a
strict order of lock acquisition and release. This ensures that the schedule is equivalent
to some serial order of executing the transactions and maintains the correctness and
consistency of the database.

c) What are the different types of lock that are commonly used in
concurrency control?In concurrency control, various types of locks are used to manage
concurrent access to shared resources and ensure data integrity. The commonly used
types of locks are:

1. Shared Lock (S-lock):

- Shared locks allow multiple transactions to read (i.e., access) a resource
simultaneously.
- Transactions holding a shared lock can only read the resource and cannot modify it.
- Multiple transactions can hold shared locks on the same resource concurrently,
promoting concurrency and allowing for concurrent reads.

2. Exclusive Lock (X-lock):

- Exclusive locks, also known as write locks, allow a single transaction exclusive
access to a resource.
- A transaction holding an exclusive lock can both read and modify the resource.
- Exclusive locks are mutually exclusive, meaning that no other transaction can
acquire a shared or exclusive lock on the same resource simultaneously.

3. Intent Lock:
- Intent locks are used to indicate the intention of a transaction to acquire locks at
different granularities.
- Intent locks are acquired at higher levels in the lock hierarchy to prevent conflicts
and coordinate lock acquisition at lower levels.
- For example, an intent lock at the table level indicates that the transaction intends
to acquire locks on individual rows or columns within the table.

4. Update Lock (U-lock):

- Update locks, also known as read-modify-write locks, combine the characteristics of
shared and exclusive locks.
- An update lock allows a transaction to read a resource and later upgrade it to an
exclusive lock to modify the resource.
- Update locks provide a mechanism for managing conflicts between read and write
operations.

5. Conversion:
- Conversion locks refer to the process of changing one type of lock into another
without releasing the lock completely.
- For example, a transaction holding a shared lock may request a conversion to an
exclusive lock if it needs to modify the resource.
19
These lock types are used by concurrency control mechanisms, such as the
two-phase locking (2PL) protocol, to control concurrent access to shared resources.
By acquiring and releasing locks based on specific rules and protocols, these locks
help prevent conflicts and maintain data consistency in multi-user database systems.

19. a) Consider the following tables representing courses taken by instructors in an

institute:
INSTRUCTOR(ID, NAME, DEPT, SALARY)
TEACHES(ID, COURSE-ID, SEMESTER,
YEAR) COURSE(COURSE-ID, TITLE, DEPT,
CREDITS)
where, ID and COURSE-ID are foreign keys referring to the primary keys with the
same names. Show an initial query tree for the following query and optimize it using
the rules of heuristics. Assume that TITLE is a candidate key of COURSE.
SELECT NAME, TITLE, SEMESTERYEAR
FROM INSTRUCTOR, COURSE, TEACHES
WHERE COURSE.COURSE-ID=TEACHES.COURSE-ID AND
TEACHES.ID = INSTRUCTOR.ID AND INSTRUCTOR.DEPT =‘MATHS’
AND TEACHES.DEPT = INSTRUCTOR.DEPT

To optimize the given query, we can follow the rules of heuristics to rearrange
and simplify the query tree. Here's the initial query tree and the optimized query tree:

Initial Query Tree:

```
JOIN
/ \
JOIN TEACHES
/ \
COURSE INSTRUCTOR
```

Optimized Query Tree:

```
JOIN
/ \
JOIN TEACHES
/ \
COURSE INSTRUCTOR
```

Explanation:
1. In the initial query tree, the tables INSTRUCTOR and COURSE are joined
first, followed by the TEACHES table. This order can be rearranged to improve
performance.

20
2. The INSTRUCTOR table has a filter condition on the DEPT column with a value
of 'MATHS'. We can push down this filter condition to the TEACHES table, as it has
a JOIN condition with the INSTRUCTOR table based on the ID and DEPT columns.
This helps reduce the intermediate result set early in the query execution.
3. The JOIN between the INSTRUCTOR and COURSE tables remains the same, as
there are no further optimization opportunities within those tables.
4. The JOIN between the COURSE and TEACHES tables remains the same, as the
query condition requires matching the COURSE-ID column in both tables.

By optimizing the query tree using these heuristics, we aim to reduce the
intermediate result set early in the execution process, thereby improving the query
performance.

20. a) Write a short note on Big Data.

Big Data refers to vast and complex sets of data that are too large and diverse
to be easily managed and analyzed using traditional methods. It is characterized by
the three V's: Volume (massive amounts of data), Velocity (generated at high speed),
and Variety (diverse data formats). Big Data is valuable for organizations as it enables
them to gain insights, make data-driven decisions, and improve processes. Advanced
technologies like cloud computing and machine learning are used to handle Big Data.
However, privacy, security, and ethical concerns are also important considerations.
Overall, Big Data has the potential to drive innovation and growth across industries.

b) What is a semantic web technology? How is it relevance?

Semantic web technology enhances the World Wide Web by adding explicit
semantics to web content, allowing machines to understand and process information
more effectively. It includes standards like RDF and ontologies to represent
knowledge and relationships. Semantic web technology improves search, data
integration, knowledge representation, analytics, and decision-making. Its relevance
lies in enabling more precise search results, seamless data integration, interoperability
across diverse sources, automated reasoning, and domain-specific applications.

c) How does RDF support semantic web technology?

RDF (Resource Description Framework) is a vital component of semantic web

technology as it enables the representation and linking of information on the web.
RDF uses subject-predicate-object triples to express relationships between resources,
allowing for structured and semantically meaningful data representation. It supports
the creation of linked data by establishing connections between resources and
promotes interoperability by providing a common data model. RDF enables querying,
reasoning, and inference on data and facilitates the development of a web of
interconnected and meaningful information.

***********************************************************************************
21

Oracle.1z0-931-23.v2024-01-29.q64 Dumps
No ratings yet
Oracle.1z0-931-23.v2024-01-29.q64 Dumps
44 pages
Module 1 Lesson 2 Basic Concepts For Construction Database
No ratings yet
Module 1 Lesson 2 Basic Concepts For Construction Database
23 pages
Cs9152 DBT Unit IV Notes
100% (5)
Cs9152 DBT Unit IV Notes
61 pages
Object Relationships in Salesforce
No ratings yet
Object Relationships in Salesforce
8 pages
Cs9152 DBT Unit IV Notes
No ratings yet
Cs9152 DBT Unit IV Notes
61 pages
CST204 D
No ratings yet
CST204 D
5 pages
Fundamental and Advanced Database Tutorial
No ratings yet
Fundamental and Advanced Database Tutorial
93 pages
Lecture 2. Data Models and The Entity-Relationship Data Model
No ratings yet
Lecture 2. Data Models and The Entity-Relationship Data Model
9 pages
Phases of Database Design
No ratings yet
Phases of Database Design
77 pages
Define Subquery With The Help of Example?
No ratings yet
Define Subquery With The Help of Example?
11 pages
DOC-20241202-WA0006.
No ratings yet
DOC-20241202-WA0006.
52 pages
Database Systems
No ratings yet
Database Systems
103 pages
DMS
No ratings yet
DMS
16 pages
DBMS Pyqs
No ratings yet
DBMS Pyqs
14 pages
2MARKSS
No ratings yet
2MARKSS
4 pages
Lecture2 3new
No ratings yet
Lecture2 3new
66 pages
DBMS Imp question
No ratings yet
DBMS Imp question
22 pages
Database Slides
No ratings yet
Database Slides
100 pages
VLB Janakiammal College of Engineering and Technology
No ratings yet
VLB Janakiammal College of Engineering and Technology
54 pages
DBMS Unit-2 Notes
No ratings yet
DBMS Unit-2 Notes
14 pages
Extra Dbms
No ratings yet
Extra Dbms
10 pages
Dbmsunit 3
No ratings yet
Dbmsunit 3
19 pages
Database Designing Concepts Data Base: Disadvantages of Manual System
60% (5)
Database Designing Concepts Data Base: Disadvantages of Manual System
51 pages
lect2
No ratings yet
lect2
43 pages
Slide2 DatabaseDesign ER2023-160053-16919882378764
No ratings yet
Slide2 DatabaseDesign ER2023-160053-16919882378764
79 pages
DBMS end sem
No ratings yet
DBMS end sem
52 pages
RDBMS Unit-2 Notes
No ratings yet
RDBMS Unit-2 Notes
16 pages
Unit - 2
No ratings yet
Unit - 2
33 pages
Database MGMT
No ratings yet
Database MGMT
112 pages
Translation of ER-diagram Into Relational Schema: Prof. Sin-Min Lee Department of Computer Science
No ratings yet
Translation of ER-diagram Into Relational Schema: Prof. Sin-Min Lee Department of Computer Science
64 pages
Translation of ER-diagram Into Relational Schema: Prof. Sin-Min Lee Department of Computer Science
No ratings yet
Translation of ER-diagram Into Relational Schema: Prof. Sin-Min Lee Department of Computer Science
64 pages
DEC5073 Chapter 2
No ratings yet
DEC5073 Chapter 2
59 pages
DBMS Practicle File Ex8
No ratings yet
DBMS Practicle File Ex8
60 pages
CST204 Database Management Systems, June 2023
No ratings yet
CST204 Database Management Systems, June 2023
5 pages
DB Session 3 Slides
No ratings yet
DB Session 3 Slides
45 pages
Database Concepts EGS 2207
No ratings yet
Database Concepts EGS 2207
51 pages
DATABASE MANAGEMENT SYSTEMS - mqp1
No ratings yet
DATABASE MANAGEMENT SYSTEMS - mqp1
32 pages
1 DBMS EntityRelationshipModel
No ratings yet
1 DBMS EntityRelationshipModel
50 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
Unit-I & II DBMS
No ratings yet
Unit-I & II DBMS
176 pages
Conceptual Design and The Entity-Relationship Model
No ratings yet
Conceptual Design and The Entity-Relationship Model
44 pages
The Entity-Relationship Model
No ratings yet
The Entity-Relationship Model
31 pages
Rdbms Short 1
No ratings yet
Rdbms Short 1
6 pages
DBMS CA3 Suggested Topics
No ratings yet
DBMS CA3 Suggested Topics
7 pages
Reality: Reality Involves A Very Large Number of Details
No ratings yet
Reality: Reality Involves A Very Large Number of Details
57 pages
64268
No ratings yet
64268
97 pages
Unit Iii Normalization
No ratings yet
Unit Iii Normalization
38 pages
ch05 PDF
No ratings yet
ch05 PDF
56 pages
The Entity-Relationship Model: IS698 Min Song
No ratings yet
The Entity-Relationship Model: IS698 Min Song
44 pages
Chapter No.2 Database Design Using ER Model
No ratings yet
Chapter No.2 Database Design Using ER Model
50 pages
Lesson 5
No ratings yet
Lesson 5
6 pages
Basic Definition-Unit 2 Reality
No ratings yet
Basic Definition-Unit 2 Reality
9 pages
DBMS Notes
No ratings yet
DBMS Notes
19 pages
Unit - 2: Data Modeling Using The Entity-Relationship (ER) Model
No ratings yet
Unit - 2: Data Modeling Using The Entity-Relationship (ER) Model
64 pages
Topic 2. Data Models
No ratings yet
Topic 2. Data Models
32 pages
02 Relational SP 08
No ratings yet
02 Relational SP 08
48 pages
Data Modeling: Database Review
No ratings yet
Data Modeling: Database Review
27 pages
Database: Structured Programming & Database Systems
No ratings yet
Database: Structured Programming & Database Systems
31 pages
DBMS Chapter 2
No ratings yet
DBMS Chapter 2
31 pages
Database Modeling and Design: Logical Design: Toby Teorey, Sam Lightstone, Tom Nadeau
No ratings yet
Database Modeling and Design: Logical Design: Toby Teorey, Sam Lightstone, Tom Nadeau
68 pages
DB Unit2
No ratings yet
DB Unit2
44 pages
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
From Everand
THE SQL LANGUAGE: Master Database Management and Unlock the Power of Data (2024 Beginner's Guide)
JAMIE POWERS
No ratings yet
DATABASE From the conceptual model to the final application in Access, Visual Basic, Pascal, Html and Php: Inside, examples of applications created with Access, Visual Studio, Lazarus and Wamp
From Everand
DATABASE From the conceptual model to the final application in Access, Visual Basic, Pascal, Html and Php: Inside, examples of applications created with Access, Visual Studio, Lazarus and Wamp
Olga Maria Stefania Cucaro
No ratings yet
Advantages of Distributed Databases and Types of Two Phase Locking
No ratings yet
Advantages of Distributed Databases and Types of Two Phase Locking
4 pages
Versant Database v.7.0.1.0 Administration Manual
No ratings yet
Versant Database v.7.0.1.0 Administration Manual
465 pages
IT3020-L01 Ordbintro
No ratings yet
IT3020-L01 Ordbintro
38 pages
NewReportingTemplate Example
No ratings yet
NewReportingTemplate Example
15 pages
Creating A Physical Standby Database Using RMAN Restore Database From Service
No ratings yet
Creating A Physical Standby Database Using RMAN Restore Database From Service
12 pages
Shelly Cashman Series Microsoft Office 365 and Access 2016 Introductory 1st Edition Pratt Solutions Manual Download
100% (17)
Shelly Cashman Series Microsoft Office 365 and Access 2016 Introductory 1st Edition Pratt Solutions Manual Download
10 pages
Serializability
No ratings yet
Serializability
26 pages
6.7.12. Example Sybase XA Datasource: Chapter 7. Configuring Modules
No ratings yet
6.7.12. Example Sybase XA Datasource: Chapter 7. Configuring Modules
2 pages
After Completing This Module, You Will Be Able To
No ratings yet
After Completing This Module, You Will Be Able To
0 pages
1. List of Azure Services
No ratings yet
1. List of Azure Services
6 pages
Flight.sql
No ratings yet
Flight.sql
13 pages
Practical Programs Solution
No ratings yet
Practical Programs Solution
27 pages
Mongodb: Goo The Following Table Shows The Relationship of Rdbms Terminology With Mongodb
No ratings yet
Mongodb: Goo The Following Table Shows The Relationship of Rdbms Terminology With Mongodb
7 pages
SQL Commands
No ratings yet
SQL Commands
37 pages
Database Management System MCQ
No ratings yet
Database Management System MCQ
29 pages
Mohammed Hafeez Amir Resume
No ratings yet
Mohammed Hafeez Amir Resume
2 pages
Sqlfordevscom Next Level Database Techniques For Developers 9 12
No ratings yet
Sqlfordevscom Next Level Database Techniques For Developers 9 12
4 pages
Apache Flink Stateful Computations Over Data Streams
No ratings yet
Apache Flink Stateful Computations Over Data Streams
1 page
III Yr B.Tech. - Computer Science & Engineering/Information Technology Data Mining
No ratings yet
III Yr B.Tech. - Computer Science & Engineering/Information Technology Data Mining
2 pages
Locking Based Concurrency Control Protocols
No ratings yet
Locking Based Concurrency Control Protocols
14 pages
Report On OLTP and OLAP Systems For An Automobile Company: Project Team Members
No ratings yet
Report On OLTP and OLAP Systems For An Automobile Company: Project Team Members
6 pages
Contoh Soal SQL Dan Jawabannya
No ratings yet
Contoh Soal SQL Dan Jawabannya
6 pages
Selection Screens: Parameters
No ratings yet
Selection Screens: Parameters
7 pages
3.1 - Programmatic Sharing - Programmatic Apex Sharing
No ratings yet
3.1 - Programmatic Sharing - Programmatic Apex Sharing
26 pages
Data Analytics Essentials 1
100% (1)
Data Analytics Essentials 1
6 pages
SQL Reminder
No ratings yet
SQL Reminder
20 pages
Power Bi Session Notes-1
No ratings yet
Power Bi Session Notes-1
10 pages

Previous Year Solved Question Paper

Uploaded by

Previous Year Solved Question Paper

Uploaded by

FOURTH SEMESTER B.

TECH DEGREE EXAMINATION, MAY 2019

CONCEPTUAL DATA MODELS

PHYSICAL DATA MODELS

Strong Entity Type:

For example, let's consider an "Employee" entity in a company database. The

Weak Entity Type:

3. What is entity integrity constraint? Why is it important?

The entity integrity constraint is important as it ensures uniqueness,

DML (Data Manipulation Language):

Example DML Statement:

DDL (Data Definition Language):

Logical Data Independence:

Physical Data Independence:

we can design an ER diagram with the following entities and relationships:

1. Works On: Represents the association between an employee and a project.

EXPENSE(TripId, AccountNo, Amount)

In the SALESPERSON relation:

Triggers are stored programs, which are automatically executed or

INTEGER: Used for whole numbers (e.g., 1, 2, 3).

10. Consider a relation R={A,B,C,D,E,F} and a set of functional

In database normalization, functional dependencies describe the relationships between

2. Partial Functional Dependencies:

the table so that it is in 1NF.

In essence, BCNF is a stronger form of normalization than 3NF. While 3NF

13. a) List aggregate functions of SQL.

ii.) SUM()-Function returns the total sum of a numeric column.

iii.) AVG()-Function returns the average value of a numeric column.

iv.) MIN()-function returns the smallest value of the selected column.

v.) MAX()-Function returns the largest value of the selected column.

1. Eliminate Redundant Dependencies:

14. Consider the relation R = {A, B, C, D, E, F, G, H} and the setof functional

Given the functional dependencies:

We can use these dependencies to determine the key:

1. Start with attribute A since it appears on the left-hand side of a functional

To decompose R into 2NF and 3NF relations, we follow these steps:

In this case, we have the following dependencies that violate 2NF:

To decompose R into 2NF relations, we create two new relations:

In this case, we have the following dependency that violates 3NF:

To decompose R2 into 3NF relations, we create two new relations:

The resulting 2NF and 3NF relations are:

b) give an example to illustrate that indexing can improve the

Concurrency control is needed in database systems to ensure that transactions

1. Lost Update Problem:

If T1 and T2 execute concurrently without proper concurrency control, one of the

2. Dirty Read Problem:

3. Inconsistent Analysis Problem:

These problems highlight the importance of concurrency control in

18. a) What are the desirable properties of transactions? Explain.

These properties collectively ensure the reliability, consistency, and durability

2. Locking and Unlocking Order:

Therefore, if every transaction in a schedule follows the two-phase locking

1. Shared Lock (S-lock):

2. Exclusive Lock (X-lock):

4. Update Lock (U-lock):

19. a) Consider the following tables representing courses taken by instructors in an

Initial Query Tree:

Optimized Query Tree:

20. a) Write a short note on Big Data.

b) What is a semantic web technology? How is it relevance?

c) How does RDF support semantic web technology?

RDF (Resource Description Framework) is a vital component of semantic web

You might also like