0% found this document useful (0 votes)

425 views

Numerical Based On Indexing: Problem 1.2

The document describes several problems involving calculating storage requirements for files on disk drives using different indexing schemes. Problem 1 calculates storage needs for a file on a given disk configuration and analyzes read times for sequential vs random access. Problem 2 calculates the average record length and number of blocks needed for a variable-length student record file. Problem 3 calculates storage values like record size, blocking factor, and number of blocks for an indexed employee file.

Uploaded by

rani

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

425 views

Numerical Based On Indexing: Problem 1.2

Uploaded by

rani

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Numerical Based on Indexing

Problem 1.1
Consider a disk with a sector size of 512 bytes, 1,000 tracks per surface, 100 sectors per track, 5
double-sided platters and a block size of 2,048 bytes. Suppose that the average seek time is 5 msec,
the average rotational delay is 5 msec, and the transfer rate is 100 MB per second. Suppose that a file
containing 1,000,000 records of 100 bytes each is to be stored on such a disk and that no record is
allowed to span two blocks.
a) How many records fit onto a block?
2048/100 = 20. We can have at most 20 records in a block.

b) How many blocks are required to store the entire file? If the file is arranged sequentially
(according to the ‘next block concept’) on disk, how many cylinders are needed?
1,000,000/20=50,000 blocks are required to store the entire file.
A track has 25 blocks, a cylinder has 25*10=250 blocks. Therefore, we need 50,000/250=200
cylinders to store the file sequentially.

c) How many records of 100 bytes each can be stored using this disk?
The disk has 1000 cylinders with 250 blocks each, i.e. it has 250,000 blocks. A block contains
20 records. Thus, the disk can store 5,000,000 records.

d) If blocks of the file are stored on disk according to the next block concept, with the first block
on block 1 of track 1, what is the number of the block stored on block 1 of track 1 on the next
disk surface?
There are 25 blocks in each track. It is block 26 on block 1 of track 1 on the next disk surface.

e) What is the time required to read the file sequentially?

We need to read 200 cylinders. In order to read one cylinder, we have 1 seek, no rotational
delays and the transfer of 250 blocks = 500KB. Therefore, the time to read one cylinder is
5msec + 500K/100M sec = 5msec + 5 msec = 10 msec. The time to read the entire file is
200*10msec =2sec.

f) What is the time required to read the file in random order? Note that in order to read a record,
the block containing the record has to be fetched from disk.
In random access, the read of every block requires an average seek time of 5 msec, an average
rotational delay of 5 msec and a transfer time of 2K/100M sec = 0.02 msec, i.e. 10.02 msec.
Therefore, the time to read the entire file is 50,000*10.02msec ~ 500sec.

Problem 1.2

Suppose that a file has r = 100,000 STUDENT records with the following fields:
NAME (30 bytes), SSN (9 bytes), ADDRESS (40 bytes), PHONE (9 bytes), BIRTHDATE (8 bytes),
SEX (1 byte), MAJORDEPTCODE (4 bytes), MINORDEPTCODE (4 bytes), CLASSCODE (4 bytes,
integer), and DEGREEPROGRAM (3 bytes).

The fields are of fixed-length.

Suppose only 75% of the STUDENT records have a value for PHONE, 80% for MAJORDEPTCODE,
15% for MINORDEPTCODE, and 95% for DEGREEPROGRAM, and 100% of the STUDENT records
have a value for the other fields. We use a variable-length record format. Each record has a 2-byte field
type for each field occurring in the record, plus the 1-byte deletion marker and a 1-byte end-of-record
marker. Suppose we use a spanned record organization, where each block has a 5-byte pointer to the next
block (this space is not used for record storage). Each block contains 1,024 bytes.

a) Calculate the average record length R in bytes.

Assuming that every field has a 2-byte field type, and that the fields not mentioned above (NAME,
SSN, ADDRESS, BIRTHDATE, SEX, CLASSCODE) have values in every record, we need the
following number of bytes for these fields in each record, plus 1 byte for the deletion marker, and 1
byte for the end-of-record marker:
R fixed = (30+2) + (9+2) + (40+2) + (8+2) + (1+2) + (4+2) +1+1 = 106 bytes

For the other fields (PHONE, MAJORDEPTCODE, MINORDEPTCODE DEGREEPROGRAM),

the average number of bytes per record is:
R variable = ((9+2)*0.75)+((4+2)*0.80)+((4+2)*0.15)+((3+2)*0.95) = 8.25+4.8+0.9+4.75 = 18.7
bytes

The average record size R = R fixed + R variable = 106 + 18.7 = 124.7 bytes
The total number of bytes needed for the whole file is r * R = 100,000 * 124.7 = 12,470,000 bytes.

b) Calculate the number of blocks needed for the file.

Using a spanned record organization with a 5-byte pointer at the end of each block, the number of
bytes available in each block is B = 1024 - 5 = 1019 bytes.

The number of blocks b needed for the file is:

b = ceiling((r * R) / B) = ceiling(12470000 / 1019) = 12,238 blocks

Problem 1.3

Consider a disk with block size B = 512 bytes. A block pointer is P = 6 bytes long, and a record
pointer is PR = 7 bytes long. A file has r = 30,000 EMPLOYEE records of fixed length. Each record
has the following fields: Name (30 bytes),Ssn (9 bytes), Department_code (9 bytes), Address (40
bytes), Phone (10 bytes), Birth_date (8 bytes), Sex (1 byte), Job_code (4 bytes), and Salary (4 bytes,
real number). An additional byte is used as a deletion marker.

a. Calculate the record size R in bytes.

Record length R = (30 + 9 + 9 + 40 + 9 + 8 + 1 + 4 + 4) + 1 = 115 bytes

b. Calculate the blocking factor bfr and the number of file blocks b, assuming an unspanned
organization.
Blocking factor bfr = floor (B/R) = floor (512/115) = 4 records per block
Number of blocks needed for file = ceiling(r/bfr) = ceiling (30000/4) = 7500
c. Suppose that the file is ordered by the key field Ssn and we want to construct a primary
index on Ssn. Calculate
(i) The index blocking factor bfri (which is also the index fan-out fo)
Index record size R i = (V SSN + P) = (9 + 6) = 15 bytes
Index blocking factor bfr i = fo = floor (B/R i) = floor (512/15) = 34

(ii) The number of first-level index entries and the number of first-level index blocks
Number of first-level index entries r1 = number of file blocks b = 7500 entries
Number of first-level index blocks b1 = ceiling (r1 / bfr i) = ceiling (7500/34) = 221 blocks

(iii) The number of levels needed if we make it into a multilevel index

Number of second-level index entries r2 = number of first-level blocks b 1= 221 entries
Number of second-level index blocks b2= ceiling (r2 /bfr i) = ceiling (221/34) = 7 blocks
Number of third-level index entries r3 = number of second-level index blocks b2 = 7 entries
Number of third-level index blocks b3 = ceiling (r3 /bfr i) = ceiling (7/34) = 1
Since the third level has only one block, it is the top index level. Hence, the index has x = 3
levels

(iv) The total number of blocks required by the multilevel index

Total number of blocks for the index bi = b 1 + b 2 + b 3 = 221 + 7 + 1 = 229 blocks

(v) The number of block accesses needed to search for and retrieve a record from the file—
given its Ssn value—using the primary index
Number of block accesses to search for a record = x + 1 = 3 + 1 = 4

d. Suppose that the file is not ordered by the key field Ssn and we want to construct a
secondary index on Ssn. Repeat the previous exercise (part c) for the secondary index and
compare with the primary index.
(Same as C )

e. Suppose that the file is ordered by the nonkey field Department_code and we want to
construct a clustering index on Department_code that uses block anchors (every new value of
Department_code starts at the beginning of a new block). Assume there are 1,000 distinct
values of Department_code and that the EMPLOYEE records are evenly distributed among
these values. Calculate
(i) the index blocking factor bfri (which is also the index fan-out fo)
Index record size Ri = (V DEPARTMENTCODE + P) = (9 + 6) = 15 bytes
Index blocking factor bfr i = (fan-out) fo = floor (B/Ri) = floor (512/15)= 34 index records per
block

(ii) The number of first-level index entries and the number of first-level index blocks
No. of first-level index entries r1 = no. of distinct DEPARTMENTCODE values= 1000 entries
Number of first-level index blocks b1 = ceiling(r 1 /bfr i) = ceiling (1000/34) = 30 blocks

(iii) The number of levels needed if we make it into a multilevel index

We can calculate the number of levels as follows:
Number of second-level index entries r 2 = number of first-level index blocks b1= 30 entries
Number of second-level index blocks b 2 = ceiling(r 2 /bfr i) = ceiling (30/34) = 1
Since the second level has one block, it is the top index level.
Hence, the index has x = 2 levels

(iv) The total number of blocks required by the multilevel index

Total number of blocks for the index b i = b 1 + b 2 = 30 + 1 = 31 blocks

(v) The number of block accesses needed to search for and retrieve all records in the file that
have a specific Department_code value, using the clustering index (assume that multiple blocks
in a cluster are contiguous).
Number of block accesses to search for the first block in the cluster of blocks = x + 1 = 2 + 1 = 3
The 30 records are clustered in ceiling (30/bfr) = ceiling (30/4) = 8 blocks.
Hence, total block accesses needed on average to retrieve all the records with a given
DEPARTMENTCODE = x + 8 = 2 + 8 = 10 block accesses.

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Conolly James Lake Mark - Geographic Information Systems in Archaeology - 2006 PDF
100% (2)
Conolly James Lake Mark - Geographic Information Systems in Archaeology - 2006 PDF
356 pages
Relational Algebra, TRC, DRC Solutions
No ratings yet
Relational Algebra, TRC, DRC Solutions
9 pages
DS Assignment 3rd Sem IPU
No ratings yet
DS Assignment 3rd Sem IPU
6 pages
Module II
No ratings yet
Module II
22 pages
Lab Manual Bca 3 Sem Data Structures-I
No ratings yet
Lab Manual Bca 3 Sem Data Structures-I
16 pages
Anna University OOPS Question Bank Unit 2
No ratings yet
Anna University OOPS Question Bank Unit 2
6 pages
Jug Problem Python Code DFS Implementation
No ratings yet
Jug Problem Python Code DFS Implementation
7 pages
Object Oriented Programming in C++
No ratings yet
Object Oriented Programming in C++
4 pages
Compiler-Design Notes
No ratings yet
Compiler-Design Notes
5 pages
Unit-V: Elementary UDP Sockets
No ratings yet
Unit-V: Elementary UDP Sockets
9 pages
Object Oriented Software Engineering Notes BCA Degree 1st Year
No ratings yet
Object Oriented Software Engineering Notes BCA Degree 1st Year
31 pages
Balaguruswamy
50% (2)
Balaguruswamy
34 pages
VB Net Unit I
100% (1)
VB Net Unit I
25 pages
BCA 3rd Data Structure 1 20
No ratings yet
BCA 3rd Data Structure 1 20
20 pages
Question Bank - Operating System
No ratings yet
Question Bank - Operating System
4 pages
Object Oriented Programming Assignment 1
No ratings yet
Object Oriented Programming Assignment 1
3 pages
Python Notes 3rd Mca
No ratings yet
Python Notes 3rd Mca
99 pages
File Handling in R Programming: Eg: File - Create ("GFG - TXT")
No ratings yet
File Handling in R Programming: Eg: File - Create ("GFG - TXT")
2 pages
PYTHON Programming
No ratings yet
PYTHON Programming
17 pages
OS Practical File
75% (4)
OS Practical File
15 pages
Lab 2
No ratings yet
Lab 2
6 pages
UNIT-2: Classes and Object, Dynamic Constructor & Destructor BCA-2 Sem
No ratings yet
UNIT-2: Classes and Object, Dynamic Constructor & Destructor BCA-2 Sem
40 pages
OOP - I GTU Study Material Presentations Unit-1 07022022102854PM
No ratings yet
OOP - I GTU Study Material Presentations Unit-1 07022022102854PM
59 pages
Methods For Handling Deadlocks
No ratings yet
Methods For Handling Deadlocks
8 pages
Chapter 1 Preliminaries
No ratings yet
Chapter 1 Preliminaries
7 pages
Lab-Iv Unix and Shell Programming Laboratory (CSE-224: Prerequisites
No ratings yet
Lab-Iv Unix and Shell Programming Laboratory (CSE-224: Prerequisites
2 pages
Java Script Questions
No ratings yet
Java Script Questions
4 pages
UNIT4
No ratings yet
UNIT4
7 pages
Active Server Pages PDF
No ratings yet
Active Server Pages PDF
34 pages
Madhuri Gupta 7th Sem AI Lab Manual1
No ratings yet
Madhuri Gupta 7th Sem AI Lab Manual1
17 pages
DSA Lab 10
No ratings yet
DSA Lab 10
15 pages
13 (A) Explain The Banker's Algorithm For Deadlock Avoidance With An Illustration. - Bituh
100% (1)
13 (A) Explain The Banker's Algorithm For Deadlock Avoidance With An Illustration. - Bituh
6 pages
Unit III - Preprocessor & Files
No ratings yet
Unit III - Preprocessor & Files
28 pages
Practical No. 7 - String Functions and Array: 1) Write A PHP Program To Demonstrate Different String Functions. Program
No ratings yet
Practical No. 7 - String Functions and Array: 1) Write A PHP Program To Demonstrate Different String Functions. Program
2 pages
Assignment 1 - Basics of Python
No ratings yet
Assignment 1 - Basics of Python
2 pages
Case Study (Analysis of Algorithm
No ratings yet
Case Study (Analysis of Algorithm
14 pages
Event Management System Synopsis
No ratings yet
Event Management System Synopsis
22 pages
Random Access Files in C
100% (1)
Random Access Files in C
4 pages
Practical 5: Introduction To Weka For Classfication
100% (1)
Practical 5: Introduction To Weka For Classfication
4 pages
Data Structure Using C Lab (KCS351) : Programming Language/Tool Used: C and Mapple
50% (2)
Data Structure Using C Lab (KCS351) : Programming Language/Tool Used: C and Mapple
1 page
Experiment No.:-1: Aim:-Introduction To HTML
100% (1)
Experiment No.:-1: Aim:-Introduction To HTML
5 pages
PHP Variables
No ratings yet
PHP Variables
4 pages
List of Programs Subject Code: PCS-307 Subject: OOP Using C++ Programming Lab
No ratings yet
List of Programs Subject Code: PCS-307 Subject: OOP Using C++ Programming Lab
4 pages
Ai Lab
No ratings yet
Ai Lab
48 pages
Dbms Lab Exam
0% (2)
Dbms Lab Exam
13 pages
Unit - V: Advanced Topics
No ratings yet
Unit - V: Advanced Topics
92 pages
Matlab File - Deepak - Yadav - Bca - 4TH - Sem - A50504819015
No ratings yet
Matlab File - Deepak - Yadav - Bca - 4TH - Sem - A50504819015
59 pages
Python Quetion and Answers
No ratings yet
Python Quetion and Answers
5 pages
Gujarat University Practical Examination December 2017 B.C.A. Semester - I Subject: CC-107 PC Software Set No: 6 Univ. Seat No
No ratings yet
Gujarat University Practical Examination December 2017 B.C.A. Semester - I Subject: CC-107 PC Software Set No: 6 Univ. Seat No
20 pages
Sumita Arora Classes and Objects Long Answer Questions
75% (4)
Sumita Arora Classes and Objects Long Answer Questions
10 pages
PYTHON Notes Unit1&Unit2
No ratings yet
PYTHON Notes Unit1&Unit2
38 pages
DBMS Practical File
No ratings yet
DBMS Practical File
39 pages
CPDS Lab Manual
No ratings yet
CPDS Lab Manual
107 pages
101 Onwards On Python Pandas and Pyplot
No ratings yet
101 Onwards On Python Pandas and Pyplot
33 pages
Lab Manual
No ratings yet
Lab Manual
20 pages
Operating System Kcs-401. Question Bank À Unit-Iii: Cpu Scheduling and Deadlocks
No ratings yet
Operating System Kcs-401. Question Bank À Unit-Iii: Cpu Scheduling and Deadlocks
4 pages
Unit 4 - Software Engineering - WWW - Rgpvnotes.in
No ratings yet
Unit 4 - Software Engineering - WWW - Rgpvnotes.in
12 pages
Revision of Class IX Syllabus
No ratings yet
Revision of Class IX Syllabus
17 pages
DBMS Unit 4 Notes PDF
No ratings yet
DBMS Unit 4 Notes PDF
61 pages
Placement Portal Management System
No ratings yet
Placement Portal Management System
29 pages
C & Data Structures
From Everand
C & Data Structures
Prof. P. Padmanabham
No ratings yet
CH 14
No ratings yet
CH 14
36 pages
Chapter 5: Advanced SQL: Database System Concepts, 6 Ed
No ratings yet
Chapter 5: Advanced SQL: Database System Concepts, 6 Ed
77 pages
Chapter 6: Formal Relational Query Languages: Database System Concepts, 6 Ed
No ratings yet
Chapter 6: Formal Relational Query Languages: Database System Concepts, 6 Ed
27 pages
Reversenum
No ratings yet
Reversenum
1 page
CH 2
No ratings yet
CH 2
22 pages
Midterm Exam Key: CMPT 354
No ratings yet
Midterm Exam Key: CMPT 354
7 pages
Midterm Exam Key: CMPT 354
No ratings yet
Midterm Exam Key: CMPT 354
7 pages
Multi
No ratings yet
Multi
1 page
Bubble
No ratings yet
Bubble
1 page
Fibo
No ratings yet
Fibo
1 page
Graph Unit 6 & 7
No ratings yet
Graph Unit 6 & 7
19 pages
Apa Thesis Table of Contents Format
100% (2)
Apa Thesis Table of Contents Format
8 pages
HB9BLA Wireless - RUTX14 Zerotier
No ratings yet
HB9BLA Wireless - RUTX14 Zerotier
4 pages
Interconnexion Réseaux - Networking Lab One - Introduction
No ratings yet
Interconnexion Réseaux - Networking Lab One - Introduction
19 pages
Systems Manual Simatic ET200S PDF
No ratings yet
Systems Manual Simatic ET200S PDF
666 pages
Using The Wago 750-352 Ethernet Coupler As Remote Io With A Compactlogix™ PLC
No ratings yet
Using The Wago 750-352 Ethernet Coupler As Remote Io With A Compactlogix™ PLC
22 pages
iSecureNet Products-DS25012020
100% (1)
iSecureNet Products-DS25012020
4 pages
Case Study 5
No ratings yet
Case Study 5
3 pages
Performance Tuning For Content Manager Sg246949
No ratings yet
Performance Tuning For Content Manager Sg246949
490 pages
QT Answers
No ratings yet
QT Answers
23 pages
E14153 First Edition / May 2018
No ratings yet
E14153 First Edition / May 2018
96 pages
TuyaGo Product Brochure
No ratings yet
TuyaGo Product Brochure
21 pages
Jazan University: Kingdom of Saudi Arabia Ministry of Higher Education
No ratings yet
Jazan University: Kingdom of Saudi Arabia Ministry of Higher Education
8 pages
Compiler Design BTCS3602 Question Bank 1
No ratings yet
Compiler Design BTCS3602 Question Bank 1
4 pages
Endterm Paper
No ratings yet
Endterm Paper
18 pages
IT24 - Advanced DBMS Model Answer Paper
No ratings yet
IT24 - Advanced DBMS Model Answer Paper
10 pages
Algebra Mock Exam: x+4 X 4 x+4 X 4
No ratings yet
Algebra Mock Exam: x+4 X 4 x+4 X 4
5 pages
G1 Neo User Manual20240725
No ratings yet
G1 Neo User Manual20240725
60 pages
Full Report - 20 Cap Confirmed
No ratings yet
Full Report - 20 Cap Confirmed
13 pages
How To Setup TP-LINK Switch With TP-LINK AP multiSSID
No ratings yet
How To Setup TP-LINK Switch With TP-LINK AP multiSSID
5 pages
Drawing The Graphs of Functions
No ratings yet
Drawing The Graphs of Functions
6 pages
User Manual - Payroll Service PDF
No ratings yet
User Manual - Payroll Service PDF
4 pages
Se Notes
No ratings yet
Se Notes
80 pages
UNIT-5 Cloud Computing
No ratings yet
UNIT-5 Cloud Computing
40 pages
Marketing Automation PDF
100% (1)
Marketing Automation PDF
24 pages
Dalmas Ogembo
No ratings yet
Dalmas Ogembo
1 page
74HC240 74HCT240: 1. General Description
No ratings yet
74HC240 74HCT240: 1. General Description
14 pages
Chapter 5. Java Database Connectivity
No ratings yet
Chapter 5. Java Database Connectivity
34 pages
Python Code: Mysql - Connector Time Datetime
No ratings yet
Python Code: Mysql - Connector Time Datetime
5 pages
EVM-1702 Evaluation Module: Features Description
No ratings yet
EVM-1702 Evaluation Module: Features Description
13 pages

Numerical Based On Indexing: Problem 1.2

Uploaded by

Numerical Based On Indexing: Problem 1.2

Uploaded by

Numerical Based on Indexing

e) What is the time required to read the file sequentially?

The fields are of fixed-length.

a) Calculate the average record length R in bytes.

For the other fields (PHONE, MAJORDEPTCODE, MINORDEPTCODE DEGREEPROGRAM),

b) Calculate the number of blocks needed for the file.

The number of blocks b needed for the file is:

a. Calculate the record size R in bytes.

(iii) The number of levels needed if we make it into a multilevel index

(iv) The total number of blocks required by the multilevel index

(iii) The number of levels needed if we make it into a multilevel index

(iv) The total number of blocks required by the multilevel index

You might also like