0% found this document useful (0 votes)

2K views

Dsa Mock Insem Question Bank

The document explains file concepts in C++, including file opening modes and inverted files, which map content to locations for efficient searching. It also covers external sorting methods for large data sets, factors affecting file organization, and various indexing techniques. Additionally, it details B+ trees, their structure, and differences from B trees, along with step-by-step construction examples for B+ trees of different orders.

Uploaded by

narutouzumaki782183

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2K views

Dsa Mock Insem Question Bank

Uploaded by

narutouzumaki782183

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

Q1 What is file? List different file opening modes in C++.

Explain concept
of inverted files.

What is a File?
In computing, a file is a collection of related data stored on a non-volatile
storage device like a hard drive or SSD. In C++, files are used for permanent
storage of data — unlike variables which lose their values when a program
ends.
In C++, files are handled using file streams, provided by the <fstream>
header.

File Opening Modes in C++

In C++, you open files using objects like ifstream, ofstream, or fstream,
along with file modes to specify how the file should be accessed.

Common File Modes:

Mode Description

ios::in Open file for reading

ios::out Open file for writing (overwrites existing content)

ios::app Append to the end of the file

ios::ate Move to the end of file after opening

ios::trunc Truncate file (delete contents if it exists)

ios::binary Open file in binary mode

🛠 Example:
cpp
CopyEdit
fstream file;
file.open("data.txt", ios::in | ios::out);

Inverted Files

What is an Inverted File?

An inverted file (or inverted index) is a data structure used to map
content (like words) to their locations in a file or database.
It is widely used in search engines, databases, and information retrieval
systems.

Key Concept:
Instead of storing documents with their contents, it stores a dictionary of
words and for each word, a list of documents (or positions) where it
appears.

Example:
Documents:
vbnet
CopyEdit
Doc1: "apple banana mango"
Doc2: "banana apple"
Doc3: "mango fruit"
Inverted File:

Word Document(s)

apple Doc1, Doc2

banana Doc1, Doc2

mango Doc1, Doc3

fruit Doc3
Q2 Define external sort with eaxmple.

External sorting is a class of algorithms used to sort data that is too large
to fit into main memory (RAM) and must reside in external storage (like a
hard drive).
It is typically used when dealing with huge files, like in database systems or
big data processing.

Why Not Use Normal Sorting?

Traditional algorithms like QuickSort or MergeSort work well in-memory. But
when the dataset is larger than RAM, we need to sort it in chunks, using disk
I/O efficiently, which is where external sorting comes in.

Example: External Merge Sort

Let’s say we need to sort 1 GB of data, but we only have 100 MB of RAM.
Step-by-Step:
1. Divide into Chunks:
Break the 1 GB file into 10 chunks of 100 MB each.
2. Sort Each Chunk:
Load each chunk into RAM one at a time, sort it using MergeSort or
QuickSort, and save the sorted chunks back to disk as temporary
files.
3. Merge Sorted Chunks:
Use a k-way merge algorithm to merge the 10 sorted chunks into a
single sorted file.
o You read a small buffer from each chunk into memory
o Repeatedly pick the smallest element from the buffers and write it
to the output file
Example (Simple Version)
Original File (on disk):
F, A, Z, B, D, M, E, C
Step 1: Split into 2 chunks (say RAM can hold 4 items)
• Chunk 1: F, A, Z, B → sort → A, B, F, Z
• Chunk 2: D, M, E, C → sort → C, D, E, M
Write both to disk as temp files.
Step 2: Merge Chunks
Merge A, B, F, Z and C, D, E, M →
Final sorted file: A, B, C, D, E, F, M, Z

Applications of External Sort:

• Database management systems
• Sorting logs, big text files
• Data warehousing
• External memory algorithms in Big Data
Q3 Write short notes on: i) Factors affecting the file organization ii) Indexed
sequential files iii) Indexing technique
i) Factors Affecting the File Organization
File organization refers to the way data is stored and accessed in a file. The
choice depends on several factors:
1. Access Method:
o Sequential or random access?
o Fast read/write needed?
2. File Size:
o Larger files may need indexing or hashing for performance.
3. Update Frequency:
o How often is data added, deleted, or modified?
4. Redundancy and Duplication:
o Some organizations reduce duplication better than others.
5. Search Efficiency:
o Is fast searching a requirement?
6. Data Volatility:
o How often the structure of data changes?
7. Storage Medium:
o Hard drive, SSD, or cloud? Each affects access speed.

ii) Indexed Sequential Files

This is a hybrid file organization method that combines:
• Sequential access (like in a sorted file)
• Indexed access (like in a database)
Structure:
• A main data file sorted by key
• An index file with key-pointer pairs
Example:
If you're storing student records sorted by roll number, the index might look
like:
yaml
CopyEdit
Index:
1001 → block 1
1050 → block 2
1100 → block 3

Main File:
[1001, 1002, ..., 1049], [1050, ..., 1099], ...
Advantages:
• Faster than pure sequential search
• Easier to update than fully indexed structures

iii) Indexing Technique

Indexing is a technique used to improve searching speed in files or
databases by maintaining a separate index structure.
Types of Indexing:
1. Primary Index
o Built on a unique field (like student ID)
2. Secondary Index
o Built on non-unique fields (like city)
3. Dense Index
o One index entry per record
4. Sparse Index
o One index entry per block/page
5. Multilevel Index
o Index on top of another index (like a hierarchy)
Example:
mathematica
CopyEdit
Index:
"apple" → Line 10
"banana" → Line 50
"grape" → Line 90
You can quickly jump to data without scanning the whole file.
Q4 Define sequential file organization. Give it’s advantages and
disadvantages.

Sequential File Organization

In Sequential File Organization, records are stored one after another in a
specific order, usually based on a key field (like Roll Number, ID, etc.).
• New records are always added at the end of the file.
• To access a specific record, the system must read from the beginning
and search in sequence.

Example:
Let's say you are storing student records sorted by Roll Number:
CopyEdit
101 John
102 Alice
103 Bob
104 Zara
If you want to find Roll No. 103, the system reads:
• 101 → 102 → 103 (stop here)
Advantages of Sequential File Organization

Advantage Description

Simple Design Easy to understand and implement

Efficient for Full Read Fast when accessing all records in order

Minimal Storage
No indexing or extra data structures needed
Overhead

Good for Batch Ideal for tasks like payroll, billing, report
Processing generation

Disadvantages of Sequential File Organization

Disadvantage Description

Slow Random
Must search from start → slow for large files
Access

Difficult to Update Inserting in order requires rewriting the file

No Flexibility Cannot handle dynamic data well

Deletion is Requires shifting or marking deleted records

Inefficient manually

Use Cases:
• Payroll systems
• Monthly billing systems
• Bank transaction logs
• Historical data processing
Q5 What is B+ tree? Give structure of it’s internal note. What is the difference
between B and B+ tree.

What is a B+ Tree?
A B+ Tree is a balanced search tree used for storing large amounts of
sorted data and allows efficient searching, insertion, and deletion.
It is widely used in:
• Databases
• File systems
• Indexing structures

Key Properties of B+ Tree:

• All values (data records) are stored only at the leaf level.
• Internal nodes store keys only, used for navigation.
• Leaves are linked together for fast range queries.
• Tree remains balanced (all leaves are at the same level).

Structure of an Internal Node (for order m):

An internal node of a B+ Tree contains:
• Up to m – 1 keys
• Up to m pointers to child nodes
Example (Order 4 B+ Tree internal node):
yaml
CopyEdit
| K1 | K2 | K3 |
/ | | \
P1 P2 P3 P4
Where:
• K1 < K2 < K3
• P1 → subtree with keys < K1
• P2 → subtree with keys between K1 and K2
• etc.

Difference Between B Tree and B+ Tree

Feature B Tree B+ Tree

Stored in internal + leaf

Data Storage Stored only in leaf nodes
nodes

Search
Slower range search Faster due to linked leaves
Efficiency

Leaves are linked for

Leaf Link Leaves are not linked
traversal

Must go to leaf for actual

Traversal Can stop at internal node
data

Mostly for in-memory Databases and file

Used In
structures systems

Example:
Suppose we insert keys: 10, 20, 30, 40, 50
B Tree (Order 3):
css
CopyEdit
[30]
/ \
[10,20] [40,50]
B+ Tree (Order 3):
css
CopyEdit
[30]
/ \
[10,20] [30,40,50]
↔
• All data is in leaf nodes
• Leaves are linked
Q6 Build B+ tree of order 3 for the following data: F, S, Q, K, C, L, H, T, V, W, M,
R

Given Data:
F, S, Q, K, C, L, H, T, V, W, M, R

B+ Tree Order 3: What it means?

• Maximum 2 keys per internal node (order − 1)
• Maximum 3 children per internal node
• Leaf nodes can hold 2 keys
• All data is stored in the leaf nodes
• Leaf nodes are linked
We'll insert in alphabetical order for clarity:
Sorted Input:
C, F, H, K, L, M, Q, R, S, T, V, W
Step-by-Step Insertion

Step 1: Insert C, F
makefile
CopyEdit
Leaf: [C, F]

Step 2: Add H → [C, F, H] → Exceeds leaf capacity → Split

• [C] and [F, H]
• Promote middle key F to parent
css
CopyEdit
[F]
/ \
[C] [F,H]

Step 3: Add K → into right leaf → [F, H, K] → Split

• Split [F, H, K] → [F] and [H, K]
• Promote middle H to root (already has F)
Now root has 2 keys: [F, H]
less
CopyEdit
[F, H]
/ | \
[C] [F] [H, K]
Step 4: Insert L → into [H, K] → becomes [H, K, L] → Split
• [H] and [K, L] → promote K
Now root has: [F, H, K] → Exceeds capacity → Split root
Promote H, new root becomes [H]
css
CopyEdit
[H]
/ \
[F] [K]
/ \ / \
[C] [F] [H] [K, L]

Step 5: Insert M → goes to [K, L] → becomes [K, L, M] → Split

• [K], [L, M] → promote L to [K]
Update internal node [K] to [K, L]
less
CopyEdit
[H]
/ \
[F] [K, L]
/ \ / | \
[C] [F] [H] [K] [L, M]

Continue Inserting: Q, R, S
• Q goes to [L, M] → [L, M, Q] → Split → [L], [M, Q] → promote M
• Insert R → goes to [M, Q] → becomes [M, Q, R] → Split → [M], [Q, R] →
promote Q
• Insert S → goes to [Q, R] → becomes [Q, R, S] → Split → [Q], [R, S] →
promote R
Update internal nodes accordingly.

Final Inserts: T, V, W
• Continue inserting with same logic into rightmost leaf
• Manage splits and promotion as needed

Final B+ Tree (Structure Only)

Final Leaf Nodes (in order, linked):

css
CopyEdit
[C] ↔ [F] ↔ [H] ↔ [K] ↔ [L] ↔ [M] ↔ [Q] ↔ [R, S] ↔ [T, V] ↔ [W]
Q6 Construct the B+ Tree of order 4 for the following data: 1, 4, 7, 10, 17, 21,
31, 25, 19, 20, 28, 42.
Let’s construct a B+ Tree of order 4 using the following keys:

Input:
1, 4, 7, 10, 17, 21, 31, 25, 19, 20, 28, 42

B+ Tree of Order 4:
• Max 3 keys in an internal node
• Max 4 children per internal node
• Max 3 keys per leaf node
• All actual data is stored in leaves
• Leaf nodes are linked

Step-by-Step Construction
We'll insert the keys in the given order and split as needed.

Step 1: Insert 1, 4, 7
Leaf node: [1, 4, 7] — no split needed

Step 2: Insert 10
Leaf: [1, 4, 7, 10] → 4 keys → Split
• Split into: [1, 4] and [7, 10]
• Promote 7 to parent
css
CopyEdit
[7]
/ \
[1, 4] [7, 10]

Step 3: Insert 17
Goes to [7, 10] → becomes [7, 10, 17] → OK

Step 4: Insert 21
Leaf: [7, 10, 17, 21] → split into [7, 10] and [17, 21]
Promote 17
Update root to: [7, 17]
less
CopyEdit
[7, 17]
/ | \
[1, 4] [7,10] [17, 21]

Step 5: Insert 31
Goes to [17, 21] → becomes [17, 21, 31] → OK

Step 6: Insert 25
Leaf [17, 21, 31] → becomes [17, 21, 25, 31] → split
→ [17, 21], [25, 31] → promote 25 to parent
Now parent [7, 17] becomes [7, 17, 25]
less
CopyEdit
[7, 17, 25]
/ | | \
[1,4] [7,10] [17,21] [25,31]

Step 7: Insert 19
Goes to [17, 21] → becomes [17, 19, 21] — OK

Step 8: Insert 20
Leaf [17, 19, 21] → becomes [17, 19, 20, 21] → split
→ [17, 19], [20, 21] → promote 20
Now parent [7, 17, 25] is full → needs split
Split [7, 17, 25] → [7], [20] → promote 17 to new root
css
CopyEdit
[17]
/ \
[7] [20]
/ \ / \
[1,4][7,10] [17,19][20,21][25,31]

Step 9: Insert 28
Goes to [25, 31] → becomes [25, 28, 31] → OK

Step 10: Insert 42

Leaf: [25, 28, 31] → becomes [25, 28, 31, 42] → split
→ [25, 28], [31, 42] → promote 31
Now [20] becomes [20, 31]

Final B+ Tree Structure:

less
CopyEdit
[17]
/ \
[7] [20, 31]
/ \ / | \
[1,4][7,10] [17,19][20,21][25,28][31,42]
• All data is stored in the leaf nodes
• Leaf nodes are linked like:
css
CopyEdit
[1,4] ↔ [7,10] ↔ [17,19] ↔ [20,21] ↔ [25,28] ↔ [31,42]

Top 25 Penetration Testing Tools (2023) PDF
50% (2)
Top 25 Penetration Testing Tools (2023) PDF
4 pages
Iqan-Md3 Uk Instructionbook
100% (1)
Iqan-Md3 Uk Instructionbook
41 pages
A Vontade de Sentido - Viktor E. Frankl
No ratings yet
A Vontade de Sentido - Viktor E. Frankl
216 pages
BCA501 NDBMSUnit 3,4
No ratings yet
BCA501 NDBMSUnit 3,4
65 pages
Assignment (DS)
No ratings yet
Assignment (DS)
8 pages
Dbms. 5 Unit Part-B
No ratings yet
Dbms. 5 Unit Part-B
8 pages
2 - Indexing Structures - Ch14
No ratings yet
2 - Indexing Structures - Ch14
50 pages
FS Mod3
No ratings yet
FS Mod3
46 pages
DBMS-Unit5
No ratings yet
DBMS-Unit5
25 pages
B - Trees
No ratings yet
B - Trees
19 pages
IT3020 L06 Indexing
No ratings yet
IT3020 L06 Indexing
41 pages
Lesson 1 Introduction To File Management
No ratings yet
Lesson 1 Introduction To File Management
82 pages
FAQ's Unit-5
No ratings yet
FAQ's Unit-5
6 pages
Btrees Animated
No ratings yet
Btrees Animated
77 pages
IT3031-L06-Indexing
No ratings yet
IT3031-L06-Indexing
45 pages
Unit 5 Dbms
No ratings yet
Unit 5 Dbms
12 pages
Tree-Structured Indexes: R & G Chapter 9
No ratings yet
Tree-Structured Indexes: R & G Chapter 9
34 pages
B+ Tree & B Tree
No ratings yet
B+ Tree & B Tree
38 pages
abhisheks008-hashn...
No ratings yet
abhisheks008-hashn...
57 pages
n3 BTrees
No ratings yet
n3 BTrees
14 pages
FS Lecture
No ratings yet
FS Lecture
17 pages
DSA & DAA Fast Study
No ratings yet
DSA & DAA Fast Study
6 pages
Java Merged
No ratings yet
Java Merged
291 pages
Dsa.cpp
No ratings yet
Dsa.cpp
11 pages
Data Structutes Using C'
No ratings yet
Data Structutes Using C'
7 pages
Unit V
No ratings yet
Unit V
55 pages
DBMS Unit 3
No ratings yet
DBMS Unit 3
81 pages
Indexing
No ratings yet
Indexing
77 pages
Find All Students With Gpa 3.0'': Can Do Binary Search On (Smaller) Index File!
No ratings yet
Find All Students With Gpa 3.0'': Can Do Binary Search On (Smaller) Index File!
42 pages
File Organization
No ratings yet
File Organization
11 pages
unit-5-indexing-2024
No ratings yet
unit-5-indexing-2024
50 pages
B Tree Application
100% (2)
B Tree Application
6 pages
DBMS - File Organization, Indexing and Hashing Notes
No ratings yet
DBMS - File Organization, Indexing and Hashing Notes
19 pages
5 unit. pdf
No ratings yet
5 unit. pdf
2 pages
Lecture 5 Trees
No ratings yet
Lecture 5 Trees
47 pages
Software Design Using C++: An Online Book
No ratings yet
Software Design Using C++: An Online Book
11 pages
Internal File Structure: Methods and Design Paradigm
No ratings yet
Internal File Structure: Methods and Design Paradigm
6 pages
Lesson 8 Cs450 - Indexing
No ratings yet
Lesson 8 Cs450 - Indexing
31 pages
Data Structure Final PART II
No ratings yet
Data Structure Final PART II
50 pages
Storage and Indexing
No ratings yet
Storage and Indexing
41 pages
UNIT-6 Important Questions & Answers
No ratings yet
UNIT-6 Important Questions & Answers
20 pages
Multilevel Indexing and B+ Trees
No ratings yet
Multilevel Indexing and B+ Trees
33 pages
OSY Notes Vol 2 (6th Chapter) - Ur Engineering Friend
No ratings yet
OSY Notes Vol 2 (6th Chapter) - Ur Engineering Friend
23 pages
Farre BCA4
No ratings yet
Farre BCA4
1 page
n04-B Trees
No ratings yet
n04-B Trees
19 pages
CS2606: Data Structures and Object-Oriented Development Chapter 10: Indexing
No ratings yet
CS2606: Data Structures and Object-Oriented Development Chapter 10: Indexing
25 pages
CPS216: Data-Intensive Computing Systems
No ratings yet
CPS216: Data-Intensive Computing Systems
70 pages
DS_TM_Study_Material_Presentations_Unit-4_1TM
No ratings yet
DS_TM_Study_Material_Presentations_Unit-4_1TM
22 pages
Tree-Structured Indexes: Comp 521 - Files and Databases Fall 2010 1
No ratings yet
Tree-Structured Indexes: Comp 521 - Files and Databases Fall 2010 1
27 pages
Divide and Conquer
No ratings yet
Divide and Conquer
10 pages
DBMS PPT
No ratings yet
DBMS PPT
17 pages
Internal Sorting:Definition: When The Entire Dataset Can Fit
No ratings yet
Internal Sorting:Definition: When The Entire Dataset Can Fit
12 pages
Advanced Pascal
No ratings yet
Advanced Pascal
38 pages
DS Importent
No ratings yet
DS Importent
17 pages
Dbms 5
No ratings yet
Dbms 5
26 pages
Unit-V Storage Management
No ratings yet
Unit-V Storage Management
98 pages
Mcoo68 Smu Mca Sem2 2011
No ratings yet
Mcoo68 Smu Mca Sem2 2011
23 pages
Lectures 1-10 (7 Files Merged)
No ratings yet
Lectures 1-10 (7 Files Merged)
386 pages
Absolute Beginner S Guide To Algorithms
No ratings yet
Absolute Beginner S Guide To Algorithms
563 pages
Indexing
No ratings yet
Indexing
141 pages
File Structure
No ratings yet
File Structure
18 pages
C++ File Handling Step by Step: A Practical Guide with Examples
From Everand
C++ File Handling Step by Step: A Practical Guide with Examples
William E. Clark
No ratings yet
Managing Multimedia and Unstructured Data in the Oracle Database
From Everand
Managing Multimedia and Unstructured Data in the Oracle Database
Marcelle Kratochvil
No ratings yet
FMD-3200/FMD-3200-BB/FMD-3300: Operator's Guide
100% (1)
FMD-3200/FMD-3200-BB/FMD-3300: Operator's Guide
12 pages
Advantages and Dis Adv of Relation Model
No ratings yet
Advantages and Dis Adv of Relation Model
2 pages
Tweequ
No ratings yet
Tweequ
20 pages
CS-T240 User Manual
No ratings yet
CS-T240 User Manual
218 pages
TESPBEATIBRIDK
No ratings yet
TESPBEATIBRIDK
3 pages
Setting Up of A Reliable Internet Connection To Inuman Elementary School
No ratings yet
Setting Up of A Reliable Internet Connection To Inuman Elementary School
4 pages
30 Different Types of Files and How to Use Them
No ratings yet
30 Different Types of Files and How to Use Them
2 pages
Lenovo IdeaPad Z470 Quanta KL6A DIS 45W Schematic
No ratings yet
Lenovo IdeaPad Z470 Quanta KL6A DIS 45W Schematic
47 pages
download instruction manual
No ratings yet
download instruction manual
2 pages
CosmozIntro and Concepts PF (Auto-Saved)
No ratings yet
CosmozIntro and Concepts PF (Auto-Saved)
14 pages
Output Log
No ratings yet
Output Log
70 pages
Current Calculation of Spare Parts
No ratings yet
Current Calculation of Spare Parts
3 pages
Instructions: Motoman-Mh50 Ii
No ratings yet
Instructions: Motoman-Mh50 Ii
82 pages
IP Project 12th 2024-25
No ratings yet
IP Project 12th 2024-25
22 pages
UTNet A Hybrid Transformer Architecture For Medical Image Segmentation PDF
No ratings yet
UTNet A Hybrid Transformer Architecture For Medical Image Segmentation PDF
11 pages
Informacion DVD 350 PDF
No ratings yet
Informacion DVD 350 PDF
1 page
Soft
No ratings yet
Soft
57 pages
Iskratel Lumia T14 Datasheet EN
No ratings yet
Iskratel Lumia T14 Datasheet EN
2 pages
15 Hackett Recommended Metrics To Benchmark Your O2C Processes
No ratings yet
15 Hackett Recommended Metrics To Benchmark Your O2C Processes
14 pages
StochasticApproximation Borkar
100% (1)
StochasticApproximation Borkar
172 pages
CPL lab interview questions
No ratings yet
CPL lab interview questions
53 pages
Seek Avenger: The Worldwide Standard in Biometric Identity Solutions
No ratings yet
Seek Avenger: The Worldwide Standard in Biometric Identity Solutions
2 pages
Install Firefox in Redhat 8
No ratings yet
Install Firefox in Redhat 8
3 pages
IS - Report Temp
No ratings yet
IS - Report Temp
7 pages
8-Char Array
No ratings yet
8-Char Array
6 pages
Intro To Data Analytics Activity Templates - Marquina Alberto
No ratings yet
Intro To Data Analytics Activity Templates - Marquina Alberto
12 pages
Presentation 1
No ratings yet
Presentation 1
8 pages

Dsa Mock Insem Question Bank

Uploaded by

Dsa Mock Insem Question Bank

Uploaded by

Q1 What is file? List different file opening modes in C++.

File Opening Modes in C++

Common File Modes:

ios::in Open file for reading

ios::out Open file for writing (overwrites existing content)

ios::app Append to the end of the file

ios::ate Move to the end of file after opening

ios::trunc Truncate file (delete contents if it exists)

ios::binary Open file in binary mode

What is an Inverted File?

apple Doc1, Doc2

banana Doc1, Doc2

mango Doc1, Doc3

Why Not Use Normal Sorting?

Example: External Merge Sort

Applications of External Sort:

ii) Indexed Sequential Files

iii) Indexing Technique

Sequential File Organization

Simple Design Easy to understand and implement

Disadvantages of Sequential File Organization

Difficult to Update Inserting in order requires rewriting the file

No Flexibility Cannot handle dynamic data well

Deletion is Requires shifting or marking deleted records

Key Properties of B+ Tree:

Structure of an Internal Node (for order m):

Difference Between B Tree and B+ Tree

Feature B Tree B+ Tree

Stored in internal + leaf

Leaves are linked for

Must go to leaf for actual

Mostly for in-memory Databases and file

B+ Tree Order 3: What it means?

Step 2: Add H → [C, F, H] → Exceeds leaf capacity → Split

Step 3: Add K → into right leaf → [F, H, K] → Split

Step 5: Insert M → goes to [K, L] → becomes [K, L, M] → Split

Final B+ Tree (Structure Only)

Final Leaf Nodes (in order, linked):

Step 10: Insert 42

Final B+ Tree Structure:

You might also like