How Tables and Indexes Are Stored On Disk

1. Tables and indexes are stored on disk in pages with multiple rows per page to reduce I/O operations. 2. An index stores pointers to rows in the heap to allow quickly looking up specific rows without scanning the entire heap. 3. When querying on an indexed column, the database first looks up the row location in the index, then performs I/O to fetch the row data from the referenced heap page. This is more efficient than scanning the entire

Uploaded by

bimo

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views

How Tables and Indexes Are Stored On Disk

Uploaded by

bimo

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

How tables and indexes are

stored on disk
And how they are queried
Storage concepts

● Table
● Row_id
● Page
● IO
● Heap data structure
● Index data structure b-tree
● Example of a query
Logical Table
column

emp_id emp_name emp_dob emp_salary

2000 Hussein 1/2/1988 $100,000

3000 Adam 3/2/1977 $200,000

row
4000 Ali 5/2/1982 $300,000
Row_ID
● Internal and system maintained
● In certain databases (mysql -innoDB) it is the same as the primary key but other
databases like Postgres have a system column row_id (tuple_id)

row_id emp_id emp_name emp_dob emp_salary

1 2000 Hussein 1/2/1988 $100,000

2 3000 Adam 3/2/1977 $200,000

3 4000 Ali 5/2/1982 $300,000

Page 0
Page 1,10,Hussein,1/2/1
● Depending on the storage model (row vs column store), the rows are 988,$100,000|2,
stored and read in logical pages. 20,Adam,3/2/1977|
● The database doesn’t read a single row, it reads a page or more in a 3,30,Ali,5/2/1982,$
single IO and we get a lot of rows in that IO. 300,000
● Each page has a size (e.g. 8KB in postgres, 16KB in MySQL) Page 1
● Assume each page holds 3 rows in this example, with 1001 rows
you will have 1001/3 = 333~ pages ( Rows 4,5,6 ) …...
Page 2
row_id emp_id emp_name emp_dob emp_salary
( Rows 7,8,9 ) …...
1 10 Hussein 1/2/1988 $100,000
…….
2 20 Adam 3/2/1977 $200,000
Page 333
3 30 Ali 5/2/1982 $300,000
More
rows….1000,10000
... .. ... …. ….
,Eddard,1/27/1999,
$250,000
1000 10000 Eddard 1/27/1999 $250,000
Page 0

IO 1,10,Hussein,1/2/1
988,$100,000|2,
20,Adam,3/2/1977|
● IO operation (input/output) is a read request to the disk 3,30,Ali,5/2/1982,$
300,000
● We try to minimize this as much as possible
Page 1
● An IO can fetch 1 page or more depending on the disk partitions and
( Rows 4,5,6 ) …...
other factors
Page 2
● An IO cannot read a single row, its a page with many rows in them,
( Rows 7,8,9 ) …...
you get them for free.
…….
● You want to minimize the number of IOs as they are expensive.
Page 333
● Some IOs in operating systems goes to the operating system cache
More
and not disk
rows….1000,10000
,Eddard,1/27/1999,
$250,000
Heap Page 0

Heap 1,10,Hussein,1/2/1
988,$100,000|2,
20,Adam,3/2/1977|
● The Heap is data structure where the table is stored with all its 3,30,Ali,5/2/1982,$
300,000
pages one after another.
Page 1
● This is where the actual data is stored including everything
( Rows 4,5,6 ) …...
● Traversing the heap is expensive as we need to read so may data
Page 2
to find what we want
( Rows 7,8,9 ) …...
● That is why we need indexes that help tell us exactly what part of
…….
the heap we need to read. What page(s) of the heap we need to
Page 333
pull
More
rows….1000,10000
,Eddard,1/27/1999,
$250,000
Index
● An index is another data structure separate from the heap that has “pointers” to the
heap
● It has part of the data and used to quickly search for something
● You can index on one column or more.
● Once you find a value of the index, you go to the heap to fetch more information
where everything is there
● Index tells you EXACTLY which page to fetch in the heap instead of taking the hit to
scan every page in the heap
● The index is also stored as pages and cost IO to pull the entries of the index.
● The smaller the index, the more it can fit in memory the faster the search
● Popular data structure for index is b-trees, learn more on that in the b-tree section
Page 0 Heap Page 0
Index on
EMP_ID
1,10,Hussein,1/2/1
10 (1,0) | 20 (2,0) | 30 (3,0) 988,$100,000|2,
40 (4,1) | 50 (5,1) | 60 (6,1) 20,Adam,3/2/1977|
70 (7,2) | 80 (8,2) | 90 (9,2) 3,30,Ali,5/2/1982,$
IO2 on
the heap 300,000
Page 1 to pull
Page 1
exactly
IO1 on the ( Rows 4,5,6 ) …...
the index 100 (10,3) | 110 (11,3) | 120 (12,3)
130 (13,4) | 140 (14,4) | 150 (15,4) page(s)
to find the Page 2
160 (16,5) | 170 (17,5) | 180 (18,5) we found
page/row in the ( Rows 7,8,9 ) …...
index
….. …….

Page N Page 333

More
9920 (992,331) | 9930 (993,331) | 9940 (994,331)
9950 (995,332) | 9960 (996,332) | 9970 (997,332)
rows….1000,10000
9980 (998,333) | 9990 (999,333) | 10000 (1000,333) ,Eddard,1/27/1999,
$250,000
Heap Page 0

1,10,Hussein,1/2/1
988,$100,000|2,
20,Adam,3/2/1977|
3,30,Ali,5/2/1982,$
300,000
No Index - Page 1

SELECT * FROM EMP ( Rows 4,5,6 ) …...

Page 2
WHERE EMP_ID =
( Rows 7,8,9 ) …...
10000; …….

Page 333

More
rows….1000,10000
,Eddard,1/27/1999,
$250,000
Index on Page 0
EMP_ID
10 (1,0) | 20 (2,0) | 30 (3,0)
40 (4,1) | 50 (5,1) | 60 (6,1)
70 (7,2) | 80 (8,2) | 90 (9,2)

With Index - Page 1

SELECT * FROM EMP 100 (10,3) | 110 (11,3) | 120 (12,3)

130 (13,4) | 140 (14,4) | 150 (15,4)
WHERE EMP_ID = 160 (16,5) | 170 (17,5) | 180 (18,5)

10000; …..
Page N

9920 (992,331) | 9930 (993,331) | 9940 (994,331)

10000 (1000,333) 9950 (995,332) | 9960 (996,332) | 9970 (997,332)

9980 (998,333) | 9990 (999,333) | 10000 (1000,333)
Heap Page 0
10000 (1000,333)
Fetch page 333, and pull row 1,10,Hussein,1/2/1
988,$100,000|2,
10000 20,Adam,3/2/1977|
3,30,Ali,5/2/1982,$
300,000
With Index - Page 1

SELECT * FROM EMP ( Rows 4,5,6 ) …...

Page 2
WHERE EMP_ID =
( Rows 7,8,9 ) …...
10000; …….

Page 333

More
rows….1000,10000
,Eddard,1/27/1999,
$250,000
Notes
● Sometimes the heap table can be organized around a single index. This is
called a clustered index or an Index Organized Table.
● Primary key is usually a clustered index unless otherwise specified.
● MySQL InnoDB always have a primary key (clustered index) other indexes
point to the primary key “value”
● Postgres only have secondary indexes and all indexes point directly to the
row_id which lives in the heap.
Storage concepts - Summary

● Table
● Row_id
● Page
● IO
● Heap data structure
● Index data structure b-tree
● Example of a query

Lesson 9 Lecture9
No ratings yet
Lesson 9 Lecture9
45 pages
Lecture9 PDF
No ratings yet
Lecture9 PDF
45 pages
Unit-6 Storage Strategies
No ratings yet
Unit-6 Storage Strategies
43 pages
INDEXING BASCIS - Unknown
No ratings yet
INDEXING BASCIS - Unknown
59 pages
CS DBMS 8
No ratings yet
CS DBMS 8
5 pages
V_Unit[1]
No ratings yet
V_Unit[1]
36 pages
V Unit
No ratings yet
V Unit
15 pages
DBMS Storage and Indexing
No ratings yet
DBMS Storage and Indexing
80 pages
File Organization
No ratings yet
File Organization
41 pages
Lecture12(CNC 312)
No ratings yet
Lecture12(CNC 312)
36 pages
File Organizations and Indexing: R&G Chapter 8
No ratings yet
File Organizations and Indexing: R&G Chapter 8
40 pages
Indexing
No ratings yet
Indexing
62 pages
093.Indexes Part3
No ratings yet
093.Indexes Part3
27 pages
index1 (5)
No ratings yet
index1 (5)
25 pages
Overview - Explain - Measuring Performance - Disk Architectures - Indexes - Join Algorithms (CTD.)
No ratings yet
Overview - Explain - Measuring Performance - Disk Architectures - Indexes - Join Algorithms (CTD.)
69 pages
How SQL Server Indexes Work: Sharon F. Dooley
No ratings yet
How SQL Server Indexes Work: Sharon F. Dooley
42 pages
Indexing - II
No ratings yet
Indexing - II
57 pages
12 Database SQL Index Interview Questions and Answers For 2 To 5 Years Experienced
No ratings yet
12 Database SQL Index Interview Questions and Answers For 2 To 5 Years Experienced
5 pages
Indexing
No ratings yet
Indexing
141 pages
Lecture 5 Trees
No ratings yet
Lecture 5 Trees
47 pages
Understanding Indexes: User Login
No ratings yet
Understanding Indexes: User Login
10 pages
G-03 Presentation
No ratings yet
G-03 Presentation
19 pages
Indexing in Database
No ratings yet
Indexing in Database
33 pages
Memoryhierarchy Indexing
No ratings yet
Memoryhierarchy Indexing
9 pages
PPT-203105251-3
No ratings yet
PPT-203105251-3
35 pages
1 Indexing Techniques
No ratings yet
1 Indexing Techniques
30 pages
Index: Presented By-VISHAKHA CHANDRA (10030141082)
No ratings yet
Index: Presented By-VISHAKHA CHANDRA (10030141082)
29 pages
Module 12 - Managing Indexes
No ratings yet
Module 12 - Managing Indexes
19 pages
Lecture3 File Orgn
No ratings yet
Lecture3 File Orgn
13 pages
Indexing
No ratings yet
Indexing
6 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
12 pages
File Organization and Indexing (1)
No ratings yet
File Organization and Indexing (1)
38 pages
Unit 6 notes DBMS final
No ratings yet
Unit 6 notes DBMS final
14 pages
File Organizations and Indexing: R&G Chapter 8
No ratings yet
File Organizations and Indexing: R&G Chapter 8
26 pages
Indexing Lecture Nov 2023 Summary
No ratings yet
Indexing Lecture Nov 2023 Summary
41 pages
Chap. 2 File Organization and Indexing: Abel J.P. Gomes
No ratings yet
Chap. 2 File Organization and Indexing: Abel J.P. Gomes
20 pages
Index and Hashing 2017 Combined
No ratings yet
Index and Hashing 2017 Combined
60 pages
02 - Indices
No ratings yet
02 - Indices
208 pages
Lec6 QP Indexing
No ratings yet
Lec6 QP Indexing
40 pages
Chapter 8 Indexing NEW
No ratings yet
Chapter 8 Indexing NEW
43 pages
DBMS Unit-4
No ratings yet
DBMS Unit-4
9 pages
DBMS Internals: How Does It All Work?
No ratings yet
DBMS Internals: How Does It All Work?
94 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
15 pages
Index Architecture: Febriliyan Samopa
No ratings yet
Index Architecture: Febriliyan Samopa
110 pages
File Organizations and Indexing: R&G Chapter 8
No ratings yet
File Organizations and Indexing: R&G Chapter 8
40 pages
File Organizations and Indexing: R&G Chapter 8
No ratings yet
File Organizations and Indexing: R&G Chapter 8
40 pages
SQL Query Optimization
No ratings yet
SQL Query Optimization
49 pages
Indexing in DBMS
No ratings yet
Indexing in DBMS
4 pages
Les 04 Optioper
No ratings yet
Les 04 Optioper
67 pages
Inls 623 - Database Systems Ii - File Structures, Indexing, and Hashing
No ratings yet
Inls 623 - Database Systems Ii - File Structures, Indexing, and Hashing
41 pages
DBMS-U5 Notes
No ratings yet
DBMS-U5 Notes
16 pages
Indexing - DBMS
No ratings yet
Indexing - DBMS
20 pages
Indexing
No ratings yet
Indexing
6 pages
SQL Server Index Basics
No ratings yet
SQL Server Index Basics
5 pages
CS 345: Topics in Data Warehousing: Thursday, October 21, 2004
No ratings yet
CS 345: Topics in Data Warehousing: Thursday, October 21, 2004
29 pages
How Does Database Indexing Work
No ratings yet
How Does Database Indexing Work
4 pages
File Storage and Indexing: Lesson 13 Cs 3200 Kathleen Durant PHD
No ratings yet
File Storage and Indexing: Lesson 13 Cs 3200 Kathleen Durant PHD
46 pages
What Is Indexing?: Indexing Is A Data Structure Technique Which Allows You To Quickly Retrieve
100% (1)
What Is Indexing?: Indexing Is A Data Structure Technique Which Allows You To Quickly Retrieve
7 pages
How Disk Drives Work
From Everand
How Disk Drives Work
Robert Stetson
1/5 (1)
2015 Blank Weekly Calendar: Sunday Monday Tuesday Wednesday Thursday Friday Saturday
No ratings yet
2015 Blank Weekly Calendar: Sunday Monday Tuesday Wednesday Thursday Friday Saturday
7 pages
Inertial Explorer New Features
No ratings yet
Inertial Explorer New Features
26 pages
Pt. Geopranata Cipta: Surveying, Mapping, & Consulting Enginering
No ratings yet
Pt. Geopranata Cipta: Surveying, Mapping, & Consulting Enginering
1 page
Lion Air ETicket (MJKHIV) - Muhammad
No ratings yet
Lion Air ETicket (MJKHIV) - Muhammad
2 pages
Chords For Piano: Compiled by Simon Creedy (Graphic Design in Sydney) Please Distribute Freely
No ratings yet
Chords For Piano: Compiled by Simon Creedy (Graphic Design in Sydney) Please Distribute Freely
14 pages
Athletes! Aphrodite: Subscribe Share Past Issues RSS Translate
No ratings yet
Athletes! Aphrodite: Subscribe Share Past Issues RSS Translate
6 pages
Math Unit 2 Grade 3 Lesson 45 47
No ratings yet
Math Unit 2 Grade 3 Lesson 45 47
130 pages
Ebooks File (Ebook PDF) Responsive Web Design With HTML 5 & CSS 9th Edition All Chapters
100% (4)
Ebooks File (Ebook PDF) Responsive Web Design With HTML 5 & CSS 9th Edition All Chapters
49 pages
3 Islam
No ratings yet
3 Islam
41 pages
Teacher's Book: Susan Bolland
No ratings yet
Teacher's Book: Susan Bolland
7 pages
Instructional Supervision Form 2
No ratings yet
Instructional Supervision Form 2
2 pages
Untitled
No ratings yet
Untitled
6 pages
Julius Caesar Conflicting Perspectives Thesis
100% (3)
Julius Caesar Conflicting Perspectives Thesis
7 pages
KH CH Mua T TH NG 8 Năm 2015
No ratings yet
KH CH Mua T TH NG 8 Năm 2015
27 pages
EDUREKHA Data Science and ML Internship Program V2 - Program Brochure
No ratings yet
EDUREKHA Data Science and ML Internship Program V2 - Program Brochure
60 pages
3ms 1st Term Exam
No ratings yet
3ms 1st Term Exam
2 pages
A History of the English Bible as Literature 2000 A History of the Bible as Literature David Norton download pdf
100% (5)
A History of the English Bible as Literature 2000 A History of the Bible as Literature David Norton download pdf
63 pages
With The Photographer
No ratings yet
With The Photographer
3 pages
Job Skills Thesis
No ratings yet
Job Skills Thesis
5 pages
Handling Unclear Requirements
No ratings yet
Handling Unclear Requirements
44 pages
Welcome To The Best Way of Life!
No ratings yet
Welcome To The Best Way of Life!
5 pages
CVC word families - Literacy Skills - KG Grade 1
No ratings yet
CVC word families - Literacy Skills - KG Grade 1
4 pages
Art by Marc Chagall
No ratings yet
Art by Marc Chagall
4 pages
多叔逻辑口语，中国雅思口语第一品牌公共微信： ddielts 新浪微博@雅思钱多多
No ratings yet
多叔逻辑口语，中国雅思口语第一品牌公共微信： ddielts 新浪微博@雅思钱多多
5 pages
cbse_4
No ratings yet
cbse_4
17 pages
List of French Speaking Countries
No ratings yet
List of French Speaking Countries
4 pages
A3 Flyers 2022 Brunei
No ratings yet
A3 Flyers 2022 Brunei
2 pages
Proposal Karangturi Bersholawat 2021 (Durung Fix)
No ratings yet
Proposal Karangturi Bersholawat 2021 (Durung Fix)
10 pages
DWGB100 Filettature
No ratings yet
DWGB100 Filettature
32 pages
I Have a Self to Recover - Sylvia Plath and the Literary Success of the
No ratings yet
I Have a Self to Recover - Sylvia Plath and the Literary Success of the
13 pages
Discussion Questions On Tsoti
No ratings yet
Discussion Questions On Tsoti
2 pages
Words From Port.
No ratings yet
Words From Port.
4 pages
Adjectives PDF
0% (1)
Adjectives PDF
107 pages
Binary Branching
No ratings yet
Binary Branching
43 pages
Kuwadzana 1 High School
No ratings yet
Kuwadzana 1 High School
18 pages
Programming Fundamentals 3
No ratings yet
Programming Fundamentals 3
10 pages