0% found this document useful (0 votes)

716 views10 pages

01 Become A PostgreSQL DBA Understanding The Architecture

The document discusses the architecture of PostgreSQL, including its components like shared memory, background processes, and database structure. It covers topics like shared buffer and WAL buffer in shared memory, the roles of postmaster, backend, and other processes, and how databases, tables, and tablespaces are organized in the file system. Examples are provided to illustrate the creation of databases, tables, and tablespaces and how they relate to each other physically.

Uploaded by

Stephen Efange

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

716 views10 pages

01 Become A PostgreSQL DBA Understanding The Architecture

Uploaded by

Stephen Efange

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

Become a PostgreSQL DBA: Understanding the Architecture

PostgreSQL is probably the most advanced database in the open source relational database market. It was first
released in 1989, and since then, there have been a lot of enhancements. According to db-engines, it is the fourth
most used database at the time of writing.

In this blog, we will discuss PostgreSQL internals, its architecture, and how the various components of PostgreSQL
interact with one another. This will serve as a starting point and building block for the remainder of our Become a
PostgreSQL DBA blog series.

PostgreSQL Architecture
The physical structure of PostgreSQL is very simple. It consists of shared memory and a few background processes
and data files. (See Figure 1-1)

Figure 1-1. PostgreSQL structure

Shared Memory
Shared Memory refers to the memory reserved for database caching and transaction log caching. The most important
elements in shared memory are Shared Buffer and WAL buffers

Shared Buffer

The purpose of Shared Buffer is to minimize DISK IO. For this purpose, the following principles must be met
 You need to access very large (tens, hundreds of gigabytes) buffers quickly.
 You should minimize contention when many users access it at the same time.
 Frequently used blocks must be in the buffer for as long as possible

WAL Buffer

The WAL buffer is a buffer that temporarily stores changes to the database. The contents stored in the WAL buffer
are written to the WAL file at a predetermined point in time. From a backup and recovery point of view, WAL
buffers and WAL files are very important.

PostgreSQL Process Types

PostgreSQL has four process types.

1. Postmaster (Daemon) Process

2. Background Process
3. Backend Process
4. Client Process

Postmaster Process

The Postmaster process is the first process started when you start PostgreSQL. At startup, performs recovery,
initialize shared memory, and run background processes. It also creates a backend process when there is a
connection request from the client process. (See Figure 1-2)

Figure 1-2. Process relationship diagram

If you check the relationships between processes with the pstree command, you can see that the Postmaster process
is the parent process of all processes. (For clarity, I added the process name and argument after the process ID)
Background Process

The list of background processes required for PostgreSQL operation are as follows. (See Table 1-1)

Process Role
logger Write the error message to the log file.
checkpointer When a checkpoint occurs, the dirty buffer is written to the file.
writer Periodically writes the dirty buffer to a file.
wal writer Write the WAL buffer to the WAL file.
Autovacuum Fork autovacuum worker when autovacuum is enabled.It is the responsibility of the autovacuum
launcher daemon to carry vacuum operations on bloated tables on demand
archiver When in Archive.log mode, copy the WAL file to the specified directory.
DBMS usage statistics such as session execution information ( pg_stat_activity ) and table usage
stats collector
statistical information ( pg_stat_all_tables ) are collected.

Backend Process

The maximum number of backend processes is set by the max_connections parameter, and the default value is 100.
The backend process performs the query request of the user process and then transmits the result. Some memory
structures are required for query execution, which is called local memory. The main parameters associated with
local memory are:

1. work_mem Space used for sorting, bitmap operations, hash joins, and merge joins. The default setting is 4
MB.
2. Maintenance_work_mem Space used for Vacuum and CREATE INDEX . The default setting is 64 MB.
3. Temp_buffers Space used for temporary tables. The default setting is 8 MB.

Client Process

Client Process refers to the background process that is assigned for every backend user connection.Usually the
postmaster process will fork a child process that is dedicated to serve a user connection.
Database Structure
Here are some things that are important to know when attempting to understand the database structure of
PostgreSQL.

Items related to the database

1. PostgreSQL consists of several databases. This is called a database cluster.

2. When initdb () is executed, template0 , template1 , and postgres databases are created.
3. The template0 and template1 databases are template databases for user database creation and contain the
system catalog tables.
4. The list of tables in the template0 and template1 databases is the same immediately after initdb ().
However, the template1 database can create objects that the user needs.
5. The user database is created by cloning the template1 database.

Items related to the tablespace

1. The pg_default and pg_global tablespaces are created immediately after initdb().
2. If you do not specify a tablespace at the time of table creation, it is stored in the pg_dafault tablespace.
3. Tables managed at the database cluster level are stored in the pg_global tablespace.
4. The physical location of the pg_default tablespace is $PGDATA\base.
5. The physical location of the pg_global tablespace is $PGDATA\global.
6. One tablespace can be used by multiple databases. At this time, a database-specific subdirectory is created
in the table space directory.
7. Creating a user tablespace creates a symbolic link to the user tablespace in the $PGDATA\tblspc directory.

Items related to the table

1. There are three files per table.

2. One is a file for storing table data. The file name is the OID of the table.
3. One is a file to manage table free space. The file name is OID_fsm .
4. One is a file for managing the visibility of the table block. The file name is OID_vm .
5. The index does not have a _vm file. That is, OID and OID_fsm are composed of two files.

Other Things to Remember...

The file name at the time of table and index creation is OID, and OID and pg_class.relfilenode are the same at this
point. However, when a rewrite operation ( Truncate , CLUSTER , Vacuum Full , REINDEX , etc.) is performed,
the relfilenode value of the affected object is changed, and the file name is also changed to the relfilenode value.
You can easily check the file location and name by using pg_relation_filepath ('< object name >'). template0,
template1, postgres database

Running Tests
If you query the pg_database view after initdb() , you can see that the template0 , template1 , and postgres databases
have been created.

 Through the datistemplate column, you can see that the template0 and template1 databases are database for
template for user database creation.
 The datlowconn column indicates whether the database can be accessed. Since the template0 database can’t
be accessed, the contents of the database can’t be changed either.
 The reason for providing two databases for the templateis that the template0 database is the initial state
template and the template1 database is the template added by the user.
 The postgres database is the default database created using the template1 database. If you do not specify a
database at connection time, you will be connected to the postgres database.
 The database is located under the $PGDATA/base directory. The directory name is the database OID
number.

Create User Database

The user database is created by cloningthe template1 database. To verify this, create a user table T1 in the template1
database. After creating the mydb01 database, check that the T1 table exists. (See Figure 1-3.)
Figure 1-3. Relationship between Template Database and User Database

pg_default tablespace

If you query pg_tablespace after initdb (), you can see that the pg_default and pg_global tablespaces have been
created.

The location of the pg_default tablespace is $PGDATA\base. There is a subdirectory by database OID in this
directory. (See Figure 1-4)
Figure 1-4. Pg_default tablespace and database relationships from a physical configuration perspective

pg_global tablespace

The pg_global tablespace is a tablespace for storing data to be managed at the 'database cluster' level.

 For example, tables of the same type as the pg_database table provide the same information whether they
are accessed from any database. (See Figure 1-5)
 The location of the pg_global tablespace is $PGDATA\global.

Figure 1-5. Relationship between pg_global tablespace and database

Create User Tablespace

1postgres=# create tablespace myts01 location '/data01';

The pg_tablespace shows that the myts01 tablespace has been created.
Symbolic links in the $PGDATA/pg_tblspc directory point to tablespace directories.

Connect to the postgres and mydb01 databases and create the table.

If you look up the /data01 directory after creating the table, you will see that the OID directory for the postgres and
mydb01 databases has been created and that there is a file in each directory that has the same OID as the T1 table.
How to Change Tablespace Location

PostgreSQL specifies a directory when creating tablespace. Therefore, if the file system where the directory is
located is full, the data can no longer be stored. To solve this problem, you can use the volume manager. However,
if you can’t use the volume manager, you can consider changing the tablespace location. The order of operation is as
follows.
Note: Tablespaces are also very useful in environments that use partition tables. Because you can use different
tablespaces for each partition table, you can more flexibly cope with file system capacity problems.

What is Vacuum?
Vacuum does the following:

1. Gathering table and index statistics

2. Reorganize the table
3. Clean up tables and index dead blocks
4. Frozen by record XID to prevent XID Wraparound

#1 and #2 are generally required for DBMS management. But #3 and #4 are necessary because of the PostgreSQL
MVCC feature

Postgres DBA Interview Questions
100% (2)
Postgres DBA Interview Questions
13 pages
PostgreSQL Database Administration Vol 1
100% (3)
PostgreSQL Database Administration Vol 1
124 pages
Patroni
100% (1)
Patroni
137 pages
Oracle Database Administration Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
From Everand
Oracle Database Administration Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
Vibrant Publishers
5/5 (1)
Starting Database Administration: Oracle DBA
From Everand
Starting Database Administration: Oracle DBA
anuragbaruah84
3/5 (2)
PostgreSQL DBA Contents
No ratings yet
PostgreSQL DBA Contents
2 pages
Postgresql InterviewQuestion
100% (1)
Postgresql InterviewQuestion
5 pages
205 Oracle To Postgres Migration
100% (2)
205 Oracle To Postgres Migration
58 pages
PostgreSQL Administration
No ratings yet
PostgreSQL Administration
66 pages
EnterpriseDB PostgreSQL Exercises
No ratings yet
EnterpriseDB PostgreSQL Exercises
29 pages
Ora2postgres DF
No ratings yet
Ora2postgres DF
72 pages
Navigating The Linux File System: (Edwin Achimbi)
100% (1)
Navigating The Linux File System: (Edwin Achimbi)
4 pages
Solutions To Written Assignment 3
67% (3)
Solutions To Written Assignment 3
4 pages
Postgresql DBA Architecture
100% (1)
Postgresql DBA Architecture
60 pages
Tuning PostgreSQL With Pgbench
No ratings yet
Tuning PostgreSQL With Pgbench
11 pages
Tuning Your PostgreSQL Server
No ratings yet
Tuning Your PostgreSQL Server
12 pages
Instant PostgreSQL Backup and Restore How-to
From Everand
Instant PostgreSQL Backup and Restore How-to
Shaun Thomas
No ratings yet
The Internals of PostgreSQL - Chapter 1 Database Cluster, Databases, and Tables
No ratings yet
The Internals of PostgreSQL - Chapter 1 Database Cluster, Databases, and Tables
10 pages
Postgres Topic
No ratings yet
Postgres Topic
116 pages
PostgreSQL Proficiency For Python People
No ratings yet
PostgreSQL Proficiency For Python People
215 pages
PostgreSQL 9 Administration Cookbook LITE: Configuration, Monitoring and Maintenance
From Everand
PostgreSQL 9 Administration Cookbook LITE: Configuration, Monitoring and Maintenance
Simon Riggs
3/5 (1)
PostgreSQL Performance Tuning
100% (9)
PostgreSQL Performance Tuning
63 pages
Inside PostgreSQL Shared Memory
100% (3)
Inside PostgreSQL Shared Memory
25 pages
15 Advanced PostgreSQL Commands
No ratings yet
15 Advanced PostgreSQL Commands
11 pages
Troubleshooting PostgreSQL - Sample Chapter
100% (1)
Troubleshooting PostgreSQL - Sample Chapter
15 pages
Administration PGSQL
No ratings yet
Administration PGSQL
109 pages
Performance Tuning PostgreSQL
No ratings yet
Performance Tuning PostgreSQL
25 pages
Administration PostgreSQL
50% (2)
Administration PostgreSQL
109 pages
9) Locking in Mysql
100% (1)
9) Locking in Mysql
15 pages
Administrating A MySQL Server
No ratings yet
Administrating A MySQL Server
6 pages
Introduction To Innodb Monitoring System
No ratings yet
Introduction To Innodb Monitoring System
31 pages
Internals of PostgreSQL Wal
100% (1)
Internals of PostgreSQL Wal
51 pages
Mysql Monitoring
100% (2)
Mysql Monitoring
20 pages
Monitoring Postgresql
No ratings yet
Monitoring Postgresql
38 pages
Mysql Dba Qa
No ratings yet
Mysql Dba Qa
4 pages
MySQL DBA Interview Questions
100% (2)
MySQL DBA Interview Questions
4 pages
Pgpool-II For Beginners
No ratings yet
Pgpool-II For Beginners
12 pages
Pganalyze - Best Practices For Optimizing Postgres Query Performance
100% (1)
Pganalyze - Best Practices For Optimizing Postgres Query Performance
26 pages
Database Partitioning With MySQL
No ratings yet
Database Partitioning With MySQL
6 pages
PostgreSQL Replication - Second Edition - Sample Chapter
No ratings yet
PostgreSQL Replication - Second Edition - Sample Chapter
27 pages
PostgreSQL Notes For Professionals+
100% (1)
PostgreSQL Notes For Professionals+
72 pages
Korn Shell (KSH) Programming
100% (1)
Korn Shell (KSH) Programming
34 pages
Postgres Admin
No ratings yet
Postgres Admin
109 pages
How To Set Up PostgreSQL For High Availability and Replication With Hot Standby
No ratings yet
How To Set Up PostgreSQL For High Availability and Replication With Hot Standby
11 pages
PGSQL CheatSheet Mysql2psql
No ratings yet
PGSQL CheatSheet Mysql2psql
7 pages
MySQL DBA
100% (6)
MySQL DBA
6 pages
DB2 9.7 for Linux, UNIX, and Windows Database Administration: Certification Study Notes
From Everand
DB2 9.7 for Linux, UNIX, and Windows Database Administration: Certification Study Notes
Roger E. Sanders
5/5 (1)
Percona XtraDB Cluster 5.7
No ratings yet
Percona XtraDB Cluster 5.7
92 pages
Advanced MySQL Administration and Programming
No ratings yet
Advanced MySQL Administration and Programming
35 pages
MySQL Advance
No ratings yet
MySQL Advance
27 pages
Oracle Dba Scripts
50% (2)
Oracle Dba Scripts
153 pages
Packt postgreSQL 9 6 High Performance 1784392979
100% (4)
Packt postgreSQL 9 6 High Performance 1784392979
495 pages
PostgreSQL Internals Through Pictures
100% (3)
PostgreSQL Internals Through Pictures
72 pages
Mysql Performance Tuning
No ratings yet
Mysql Performance Tuning
17 pages
MongoDB Manual Master
No ratings yet
MongoDB Manual Master
1,117 pages
MySQL Commands1 PDF
No ratings yet
MySQL Commands1 PDF
3 pages
PostgreSQL Quick Start
100% (1)
PostgreSQL Quick Start
57 pages
PostgreSQL CHEAT SHEET
No ratings yet
PostgreSQL CHEAT SHEET
8 pages
EDB High Availability Scalability v1.0
No ratings yet
EDB High Availability Scalability v1.0
23 pages
PostgreSQL 9.0 High Performance
From Everand
PostgreSQL 9.0 High Performance
Gregory Smith
4/5 (1)
Mastering MariaDB
From Everand
Mastering MariaDB
Razzoli Federico
No ratings yet
Oracle Database Mastery: Comprehensive Techniques for Advanced Application
From Everand
Oracle Database Mastery: Comprehensive Techniques for Advanced Application
Adam Jones
No ratings yet
Ndole Recipe
No ratings yet
Ndole Recipe
2 pages
Acquiring and Managing Software: A-Debian GNU/Linux
No ratings yet
Acquiring and Managing Software: A-Debian GNU/Linux
4 pages
Chapter 1 - Mysql Cookbook Installing and Upgrading Mysql
No ratings yet
Chapter 1 - Mysql Cookbook Installing and Upgrading Mysql
6 pages
Install Solaris
No ratings yet
Install Solaris
3 pages
VTC MySQL Architechture
No ratings yet
VTC MySQL Architechture
25 pages
VTC MySQL Configuration
No ratings yet
VTC MySQL Configuration
23 pages
CBT MySQL Architecture 1
No ratings yet
CBT MySQL Architecture 1
20 pages
70-450 SQL Server Instance Security
No ratings yet
70-450 SQL Server Instance Security
3 pages
Upgrade
No ratings yet
Upgrade
11 pages
#3. Installing Mysql Using Linux Generic Binaries
No ratings yet
#3. Installing Mysql Using Linux Generic Binaries
2 pages
Create Schemas Script
No ratings yet
Create Schemas Script
13 pages
CBT MySQL Architecture 2
No ratings yet
CBT MySQL Architecture 2
18 pages
Booking Confirmation
No ratings yet
Booking Confirmation
56 pages
ChatLog Linux - MySQL Classs 2021-01-29 11 - 45
No ratings yet
ChatLog Linux - MySQL Classs 2021-01-29 11 - 45
1 page
Pluralsight
No ratings yet
Pluralsight
2 pages
An Introduction To MariaDB's Data at Rest Encryption (DARE) - Part 1
No ratings yet
An Introduction To MariaDB's Data at Rest Encryption (DARE) - Part 1
2 pages
How To Fix A Corrupt User Profile in Windows - How-To - PC Advisor
No ratings yet
How To Fix A Corrupt User Profile in Windows - How-To - PC Advisor
3 pages
Install The Anbox Snap: Install DKMS Package From PPA
No ratings yet
Install The Anbox Snap: Install DKMS Package From PPA
2 pages
Be - Information Technology Engineering - Semester 5 - 2024 - May - Operating Systems Os Pattern 2019
No ratings yet
Be - Information Technology Engineering - Semester 5 - 2024 - May - Operating Systems Os Pattern 2019
2 pages
1Z0 822
No ratings yet
1Z0 822
4 pages
Vr23 Oopj Unit 5 Qbank
No ratings yet
Vr23 Oopj Unit 5 Qbank
2 pages
Oracle 1Z0-821 Exams - Free VCE Examcollection - Us PDF
No ratings yet
Oracle 1Z0-821 Exams - Free VCE Examcollection - Us PDF
18 pages
RPi USB Audio Gadget HowTo Oct 23
No ratings yet
RPi USB Audio Gadget HowTo Oct 23
21 pages
NMCNTT-03-Operating Systems
No ratings yet
NMCNTT-03-Operating Systems
57 pages
Docker For Local Web Development, Part 3-A Three-Tier Architecture With Frameworks
No ratings yet
Docker For Local Web Development, Part 3-A Three-Tier Architecture With Frameworks
42 pages
Building Oscam
No ratings yet
Building Oscam
5 pages
Fesetup Installation
No ratings yet
Fesetup Installation
14 pages
Sti Trace
No ratings yet
Sti Trace
2 pages
Unpacking For Dummies Compressed
No ratings yet
Unpacking For Dummies Compressed
78 pages
CPU scheduling
No ratings yet
CPU scheduling
36 pages
Assignment 03 Sol (1)
No ratings yet
Assignment 03 Sol (1)
23 pages
Setup Act
No ratings yet
Setup Act
10 pages
Dr.G.R.Damodaran College of Science
No ratings yet
Dr.G.R.Damodaran College of Science
35 pages
Ibm Z 1
No ratings yet
Ibm Z 1
546 pages
Crash 20241018
No ratings yet
Crash 20241018
2 pages
CS Final Sample Paper
No ratings yet
CS Final Sample Paper
6 pages
Olevel 1 Ittnb b3 03april20 AKT
No ratings yet
Olevel 1 Ittnb b3 03april20 AKT
3 pages
Disabling Dr. Watson Debugger
No ratings yet
Disabling Dr. Watson Debugger
3 pages
The Ultimate Docker Cheat Sheet
No ratings yet
The Ultimate Docker Cheat Sheet
12 pages
Mod 3 Solutions
No ratings yet
Mod 3 Solutions
14 pages
DBMS Chapter9 Exercise Answers
No ratings yet
DBMS Chapter9 Exercise Answers
3 pages
Parallel Programming Models: Sathish Vadhiyar
No ratings yet
Parallel Programming Models: Sathish Vadhiyar
32 pages
Solaris 10 Disk Layout
No ratings yet
Solaris 10 Disk Layout
3 pages
Log
No ratings yet
Log
12 pages
Hbase Apache Org Book HTML
No ratings yet
Hbase Apache Org Book HTML
482 pages

01 Become A PostgreSQL DBA Understanding The Architecture

Uploaded by

01 Become A PostgreSQL DBA Understanding The Architecture

Uploaded by

Become a PostgreSQL DBA: Understanding the Architecture

Figure 1-1. PostgreSQL structure

PostgreSQL Process Types

1. Postmaster (Daemon) Process

Figure 1-2. Process relationship diagram

Items related to the database

1. PostgreSQL consists of several databases. This is called a database cluster.

Items related to the tablespace

Items related to the table

1. There are three files per table.

Other Things to Remember...

Create User Database

Figure 1-5. Relationship between pg_global tablespace and database

Create User Tablespace

1postgres=# create tablespace myts01 location '/data01';

1. Gathering table and index statistics

You might also like