0% found this document useful (0 votes)

51 views

Scaling Index For VLDB and Busy Database

This document discusses scaling indexes for very large and busy databases. It covers B-tree index concepts, scalability challenges like index contention, and options for improving scalability like function-based indexes and online index builds. The document provides guidance on monitoring indexes, principles for improving scalability, and enhancements in Oracle 12c to further enable scaling of indexes.

Uploaded by

신종근

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views

Scaling Index For VLDB and Busy Database

Uploaded by

신종근

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Scaling Indexes for very large and very busy databases

Saibabu Devabhaktuni

January 2015

1
Who Am I

1) Director of DB engineering at eBay Inc (PayPal)

2) Managing Oracle databases since 1998
3) Blog at http://sai-oracle.blogspot.com
4) Can be contacted at [email protected]

2
Scope

1) Only B-tree indexes are covered, applicable to IOT also.

2) No Domain indexes
3) Applicable for 11g/12c database versions
4) Applicable for Web scale OLTP and bulk workloads
5) Some of the observations here can be incorrect or may change in future versions

3
Agenda

1) B-tree index concepts

2) Scalability challenges
3) Index contention
4) Scalability options
5) Online index build

4
B-tree Index Concepts

1) Minimum of one key column entries in each index block

2) Conceptual analogy, new index tree level when power(single block entries, next level) reached
3) http://sai-oracle.blogspot.com/2006/03/how-to-predict-index-blevel-and-when.html for more information
4) Index is balanced at all level’s of the tree
5) Uniqueness using rowid is enforced when non unique index created
6) Leaf blocks can be traversed in either direction at the leaf level

5
Oracle’s Implementation of B-tree Indexes

1) Root block address always stay same [1]

2) Last leaf block pointer key value in far right branch block defined as max value
3) Leaf blocks are split with 50/50 key entries unless max value is reached
4) New leaf block created with one key value entry when max value reached [2]
5) Least possible unique key values stored in branch blocks for leaf block pointers
6) ITL entry markers always cloned during leaf block split (optimized in 12c after ER 16075761)
7) Optimistic space management, i.e. freed leaf blocks are not unlinked right away from the tree

6
Index Design Criteria

1) Non partitioned simple B-tree index is the most optimal index to support all flavors of sql queries with
least possible cost per execution, especially if optimizer is relying on index to avoid sort
2) Above index is also typically most space efficient and incur lesser physical I/O, but least scalable
3) Most scalable index designed for high DML’s can restrict type of sql queries or increase execution cost
4) Above index pattern is typically less space efficient and incur higher physical I/O.
5) Information life cycle management (ILM) pose another challenge in balancing between scaling for
DML’s versus optimizing for reads.
6) Best possible optimization for primary key based queries leads to an IOT index, use it very sparingly as
it causes secondary index based queries to be much more expensive.

7
Index Scalability Upfront Design Challenges

1) Unknown peak DML rate at design time

2) Evolving DML rate or access pattern changes over time
3) Evolving key value combination inter column relationship
4) Possible burst of DML after slow down up the transaction or application call stack
5) Possible ineffective partition pruning for any top sql
6) Possible conflict with ILM, i.e. forcing local indexes
7) Ability to identify hot indexes proactively
8) Cost of rebuilding indexes in future, i.e. it may force conservative and restrictive design upfront

8
Types of Index Contention

1) TX: Index contention due to block splits (DML limit can vary from 500 to 3000 monotonically increasing
column inserts per second based on hardware and the system load)
2) Buffer busy waits at higher call stack (high DML)
3) Row lock contention for unique indexes (duplicate data inserts)
4) ITL contention (only 169 ITL slots available in 8k block)
5) Recursive space management operations after mass delete and insert ( http://sai-
oracle.blogspot.com/2009/04/beware-of-index-contention-after-mass.html , fixed in 12c, bug 8446989)
6) High water mark enqueue contention (flashback can aggravate it)

9
Monitoring and Detection of Index Contention

1) Goal should be to detect index contention at early stage before it grows to an outage
2) Real time monitoring of active sessions for index contention and group by current_row_obj#
3) Real time monitoring of ASH samples, group it by current_obj# for top indexes
4) Monitor segment statistics (v$segstat) for ITL contention
5) AWR report, look for top wait events and top segments
6) Check the rate of sequence gets for any indexes on sequence based columns
7) Profile increase in top DML executions for identifying any indexes prone for contention.

10
Index Scalability Principles

1) Carefully choose order of index key columns to reduce inserting at max value
2) Randomizing data arrival order of index key columns at application level if possible, i.e. combination of
sequence, machine id, process id, etc.
3) Try to partition index key column data at application level, i.e. key column value sharding by using
another column and adding it to index key column list, i.e. (mod(key1, n), key1)
4) Convert ordered sequences to unordered when creating indexes on those columns
5) Target for fewer DML’s on index key columns (i.e. updates)
6) Target for index key value combination length to be as low as possible
7) Avoid global indexes if possible, i.e. for using partition operations to enforce ILM

11
Index Scalability Options at DB level

1) Reverse key index

1) Pro: Up to 100 insertion points at any point in time for sequence based column values
2) Con: Increased physical I/O due to data distribution and need exact query predicate values
2) Global hash partitioned index
1) Pro: Better data affinity than reverse key and support range queries
2) Con: Inefficient partition operations (ILM) and key column length can’t be changed online
3) Function based index, i.e. using (mod (key column, x), key column) or virtual columns
1) Pro: Better data affinity and efficient ILM partition operations
2) Con: Require application query change and union all based range queries
4) Key column data arrival randomization at application level
1) Pro: No changes at DB level, application queries based on data arrival pattern
2) Con: Organizational challenges and less data affinity leading to more physical I/O
5) Using no ordered sequences based column indexes in RAC
1) Pro: Scales well in RAC
2) Con: Application dependency on sequence order and some queries can become expensive

12
Index Scalability Options (new in 12c)

1) Standard_hash function based index for large key columns

1) Pro: key value length is fixed at 128 bytes (for 64 bit), no application code change is required
2) Con: Only useful for the column value length much higher than 128 bytes
2) Deferred global index maintenance leading to better adoption (Implemented with ER 8677124)
1) Pro: ILM partition operations are much faster (set event 43820 for space management bug)
2) Con: Higher redo due to offline index entry deletes and higher physical I/O due to larger index
3) Partial indexes for partitioned tables
1) Pro: Faster data loads and enable read optimized index for less active partitions
2) Con: All query patterns need to be understood
4) Partitioned sequences (undocumented), i.e. using it for index key column
1) Pro: Meant for scaling indexes in RAC environment
2) Con: Need application compatibility for out of order sequences and slower order by queries
5) Online drop index operation incase if any index type or structure change needed

13
Online Index Build Scalability

1) Best designed online feature among all Oracle database HA features

2) http://www.nocoug.org/download/2012-02/Internals%20of%20online%20index%20build.pdf for internals
of online index build
3) No DML contention during prepare or merge phase
4) Speed of index build is less influenced by rate of DML’s compared to older implementation
5) Journal IOT table is automatically cleared up after aborting online index

14
ER’s for Further Scaling of Indexes

1) ER 12979221: Reducing Index Contention during branch/leaf block splits

2) ER 16075761: Initiate leaf block split when ITL limit reached
3) ER 9912950: Contention on underlying journal IOT table when scalable index is being build (i.e.
reverse key or global hash partitioned index)
4) Bug 10038517: dbms_repair.online_index_clean waits for exclusive table lock
5) ER 8759587: Ability to create composite partitioned local index while the table is only partitioned at top
level (i.e. table partition by range and local index composite partition by range and hash)
6) Bug 18715233: ORA-600 with online index build when MSSM tablespace is used

15
Summary

1) Consider designing index upfront for future scalability needs

2) Prefer index scaling through application changes
3) Prefer enforcing ILM policies through optimal index design
4) All operations of index scaling options at DB level are online as of 12c
5) Proactively monitor ASH data and awr reports for any signs of index contention
6) Don’t shy away from using online index build feature to fix index scaling
7) When designed properly Oracle indexes are capable of handling web scale workloads.

The Science of Self-Control - Menno Henselmans V3
No ratings yet
The Science of Self-Control - Menno Henselmans V3
251 pages
DBA Solved All
57% (7)
DBA Solved All
22 pages
Teradata Basics Exam - Sample Question Set 1 (Answers in Italic Font)
No ratings yet
Teradata Basics Exam - Sample Question Set 1 (Answers in Italic Font)
5 pages
9ib SCALE
No ratings yet
9ib SCALE
66 pages
Sybase Interview Questions and Answers ..... NEW
100% (1)
Sybase Interview Questions and Answers ..... NEW
7 pages
Interview_Qs_1
100% (1)
Interview_Qs_1
48 pages
ADMT end war
No ratings yet
ADMT end war
30 pages
Index - 1: The Oracle B-Tree Index
No ratings yet
Index - 1: The Oracle B-Tree Index
6 pages
B34 R3
No ratings yet
B34 R3
3 pages
BR Columndb
No ratings yet
BR Columndb
18 pages
Unit 1-2mark: 1. Define Database Management System
No ratings yet
Unit 1-2mark: 1. Define Database Management System
15 pages
TSQL Material
No ratings yet
TSQL Material
78 pages
Oracle Indexes
No ratings yet
Oracle Indexes
3 pages
Correlated Subquery
No ratings yet
Correlated Subquery
15 pages
SQL Server Questionnaire-I
No ratings yet
SQL Server Questionnaire-I
47 pages
Bda QB 2
No ratings yet
Bda QB 2
15 pages
Sample Basic Interview - Questions
No ratings yet
Sample Basic Interview - Questions
9 pages
Test Units
No ratings yet
Test Units
19 pages
5.1 Computer Networks: (Common With Information Technology) Rationale
No ratings yet
5.1 Computer Networks: (Common With Information Technology) Rationale
14 pages
Query Processing and Optimization in Oracle RDB: Gennady Antoshenkov, Mohamed Ziauddin
No ratings yet
Query Processing and Optimization in Oracle RDB: Gennady Antoshenkov, Mohamed Ziauddin
9 pages
Relational Database Management System
No ratings yet
Relational Database Management System
3 pages
?oracle Index Maintenance Overview (Database Box)
No ratings yet
?oracle Index Maintenance Overview (Database Box)
17 pages
JX Usenix
No ratings yet
JX Usenix
14 pages
5 Marks For DBMS
No ratings yet
5 Marks For DBMS
7 pages
ADS_MidSolution_Feb25
No ratings yet
ADS_MidSolution_Feb25
14 pages
DBMS Revision Stuff-1
100% (1)
DBMS Revision Stuff-1
71 pages
DA3 QP - D2
No ratings yet
DA3 QP - D2
2 pages
IT35012m
No ratings yet
IT35012m
8 pages
CS Project
No ratings yet
CS Project
16 pages
Dbms Imp All Units
No ratings yet
Dbms Imp All Units
15 pages
mod4
No ratings yet
mod4
4 pages
Wipro Interview Questions
100% (2)
Wipro Interview Questions
39 pages
MCS-014 Block 3
No ratings yet
MCS-014 Block 3
70 pages
Advanced Databases Assignment 1 (1)
No ratings yet
Advanced Databases Assignment 1 (1)
5 pages
{cbdc7130-d5ab-4da2-817a-23e376d8428b}_Columnstore_ICDE_2016
No ratings yet
{cbdc7130-d5ab-4da2-817a-23e376d8428b}_Columnstore_ICDE_2016
13 pages
05 Oracle-Sql1
No ratings yet
05 Oracle-Sql1
66 pages
Database Design Standard
No ratings yet
Database Design Standard
9 pages
Database Management System
No ratings yet
Database Management System
80 pages
Question Bank - Dbms
No ratings yet
Question Bank - Dbms
4 pages
OCI - Questoes
No ratings yet
OCI - Questoes
11 pages
CCNP SWITCH v7.0 Chapter 2
No ratings yet
CCNP SWITCH v7.0 Chapter 2
5 pages
d-s-s-1
No ratings yet
d-s-s-1
6 pages
HospitalCloud_Final am
No ratings yet
HospitalCloud_Final am
90 pages
Unit-1 Notes
No ratings yet
Unit-1 Notes
18 pages
Query Optimization in Mysql Database Usi F8e2fb8b
No ratings yet
Query Optimization in Mysql Database Usi F8e2fb8b
7 pages
Unit 1-2mark: 1. Define Database Management System
No ratings yet
Unit 1-2mark: 1. Define Database Management System
15 pages
Peoplesoftques
No ratings yet
Peoplesoftques
26 pages
Lectur 5
No ratings yet
Lectur 5
37 pages
Database Design and Management Laboratory Manual
No ratings yet
Database Design and Management Laboratory Manual
46 pages
Nosql and Data Scalability: Getting Started With
100% (1)
Nosql and Data Scalability: Getting Started With
6 pages
CS614 - Helping Material
No ratings yet
CS614 - Helping Material
7 pages
1955 PDF
No ratings yet
1955 PDF
16 pages
Clean Architecture
No ratings yet
Clean Architecture
14 pages
Cao Newest - Computer Architecture and Organization
No ratings yet
Cao Newest - Computer Architecture and Organization
124 pages
ODI Knowledge Module Introduction
100% (2)
ODI Knowledge Module Introduction
14 pages
Introduction to Microsoft SQL Server
From Everand
Introduction to Microsoft SQL Server
Eric Frick
No ratings yet
Mastering the Art of x86 Assembly Programming: Unlocking the Secrets of Expert-Level Skills
From Everand
Mastering the Art of x86 Assembly Programming: Unlocking the Secrets of Expert-Level Skills
Steve Jones
No ratings yet
Embedded Systems Programming with C++: Real-World Techniques
From Everand
Embedded Systems Programming with C++: Real-World Techniques
Robert Johnson
No ratings yet
Mastering DuckDB: High-Performance Analytics Made Easy
From Everand
Mastering DuckDB: High-Performance Analytics Made Easy
Robert Johnson
No ratings yet
Advanced SQL Queries: Writing Efficient Code for Big Data
From Everand
Advanced SQL Queries: Writing Efficient Code for Big Data
Robert Johnson
5/5 (2)
Crafting Data-Driven Solutions: Core Principles for Robust, Scalable, and Sustainable Systems
From Everand
Crafting Data-Driven Solutions: Core Principles for Robust, Scalable, and Sustainable Systems
Peter Jones
No ratings yet
Oracle Common Error Handling-Lunar1
No ratings yet
Oracle Common Error Handling-Lunar1
85 pages
Back To The Roots Oracle Database IO Management
No ratings yet
Back To The Roots Oracle Database IO Management
35 pages
Oracle Drivers Config For HA
No ratings yet
Oracle Drivers Config For HA
73 pages
RAC Architecture - 1
No ratings yet
RAC Architecture - 1
1 page
MongoDB Support Portal Overview
No ratings yet
MongoDB Support Portal Overview
8 pages
Government of The People's Republic of Bangladesh Sample Question Bank For Certificate of Proficiency (COP) MCQ Exam
No ratings yet
Government of The People's Republic of Bangladesh Sample Question Bank For Certificate of Proficiency (COP) MCQ Exam
13 pages
Lecture 17-Cell Biology
No ratings yet
Lecture 17-Cell Biology
46 pages
Network HD Camera User Manual 180530
No ratings yet
Network HD Camera User Manual 180530
11 pages
Kisi-Kisi SAT Bahasa Inggris XI 2023-2024
No ratings yet
Kisi-Kisi SAT Bahasa Inggris XI 2023-2024
6 pages
ZYD-50 Móvil
No ratings yet
ZYD-50 Móvil
10 pages
Maraan Mid-term exams CHICHEWA F4
No ratings yet
Maraan Mid-term exams CHICHEWA F4
15 pages
Math 237 Week 3
No ratings yet
Math 237 Week 3
12 pages
Sunertech Presentation
No ratings yet
Sunertech Presentation
30 pages
Monawar CV
No ratings yet
Monawar CV
3 pages
Data Analytics Phase - 5 Cyber
No ratings yet
Data Analytics Phase - 5 Cyber
19 pages
Oil& Gas Separation
No ratings yet
Oil& Gas Separation
9 pages
Lottery Mindset, Mispricing and Idiosyncratic Volatility Puzzle Evidence From The Chinese Stock Market
No ratings yet
Lottery Mindset, Mispricing and Idiosyncratic Volatility Puzzle Evidence From The Chinese Stock Market
13 pages
Joseph Evans CV
No ratings yet
Joseph Evans CV
2 pages
T.W. Winfield W.J. Bashe (Technology Applications Inc.) T.V. Baker (Technology Applications Inc.)
No ratings yet
T.W. Winfield W.J. Bashe (Technology Applications Inc.) T.V. Baker (Technology Applications Inc.)
16 pages
Atividade Ingles 23 6
No ratings yet
Atividade Ingles 23 6
4 pages
OCPP Minimum Requirements Blu Smart
No ratings yet
OCPP Minimum Requirements Blu Smart
22 pages
Ca51023 - Research Ethics in Accounting (Module 2)
No ratings yet
Ca51023 - Research Ethics in Accounting (Module 2)
34 pages
AgBus 8 BUSINESS PLAN 11
No ratings yet
AgBus 8 BUSINESS PLAN 11
15 pages
Marketing Management BCOM Notes
0% (1)
Marketing Management BCOM Notes
4 pages
IPSF Code of Points 2017-18 Final English
No ratings yet
IPSF Code of Points 2017-18 Final English
135 pages
Lps 22 HH
No ratings yet
Lps 22 HH
59 pages
Rurbanisation of A Small Village: A Case Study of Sardoi Under The Vishwakarma Yojana
No ratings yet
Rurbanisation of A Small Village: A Case Study of Sardoi Under The Vishwakarma Yojana
7 pages
Analysis of The Dulles Greenway
No ratings yet
Analysis of The Dulles Greenway
21 pages
2 Exam Paper Ib SL Math Exponentials and Logs Logarithms
No ratings yet
2 Exam Paper Ib SL Math Exponentials and Logs Logarithms
6 pages
Chapter 9
No ratings yet
Chapter 9
51 pages
The Breakthrough Company - McFarland
No ratings yet
The Breakthrough Company - McFarland
2 pages
YIP Application
No ratings yet
YIP Application
4 pages
DS/DA-T Series: Instruction Manual
No ratings yet
DS/DA-T Series: Instruction Manual
1 page
Amfori Bsci System Manual Guides English 2023-2-3
100% (1)
Amfori Bsci System Manual Guides English 2023-2-3
80 pages

Scaling Index For VLDB and Busy Database

Uploaded by

Scaling Index For VLDB and Busy Database

Uploaded by

Scaling Indexes for very large and very busy databases

1) Director of DB engineering at eBay Inc (PayPal)

1) Only B-tree indexes are covered, applicable to IOT also.

1) B-tree index concepts

1) Minimum of one key column entries in each index block

1) Root block address always stay same [1]

1) Unknown peak DML rate at design time

1) Reverse key index

1) Standard_hash function based index for large key columns

1) Best designed online feature among all Oracle database HA features

1) ER 12979221: Reducing Index Contention during branch/leaf block splits

1) Consider designing index upfront for future scalability needs

You might also like