How to compute the complexity parameter α?: Study Notes of CART

The document discusses how to compute the complexity parameter (α) for CART decision tree models. It presents a formula to calculate α as the ratio of the difference between the risk of a node (R(t)) and its subtree (R(Tt)) to the number of terminal nodes in the subtree minus one. It proves this formula works by showing that increasing α increases the risk of subtrees faster than individual nodes, until their risks are equal. An example calculation on a sample dataset with 5 terminal nodes demonstrates applying the formula to compute α for each node.

Uploaded by

Anthony Castro Adrianzén

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

70 views3 pages

How to compute the complexity parameter α?: Study Notes of CART

Uploaded by

Anthony Castro Adrianzén

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Study Notes of CART (2):

How to compute the complexity parameter α?

by Yin Zhao
School of Mathematical Sciences
USM, Penang, Malaysia
December 2013

Proposition: The complexity parameter α (i.e. cp in R “rpart” package) is

R(t) − R(Tt )
α=
|T̃ | − 1

Proof:
Recall that the definition: Rα (T ) = R(T ) + α|T̃ |, and Tt is a branch including node
t. For any single node t ∈ T , we have

Rα (t) = R(t) + α (1)

since there is only one terminal node at a single node.

Similarly, for any branch Tt ∈ T , we have

Rα (Tt ) = R(Tt ) + α|T̃t | (2)

When α = 0, R0 (t) = R(t) > R(Tt ) = R0 (Tt ). This inequality is guaranteed

because the first step of pruning is to prune off all of the terminal nodes which
satisfy R(t) = R(tL ) + R(tR ). That is, the remaining nodes must be satisfied
R(t) > R(tL ) + R(tR ) (the details can be found in the previous notes). Further-
more, the inequality holds for sufficient small α.
Then if we gradually increase α, Rα (Tt ) increases faster than Rα (t) since the coeffi-
cients |T̃t | > 1. In other words, at a certain α we will have Rα (Tt ) = Rα (t). Solve the
equations (1) and (2), we have

R(t) + α = R(Tt ) + α|T̃t |

1
⇒ (T̃t − 1) · α = R(t) − R(Tt )
R(t) − R(Tt )
⇒α=
|T̃ | − 1
as desired.
Example:
This example will simply show how to calculate the complexity parameter α (see
Figure 1 below). The data set has 2 classes say A, B, and 200 samples in all. T1 is a
subtree of the whole tree T , there are 5 terminal nodes in T1 , say t5 , t6 , t7 , t8 , and t9 .

Figure 1: Subtree T1 obtains 5 leaves

According to the formula, we have

R(t1 ) − R(Tt1 ) 100/200 − 0

α(T1 (t1 )) = = = 1/8
5−1 4
R(t2 ) − R(Tt2 ) 10/200 − 0
α(T1 (t2 )) = = = 1/40
3−1 2
R(t3 ) − R(Tt3 ) 60/200 − 0
α(T1 (t3 )) = = = 3/10
2−1 1

2
R(t4 ) − R(Tt4 ) 2/200 − 0
α(T1 (t4 )) = = = 1/100
2−1 1
α(T1 (t4 )) is the first value of α since it obtains the lowest value. That is, we prune
the tree below the node t4 . After this a new iteration should be used as before and
the tree will be pruned once again.

Data Structures U1
No ratings yet
Data Structures U1
88 pages
Final Material
No ratings yet
Final Material
176 pages
CS214-lec-3-4 Complexity
No ratings yet
CS214-lec-3-4 Complexity
65 pages
Chapter 2-Analysis of Algorithms: 2021 Prepared By: Beimnet G
No ratings yet
Chapter 2-Analysis of Algorithms: 2021 Prepared By: Beimnet G
51 pages
Time Complexity
No ratings yet
Time Complexity
63 pages
Dm-Classtrees-2-2018 PDF
No ratings yet
Dm-Classtrees-2-2018 PDF
46 pages
Algorithms_2022-2023
No ratings yet
Algorithms_2022-2023
258 pages
Lecture 1 or 2 Notes
No ratings yet
Lecture 1 or 2 Notes
64 pages
AST Day 3 Slides
No ratings yet
AST Day 3 Slides
79 pages
Complexity Lec#3
No ratings yet
Complexity Lec#3
25 pages
Introduction To Algorithms: K. Sudarshana
No ratings yet
Introduction To Algorithms: K. Sudarshana
84 pages
Design Analysis and Algorithm Organiser
No ratings yet
Design Analysis and Algorithm Organiser
138 pages
Lecture4 RecursiveAlgo Vs DP
No ratings yet
Lecture4 RecursiveAlgo Vs DP
28 pages
DSA Unit1 2
No ratings yet
DSA Unit1 2
93 pages
Algo PPT
No ratings yet
Algo PPT
146 pages
CS214 DS2024 Lec 2 Complexity
No ratings yet
CS214 DS2024 Lec 2 Complexity
29 pages
Design and Analysis-Unit 1
No ratings yet
Design and Analysis-Unit 1
74 pages
Algorithm Analysis Module 1 Important Topics
No ratings yet
Algorithm Analysis Module 1 Important Topics
30 pages
DSA Module1
No ratings yet
DSA Module1
67 pages
DAA_unit1
No ratings yet
DAA_unit1
145 pages
01-Slides
No ratings yet
01-Slides
109 pages
Josh Coding Questions
No ratings yet
Josh Coding Questions
7 pages
Slide 2
No ratings yet
Slide 2
22 pages
2 Program Complexities
No ratings yet
2 Program Complexities
37 pages
DAA - Ch. 1
No ratings yet
DAA - Ch. 1
64 pages
Lec 1 Week 1
No ratings yet
Lec 1 Week 1
32 pages
Notes
No ratings yet
Notes
60 pages
Analysis of Algorithm
No ratings yet
Analysis of Algorithm
23 pages
Complexity
No ratings yet
Complexity
13 pages
DAA-Unit-I
No ratings yet
DAA-Unit-I
69 pages
INTRODUCTION
No ratings yet
INTRODUCTION
22 pages
ADSA_IA1_solution
No ratings yet
ADSA_IA1_solution
9 pages
CH1 Part1
No ratings yet
CH1 Part1
40 pages
Chapter 1 - Analysis of Algorithms 2
No ratings yet
Chapter 1 - Analysis of Algorithms 2
44 pages
Desgin Analysis And Algorithms Full notes
No ratings yet
Desgin Analysis And Algorithms Full notes
29 pages
ads unit 1
No ratings yet
ads unit 1
24 pages
Introduction Algorithm
No ratings yet
Introduction Algorithm
53 pages
1 Theme: Comparison of The Implementation of The CART Algorithm Under Tanagra and R (Rpart Package)
No ratings yet
1 Theme: Comparison of The Implementation of The CART Algorithm Under Tanagra and R (Rpart Package)
15 pages
Daa Unit 1 Lk5kzl
No ratings yet
Daa Unit 1 Lk5kzl
7 pages
Ads Unit 1
No ratings yet
Ads Unit 1
15 pages
QP4
No ratings yet
QP4
28 pages
ADSA Unit-I (1)
No ratings yet
ADSA Unit-I (1)
21 pages
algorithm_analysis
No ratings yet
algorithm_analysis
5 pages
Document 9
No ratings yet
Document 9
15 pages
Week 3
No ratings yet
Week 3
38 pages
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
No ratings yet
DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING
7 pages
Quantum Design and Analysis of Algorithms Full PDF
100% (1)
Quantum Design and Analysis of Algorithms Full PDF
196 pages
UNIT 1_DAA_NOTES_2023
No ratings yet
UNIT 1_DAA_NOTES_2023
30 pages
ada mqp solved
No ratings yet
ada mqp solved
27 pages
Cheatsheet On Algorithmic Concepts! ?
No ratings yet
Cheatsheet On Algorithmic Concepts! ?
7 pages
Algorithm and Complexity
No ratings yet
Algorithm and Complexity
42 pages
Apple T Notes
No ratings yet
Apple T Notes
29 pages
Algorithms Unit-1
No ratings yet
Algorithms Unit-1
7 pages
DAA IA1 Updated
No ratings yet
DAA IA1 Updated
14 pages
2marks DAA
No ratings yet
2marks DAA
10 pages
unit1
No ratings yet
unit1
3 pages
05 DSA PPT Algorithmic Anaysis-II
No ratings yet
05 DSA PPT Algorithmic Anaysis-II
19 pages
Cambridge - IGCSE - ComputerScience - Chapter (7&8) Notes
100% (2)
Cambridge - IGCSE - ComputerScience - Chapter (7&8) Notes
55 pages
Tafl pyqs
No ratings yet
Tafl pyqs
19 pages
Artificial Intelligence: "PR Teemuenr
No ratings yet
Artificial Intelligence: "PR Teemuenr
472 pages
Problem Set 2: Instructions
No ratings yet
Problem Set 2: Instructions
4 pages
05 Brute Force
No ratings yet
05 Brute Force
56 pages
cs188-su24-lec03
No ratings yet
cs188-su24-lec03
90 pages
Rice Theorem
No ratings yet
Rice Theorem
2 pages
Data Structures & Algorithms: Lecture 4: Linked Lists
No ratings yet
Data Structures & Algorithms: Lecture 4: Linked Lists
54 pages
01 DS and Algorithm Session 01
100% (4)
01 DS and Algorithm Session 01
28 pages
Introduction to the Design and Analysis of Algorithms 3rd Edition Levitin Solutions Manualpdf download
100% (4)
Introduction to the Design and Analysis of Algorithms 3rd Edition Levitin Solutions Manualpdf download
54 pages
DRL - AI309 - A - Assignment - 1 - F24 - GIKI
No ratings yet
DRL - AI309 - A - Assignment - 1 - F24 - GIKI
3 pages
dsa roadmap
No ratings yet
dsa roadmap
10 pages
hw3-sol-1
No ratings yet
hw3-sol-1
1 page
Construction of Some New Quantum BCH Codes
No ratings yet
Construction of Some New Quantum BCH Codes
10 pages
Chapter 2 Itterative Algorithms
No ratings yet
Chapter 2 Itterative Algorithms
8 pages
Chapter 9 (Data Structure - I)
No ratings yet
Chapter 9 (Data Structure - I)
4 pages
Sample Exam Solutions: CENG 351
No ratings yet
Sample Exam Solutions: CENG 351
14 pages
Flat Apr 2023
No ratings yet
Flat Apr 2023
2 pages
Single Linked List - Deletion
No ratings yet
Single Linked List - Deletion
18 pages
Model Viva Questions For "Name of The Lab: Data Structure of Lab"
No ratings yet
Model Viva Questions For "Name of The Lab: Data Structure of Lab"
14 pages
Searching: in This
No ratings yet
Searching: in This
16 pages
Rounding Numbers
No ratings yet
Rounding Numbers
4 pages
Computer Science & Engineering
No ratings yet
Computer Science & Engineering
22 pages
2017 Networks Practice Sac Pwe
No ratings yet
2017 Networks Practice Sac Pwe
9 pages
Class A (2. Int I 3. Void Display (4. System - Out.println (I) 5.) 6.) 7. Class B Extends A (8. Int J
No ratings yet
Class A (2. Int I 3. Void Display (4. System - Out.println (I) 5.) 6.) 7. Class B Extends A (8. Int J
3 pages
Algorithms - Data Structures
No ratings yet
Algorithms - Data Structures
3 pages
Jacobi Method: Description Algorithm Convergence Example
No ratings yet
Jacobi Method: Description Algorithm Convergence Example
6 pages
CM1020 Revised
No ratings yet
CM1020 Revised
7 pages
Cube Root Three Exists
No ratings yet
Cube Root Three Exists
1 page
Standard Problems: PCS104: Advanced Algorithms
No ratings yet
Standard Problems: PCS104: Advanced Algorithms
15 pages
Laplace Transforms Essentials
From Everand
Laplace Transforms Essentials
Morteza Shafii-Mousavi
3.5/5 (3)
Topics on Tournaments in Graph Theory
From Everand
Topics on Tournaments in Graph Theory
John W. Moon
No ratings yet
Lectures on Integral Equations
From Everand
Lectures on Integral Equations
Harold Widom
3.5/5 (1)

How to compute the complexity parameter α?: Study Notes of CART

Uploaded by

How to compute the complexity parameter α?: Study Notes of CART

Uploaded by

Study Notes of CART (2):

How to compute the complexity parameter α?

Proposition: The complexity parameter α (i.e. cp in R “rpart” package) is

Rα (t) = R(t) + α (1)

since there is only one terminal node at a single node.

Rα (Tt ) = R(Tt ) + α|T̃t | (2)

When α = 0, R0 (t) = R(t) > R(Tt ) = R0 (Tt ). This inequality is guaranteed

R(t) + α = R(Tt ) + α|T̃t |

Figure 1: Subtree T1 obtains 5 leaves

According to the formula, we have

R(t1 ) − R(Tt1 ) 100/200 − 0

You might also like