0% found this document useful (0 votes)

50 views7 pages

Machine Learning Contents 2

Uploaded by

Javed Imran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

50 views7 pages

Machine Learning Contents 2

Uploaded by

Javed Imran

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Brief Contents

Preface xv
Prologue: A machine learning sampler 1
1 The ingredients of machine learning 13
2 Binary classiﬁcation and related tasks 49
3 Beyond binary classiﬁcation 81
4 Concept learning 104
5 Tree models 129
6 Rule models 157
7 Linear models 194
8 Distance-based models 231
9 Probabilistic models 262
10 Features 298
11 Model ensembles 330
12 Machine learning experiments 343
Epilogue: Where to go from here 360
Important points to remember 363
References 367
Index 383

vii
Contents

Preface xv

Prologue: A machine learning sampler 1

1 The ingredients of machine learning 13

1.1 Tasks: the problems that can be solved with machine learning . . . . . . . 14
Looking for structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 16
Evaluating performance on a task . . . . . . . . . . . . . . . . . . . . . . . . 18
1.2 Models: the output of machine learning . . . . . . . . . . . . . . . . . . . . 20
Geometric models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
Probabilistic models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
Logical models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Grouping and grading . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
1.3 Features: the workhorses of machine learning . . . . . . . . . . . . . . . . 38
Two uses of features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
Feature construction and transformation . . . . . . . . . . . . . . . . . . . 41
Interaction between features . . . . . . . . . . . . . . . . . . . . . . . . . . 44
1.4 Summary and outlook . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
What you’ll ﬁnd in the rest of the book . . . . . . . . . . . . . . . . . . . . . 48

2 Binary classiﬁcation and related tasks 49

2.1 Classiﬁcation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

ix
x Contents

Assessing classiﬁcation performance . . . . . . . . . . . . . . . . . . . . . . 53

Visualising classification performance . . . . . . . . . . . . . . . . . . . . . 58
2.2 Scoring and ranking . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61
Assessing and visualising ranking performance . . . . . . . . . . . . . . . . 63
Turning rankers into classifiers . . . . . . . . . . . . . . . . . . . . . . . . . 69
2.3 Class probability estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . 72
Assessing class probability estimates . . . . . . . . . . . . . . . . . . . . . . 73
Turning rankers into class probability estimators . . . . . . . . . . . . . . . 76
2.4 Binary classification and related tasks: Summary and further reading . . 79

3 Beyond binary classiﬁcation 81

3.1 Handling more than two classes . . . . . . . . . . . . . . . . . . . . . . . . . 81
Multi-class classiﬁcation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 82
Multi-class scores and probabilities . . . . . . . . . . . . . . . . . . . . . . 86
3.2 Regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 91
3.3 Unsupervised and descriptive learning . . . . . . . . . . . . . . . . . . . . 95
Predictive and descriptive clustering . . . . . . . . . . . . . . . . . . . . . . 96
Other descriptive models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 100
3.4 Beyond binary classiﬁcation: Summary and further reading . . . . . . . . 102

4 Concept learning 104

4.1 The hypothesis space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 106
Least general generalisation . . . . . . . . . . . . . . . . . . . . . . . . . . . 108
Internal disjunction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110
4.2 Paths through the hypothesis space . . . . . . . . . . . . . . . . . . . . . . 112
Most general consistent hypotheses . . . . . . . . . . . . . . . . . . . . . . 116
Closed concepts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116
4.3 Beyond conjunctive concepts . . . . . . . . . . . . . . . . . . . . . . . . . . 119
Using ﬁrst-order logic . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122
4.4 Learnability . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 124
4.5 Concept learning: Summary and further reading . . . . . . . . . . . . . . . 127

5 Tree models 129

5.1 Decision trees . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 133
5.2 Ranking and probability estimation trees . . . . . . . . . . . . . . . . . . . 138
Sensitivity to skewed class distributions . . . . . . . . . . . . . . . . . . . . 143
5.3 Tree learning as variance reduction . . . . . . . . . . . . . . . . . . . . . . . 148
Regression trees . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 148
Contents xi

Clustering trees . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152

5.4 Tree models: Summary and further reading . . . . . . . . . . . . . . . . . . 155

6 Rule models 157

6.1 Learning ordered rule lists . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158
Rule lists for ranking and probability estimation . . . . . . . . . . . . . . . 164
6.2 Learning unordered rule sets . . . . . . . . . . . . . . . . . . . . . . . . . . 167
Rule sets for ranking and probability estimation . . . . . . . . . . . . . . . 173
A closer look at rule overlap . . . . . . . . . . . . . . . . . . . . . . . . . . . 174
6.3 Descriptive rule learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 176
Rule learning for subgroup discovery . . . . . . . . . . . . . . . . . . . . . . 178
Association rule mining . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 182
6.4 First-order rule learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189
6.5 Rule models: Summary and further reading . . . . . . . . . . . . . . . . . . 192

7 Linear models 194

7.1 The least-squares method . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196
Multivariate linear regression . . . . . . . . . . . . . . . . . . . . . . . . . . 201
Regularised regression . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 204
Using least-squares regression for classiﬁcation . . . . . . . . . . . . . . . 205
7.2 The perceptron . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 207
7.3 Support vector machines . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 211
Soft margin SVM . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216
7.4 Obtaining probabilities from linear classiﬁers . . . . . . . . . . . . . . . . 219
7.5 Going beyond linearity with kernel methods . . . . . . . . . . . . . . . . . 224
7.6 Linear models: Summary and further reading . . . . . . . . . . . . . . . . 228

8 Distance-based models 231

8.1 So many roads. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231
8.2 Neighbours and exemplars . . . . . . . . . . . . . . . . . . . . . . . . . . . . 237
8.3 Nearest-neighbour classiﬁcation . . . . . . . . . . . . . . . . . . . . . . . . 242
8.4 Distance-based clustering . . . . . . . . . . . . . . . . . . . . . . . . . . . . 245
K -means algorithm . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 247
Clustering around medoids . . . . . . . . . . . . . . . . . . . . . . . . . . . 250
Silhouettes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 252
8.5 Hierarchical clustering . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 253
8.6 From kernels to distances . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258
8.7 Distance-based models: Summary and further reading . . . . . . . . . . . 260
xii Contents

9 Probabilistic models 262

9.1 The normal distribution and its geometric interpretations . . . . . . . . . 266
9.2 Probabilistic models for categorical data . . . . . . . . . . . . . . . . . . . . 273
Using a naive Bayes model for classiﬁcation . . . . . . . . . . . . . . . . . . 275
Training a naive Bayes model . . . . . . . . . . . . . . . . . . . . . . . . . . 279
9.3 Discriminative learning by optimising conditional likelihood . . . . . . . 282
9.4 Probabilistic models with hidden variables . . . . . . . . . . . . . . . . . . 286
Expectation-Maximisation . . . . . . . . . . . . . . . . . . . . . . . . . . . . 288
Gaussian mixture models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 289
9.5 Compression-based models . . . . . . . . . . . . . . . . . . . . . . . . . . . 292
9.6 Probabilistic models: Summary and further reading . . . . . . . . . . . . . 295

10 Features 298
10.1 Kinds of feature . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 299
Calculations on features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 299
Categorical, ordinal and quantitative features . . . . . . . . . . . . . . . . 304
Structured features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 305
10.2 Feature transformations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 307
Thresholding and discretisation . . . . . . . . . . . . . . . . . . . . . . . . . 308
Normalisation and calibration . . . . . . . . . . . . . . . . . . . . . . . . . . 314
Incomplete features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 321
10.3 Feature construction and selection . . . . . . . . . . . . . . . . . . . . . . . 322
Matrix transformations and decompositions . . . . . . . . . . . . . . . . . 324
10.4 Features: Summary and further reading . . . . . . . . . . . . . . . . . . . . 327

11 Model ensembles 330

11.1 Bagging and random forests . . . . . . . . . . . . . . . . . . . . . . . . . . . 331
11.2 Boosting . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 334
Boosted rule learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 337
11.3 Mapping the ensemble landscape . . . . . . . . . . . . . . . . . . . . . . . 338
Bias, variance and margins . . . . . . . . . . . . . . . . . . . . . . . . . . . . 338
Other ensemble methods . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 339
Meta-learning . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 340
11.4 Model ensembles: Summary and further reading . . . . . . . . . . . . . . 341

12 Machine learning experiments 343

12.1 What to measure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 344
12.2 How to measure it . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 348
Contents xiii

12.3 How to interpret it . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 351

Interpretation of results over multiple data sets . . . . . . . . . . . . . . . . 354
12.4 Machine learning experiments: Summary and further reading . . . . . . . 357

Epilogue: Where to go from here 360

Important points to remember 363

References 367

Index 383

STA301 Formulas Definitions 01 To 45
0% (1)
STA301 Formulas Definitions 01 To 45
28 pages
Introduction To Data Mining 2005
60% (5)
Introduction To Data Mining 2005
400 pages
The Hundred-Page Machine Learning Book-Andriy Burkov (2019) - Removed
No ratings yet
The Hundred-Page Machine Learning Book-Andriy Burkov (2019) - Removed
145 pages
Machine Learning Simplified
100% (1)
Machine Learning Simplified
109 pages
Scikit-Learn User Guide Release 0.19.dev0
100% (2)
Scikit-Learn User Guide Release 0.19.dev0
2,133 pages
Vorlesung Main Compressed
No ratings yet
Vorlesung Main Compressed
1,437 pages
Orange3 Data Mining Library Using Python
50% (2)
Orange3 Data Mining Library Using Python
102 pages
Scikit Learn Docs
100% (1)
Scikit Learn Docs
2,201 pages
Machine Learning
No ratings yet
Machine Learning
216 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
112 pages
User Guide 0.16.1 PDF
No ratings yet
User Guide 0.16.1 PDF
2,160 pages
Scikit Learn Docs
No ratings yet
Scikit Learn Docs
1,810 pages
1 All Notes G
No ratings yet
1 All Notes G
217 pages
Jacob Eisenstein - Natural Language Processing-MIT Press
No ratings yet
Jacob Eisenstein - Natural Language Processing-MIT Press
591 pages
Pca - STATA
No ratings yet
Pca - STATA
17 pages
Poly ML SIR
No ratings yet
Poly ML SIR
378 pages
Unit 4 - Operations Planning and Control
No ratings yet
Unit 4 - Operations Planning and Control
212 pages
MLT End Term Quizzes
No ratings yet
MLT End Term Quizzes
166 pages
Undergraduate Fundamentals of Machine Learning
No ratings yet
Undergraduate Fundamentals of Machine Learning
163 pages
Machinelearning GateNotes
No ratings yet
Machinelearning GateNotes
105 pages
Supp 2
No ratings yet
Supp 2
214 pages
Example of UITM Research Method Slide
0% (1)
Example of UITM Research Method Slide
12 pages
Theoretical Bioinformatics and Machine Learning - Hochreiter - 2013
No ratings yet
Theoretical Bioinformatics and Machine Learning - Hochreiter - 2013
400 pages
Eisenstein NLP Notes
No ratings yet
Eisenstein NLP Notes
573 pages
Practical Machine Learning R
90% (10)
Practical Machine Learning R
149 pages
SML Book Draft Latest
No ratings yet
SML Book Draft Latest
275 pages
Machine Learning
No ratings yet
Machine Learning
95 pages
Machine Learning Notes 1
No ratings yet
Machine Learning Notes 1
120 pages
Data Mining Notes
100% (1)
Data Mining Notes
178 pages
Cs181 Textbook
No ratings yet
Cs181 Textbook
163 pages
F2-14 Budget Preparation
100% (1)
F2-14 Budget Preparation
18 pages
Summary FS24
No ratings yet
Summary FS24
63 pages
MachineLearning 1 1
No ratings yet
MachineLearning 1 1
81 pages
Foundations of Machine
No ratings yet
Foundations of Machine
120 pages
F2-17 Capital Budgeting and Discounted Cash Flows PDF
No ratings yet
F2-17 Capital Budgeting and Discounted Cash Flows PDF
28 pages
F2-17 Capital Budgeting and Discounted Cash Flows PDF
No ratings yet
F2-17 Capital Budgeting and Discounted Cash Flows PDF
28 pages
Machine Learning Summarized Notes 1660762916
No ratings yet
Machine Learning Summarized Notes 1660762916
111 pages
Chapter 2 Machine Learning Draft-85-172
No ratings yet
Chapter 2 Machine Learning Draft-85-172
88 pages
Optimization Problems For Machine Learning: A Survey
No ratings yet
Optimization Problems For Machine Learning: A Survey
41 pages
Stats Cheat Sheet (Size 11)
No ratings yet
Stats Cheat Sheet (Size 11)
5 pages
SML Book Draft Latest
No ratings yet
SML Book Draft Latest
194 pages
Predicting Structured Data
No ratings yet
Predicting Structured Data
361 pages
Extra Lecturenotes Cs725
No ratings yet
Extra Lecturenotes Cs725
119 pages
Chapter - 2-Statistical Estimations
No ratings yet
Chapter - 2-Statistical Estimations
55 pages
Machine Learning Complete-Course-Notes Polimi
No ratings yet
Machine Learning Complete-Course-Notes Polimi
107 pages
Machine Learning Algorithms Applications and Practices in Data Science PDF
No ratings yet
Machine Learning Algorithms Applications and Practices in Data Science PDF
113 pages
Machine Learning Guide: Meher Krishna Patel
No ratings yet
Machine Learning Guide: Meher Krishna Patel
121 pages
F2-15 Flexible Budgets, Budgetary Control and Reporting
No ratings yet
F2-15 Flexible Budgets, Budgetary Control and Reporting
16 pages
A Comprehensive Guide To Machine Learning
No ratings yet
A Comprehensive Guide To Machine Learning
152 pages
ANOVA Ajay 29 11 21
No ratings yet
ANOVA Ajay 29 11 21
50 pages
F2-12 Budgeting - Nature, Purpose and Behavioural Aspects
No ratings yet
F2-12 Budgeting - Nature, Purpose and Behavioural Aspects
12 pages
6.036 Notes
No ratings yet
6.036 Notes
99 pages
An Adventure of Epic Porpoises
No ratings yet
An Adventure of Epic Porpoises
174 pages
ADM-SHS-StatProb-Q3-M18-Defining Sampling Distribution of The Sample Mean For Normal Population
No ratings yet
ADM-SHS-StatProb-Q3-M18-Defining Sampling Distribution of The Sample Mean For Normal Population
35 pages
Exercises
No ratings yet
Exercises
69 pages
SGN-2506 Introduction To Pattern Recognition Handout
No ratings yet
SGN-2506 Introduction To Pattern Recognition Handout
82 pages
Uoc Luong Phi Tham So
No ratings yet
Uoc Luong Phi Tham So
84 pages
Autoregressive Models: Alexander Zhigalov / Dept. of CS, University of Helsinki and Dept. of NBE, Aalto University
No ratings yet
Autoregressive Models: Alexander Zhigalov / Dept. of CS, University of Helsinki and Dept. of NBE, Aalto University
30 pages
F2-18 Performance Measurement
No ratings yet
F2-18 Performance Measurement
30 pages
TYBSc (CS) WT - Deleted (2) - Removed - Removed
No ratings yet
TYBSc (CS) WT - Deleted (2) - Removed - Removed
13 pages
Eisenstein-Nov18 - Definicao-1-30
No ratings yet
Eisenstein-Nov18 - Definicao-1-30
30 pages
F2-13 Statistical Techniques
No ratings yet
F2-13 Statistical Techniques
24 pages
Predictive Analytics and Data Mining: Charles Elkan Elkan@cs - Ucsd.edu May 31, 2011
No ratings yet
Predictive Analytics and Data Mining: Charles Elkan Elkan@cs - Ucsd.edu May 31, 2011
165 pages
F2-11 Alternative Costing Principles
No ratings yet
F2-11 Alternative Costing Principles
20 pages
6036 Lecture Notes
No ratings yet
6036 Lecture Notes
56 pages
Non Linear Regression
No ratings yet
Non Linear Regression
20 pages
Midterm 2
No ratings yet
Midterm 2
8 pages
F2-04 Presenting Information
No ratings yet
F2-04 Presenting Information
14 pages
F2-04 Presenting Information
No ratings yet
F2-04 Presenting Information
14 pages
Project Delhi Metro 20 April 2016
No ratings yet
Project Delhi Metro 20 April 2016
41 pages
Detailed Contents
No ratings yet
Detailed Contents
8 pages
F2-07 Accounting For Overheads
No ratings yet
F2-07 Accounting For Overheads
22 pages
F2-10 Process Costing
No ratings yet
F2-10 Process Costing
30 pages
Concepts of Probability and Statistics Hydrology
No ratings yet
Concepts of Probability and Statistics Hydrology
16 pages
Microfit Guide2
No ratings yet
Microfit Guide2
17 pages
Xtxttobit
No ratings yet
Xtxttobit
9 pages
CHAPTER6 Estimation
No ratings yet
CHAPTER6 Estimation
18 pages
Hilton Chapter 4 Live Adobe Connect
No ratings yet
Hilton Chapter 4 Live Adobe Connect
15 pages
Question Bank For DOT
No ratings yet
Question Bank For DOT
3 pages
Midterm TQ (Prob Stat)
No ratings yet
Midterm TQ (Prob Stat)
2 pages
TD 1
No ratings yet
TD 1
6 pages
ML Ex 5
No ratings yet
ML Ex 5
6 pages
Four Imams & Four Madhabs: Young Muslim Workshop Session 3
No ratings yet
Four Imams & Four Madhabs: Young Muslim Workshop Session 3
5 pages
Gamma Distribution
No ratings yet
Gamma Distribution
30 pages
ML Unit 03 MCQ
No ratings yet
ML Unit 03 MCQ
20 pages
F2-06 Accounting For Labour PDF
No ratings yet
F2-06 Accounting For Labour PDF
22 pages
F2-06 Accounting For Labour PDF
No ratings yet
F2-06 Accounting For Labour PDF
22 pages
Regression: Descriptive Statistics
No ratings yet
Regression: Descriptive Statistics
2 pages
YJC H2 MATH P2 Solution 9758
No ratings yet
YJC H2 MATH P2 Solution 9758
9 pages
Predictive Analysis Contents 15
No ratings yet
Predictive Analysis Contents 15
7 pages
Minimum Variance Unbiased Estimators
No ratings yet
Minimum Variance Unbiased Estimators
4 pages
F2-08 Absorption and Marginal Costing
No ratings yet
F2-08 Absorption and Marginal Costing
16 pages
SPSS Estimating Percentile Ranks
No ratings yet
SPSS Estimating Percentile Ranks
6 pages
AP Bio Math and Statistics
No ratings yet
AP Bio Math and Statistics
8 pages
Correlation and Regration
No ratings yet
Correlation and Regration
8 pages
F2-02 Sources of Data
No ratings yet
F2-02 Sources of Data
14 pages
F2-02 Sources of Data
No ratings yet
F2-02 Sources of Data
14 pages
0975 Data Science and Machine Learning
No ratings yet
0975 Data Science and Machine Learning
6 pages
Preface To The Second Edition V 1 1
No ratings yet
Preface To The Second Edition V 1 1
9 pages
Predictive Analysis Contents 17
No ratings yet
Predictive Analysis Contents 17
4 pages
6.867 Lecture Notes: Section 1: Introduction: 1 Intro 2 2 Problem Class 3
No ratings yet
6.867 Lecture Notes: Section 1: Introduction: 1 Intro 2 2 Problem Class 3
10 pages
St. Peter's College of Ormoc: SY 2020-2021 School Theme
No ratings yet
St. Peter's College of Ormoc: SY 2020-2021 School Theme
4 pages
MPhil Econometrics Question Final Exam 2022
No ratings yet
MPhil Econometrics Question Final Exam 2022
2 pages

Machine Learning Contents 2

Uploaded by

Machine Learning Contents 2

Uploaded by

Brief Contents

Prologue: A machine learning sampler 1

1 The ingredients of machine learning 13

2 Binary classiﬁcation and related tasks 49

Assessing classiﬁcation performance . . . . . . . . . . . . . . . . . . . . . . 53

3 Beyond binary classiﬁcation 81

4 Concept learning 104

5 Tree models 129

Clustering trees . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 152

6 Rule models 157

7 Linear models 194

8 Distance-based models 231

9 Probabilistic models 262

11 Model ensembles 330

12 Machine learning experiments 343

12.3 How to interpret it . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 351

Epilogue: Where to go from here 360

Important points to remember 363

You might also like