0% found this document useful (0 votes)

3 views

1

Springer Texts in Statistics is a series of advanced textbooks aimed at undergraduate and graduate courses in statistics, edited by Genevera I. Allen, Richard D. De Veaux, and Rebecca Nugent. The document highlights the second edition of 'An Introduction to Statistical Learning with Applications in R,' which expands on key statistical learning topics and includes hands-on labs using R software. This edition is designed for advanced undergraduates and master's students, providing a less technical approach to statistical learning applications.

Uploaded by

monrovialiberia322

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

1

Uploaded by

monrovialiberia322

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Springer Texts in Statistics

Series Editors
G. Allen, Department of Statistics, Houston, TX, USA
R. De Veaux, Department of Mathematics and Statistics, Williams College,
Williamstown, MA, USA
R. Nugent, Department of Statistics, Carnegie Mellon University, Pittsburgh, PA,
USA
Springer Texts in Statistics (STS) includes advanced textbooks from 3rd- to 4th-year
undergraduate courses to 1st- to 2nd-year graduate courses. Exercise sets should be
included. The series editors are currently Genevera I. Allen, Richard D. De Veaux,
and Rebecca Nugent. Stephen Fienberg, George Casella, and Ingram Olkin were
editors of the series for many years.

More information about this series at http://www.springer.com/series/417

Gareth James Daniela Witten
• •

Trevor Hastie Robert Tibshirani

•

An Introduction to Statistical
Learning
with Applications in R

Second Edition

123
Gareth James Daniela Witten
Department of Data Science and Operations Department of Statistics
University of Southern California University of Washington
Los Angeles, CA, USA Seattle, WA, USA

Trevor Hastie Robert Tibshirani

Department of Statistics Department of Statistics
Stanford University Stanford University
Stanford, CA, USA Stanford, CA, USA

ISSN 1431-875X ISSN 2197-4136 (electronic)

Springer Texts in Statistics
ISBN 978-1-0716-1417-4 ISBN 978-1-0716-1418-1 (eBook)
https://doi.org/10.1007/978-1-0716-1418-1
1st edition: © Springer Science+Business Media New York 2013 (Corrected at 8th printing 2017)
2nd edition: © Springer Science+Business Media, LLC, part of Springer Nature 2021
This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part
of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations,
recitation, broadcasting, reproduction on microfilms or in any other physical way, and transmission
or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar
methodology now known or hereafter developed.
The use of general descriptive names, registered names, trademarks, service marks, etc. in this
publication does not imply, even in the absence of a specific statement, that such names are exempt from
the relevant protective laws and regulations and therefore free for general use.
The publisher, the authors and the editors are safe to assume that the advice and information in this
book are believed to be true and accurate at the date of publication. Neither the publisher nor the
authors or the editors give a warranty, express or implied, with respect to the material contained herein or
for any errors or omissions that may have been made.

This Springer imprint is published by the registered company Springer Science+Business Media, LLC
part of Springer Nature.
The registered company address is: 1 New York Plaza, New York, NY 10004, U.S.A.
To our parents:

Alison and Michael James

Chiara Nappi and Edward Witten

Valerie and Patrick Hastie

Vera and Sami Tibshirani

and to our families:

Michael, Daniel, and Catherine

Tessa, Theo, Otto, and Ari

Samantha, Timothy, and Lynda

Charlie, Ryan, Julie, and Cheryl

Preface

Statistical learning refers to a set of tools for making sense of complex

datasets. In recent years, we have seen a staggering increase in the scale and
scope of data collection across virtually all areas of science and industry.
As a result, statistical learning has become a critical toolkit for anyone who
wishes to understand data — and as more and more of today’s jobs involve
data, this means that statistical learning is fast becoming a critical toolkit
for everyone.
One of the first books on statistical learning — The Elements of Statisti-
cal Learning (ESL, by Hastie, Tibshirani, and Friedman) — was published
in 2001, with a second edition in 2009. ESL has become a popular text not
only in statistics but also in related fields. One of the reasons for ESL’s
popularity is its relatively accessible style. But ESL is best-suited for indi-
viduals with advanced training in the mathematical sciences.
An Introduction to Statistical Learning (ISL) arose from the clear need
for a broader and less technical treatment of the key topics in statistical
learning. The intention behind ISL is to concentrate more on the applica-
tions of the methods and less on the mathematical details. Beginning with
Chapter 2, each chapter in ISL contains a lab illustrating how to implement
the statistical learning methods seen in that chapter using the popular sta-
tistical software package R. These labs provide the reader with valuable
hands-on experience.
ISL is appropriate for advanced undergraduates or master’s students in
Statistics or related quantitative fields, or for individuals in other disciplines
who wish to use statistical learning tools to analyze their data. It can be
used as a textbook for a course spanning two semesters.

vii
viii Preface

The first edition of ISL covered a number of important topics, including

sparse methods for classification and regression, decision trees, boosting,
support vector machines, and clustering. Since it was published in 2013, it
has become a mainstay of undergraduate and graduate classrooms across
the United States and worldwide, as well as a key reference book for data
scientists.
In this second edition of ISL, we have greatly expanded the set of topics
covered. In particular, the second edition includes new chapters on deep
learning (Chapter 10), survival analysis (Chapter 11), and multiple testing
(Chapter 13). We have also substantially expanded some chapters that were
part of the first edition: among other updates, we now include treatments
of naive Bayes and generalized linear models in Chapter 4, Bayesian addi-
tive regression trees in Chapter 8, and matrix completion in Chapter 12.
Furthermore, we have updated the R code throughout the labs to ensure
that the results that they produce agree with recent R releases.
We are grateful to these readers for providing valuable comments on the
first edition of this book: Pallavi Basu, Alexandra Chouldechova, Patrick
Danaher, Will Fithian, Luella Fu, Sam Gross, Max Grazier G’Sell, Court-
ney Paulson, Xinghao Qiao, Elisa Sheng, Noah Simon, Kean Ming Tan,
Xin Lu Tan. We thank these readers for helpful input on the second edi-
tion of this book: Alan Agresti, Iain Carmichael, Yiqun Chen, Erin Craig,
Daisy Ding, Lucy Gao, Ismael Lemhadri, Bryan Martin, Anna Neufeld, Ge-
off Tims, Carsten Voelkmann, Steve Yadlowsky, and James Zou. We also
thank Anna Neufeld for her assistance in reformatting the R code through-
out this book. We are immensely grateful to Balasubramanian “Naras”
Narasimhan for his assistance on both editions of this textbook.
It has been an honor and a privilege for us to see the considerable impact
that the first edition of ISL has had on the way in which statistical learning
is practiced, both in and out of the academic setting. We hope that this new
edition will continue to give today’s and tomorrow’s applied statisticians
and data scientists the tools they need for success in a data-driven world.

It’s tough to make predictions, especially about the future.

-Yogi Berra
Contents

Preface vii

1 Introduction 1

2 Statistical Learning 15
2.1 What Is Statistical Learning? . . . . . . . . . . . . . . . . . 15
2.1.1 Why Estimate f ? . . . . . . . . . . . . . . . . . . . 17
2.1.2 How Do We Estimate f ? . . . . . . . . . . . . . . . 21
2.1.3 The Trade-Off Between Prediction Accuracy
and Model Interpretability . . . . . . . . . . . . . . 24
2.1.4 Supervised Versus Unsupervised Learning . . . . . 26
2.1.5 Regression Versus Classification Problems . . . . . 28
2.2 Assessing Model Accuracy . . . . . . . . . . . . . . . . . . 29
2.2.1 Measuring the Quality of Fit . . . . . . . . . . . . 29
2.2.2 The Bias-Variance Trade-Off . . . . . . . . . . . . . 33
2.2.3 The Classification Setting . . . . . . . . . . . . . . 37
2.3 Lab: Introduction to R . . . . . . . . . . . . . . . . . . . . 42
2.3.1 Basic Commands . . . . . . . . . . . . . . . . . . . 43
2.3.2 Graphics . . . . . . . . . . . . . . . . . . . . . . . . 45
2.3.3 Indexing Data . . . . . . . . . . . . . . . . . . . . . 47
2.3.4 Loading Data . . . . . . . . . . . . . . . . . . . . . 48
2.3.5 Additional Graphical and Numerical Summaries . . 50
2.4 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52

3 Linear Regression 59
3.1 Simple Linear Regression . . . . . . . . . . . . . . . . . . . 60
3.1.1 Estimating the Coefficients . . . . . . . . . . . . . 61
3.1.2 Assessing the Accuracy of the Coefficient
Estimates . . . . . . . . . . . . . . . . . . . . . . . 63
3.1.3 Assessing the Accuracy of the Model . . . . . . . . 68
3.2 Multiple Linear Regression . . . . . . . . . . . . . . . . . . 71
3.2.1 Estimating the Regression Coefficients . . . . . . . 72
3.2.2 Some Important Questions . . . . . . . . . . . . . . 75
3.3 Other Considerations in the Regression Model . . . . . . . 83

ix
x CONTENTS

3.3.1 Qualitative Predictors . . . . . . . . . . . . . . . . 83

3.3.2 Extensions of the Linear Model . . . . . . . . . . . 87
3.3.3 Potential Problems . . . . . . . . . . . . . . . . . . 92
3.4 The Marketing Plan . . . . . . . . . . . . . . . . . . . . . . 103
3.5 Comparison of Linear Regression with K-Nearest
Neighbors . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
3.6 Lab: Linear Regression . . . . . . . . . . . . . . . . . . . . 110
3.6.1 Libraries . . . . . . . . . . . . . . . . . . . . . . . . 110
3.6.2 Simple Linear Regression . . . . . . . . . . . . . . . 111
3.6.3 Multiple Linear Regression . . . . . . . . . . . . . . 114
3.6.4 Interaction Terms . . . . . . . . . . . . . . . . . . . 116
3.6.5 Non-linear Transformations of the Predictors . . . 116
3.6.6 Qualitative Predictors . . . . . . . . . . . . . . . . 119
3.6.7 Writing Functions . . . . . . . . . . . . . . . . . . . 120
3.7 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 121

4 Classification 129
4.1 An Overview of Classification . . . . . . . . . . . . . . . . . 130
4.2 Why Not Linear Regression? . . . . . . . . . . . . . . . . . 131
4.3 Logistic Regression . . . . . . . . . . . . . . . . . . . . . . 133
4.3.1 The Logistic Model . . . . . . . . . . . . . . . . . . 133
4.3.2 Estimating the Regression Coefficients . . . . . . . 135
4.3.3 Making Predictions . . . . . . . . . . . . . . . . . . 136
4.3.4 Multiple Logistic Regression . . . . . . . . . . . . . 137
4.3.5 Multinomial Logistic Regression . . . . . . . . . . . 140
4.4 Generative Models for Classification . . . . . . . . . . . . . 141
4.4.1 Linear Discriminant Analysis for p = 1 . . . . . . . 142
4.4.2 Linear Discriminant Analysis for p >1 . . . . . . . 145
4.4.3 Quadratic Discriminant Analysis . . . . . . . . . . 152
4.4.4 Naive Bayes . . . . . . . . . . . . . . . . . . . . . . 153
4.5 A Comparison of Classification Methods . . . . . . . . . . 158
4.5.1 An Analytical Comparison . . . . . . . . . . . . . . 158
4.5.2 An Empirical Comparison . . . . . . . . . . . . . . 161
4.6 Generalized Linear Models . . . . . . . . . . . . . . . . . . 164
4.6.1 Linear Regression on the Bikeshare Data . . . . . . 164
4.6.2 Poisson Regression on the Bikeshare Data . . . . . 167
4.6.3 Generalized Linear Models in Greater Generality . 170
4.7 Lab: Classification Methods . . . . . . . . . . . . . . . . . . 171
4.7.1 The Stock Market Data . . . . . . . . . . . . . . . 171
4.7.2 Logistic Regression . . . . . . . . . . . . . . . . . . 172
4.7.3 Linear Discriminant Analysis . . . . . . . . . . . . 177
4.7.4 Quadratic Discriminant Analysis . . . . . . . . . . 179
4.7.5 Naive Bayes . . . . . . . . . . . . . . . . . . . . . . 180
4.7.6 K-Nearest Neighbors . . . . . . . . . . . . . . . . . 181
4.7.7 Poisson Regression . . . . . . . . . . . . . . . . . . 185
CONTENTS xi

4.8 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189

5 Resampling Methods 197

5.1 Cross-Validation . . . . . . . . . . . . . . . . . . . . . . . . 198
5.1.1 The Validation Set Approach . . . . . . . . . . . . 198
5.1.2 Leave-One-Out Cross-Validation . . . . . . . . . . 200
5.1.3 k-Fold Cross-Validation . . . . . . . . . . . . . . . 203
5.1.4 Bias-Variance Trade-Off for k-Fold
Cross-Validation . . . . . . . . . . . . . . . . . . . 205
5.1.5 Cross-Validation on Classification Problems . . . . 206
5.2 The Bootstrap . . . . . . . . . . . . . . . . . . . . . . . . . 209
5.3 Lab: Cross-Validation and the Bootstrap . . . . . . . . . . 212
5.3.1 The Validation Set Approach . . . . . . . . . . . . 213
5.3.2 Leave-One-Out Cross-Validation . . . . . . . . . . 214
5.3.3 k-Fold Cross-Validation . . . . . . . . . . . . . . . 215
5.3.4 The Bootstrap . . . . . . . . . . . . . . . . . . . . 216
5.4 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 219

6 Linear Model Selection and Regularization 225

6.1 Subset Selection . . . . . . . . . . . . . . . . . . . . . . . . 227
6.1.1 Best Subset Selection . . . . . . . . . . . . . . . . . 227
6.1.2 Stepwise Selection . . . . . . . . . . . . . . . . . . 229
6.1.3 Choosing the Optimal Model . . . . . . . . . . . . 232
6.2 Shrinkage Methods . . . . . . . . . . . . . . . . . . . . . . 237
6.2.1 Ridge Regression . . . . . . . . . . . . . . . . . . . 237
6.2.2 The Lasso . . . . . . . . . . . . . . . . . . . . . . . 241
6.2.3 Selecting the Tuning Parameter . . . . . . . . . . . 250
6.3 Dimension Reduction Methods . . . . . . . . . . . . . . . . 251
6.3.1 Principal Components Regression . . . . . . . . . . 252
6.3.2 Partial Least Squares . . . . . . . . . . . . . . . . . 259
6.4 Considerations in High Dimensions . . . . . . . . . . . . . 261
6.4.1 High-Dimensional Data . . . . . . . . . . . . . . . . 261
6.4.2 What Goes Wrong in High Dimensions? . . . . . . 262
6.4.3 Regression in High Dimensions . . . . . . . . . . . 264
6.4.4 Interpreting Results in High Dimensions . . . . . . 266
6.5 Lab: Linear Models and Regularization Methods . . . . . . 267
6.5.1 Subset Selection Methods . . . . . . . . . . . . . . 267
6.5.2 Ridge Regression and the Lasso . . . . . . . . . . . 274
6.5.3 PCR and PLS Regression . . . . . . . . . . . . . . 279
6.6 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 282

7 Moving Beyond Linearity 289

7.1 Polynomial Regression . . . . . . . . . . . . . . . . . . . . . 290
7.2 Step Functions . . . . . . . . . . . . . . . . . . . . . . . . . 292
7.3 Basis Functions . . . . . . . . . . . . . . . . . . . . . . . . 294
xii CONTENTS

7.4 Regression Splines . . . . . . . . . . . . . . . . . . . . . . . 295

7.4.1 Piecewise Polynomials . . . . . . . . . . . . . . . . 295
7.4.2 Constraints and Splines . . . . . . . . . . . . . . . 295
7.4.3 The Spline Basis Representation . . . . . . . . . . 297
7.4.4 Choosing the Number and Locations
of the Knots . . . . . . . . . . . . . . . . . . . . . . 298
7.4.5 Comparison to Polynomial Regression . . . . . . . 300
7.5 Smoothing Splines . . . . . . . . . . . . . . . . . . . . . . . 301
7.5.1 An Overview of Smoothing Splines . . . . . . . . . 301
7.5.2 Choosing the Smoothing Parameter λ . . . . . . . 302
7.6 Local Regression . . . . . . . . . . . . . . . . . . . . . . . . 304
7.7 Generalized Additive Models . . . . . . . . . . . . . . . . . 306
7.7.1 GAMs for Regression Problems . . . . . . . . . . . 307
7.7.2 GAMs for Classification Problems . . . . . . . . . . 310
7.8 Lab: Non-linear Modeling . . . . . . . . . . . . . . . . . . . 311
7.8.1 Polynomial Regression and Step Functions . . . . . 312
7.8.2 Splines . . . . . . . . . . . . . . . . . . . . . . . . . 317
7.8.3 GAMs . . . . . . . . . . . . . . . . . . . . . . . . . 318
7.9 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 321

8 Tree-Based Methods 327

8.1 The Basics of Decision Trees . . . . . . . . . . . . . . . . . 327
8.1.1 Regression Trees . . . . . . . . . . . . . . . . . . . 328
8.1.2 Classification Trees . . . . . . . . . . . . . . . . . . 335
8.1.3 Trees Versus Linear Models . . . . . . . . . . . . . 338
8.1.4 Advantages and Disadvantages of Trees . . . . . . 339
8.2 Bagging, Random Forests, Boosting, and Bayesian Additive
Regression Trees . . . . . . . . . . . . . . . . . . . . . . . . 340
8.2.1 Bagging . . . . . . . . . . . . . . . . . . . . . . . . 340
8.2.2 Random Forests . . . . . . . . . . . . . . . . . . . . 343
8.2.3 Boosting . . . . . . . . . . . . . . . . . . . . . . . . 345
8.2.4 Bayesian Additive Regression Trees . . . . . . . . . 348
8.2.5 Summary of Tree Ensemble Methods . . . . . . . . 351
8.3 Lab: Decision Trees . . . . . . . . . . . . . . . . . . . . . . 353
8.3.1 Fitting Classification Trees . . . . . . . . . . . . . . 353
8.3.2 Fitting Regression Trees . . . . . . . . . . . . . . . 356
8.3.3 Bagging and Random Forests . . . . . . . . . . . . 357
8.3.4 Boosting . . . . . . . . . . . . . . . . . . . . . . . . 359
8.3.5 Bayesian Additive Regression Trees . . . . . . . . . 360
8.4 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 361

9 Support Vector Machines 367

9.1 Maximal Margin Classifier . . . . . . . . . . . . . . . . . . 368
9.1.1 What Is a Hyperplane? . . . . . . . . . . . . . . . . 368
9.1.2 Classification Using a Separating Hyperplane . . . 369
CONTENTS xiii

9.1.3 The Maximal Margin Classifier . . . . . . . . . . . 371

9.1.4 Construction of the Maximal Margin Classifier . . 372
9.1.5 The Non-separable Case . . . . . . . . . . . . . . . 373
9.2 Support Vector Classifiers . . . . . . . . . . . . . . . . . . . 373
9.2.1 Overview of the Support Vector Classifier . . . . . 373
9.2.2 Details of the Support Vector Classifier . . . . . . . 375
9.3 Support Vector Machines . . . . . . . . . . . . . . . . . . . 379
9.3.1 Classification with Non-Linear Decision
Boundaries . . . . . . . . . . . . . . . . . . . . . . 379
9.3.2 The Support Vector Machine . . . . . . . . . . . . 380
9.3.3 An Application to the Heart Disease Data . . . . . 383
9.4 SVMs with More than Two Classes . . . . . . . . . . . . . 385
9.4.1 One-Versus-One Classification . . . . . . . . . . . . 385
9.4.2 One-Versus-All Classification . . . . . . . . . . . . 385
9.5 Relationship to Logistic Regression . . . . . . . . . . . . . 386
9.6 Lab: Support Vector Machines . . . . . . . . . . . . . . . . 388
9.6.1 Support Vector Classifier . . . . . . . . . . . . . . . 389
9.6.2 Support Vector Machine . . . . . . . . . . . . . . . 392
9.6.3 ROC Curves . . . . . . . . . . . . . . . . . . . . . . 394
9.6.4 SVM with Multiple Classes . . . . . . . . . . . . . 396
9.6.5 Application to Gene Expression Data . . . . . . . . 396
9.7 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 398

10 Deep Learning 403

10.1 Single Layer Neural Networks . . . . . . . . . . . . . . . . 404
10.2 Multilayer Neural Networks . . . . . . . . . . . . . . . . . . 407
10.3 Convolutional Neural Networks . . . . . . . . . . . . . . . . 411
10.3.1 Convolution Layers . . . . . . . . . . . . . . . . . . 412
10.3.2 Pooling Layers . . . . . . . . . . . . . . . . . . . . 415
10.3.3 Architecture of a Convolutional Neural Network . . 415
10.3.4 Data Augmentation . . . . . . . . . . . . . . . . . . 417
10.3.5 Results Using a Pretrained Classifier . . . . . . . . 417
10.4 Document Classification . . . . . . . . . . . . . . . . . . . . 419
10.5 Recurrent Neural Networks . . . . . . . . . . . . . . . . . . 421
10.5.1 Sequential Models for Document Classification . . 424
10.5.2 Time Series Forecasting . . . . . . . . . . . . . . . 427
10.5.3 Summary of RNNs . . . . . . . . . . . . . . . . . . 431
10.6 When to Use Deep Learning . . . . . . . . . . . . . . . . . 432
10.7 Fitting a Neural Network . . . . . . . . . . . . . . . . . . . 434
10.7.1 Backpropagation . . . . . . . . . . . . . . . . . . . 435
10.7.2 Regularization and Stochastic Gradient Descent . . 436
10.7.3 Dropout Learning . . . . . . . . . . . . . . . . . . . 438
10.7.4 Network Tuning . . . . . . . . . . . . . . . . . . . . 438
10.8 Interpolation and Double Descent . . . . . . . . . . . . . . 439
10.9 Lab: Deep Learning . . . . . . . . . . . . . . . . . . . . . . 443
xiv CONTENTS

10.9.1 A Single Layer Network on the Hitters Data . . . . 443

10.9.2 A Multilayer Network on the MNIST Digit Data . 445
10.9.3 Convolutional Neural Networks . . . . . . . . . . . 448
10.9.4 Using Pretrained CNN Models . . . . . . . . . . . 451
10.9.5 IMDb Document Classification . . . . . . . . . . . 452
10.9.6 Recurrent Neural Networks . . . . . . . . . . . . . 454
10.10 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 458

11 Survival Analysis and Censored Data 461

11.1 Survival and Censoring Times . . . . . . . . . . . . . . . . 462
11.2 A Closer Look at Censoring . . . . . . . . . . . . . . . . . . 463
11.3 The Kaplan-Meier Survival Curve . . . . . . . . . . . . . . 464
11.4 The Log-Rank Test . . . . . . . . . . . . . . . . . . . . . . 466
11.5 Regression Models With a Survival Response . . . . . . . . 469
11.5.1 The Hazard Function . . . . . . . . . . . . . . . . . 469
11.5.2 Proportional Hazards . . . . . . . . . . . . . . . . . 471
11.5.3 Example: Brain Cancer Data . . . . . . . . . . . . 475
11.5.4 Example: Publication Data . . . . . . . . . . . . . 475
11.6 Shrinkage for the Cox Model . . . . . . . . . . . . . . . . . 478
11.7 Additional Topics . . . . . . . . . . . . . . . . . . . . . . . 480
11.7.1 Area Under the Curve for Survival Analysis . . . . 480
11.7.2 Choice of Time Scale . . . . . . . . . . . . . . . . . 481
11.7.3 Time-Dependent Covariates . . . . . . . . . . . . . 481
11.7.4 Checking the Proportional Hazards Assumption . . 482
11.7.5 Survival Trees . . . . . . . . . . . . . . . . . . . . . 482
11.8 Lab: Survival Analysis . . . . . . . . . . . . . . . . . . . . . 483
11.8.1 Brain Cancer Data . . . . . . . . . . . . . . . . . . 483
11.8.2 Publication Data . . . . . . . . . . . . . . . . . . . 486
11.8.3 Call Center Data . . . . . . . . . . . . . . . . . . . 487
11.9 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 490

12 Unsupervised Learning 497

12.1 The Challenge of Unsupervised Learning . . . . . . . . . . 497
12.2 Principal Components Analysis . . . . . . . . . . . . . . . . 498
12.2.1 What Are Principal Components? . . . . . . . . . . 499
12.2.2 Another Interpretation of Principal Components . 503
12.2.3 The Proportion of Variance Explained . . . . . . . 505
12.2.4 More on PCA . . . . . . . . . . . . . . . . . . . . . 507
12.2.5 Other Uses for Principal Components . . . . . . . . 510
12.3 Missing Values and Matrix Completion . . . . . . . . . . . 510
12.4 Clustering Methods . . . . . . . . . . . . . . . . . . . . . . 516
12.4.1 K-Means Clustering . . . . . . . . . . . . . . . . . 517
12.4.2 Hierarchical Clustering . . . . . . . . . . . . . . . . 521
12.4.3 Practical Issues in Clustering . . . . . . . . . . . . 530
12.5 Lab: Unsupervised Learning . . . . . . . . . . . . . . . . . 532
CONTENTS xv

12.5.1 Principal Components Analysis . . . . . . . . . . . 532

12.5.2 Matrix Completion . . . . . . . . . . . . . . . . . . 535
12.5.3 Clustering . . . . . . . . . . . . . . . . . . . . . . . 538
12.5.4 NCI60 Data Example . . . . . . . . . . . . . . . . . 542
12.6 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 548

13 Multiple Testing 553

13.1 A Quick Review of Hypothesis Testing . . . . . . . . . . . 554
13.1.1 Testing a Hypothesis . . . . . . . . . . . . . . . . . 555
13.1.2 Type I and Type II Errors . . . . . . . . . . . . . . 559
13.2 The Challenge of Multiple Testing . . . . . . . . . . . . . . 560
13.3 The Family-Wise Error Rate . . . . . . . . . . . . . . . . . 561
13.3.1 What is the Family-Wise Error Rate? . . . . . . . 562
13.3.2 Approaches to Control the Family-Wise Error Rate 564
13.3.3 Trade-Off Between the FWER and Power . . . . . 570
13.4 The False Discovery Rate . . . . . . . . . . . . . . . . . . . 571
13.4.1 Intuition for the False Discovery Rate . . . . . . . 571
13.4.2 The Benjamini-Hochberg Procedure . . . . . . . . 573
13.5 A Re-Sampling Approach to p-Values and False Discovery
Rates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 575
13.5.1 A Re-Sampling Approach to the p-Value . . . . . . 576
13.5.2 A Re-Sampling Approach to the False Discovery Rate578
13.5.3 When Are Re-Sampling Approaches Useful? . . . . 581
13.6 Lab: Multiple Testing . . . . . . . . . . . . . . . . . . . . . 582
13.6.1 Review of Hypothesis Tests . . . . . . . . . . . . . 582
13.6.2 The Family-Wise Error Rate . . . . . . . . . . . . . 583
13.6.3 The False Discovery Rate . . . . . . . . . . . . . . 586
13.6.4 A Re-Sampling Approach . . . . . . . . . . . . . . 588
13.7 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 591

Index 597

Modelling Survival Data in Medical Research PDF
100% (6)
Modelling Survival Data in Medical Research PDF
538 pages
Modelling Survival Data in Medical Research 3rd Ed by Collett and Kimber - 1 PDF
67% (6)
Modelling Survival Data in Medical Research 3rd Ed by Collett and Kimber - 1 PDF
538 pages
Essentials of Probability Theory For Statisticians
67% (3)
Essentials of Probability Theory For Statisticians
419 pages
CLARA CLARANS Example
No ratings yet
CLARA CLARANS Example
3 pages
201 - 04 - 01 - Bijma An Introduction To Mathematical Statistics 2017
100% (2)
201 - 04 - 01 - Bijma An Introduction To Mathematical Statistics 2017
380 pages
All Models Are Wrong
No ratings yet
All Models Are Wrong
429 pages
Untitled
No ratings yet
Untitled
633 pages
The BUGS Book: A Practical Introduction To Bayesian Analysis
No ratings yet
The BUGS Book: A Practical Introduction To Bayesian Analysis
393 pages
Applied Stochastic Modelling, Second Edition PDF
100% (5)
Applied Stochastic Modelling, Second Edition PDF
363 pages
(Monographs On Statistics and Applied Probability (Series) 26) Silverman, B. W - Density Estimation For Statistics and Data Analysis-Routledge (2018)
No ratings yet
(Monographs On Statistics and Applied Probability (Series) 26) Silverman, B. W - Density Estimation For Statistics and Data Analysis-Routledge (2018)
186 pages
Microsoft Azure Cognitive Services Custom Vision
No ratings yet
Microsoft Azure Cognitive Services Custom Vision
14 pages
Complete Download An Introduction to Statistical Learning: with Applications in Python Gareth James PDF All Chapters
No ratings yet
Complete Download An Introduction to Statistical Learning: with Applications in Python Gareth James PDF All Chapters
55 pages
ISLR 7th Edition Unknown. - The full ebook version is just one click away
100% (2)
ISLR 7th Edition Unknown. - The full ebook version is just one click away
52 pages
Berk Ra Statistical Learning From A Regression Perspective
100% (2)
Berk Ra Statistical Learning From A Regression Perspective
451 pages
Islp 2
No ratings yet
Islp 2
6 pages
Instant Download An Introduction to Statistical Learning: with Applications in Python Gareth James PDF All Chapters
100% (3)
Instant Download An Introduction to Statistical Learning: with Applications in Python Gareth James PDF All Chapters
50 pages
Islp 3
No ratings yet
Islp 3
5 pages
Islp 1
No ratings yet
Islp 1
15 pages
(Ebook) An Introduction To Statistical Learning: With Applications In R (Second Edition) by Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani ISBN 9781071614174, 1071614177 - Quickly download the ebook to read anytime, anywhere
50% (2)
(Ebook) An Introduction To Statistical Learning: With Applications In R (Second Edition) by Gareth James, Daniela Witten, Trevor Hastie, Robert Tibshirani ISBN 9781071614174, 1071614177 - Quickly download the ebook to read anytime, anywhere
84 pages
Instant Access to An Introduction To Statistical Learning: With Applications In R (Second Edition) Gareth James ebook Full Chapters
100% (3)
Instant Access to An Introduction To Statistical Learning: With Applications In R (Second Edition) Gareth James ebook Full Chapters
65 pages
Course Details
No ratings yet
Course Details
6 pages
Introductory Statistics for Data Analysis Warren J. Ewens instant download
No ratings yet
Introductory Statistics for Data Analysis Warren J. Ewens instant download
33 pages
Buy ebook Statistical Regression and Classification From Linear Models to Machine Learning 1st Edition Norman Matloff cheap price
100% (1)
Buy ebook Statistical Regression and Classification From Linear Models to Machine Learning 1st Edition Norman Matloff cheap price
41 pages
Download ebooks file Statistical Regression and Classification From Linear Models to Machine Learning 1st Edition Norman Matloff all chapters
100% (5)
Download ebooks file Statistical Regression and Classification From Linear Models to Machine Learning 1st Edition Norman Matloff all chapters
61 pages
FahrmeirAndTutz-Generalized Additive Models
No ratings yet
FahrmeirAndTutz-Generalized Additive Models
536 pages
Non Parametric Curve Estimation
No ratings yet
Non Parametric Curve Estimation
423 pages
Statistical Regression and Classification From Linear Models to Machine Learning 1st Edition Norman Matloff 2024 Scribd Download
100% (14)
Statistical Regression and Classification From Linear Models to Machine Learning 1st Edition Norman Matloff 2024 Scribd Download
60 pages
Statistics: New Foundations, Toolbox, and Machine Learning Recipes
No ratings yet
Statistics: New Foundations, Toolbox, and Machine Learning Recipes
309 pages
Statistical Regression and Classification From Linear Models to Machine Learning 1st Edition Norman Matloff instant download
No ratings yet
Statistical Regression and Classification From Linear Models to Machine Learning 1st Edition Norman Matloff instant download
55 pages
Linear Models with R (Chapman & Hall/CRC Texts in Statistical Science) 2nd Edition, (Ebook PDF) - The complete ebook version is now available for download
100% (1)
Linear Models with R (Chapman & Hall/CRC Texts in Statistical Science) 2nd Edition, (Ebook PDF) - The complete ebook version is now available for download
63 pages
Introduction To Statistical Modeling With SAS/STAT Software
No ratings yet
Introduction To Statistical Modeling With SAS/STAT Software
60 pages
Statistical Regression and Classification - From Linear Models To Machine Learning
100% (9)
Statistical Regression and Classification - From Linear Models To Machine Learning
532 pages
Module 2
No ratings yet
Module 2
84 pages
Buy ebook Statistical Foundations, Reasoning and Inference: For Science and Data Science (Springer Series in Statistics) Göran Kauermann cheap price
100% (3)
Buy ebook Statistical Foundations, Reasoning and Inference: For Science and Data Science (Springer Series in Statistics) Göran Kauermann cheap price
65 pages
Statlearn PDF
No ratings yet
Statlearn PDF
123 pages
(Ebook) Statistical Regression and Classification: From Linear Models to Machine Learning by Norman Matloff ISBN 9781498710916, 1498710913 - The full ebook version is available, download now to explore
100% (1)
(Ebook) Statistical Regression and Classification: From Linear Models to Machine Learning by Norman Matloff ISBN 9781498710916, 1498710913 - The full ebook version is available, download now to explore
50 pages
(Ebook) Linear Models With R (Second Edition) by Julian James Faraway ISBN 9781439887332, 1439887330 instant download
100% (1)
(Ebook) Linear Models With R (Second Edition) by Julian James Faraway ISBN 9781439887332, 1439887330 instant download
55 pages
Previewpdf
No ratings yet
Previewpdf
55 pages
Full download (Ebook) Introductory Statistics for Data Analysis by Warren J. Ewens, Katherine Brumberg ISBN 9783031281884, 3031281888 pdf docx
100% (7)
Full download (Ebook) Introductory Statistics for Data Analysis by Warren J. Ewens, Katherine Brumberg ISBN 9783031281884, 3031281888 pdf docx
81 pages
Statistical Modeling and Computation
No ratings yet
Statistical Modeling and Computation
6 pages
Linear Models with R (Chapman & Hall/CRC Texts in Statistical Science) 2nd Edition, (Ebook PDF) pdf download
100% (2)
Linear Models with R (Chapman & Hall/CRC Texts in Statistical Science) 2nd Edition, (Ebook PDF) pdf download
51 pages
An Introduction to Generalized Linear Models Second Edition Annette J. Dobson - Experience the full ebook by downloading it now
100% (1)
An Introduction to Generalized Linear Models Second Edition Annette J. Dobson - Experience the full ebook by downloading it now
48 pages
Full download An Introduction to Statistics with Python With Applications in the Life Sciences 2nd 2nd Edition Thomas Haslwanter pdf docx
No ratings yet
Full download An Introduction to Statistics with Python With Applications in the Life Sciences 2nd 2nd Edition Thomas Haslwanter pdf docx
50 pages
(Springer Series in Statistics) Anthony C. Atkinson, Marco Riani, Andrea Cerioli (Auth.) - Exploring Multivariate Data With The Forward Search-Springer-Verlag New York (2004)
No ratings yet
(Springer Series in Statistics) Anthony C. Atkinson, Marco Riani, Andrea Cerioli (Auth.) - Exploring Multivariate Data With The Forward Search-Springer-Verlag New York (2004)
642 pages
BTMMeeting25Nov2020-StatisticalLearning
No ratings yet
BTMMeeting25Nov2020-StatisticalLearning
49 pages
Booklet Stats v8
No ratings yet
Booklet Stats v8
309 pages
Preview of Density estimation for statistics and data analysis
No ratings yet
Preview of Density estimation for statistics and data analysis
19 pages
MATH1208AnnotatedBook Imp
No ratings yet
MATH1208AnnotatedBook Imp
145 pages
Linear Models and The Relevant Distributions and Matrix Algebra
No ratings yet
Linear Models and The Relevant Distributions and Matrix Algebra
539 pages
Introduction to Statistics and Data Analysis: With Exercises, Solutions and Applications in R, 2nd Edition Christian Heumann All Chapters Instant Download
100% (3)
Introduction to Statistics and Data Analysis: With Exercises, Solutions and Applications in R, 2nd Edition Christian Heumann All Chapters Instant Download
50 pages
An Introduction To Nonparametric Statistics-CRC Press (2020)
No ratings yet
An Introduction To Nonparametric Statistics-CRC Press (2020)
225 pages
121 Stochastic Processes An Introduction Peter W. Jones Peter Smith Edisi 3 2018
100% (1)
121 Stochastic Processes An Introduction Peter W. Jones Peter Smith Edisi 3 2018
271 pages
Mathematical Statistics basic ideas and selected topics Volume I Second Edition Bickel Peter J. all chapter instant download
100% (1)
Mathematical Statistics basic ideas and selected topics Volume I Second Edition Bickel Peter J. all chapter instant download
77 pages
Linear Mixed Models For Longitudinal Data
100% (1)
Linear Mixed Models For Longitudinal Data
579 pages
Intro Stat
No ratings yet
Intro Stat
324 pages
An Introduction to Statistics with Python With Applications in the Life Sciences 2nd 2nd Edition Thomas Haslwanter - Quickly download the ebook to never miss important content
100% (2)
An Introduction to Statistics with Python With Applications in the Life Sciences 2nd 2nd Edition Thomas Haslwanter - Quickly download the ebook to never miss important content
67 pages
SASprimer
No ratings yet
SASprimer
125 pages
124 Stochastic Processes From Applications To Theory Pierre Del Moral Spiridon Penev Edisi 1 2016
100% (1)
124 Stochastic Processes From Applications To Theory Pierre Del Moral Spiridon Penev Edisi 1 2016
916 pages
Statistics Super Review, 2nd Ed.
From Everand
Statistics Super Review, 2nd Ed.
The Editors of REA
5/5 (3)
Statistics and Data Analysis Essentials
From Everand
Statistics and Data Analysis Essentials
Jayant Ramaswamy
No ratings yet
Analyzing Quantitative Data: An Introduction for Social Researchers
From Everand
Analyzing Quantitative Data: An Introduction for Social Researchers
Debra Wetcher-Hendricks
No ratings yet
Core Concepts in Statistical Learning
From Everand
Core Concepts in Statistical Learning
Tushar Gulati
No ratings yet
Statement by His Excellency Joseph Nyuma Boakai
No ratings yet
Statement by His Excellency Joseph Nyuma Boakai
4 pages
sgbv-prosecutionhandbook-v1 (1)
No ratings yet
sgbv-prosecutionhandbook-v1 (1)
285 pages
Criminal_procedure_law
No ratings yet
Criminal_procedure_law
115 pages
Liberia Legislation Handbook
No ratings yet
Liberia Legislation Handbook
154 pages
The Executive Law of 1972 Chapter 25
No ratings yet
The Executive Law of 1972 Chapter 25
402 pages
LBR 199332
No ratings yet
LBR 199332
172 pages
B.Tech - Minor Program - Course structure-JNTUH
No ratings yet
B.Tech - Minor Program - Course structure-JNTUH
14 pages
[Ebooks PDF] download grokking Machine Learning MEAP v07 Luis G Serrano full chapters
100% (2)
[Ebooks PDF] download grokking Machine Learning MEAP v07 Luis G Serrano full chapters
39 pages
Detection of Online Employment Scam Through Fake Jobs Using Random Forest Classifier
No ratings yet
Detection of Online Employment Scam Through Fake Jobs Using Random Forest Classifier
8 pages
Classification and Regression Trees First Issued In Hardback Edition Breiman pdf download
100% (2)
Classification and Regression Trees First Issued In Hardback Edition Breiman pdf download
65 pages
Projects
No ratings yet
Projects
35 pages
1 s2.0 S016740482300456X Main
No ratings yet
1 s2.0 S016740482300456X Main
13 pages
1822-b.e-cse-batchno-150
No ratings yet
1822-b.e-cse-batchno-150
64 pages
Imm 5781
No ratings yet
Imm 5781
67 pages
21CS54 Aiml Module3 PPT
No ratings yet
21CS54 Aiml Module3 PPT
102 pages
Aplikasi Citra Drone Untuk Klasifikasi Vegetasi Di Cagar Alam Curah Manis Sempolan 1 Menggunakan Metode Manual, Object Base Image
No ratings yet
Aplikasi Citra Drone Untuk Klasifikasi Vegetasi Di Cagar Alam Curah Manis Sempolan 1 Menggunakan Metode Manual, Object Base Image
13 pages
Augilera Et Al. (2010) - Hybrid Bayesian Network Classifiers - Application To Species Distribution Models
No ratings yet
Augilera Et Al. (2010) - Hybrid Bayesian Network Classifiers - Application To Species Distribution Models
10 pages
Support Vector Machine
0% (1)
Support Vector Machine
7 pages
Mod 4 - CLustering
No ratings yet
Mod 4 - CLustering
55 pages
Question Bank FDS
No ratings yet
Question Bank FDS
4 pages
Course Outline CSC 588 Data Warehousing and Data Mining1
No ratings yet
Course Outline CSC 588 Data Warehousing and Data Mining1
5 pages
Object-Oriented and Multi-Scale Image Analysis in Semantic Networks
0% (1)
Object-Oriented and Multi-Scale Image Analysis in Semantic Networks
7 pages
CS614 FinalTerm Solved Papers
No ratings yet
CS614 FinalTerm Solved Papers
24 pages
Amazon-Fine-Food-Review - K-Means, Agglomerative & DBSCAN Clustering
No ratings yet
Amazon-Fine-Food-Review - K-Means, Agglomerative & DBSCAN Clustering
79 pages
Markov Chains Application To The Financial-Economic Time Series Prediction
No ratings yet
Markov Chains Application To The Financial-Economic Time Series Prediction
26 pages
Performance Evaluation of Various Data Mining Algorithms On Road Traf Fic Accident Dataset
No ratings yet
Performance Evaluation of Various Data Mining Algorithms On Road Traf Fic Accident Dataset
12 pages
Implementation of real time activity sensing
No ratings yet
Implementation of real time activity sensing
9 pages
Denoisng of Images
No ratings yet
Denoisng of Images
59 pages
Emotion Recognition From Formal Text (Poetry)
No ratings yet
Emotion Recognition From Formal Text (Poetry)
3 pages
Full ml-2
No ratings yet
Full ml-2
1 page
Solutions for Problems from Neural Networks and Learning Machines, 3rd Edition by Simon Haykin
No ratings yet
Solutions for Problems from Neural Networks and Learning Machines, 3rd Edition by Simon Haykin
5 pages
Capstone Project - Credit Card Fraud Prediction - Alexandre Daltro
No ratings yet
Capstone Project - Credit Card Fraud Prediction - Alexandre Daltro
15 pages
MLF Lec01
No ratings yet
MLF Lec01
23 pages
Deep Learning Autoencoders
No ratings yet
Deep Learning Autoencoders
31 pages

1

Uploaded by

1

Uploaded by

Springer Texts in Statistics

More information about this series at http://www.springer.com/series/417

Trevor Hastie Robert Tibshirani

Trevor Hastie Robert Tibshirani

ISSN 1431-875X ISSN 2197-4136 (electronic)

Alison and Michael James

Chiara Nappi and Edward Witten

Valerie and Patrick Hastie

Vera and Sami Tibshirani

and to our families:

Michael, Daniel, and Catherine

Tessa, Theo, Otto, and Ari

Samantha, Timothy, and Lynda

Charlie, Ryan, Julie, and Cheryl

Statistical learning refers to a set of tools for making sense of complex

The first edition of ISL covered a number of important topics, including

It’s tough to make predictions, especially about the future.

3.3.1 Qualitative Predictors . . . . . . . . . . . . . . . . 83

4.8 Exercises . . . . . . . . . . . . . . . . . . . . . . . . . . . . 189

5 Resampling Methods 197

6 Linear Model Selection and Regularization 225

7 Moving Beyond Linearity 289

7.4 Regression Splines . . . . . . . . . . . . . . . . . . . . . . . 295

8 Tree-Based Methods 327

9 Support Vector Machines 367

9.1.3 The Maximal Margin Classifier . . . . . . . . . . . 371

10 Deep Learning 403

10.9.1 A Single Layer Network on the Hitters Data . . . . 443

11 Survival Analysis and Censored Data 461

12 Unsupervised Learning 497

12.5.1 Principal Components Analysis . . . . . . . . . . . 532

13 Multiple Testing 553

You might also like