0% found this document useful (0 votes)

31 views

Cse291d 7

A mixture model could be used to cluster patients based on features like vital signs, symptoms, medical history, etc. Each cluster would correspond to a triage category like urgent, priority, routine. The model would be trained on historical patient data where triage labels are known. New patients could then be assigned to clusters/categories in real-time to help direct staff resources. Periodic retraining could refine the model as more data is collected. This approach aims to standardize triage while leveraging all available patient information to help prioritize care.

Uploaded by

ballechase

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views

Cse291d 7

Uploaded by

ballechase

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

CSE291D Lecture 7

Mixture models (revisited)

1
Announcements
• Project proposals are due today!!

• Please submit your proposal to me by email

– It’s due today, but I’ll give you a small extension:
until noon tomorrow

• I am happy to discuss your project plans

2
Log-sum-exp trick
• When computing probabilities, you will probably quickly hit
numerical underflow issues (e.g. in your homework!)

• Solution: work with unnormalized probabilities in log space.

• Normalize only when you need to, in a

numerically stable way.

3
Log-sum-exp

• Log-sum-exp in
2 dimensions:

• Monotonic,
convex

Figure from https://inst.eecs.berkeley.edu/~ee127a/book/login/def_lse_fcn.html 4

Log-sum-exp trick

MATLAB code:

5
Latent variable models

Z Latent variables

Parameters Φ X Observed data

Data
Points

Dimensionality(X) >> dimensionality(Z)

Z is a bottleneck, which finds a compressed, low-dimensional representation of X 6
Mixture models

Discrete
Z latent variables:
Cluster assignments

Parameters Φ X Observed data

Data
Points

7
Learning outcomes
By the end of the lesson, you should be able to:

• Train mixture models in a variety of ways:

– EM
– Gibbs sampling
– Collapsed Gibbs sampling

• Apply mixture models to data analysis tasks

8
9
10
Mixture models

Convex combination
of distributions

Marginalizing over a
latent variable zi

11
Mixture of Gaussians

12
Uses of mixture models: Clustering

Cluster centers

13
Uses of mixture models:
Density estimation

14
Mixture models
for classification (e.g. naïve Bayes)

Cluster assignments
Y correspond to
observed class labels

Parameters Φ X
Data
Points

15
Semi-supervised classification
Y Z

Φ X Φ X
Data Data
Points Points

Naïve Bayes with missing labels? Mixture models with some

observed labels?

16
Mixtures of experts

Regression
output
Y

Input feature X

17
Mixtures of experts

Regression
output
Y
Input-dependent
cluster assignments:
gating function

Input feature X

18
Mixtures of experts

Gating function:

Predicted y:

19
Topic models

Mimno, D. (2012). Computational historiography: Data mining in a century of classics journals.

20
ACM Journal on Computing and Cultural Heritage, Vol. 5, No. 1, Article 3,.
Computing the MLE
• Suppose the complete data log-likelihood is in the
exponential family

• Is this convex in theta?

What about the log-likelihood
when z is unobserved (the observed data LL)?

21
Actually, D is strictly speaking the
right answer: the complete data log-
likelihood is concave.

Of course, the negative complete

data log-likelihood is convex, so the
difference doesn’t matter in practice.

,non-
22
,convex convex
Computing the MLE
• Suppose the complete data log-likelihood is in
the exponential family

Log-sum-exp

Linear
Convex Concave

23
Computing the MLE
• Observed data LL:

Log-sum-exp Log-sum-exp

Difference of
convex, D.C.

Convex Has local optima

Convex

24
Mixtures of exponential families
• 1-of-k notation:

25
Mixtures of exponential families
• The complete data log-likelihood is in the exponential
family:

• The non-convexity result applies

• Sufficient statistics are counts of cluster assignments, and

counts of component sufficient statistics per cluster

26
EM for exponential family mixtures
• E-step: Compute lower bound, expected
complete data log-likelihood:

Linear
Convex Convex Convex

27
EM for exponential family mixtures

• E-step responsibilities:

28
M-step

• Lagrange multiplier, take derivative, set to 0,

• Compute MLE for each component, with expected sufficient

statistics plugged in for the sufficient statistics
• It’s as if fractional data points were assigned to each cluster,
weighted by their responsibilities
29
Gibbs sampling
• We can use MCMC to infer the full posterior
over latent variables, parameters, instead of
just a point estimate

• Gibbs updates for each , , in turn.

• This is in your homework!

30
Collapsed Gibbs sampling
• Marginalize out the parameters
• Perform Gibbs sampling on just the z’s
• Recover parameter estimates based on z at the end
of the algorithm

• Rao-Blackwell Theorem:

31
Collapsed Gibbs sampling

Before collapsing After collapsing

32
Marginalize out mixture parameters
• Assume a Dirichlet prior on mixture weights

• Polya urn model! (Same as posterior

predictive for Dirichlet-multinomial)

33
Collapsed conditional probabilities

• Probability of drawing the last ball from the

urn, given all the others

34
Collapsed Gibbs sampler
for mixture model

35
36
Performance of collapsed sampler

37
Mixing advantages of
collapsed sampler
• Stochasticity: updating z updates the counts
immediately, so the information is propagated
sooner

• Removes dependency between old parameter

and z.
– When you update a z variable, theta is “out of date”
and “wants” to keep the z’s in their old location. So
there is a battle between old parameters and the
data, which slows down mixing

38
Think-pair-share: Triage

• You are a data scientist working for a hospital.

They need help designing an automatic triage
system which clusters patients according to
several levels of the urgency of care needed.

Design a mixture modeling approach for

automatically grouping clusters into triage
categories so that patients can get the right level
of attention from the hospital staff.

Solutions To Fossen Structural Geology PDF
0% (1)
Solutions To Fossen Structural Geology PDF
31 pages
The Innate Theory
No ratings yet
The Innate Theory
6 pages
Azure Data Factory
100% (4)
Azure Data Factory
16 pages
NCM 118
No ratings yet
NCM 118
7 pages
Clustering Mixture
No ratings yet
Clustering Mixture
22 pages
Chap2 Part2 GMM
No ratings yet
Chap2 Part2 GMM
34 pages
Notes7_Mixtures_and_EM
No ratings yet
Notes7_Mixtures_and_EM
7 pages
Finite Mixture Modelling Model Specification, Estimation & Application
No ratings yet
Finite Mixture Modelling Model Specification, Estimation & Application
11 pages
کتاب ششم بارگزاری شده
No ratings yet
کتاب ششم بارگزاری شده
49 pages
Mixture Models
No ratings yet
Mixture Models
16 pages
CB PDF
No ratings yet
CB PDF
69 pages
Lec15 16 Handout
No ratings yet
Lec15 16 Handout
33 pages
14 Gaussian Mixture Models
No ratings yet
14 Gaussian Mixture Models
60 pages
Week 7 - Latent Variable Models and Expectation Maximization
No ratings yet
Week 7 - Latent Variable Models and Expectation Maximization
39 pages
GMM
No ratings yet
GMM
26 pages
Chapter 1 - Part1
No ratings yet
Chapter 1 - Part1
56 pages
Mixture Models and Expectation-Maximization: Justus H. Piater
No ratings yet
Mixture Models and Expectation-Maximization: Justus H. Piater
11 pages
Gaussian Mixture Model: P (X - Y) P (Y - X) P (X)
No ratings yet
Gaussian Mixture Model: P (X - Y) P (Y - X) P (X)
3 pages
Flexmix Intro
No ratings yet
Flexmix Intro
18 pages
Cours Finite Mixture
No ratings yet
Cours Finite Mixture
65 pages
20-gaussian-mixture-model
No ratings yet
20-gaussian-mixture-model
55 pages
EM-converted
No ratings yet
EM-converted
22 pages
Cse291d 8
No ratings yet
Cse291d 8
50 pages
401 Week7 Part 2 EM Algorithm
No ratings yet
401 Week7 Part 2 EM Algorithm
58 pages
lec12
No ratings yet
lec12
15 pages
Package Mixr': R Topics Documented
No ratings yet
Package Mixr': R Topics Documented
29 pages
M03 Clustering (1)
No ratings yet
M03 Clustering (1)
37 pages
unsupervised_learning_clustering_math
No ratings yet
unsupervised_learning_clustering_math
28 pages
Algoritmo E-M. Utilizado para Calcular La Mezcla de Gausianas
No ratings yet
Algoritmo E-M. Utilizado para Calcular La Mezcla de Gausianas
8 pages
掃描文件 2019年10月24日
No ratings yet
掃描文件 2019年10月24日
19 pages
Dis10 Sol PDF
No ratings yet
Dis10 Sol PDF
6 pages
gmm
No ratings yet
gmm
8 pages
Bishop-Pattern-Recognition-and-Machine-Learning-2006 第455 - 459页
No ratings yet
Bishop-Pattern-Recognition-and-Machine-Learning-2006 第455 - 459页
5 pages
ML RUSA Module 6 Probablistic EM KNN SVM
No ratings yet
ML RUSA Module 6 Probablistic EM KNN SVM
51 pages
Expectation Maximization
No ratings yet
Expectation Maximization
19 pages
ML Columbia PDF
No ratings yet
ML Columbia PDF
615 pages
Fitting A Mixture Distribution To Data
No ratings yet
Fitting A Mixture Distribution To Data
12 pages
Most Compact and Complete Data Science Cheat Sheet 1672981093
No ratings yet
Most Compact and Complete Data Science Cheat Sheet 1672981093
10 pages
Enhancing Clustering Mechanism by Implementation of EM Algorithm For Gaussian Mixture Model
No ratings yet
Enhancing Clustering Mechanism by Implementation of EM Algorithm For Gaussian Mixture Model
4 pages
Lec15 PDF
No ratings yet
Lec15 PDF
8 pages
Tutorial em
No ratings yet
Tutorial em
57 pages
COMP4702 Notes 2019: Week 2 - Supervised Learning
No ratings yet
COMP4702 Notes 2019: Week 2 - Supervised Learning
23 pages
Lecture 19 and 20
No ratings yet
Lecture 19 and 20
27 pages
Stats, Mle, and Other Stuff: 1 Sevssd
No ratings yet
Stats, Mle, and Other Stuff: 1 Sevssd
10 pages
21 Efficient Inference A K-Means
No ratings yet
21 Efficient Inference A K-Means
32 pages
EM and Kmeans relations
No ratings yet
EM and Kmeans relations
70 pages
S6, S7, S8 CS - U4 Getter Setter EM Algorithm
No ratings yet
S6, S7, S8 CS - U4 Getter Setter EM Algorithm
32 pages
What Is Data Science? Probability Overview Descriptive Statistics
No ratings yet
What Is Data Science? Probability Overview Descriptive Statistics
10 pages
GMMEMNotes
No ratings yet
GMMEMNotes
10 pages
ds11 2
No ratings yet
ds11 2
19 pages
CPSC 540: Machine Learning: Mixture Models, Expectation Maximization
No ratings yet
CPSC 540: Machine Learning: Mixture Models, Expectation Maximization
38 pages
Essentials of Bayesian Inference 1706204646
No ratings yet
Essentials of Bayesian Inference 1706204646
21 pages
Finite Mixture Models
No ratings yet
Finite Mixture Models
26 pages
Data Science Cheat Sheet
No ratings yet
Data Science Cheat Sheet
10 pages
Approximate Bayesian Computation For Finite Mixture Models
No ratings yet
Approximate Bayesian Computation For Finite Mixture Models
21 pages
lec13
No ratings yet
lec13
27 pages
CS-601-Machine-learning-Unit-5 (1)
No ratings yet
CS-601-Machine-learning-Unit-5 (1)
18 pages
GAUSSIAN MIXTURES
No ratings yet
GAUSSIAN MIXTURES
5 pages
Journal of Statistical Software: Mixtools: An R Package For Analyzing Finite Mixture Models
No ratings yet
Journal of Statistical Software: Mixtools: An R Package For Analyzing Finite Mixture Models
29 pages
cs229 Notes7b PDF
No ratings yet
cs229 Notes7b PDF
4 pages
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Introduction to Logarithms and Exponentials
From Everand
Introduction to Logarithms and Exponentials
Simone Malacrida
No ratings yet
Symbolic Mathematics in Data Science. Algebra, Calculus, and Geometry with Matlab
From Everand
Symbolic Mathematics in Data Science. Algebra, Calculus, and Geometry with Matlab
César Pérez López
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
What Is The People Posting About Symptoms Related To Coronavirus in Bogota, Colombia
No ratings yet
What Is The People Posting About Symptoms Related To Coronavirus in Bogota, Colombia
3 pages
Measuring Human and Economic Activity From Satellite Imagery To Support City-Scale Decision-Making During COVID-19 Pandemic
No ratings yet
Measuring Human and Economic Activity From Satellite Imagery To Support City-Scale Decision-Making During COVID-19 Pandemic
12 pages
Prediction of COVID-19 Disease Progression in India Under The Effect of National Lockdown
No ratings yet
Prediction of COVID-19 Disease Progression in India Under The Effect of National Lockdown
11 pages
Understanding The COVID19 Outbreak: A Comparative Data Analytics and Study
No ratings yet
Understanding The COVID19 Outbreak: A Comparative Data Analytics and Study
9 pages
A Public Website For The Automated Assessment and Validation of Sars-Cov-2 Diagnostic PCR Assays
No ratings yet
A Public Website For The Automated Assessment and Validation of Sars-Cov-2 Diagnostic PCR Assays
8 pages
Google COVID-19 Community Mobility Reports: Anonymization Process Description (Version 1.0)
No ratings yet
Google COVID-19 Community Mobility Reports: Anonymization Process Description (Version 1.0)
5 pages
Covid19: Unless One Gets Everyone To Act, Policies May Be Ineffective or Even Backfire
No ratings yet
Covid19: Unless One Gets Everyone To Act, Policies May Be Ineffective or Even Backfire
15 pages
Inhibitors Against SARS-COV-2 (COVID-19) Protease
No ratings yet
Inhibitors Against SARS-COV-2 (COVID-19) Protease
15 pages
The Modelling of COVID19 Pathways Sheds Light On Mechanisms, Opportunities and On Controversial Interpretations of Medical Treatments. v2
No ratings yet
The Modelling of COVID19 Pathways Sheds Light On Mechanisms, Opportunities and On Controversial Interpretations of Medical Treatments. v2
15 pages
Curating A COVID-19 Data Repository and Forecasting County-Level Death Counts in The United States
No ratings yet
Curating A COVID-19 Data Repository and Forecasting County-Level Death Counts in The United States
25 pages
Immunological Determinants of Clinical Outcomes in COVID-19: A Quantitative Perspective
No ratings yet
Immunological Determinants of Clinical Outcomes in COVID-19: A Quantitative Perspective
36 pages
A Semantically Enriched Dataset Based On Biomedical NER For The COVID19 Open Research Dataset Challenge
No ratings yet
A Semantically Enriched Dataset Based On Biomedical NER For The COVID19 Open Research Dataset Challenge
3 pages
BIMCV COVID-19+: A Large Annotated Dataset of RX and CT Images From COVID-19 Patients
No ratings yet
BIMCV COVID-19+: A Large Annotated Dataset of RX and CT Images From COVID-19 Patients
22 pages
Weibo-COV: A Large-Scale COVID-19 Social Media Dataset From Weibo
No ratings yet
Weibo-COV: A Large-Scale COVID-19 Social Media Dataset From Weibo
9 pages
Modelling The Spread of Covid19 in Italy Using A Revised Version of The SIR Model
No ratings yet
Modelling The Spread of Covid19 in Italy Using A Revised Version of The SIR Model
11 pages
Modeling COVID-19 Dynamics in Illinois Under Non-Pharmaceutical Interventions
No ratings yet
Modeling COVID-19 Dynamics in Illinois Under Non-Pharmaceutical Interventions
21 pages
A Bayesian - Deep Learning Model For Estimating Covid-19 Evolution in Spain
No ratings yet
A Bayesian - Deep Learning Model For Estimating Covid-19 Evolution in Spain
22 pages
Incorporating Social Opinion in The Evolution of An Epidemic Spread
No ratings yet
Incorporating Social Opinion in The Evolution of An Epidemic Spread
22 pages
The Impact of Covid-19 On The Uk Fresh Food Supply Chain
No ratings yet
The Impact of Covid-19 On The Uk Fresh Food Supply Chain
21 pages
COVID-19 Diagnosis by Routine Blood Tests Using Machine Learning
No ratings yet
COVID-19 Diagnosis by Routine Blood Tests Using Machine Learning
11 pages
COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report Generation
No ratings yet
COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report Generation
11 pages
Machine Learning Automatically Detects Covid-19 Using Chest Cts in A Large Multicenter Cohort
No ratings yet
Machine Learning Automatically Detects Covid-19 Using Chest Cts in A Large Multicenter Cohort
27 pages
Automated Chest CT Image Segmentation of COVID-19 Lung Infection Based On 3D U-Net
No ratings yet
Automated Chest CT Image Segmentation of COVID-19 Lung Infection Based On 3D U-Net
9 pages
Data-Driven Analytical Models of COVID-2019 For Epidemic Prediction, Clinical Diagnosis, Policy Effectiveness and Contact Tracing: A Survey
No ratings yet
Data-Driven Analytical Models of COVID-2019 For Epidemic Prediction, Clinical Diagnosis, Policy Effectiveness and Contact Tracing: A Survey
14 pages
Quantifying Policy Responses To A Global Emergency: Insights From The COVID-19 Pandemic
No ratings yet
Quantifying Policy Responses To A Global Emergency: Insights From The COVID-19 Pandemic
47 pages
Does Non-COVID19 Lung Lesion Help? Investigating Transferability in COVID-19 CT Image Segmentation
No ratings yet
Does Non-COVID19 Lung Lesion Help? Investigating Transferability in COVID-19 CT Image Segmentation
9 pages
Edge Covid-19: A Web Platform To Generate Submission-Ready Genomes For Sars-Cov-2 Sequencing Efforts
No ratings yet
Edge Covid-19: A Web Platform To Generate Submission-Ready Genomes For Sars-Cov-2 Sequencing Efforts
18 pages
Designing An ML-Minded Product and A Product-Minded ML System
No ratings yet
Designing An ML-Minded Product and A Product-Minded ML System
43 pages
Nonlinear Dynamic Analysis of An Epidemiological Model For COVID-19 Including Public Behavior and Government Action
No ratings yet
Nonlinear Dynamic Analysis of An Epidemiological Model For COVID-19 Including Public Behavior and Government Action
14 pages
Transnfcm: Translation-Based Neural Fashion Compatibility Modeling
No ratings yet
Transnfcm: Translation-Based Neural Fashion Compatibility Modeling
8 pages
Abhishek K Vyas: Mobile: 9974872931
No ratings yet
Abhishek K Vyas: Mobile: 9974872931
2 pages
Motortronics mvc4 User Manual
No ratings yet
Motortronics mvc4 User Manual
79 pages
BEL Paint Coating Catalog
No ratings yet
BEL Paint Coating Catalog
14 pages
Global+Cardio 2025-2-70 Def
No ratings yet
Global+Cardio 2025-2-70 Def
28 pages
NSC Unit - 2 - 221218 - 100752
No ratings yet
NSC Unit - 2 - 221218 - 100752
25 pages
Infectious Agents James N KC
No ratings yet
Infectious Agents James N KC
3 pages
Criterion D - Reflecting and Improving Performance (As Client and Coach) - Bridget Bitarabeho - Petronilla
No ratings yet
Criterion D - Reflecting and Improving Performance (As Client and Coach) - Bridget Bitarabeho - Petronilla
2 pages
Unmanned Aerial Vehicles: An Armada International Supplement
No ratings yet
Unmanned Aerial Vehicles: An Armada International Supplement
36 pages
MUSIC7 2Q-2b
No ratings yet
MUSIC7 2Q-2b
18 pages
PARTS of SPEECH Pages 2 125
No ratings yet
PARTS of SPEECH Pages 2 125
124 pages
Tiếng Anh 8 I-Learn Smart World - Unit 7 (Tâm)
No ratings yet
Tiếng Anh 8 I-Learn Smart World - Unit 7 (Tâm)
5 pages
3076
No ratings yet
3076
84 pages
Andia Sfarsitul Lumii Donlowd - Google Search
No ratings yet
Andia Sfarsitul Lumii Donlowd - Google Search
1 page
DB68-03711A-06 IM ACC AHU Kit GB EN 221128-D01
No ratings yet
DB68-03711A-06 IM ACC AHU Kit GB EN 221128-D01
40 pages
Indigenous Peoples' Day Resolution
No ratings yet
Indigenous Peoples' Day Resolution
1 page
The Muslim Newsletter
No ratings yet
The Muslim Newsletter
2 pages
Curriculum: Bulacan State University College of Education
No ratings yet
Curriculum: Bulacan State University College of Education
9 pages
Effects of stirrup spacing on shear performance of hybrid composite beams produced by pultruded GFRP profile infilled with reinforced concrete
No ratings yet
Effects of stirrup spacing on shear performance of hybrid composite beams produced by pultruded GFRP profile infilled with reinforced concrete
23 pages
Parry & Clark - The Law of Succession - Kerridge, R - (Roger) - 317-365
No ratings yet
Parry & Clark - The Law of Succession - Kerridge, R - (Roger) - 317-365
49 pages
Chap. 3 pdf2
No ratings yet
Chap. 3 pdf2
38 pages
Lama Fera Procedure
100% (6)
Lama Fera Procedure
3 pages
Data Domain Instructions
No ratings yet
Data Domain Instructions
2 pages
Assignment:: 1. What Is An Information Security Assurance?
No ratings yet
Assignment:: 1. What Is An Information Security Assurance?
3 pages
Internal Assessment Questions of Social Work of The Academic Session 2022 2023 For M.A. Semester I Examination 2022 CBCS Mode
No ratings yet
Internal Assessment Questions of Social Work of The Academic Session 2022 2023 For M.A. Semester I Examination 2022 CBCS Mode
2 pages
7 Domains of Teaching Excellence
No ratings yet
7 Domains of Teaching Excellence
4 pages
What is Pattern Recognition and Machine Learning
No ratings yet
What is Pattern Recognition and Machine Learning
6 pages