Machine Learning 10-401, Spring 2018: Introduction, Admin, Course Overview
Machine Learning 10-401, Spring 2018: Introduction, Admin, Course Overview
Document Categorization
2
Machine Learning is Changing the World
“Machine learning is the hot new thing”
(John Hennessy, President, Stanford)
• Applications.
What is Machine Learning?
Examples of important machine learning paradigms.
Supervised Classification
Goal: use emails seen so far to produce good prediction rule for
future data.
Supervised Classification. Example: Spam Detection
Represent each message by features. (e.g., keywords, spelling, etc.)
example label
Reasonable RULES:
+ + -
+ -
+
Predict SPAM if unknown AND (money OR pills)
-
- -
Predict SPAM if 2money + 3pills –5 known > 0 -
Linearly separable
Supervised Classification. Example: Image classification
• Medicine:
– diagnose a disease
• input: from symptoms, lab measurements, test results, DNA tests, …
• output: one of set of possible diseases, or “none of the above”
• examples: audiology, thyroid cancer, diabetes, …
– or: response to chemo drug X
– or: will patient be re-admitted soon?
• Computational Economics:
– predict if a stock will rise or fall
– predict if a user will click on an ad or not
• in order to decide which ad to show
Regression. Predicting a numeric value
Stock market
Weather prediction
Temperature
72° F
Dimensionality Reduction
• Homeworks 1 to 4
– Theory/math handouts
– Programming exercises; applying/evaluating existing learners
– Late assignments:
• Up to 50% credit if it’s less than 48 hrs late
• You can drop your lowest assignment grade
• Projects: conduct a small experiment or read a couple of papers and present the main
ideas or work on a small theoretical question.
• Project presentations: April 23 and April 25
Collaboration policy (see syllabus)
• Discussion of anything is ok…
• …but the goal should be to understand better, not
save work.
• So:
– no notes of the discussion are allowed…the only
thing you can take away is whatever’s in your
brain.
– you should acknowledge who you got help
from/did help in your homework
Brief Overview
• Meeting Time: Mon, Wed, NSH 3002, 10:30 – 11:50
• Course Staff
• Instructors:
–- Mohamad Mahdi Kassir
Maria Florina (Nina)( [email protected]).
Balcan ([email protected])
• TAs:
– Kenneth Marino ([email protected])
– Colin White ([email protected])
– Nupur Chatterji ([email protected])
Maria-Florina Balcan: Nina
• Foundations for Modern Machine Learning
• E.g., interactive, semi-supervised, distributed, life-long learning
𝑥5 𝑓 = 𝑁𝑜 𝑓 = 𝑌𝑒𝑠 𝑥5
1 0 1 0
𝑓 = 𝑌𝑒𝑠 𝑓 = 𝑁𝑜 𝑓 = 𝑌𝑒𝑠 𝑓 = 𝑁𝑜
ID3: Natural greedy approach to growing a decision tree top-down (from the
root to the leaves by repeatedly replacing an existing leaf with an internal node.).
No Yes No Yes