Lectures1 2
Lectures1 2
Learning
Richa Singh
Slides are prepared from several information sources on the web and books
About the instructors
• Richa Singh:
– Professor and HoD - CSE, IIT Jodhpur
– Faculty, IIIT Delhi (2009 – 2019)
– home.iitj.ac.in/~richa
• Reference Books:
– Tom Mitchell, Machine Learning
– C. Bishop, Pattern Recognition and Machine Learning, Springer
– K. Murphy, Machine Learning: a Probabilistic Perspective, MIT
Press
– Shalev-Shwartz,S., Ben-David,S., (2014), Understanding Machine
Learning: From Theory to Algorithms, Cambridge University
Press
Evaluation Components
• Grading
– Assignments (labs, programming and written): 30%
– Exams: 30%
– Project: 20%
– Quiz : 10%
– Kaggle Challenges: 10%
• Project team size: 2 students
– Predefined project topics: you have to select one
• Assignments: individually
Collaboration Policy
• Discussion with friends and colleagues is
good… but
– the objective should be to improve understanding
and learning
– Not getting answers
• If you have discussed with anyone, you should
acknowledge who helped you – from the class
or outside the class
Plagiarism Policy
• Cheating in assignments/quizzes/projects/
– First offence: Zero in the evaluation component
– Second offense: Grade reduction/F grade
• Lectures: 3 days
• Labs: 2 groups
– Time will be decided this week
Any questions regarding
administrative guidelines?
What is Pattern Recognition and
Machine Learning?
Machine Learning
• What do we understand by learning?
– Learning is any process by which a system improves
performance from experience.”
– Herbert Simon (1950)
• Machine Learning is the study of algorithms that
– improve their performance P
– at some task T
– with experience E.
A well-defined learning task is given by <P, T, E>.
Weather forecast
What are the facial expressions?
Database
(Input, Feature Learnt Model
Preprocessing Learning
Class labels) selection
Training
Machine Learning Pipeline
Validation
Evaluate the
Select the
Learning performance
Learning on
best model validation dataset
Database
(Input, Feature Learnt Model
Preprocessing Learning
Class labels) selection
Training
Machine Learning Pipeline
Testing
Database
(Input, Feature Learnt Model
Preprocessing Learning
Class labels) selection
Training
Machine Learning Pipeline
Testing
Database
(Input, Feature Learnt Model
Preprocessing Learning
Class labels) selection
Training
Typical Classification Algorithm
Typical Classification Algorithm
Performance Measure:
Accuracy (or fraction of correct answers) on test data