0% found this document useful (0 votes)
103 views

Machine Learning PDF

The document outlines a machine learning course consisting of 6 units covering topics such as linear regression, classification algorithms, clustering methods, and information retrieval. It includes 15 programming exercises to demonstrate machine learning algorithms like linear regression, logistic regression, k-nearest neighbors, decision trees, naive Bayes, k-means clustering, and support vector machines using Python. Students will also complete a mini-project in groups of 4 applying machine learning techniques to different datasets.

Uploaded by

Khushi Jain
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
103 views

Machine Learning PDF

The document outlines a machine learning course consisting of 6 units covering topics such as linear regression, classification algorithms, clustering methods, and information retrieval. It includes 15 programming exercises to demonstrate machine learning algorithms like linear regression, logistic regression, k-nearest neighbors, decision trees, naive Bayes, k-means clustering, and support vector machines using Python. Students will also complete a mini-project in groups of 4 applying machine learning techniques to different datasets.

Uploaded by

Khushi Jain
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Machine Learning LTPC–3024

Unit 1: Introduction to Machine Learning .


The Origins of Machine Learning, Uses and Abuses of Machine Learning, How do Machines Learn? - Abstraction and Knowledge
Representation, Generalization, Assessing the Success of Learning, Steps to Apply Machine Learning to Data, Choosing a Machine
Learning Algorithm - Thinking about the Input Data, Thinking about Types of Machine Learning Algorithms, Matching Data to an
Appropriate Algorithm

Unit 2: Simple Linear Regression .


Introduction to Simple Linear Regression, Simple Linear Regression Model Building, Estimation of Parameters Using Ordinary Least
Squares, Interpretation of Simple Linear Regression Coefficients, Validation of Simple Linear Regression Model, Coefficient of
Determination (R-squared) and Adjusted R-Squared, Spurious Regression, Hypothesis Test for Regression Coefficients (t-Test),
Test for Overall Model: Analysis of Variance (F-Test), Residual Analysis

Unit 3: Multiple Regression and Model Building .


Introduction, Ordinary Least Squares Estimation for Multiple Linear Regression, Multiple Linear Regression Model Building, Partial
Correlation and Regression Model Building, Interpretation of Multiple Linear Regression Coefficients - Partial Regression Coefficients,
Standardized Regression Coefficient, Regression Models with Categorical (i.e., Qualitative) Variables - Interpretation of Regression
Coefficients of Categorical Variables, Interaction Variables in Regression Models, Validation of Multiple Regression Model,
Coefficient of Multiple Determination (R-Squared), Adjusted R-Squared, Statistical Significance of Individual Variables in Multiple
Linear Regression: t-Test, Validation of Overall Regression Model: F-Test, Validation of Portions of a Multiple Linear Regression
Model - Partial F-Test, Residual Analysis in Multiple Linear Regression

Unit 4: Introduction to Classification & Classification Algorithms .


What is Classification? General Approach to Classification, k-Nearest Neighbor Algorithm, Logistic Regression, Decision Trees, Naive
Bayesian Classifier, Ensemble Methods: Bagging, Boosting and AdaBoost and XBoost, Random Forests, Advanced Classification
Methods: Backpropagation in Multilayer Feed-Forward Neural Networks, Support Vector Machines, Rough Set and Fuzzy Set
Approaches, Classification Model Evaluation and Selection: Sensitivity, Specificity, Positive Predictive Value, Negative Predictive
Value, Lift Curves and Gain Curves, ROC Curves, Misclassification Cost Adjustment to Reflect Real-World Concerns, Decision
Cost/Benefit Analysis

Unit 5: Introduction to Cluster Analysis & Clustering Methods .


The Clustering Task and the Requirements for Cluster Analysis , Overview of Some Basic Clustering Methods, Hierarchical Methods:
Agglomerate versus Divisive Hierarchical Clustering, Distance Measures, Probabilistic Hierarchical Clustering, Multiphase Hierarchical
Clustering Using Clustering Feature Trees, Partitioning Methods: k-Means Clustering, k-Medoids Clustering, Density-Based
Clustering: DBSCAN - Density-Based Clustering Based on Connected Regions with High Density, Measuring Clustering Goodness

Unit 6: Information Retrieval .


Information Retrieval: Representation, Similarity, Evaluation, Modification, Stoplist and Word Classes, Phrases, Lemmatizing and
Stemming, Related Words, Statistically Related Words, Metadata. Text Clustering: Clustering versus Categorization, Clustering
Algorithms for Text - Partitioning Algorithms and Hierarchical Algorithms, Text Representation - Group of Documents, Projection and
Feature Selection, Applications of Document Clustering, Evaluation of Text Clustering: Internal and External Measures.

Text Book - Machine Learning (IBM ICE Publications)

Lab Tools – Python

Lab Exercises –
1. Program to demonstrate Simple Linear Regression
2. Program to demonstrate Logistic Regression using SCIKIT learn
3. Program to demonstrate Logistic Regression
4. Program to demonstrate Multiple Linear Regression
5. Program to demonstrate k-Nearest Neighbor flowers classification
6. Program to demonstrate Decision Tree – ID3 Algorithm
7. Program to demonstrate Naïve- Bayes Classifier
8. Program to demonstrate Back-Propagation Algorithm
9. Program to demonstrate k-means clustering algorithm
10. Program to demonstrate K-Means Clustering Algorithm on Handwritten Dataset
11. Program to demonstrate K-Medoid clustering algorithm
12. Program to demonstrate DBSCAN clustering algorithm
13. Program to demonstrate SVM based classification
14. Program to demonstrate PCA on face recognition
15. Program to demonstrate PCA and LDA on Iris dataset
16. Mini Project works shall be given with a batch of four students considering different datasets such as digit
dataset, face dataset, flower dataset and micro-array dataset.

You might also like