Seminar Report
Seminar Report
Seminar Report
On
MACHINE LEARNING
Submitted in partial fulfilment
For the award of the degree of
Bachelor of Technology
In
ELECTRONICS AND COMMUNICATION ENGINEERING
(Rajasthan Technical University ,Kota)
This project was developed as part of an internship program at YBI Foundation, showcasing
the integration of theoretical learning and hands-on experience in software development
and data science.
ACKNOWLEDGEMENT
I would like to express my gratitude for the people who were part of my report, directly or
indirectly people who gave unending support right from the stage the idea was conceived. It
gives me a great pleasure to have an opportunity to acknowledge and to express gratitude
to those who were associated with me during my Internship at YBI Foundation.
I take this opportunity to thank the industrial training coordinator, H.O.D of Computer
Science and Engineering department. I am highly indebted to my project guide Dr. Alok
Yadav (Training Instructor) for his guidance and words of wisdom. He always showed me
the right direction during the course of this report project work. I am duly thankful to him
for teaching and referring me to various blocks, providing work, and for permitting me to
have training of duration of 4 weeks.
Movie Recommendation System Using
Python
Contents
1 Objective 3
2 Internship Experience 4
4 Dataset Details 6
4.3 Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
5 Steps Undertaken 7
1
6 Recommendation Algorithm 8
7 Model Evaluation 8
7.2 Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
10 References 10
2
1 Objective
3
2 Internship Experience
The YBI Foundation offers short-term internships focusing on building foundational and
advanced skills in Python programming and data science. Over the 15-day internship,
participants were introduced to a range of technologies essential for data-driven projects.
• Data Analysis: Understanding the dataset, cleaning the data, and preparing it
for modeling.
Technical Skills:
Professional Development:
4
3 Technologies and Tools Used
5
4 Dataset Details
The dataset used in this project is the MovieLens dataset, a popular benchmark in the
recommendation systems domain. It contains user ratings for movies spanning multiple
genres.
4.3 Statistics
• Total Movies: Over 10,000
• Total Users: 100,000+
• Number of Ratings: 1,000,000+
6
5 Steps Undertaken
The first step was importing the data into the Python environment using Pandas. The
datasets were loaded using pd.read csv() and explored for missing values and duplicates.
Example:
import pandas as pd
movies = pd.read_csv(’movies.csv’)
ratings = pd.read_csv(’ratings.csv’)
7
6 Recommendation Algorithm
The Surprise library’s SVD (Singular Value Decomposition) was used to build the rec-
ommendation model.
Steps Taken:
7 Model Evaluation
7.2 Performance
The RMSE of the model was 0.87, indicating good prediction accuracy.
8
8 Prediction and Results
Strengths:
• Highly personalized.
• Efficient for large datasets.
Limitations:
9
9 Conclusion and Future Scope
10 References
• MovieLens Dataset:
https://grouplens.org/datasets/movielens/
• Surprise Library Documentation:
https://surprise.readthedocs.io/en/stable/
• Scikit-learn Official Guide: https://scikit-learn.org/
10