0% found this document useful (0 votes)

3 views

Seminar Report

Uploaded by

anuj27092004

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Seminar Report

Uploaded by

anuj27092004

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

A

Seminar Report
On

MACHINE LEARNING
Submitted in partial fulfilment
For the award of the degree of
Bachelor of Technology
In
ELECTRONICS AND COMMUNICATION ENGINEERING
(Rajasthan Technical University ,Kota)

SUBMITTED TO : SUBMITTED BY:

KRINA DAYANI Name: LAKSH

SARONJA
(Guest Faculty)
Roll No.: 23/296
URN: 23EUCEC027

DEPARTMENT OF ELECTRONICS ENGINEERING

RAJASTHAN TECHNICAL UNIVERSITY, KOTA
DECEMBER 2024
Abstract
The Movie Recommendation System is a Python-based project designed to enhance the
movie-watching experience by providing personalized recommendations. Utilizing
powerful data manipulation libraries like Pandas and NumPy, this system analyzes user
preferences, historical data, and movie features to deliver accurate and tailored suggestions.

The core objective of this project is to implement recommendation algorithms, such as

content-based filtering and collaborative filtering, which enable the system to predict
movies a user might enjoy. The dataset used includes movie metadata, ratings, and user
interactions, enabling the system to build robust relationships between movies and viewer
preferences.

By leveraging Python's flexibility and efficient computational tools, the project

demonstrates the practical application of machine learning concepts in real-world
scenarios. This system has the potential to be expanded for commercial applications,
integrating additional data sources, and improving recommendation accuracy through deep
learning techniques.

This project was developed as part of an internship program at YBI Foundation, showcasing
the integration of theoretical learning and hands-on experience in software development
and data science.
ACKNOWLEDGEMENT
I would like to express my gratitude for the people who were part of my report, directly or
indirectly people who gave unending support right from the stage the idea was conceived. It
gives me a great pleasure to have an opportunity to acknowledge and to express gratitude
to those who were associated with me during my Internship at YBI Foundation.

I take this opportunity to thank the industrial training coordinator, H.O.D of Computer
Science and Engineering department. I am highly indebted to my project guide Dr. Alok
Yadav (Training Instructor) for his guidance and words of wisdom. He always showed me
the right direction during the course of this report project work. I am duly thankful to him
for teaching and referring me to various blocks, providing work, and for permitting me to
have training of duration of 4 weeks.
Movie Recommendation System Using
Python

Contents

1 Objective 3

2 Internship Experience 4

2.1 YBI Foundation Overview . . . . . . . . . . . . . . . . . . . . . . . . . . 4

2.2 Role in the Internship . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

2.3 Skills Acquired . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

3 Technologies and Tools Used 5

3.1 Python Programming . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

3.2 Libraries Used . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

4 Dataset Details 6

4.1 Source of the Dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

4.2 Structure of the Dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

4.3 Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

5 Steps Undertaken 7

5.1 Data Import and Exploration . . . . . . . . . . . . . . . . . . . . . . . . 7

5.2 Data Cleaning and Preprocessing . . . . . . . . . . . . . . . . . . . . . . 7

5.3 Data Visualization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

1
6 Recommendation Algorithm 8

6.1 Why Collaborative Filtering? . . . . . . . . . . . . . . . . . . . . . . . . 8

6.2 SVD Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

7 Model Evaluation 8

7.1 Metrics Used . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

7.2 Performance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

8 Prediction and Results 9

8.1 Top Recommendations for User X: ........................................................................ 9

8.2 Strengths and Limitations ...................................................................................... 9

9 Conclusion and Future Scope 10

9.1 Future Improvements ........................................................................................... 10

10 References 10

2
1 Objective

The objective of this industrial training was to develop a project on

a Movie Recommendation System capable of suggesting
personalized movie recommendations to users. With the growing use
of online streaming platforms, personalized content
recommendations have become a core component of enhancing user
engagement. By analyzing user ratings and leveraging advanced
recommendation algorithms, this system attempts to predict and
recommend movies that align with the users’ tastes.
This project was completed as part of a 15-day internship at YBI
Foundation, where the focus was on learning Python
programming, mastering data manipulation techniques, and
implementing real-world machine learning models.

3
2 Internship Experience

2.1 YBI Foundation Overview

The YBI Foundation offers short-term internships focusing on building foundational and
advanced skills in Python programming and data science. Over the 15-day internship,
participants were introduced to a range of technologies essential for data-driven projects.

2.2 Role in the Internship

During my internship, I was tasked with implementing a real-world recommendation

system. My primary responsibilities included:

• Data Analysis: Understanding the dataset, cleaning the data, and preparing it
for modeling.

• Model Building: Applying collaborative filtering algorithms for recommenda-

tions.

• Visualization: Using data visualization tools to identify patterns and trends.

• Evaluation: Testing the model’s accuracy using statistical metrics like RMSE and
MAE.

2.3 Skills Acquired

Technical Skills:

• Proficiency in Python libraries such as Pandas, NumPy, and Matplotlib.

• Familiarity with advanced algorithms like Singular Value Decomposition (SVD).

Professional Development:

• Improved problem-solving skills.

• Gained experience in documenting and presenting technical work.

4
3 Technologies and Tools Used

3.1 Python Programming

Python is a versatile programming language widely used in data science and

machine learning. It provides robust libraries for data manipulation,
statistical analysis, and machine learning.

3.2 Libraries Used

1. Pandas: A powerful library for data manipulation. Used for importing
datasets, cleaning missing data, and reshaping data structures.
2. NumPy: Efficient for numerical computations, enabling operations on
large arrays and matrices.
3. Matplotlib and Seaborn: Visualization libraries used to create
plots, graphs, and heatmaps. Helped understand rating distributions
and user preferences.
4. Surprise: A Python library specialized in recommendation systems.
Used to implement collaborative filtering techniques like SVD.
5. Scikit-learn: Essential for train-test splitting and model evaluation.

5
4 Dataset Details

4.1 Source of the Dataset

The dataset used in this project is the MovieLens dataset, a popular benchmark in the
recommendation systems domain. It contains user ratings for movies spanning multiple
genres.

4.2 Structure of the Dataset

• Movies.csv: Columns: Movie ID, Title, Genre.
Example: Movie ID 1 → Toy Story (1995) → Genre: Animation, Children, Adven-
ture.

• Ratings.csv: Columns: User ID, Movie ID, Rating, Timestamp.

Example: User 5 rated Movie ID 1 with 4.5 stars.

• Links.csv (Optional): Provides metadata such as IMDb links.

4.3 Statistics
• Total Movies: Over 10,000
• Total Users: 100,000+
• Number of Ratings: 1,000,000+

6
5 Steps Undertaken

5.1 Data Import and Exploration

The first step was importing the data into the Python environment using Pandas. The
datasets were loaded using pd.read csv() and explored for missing values and duplicates.
Example:

import pandas as pd
movies = pd.read_csv(’movies.csv’)
ratings = pd.read_csv(’ratings.csv’)

5.2 Data Cleaning and Preprocessing

• Missing Data Handling: Removed rows with null values to avoid inconsistencies.
• Feature Engineering: Encoded genres into numerical format for easier analysis.
• Matrix Construction: Created a user-item matrix where rows represent users
and columns represent movies.

5.3 Data Visualization

Visualization was performed to understand patterns.

• Rating Distribution: Plotted a histogram to show the frequency of ratings.

• Most Popular Movies: Identified movies with the highest number of ratings.

import matplotlib.pyplot as plt

ratings[’rating’].hist(bins=5)
plt.title(’Rating Distribution’)
plt.show()

7
6 Recommendation Algorithm

6.1 Why Collaborative Filtering?

This technique predicts user preferences by identifying patterns in user behavior.

6.2 SVD Implementation

The Surprise library’s SVD (Singular Value Decomposition) was used to build the rec-
ommendation model.

Steps Taken:

1. Data Preparation: Loaded data into the Surprise library format.

2. Apply Algorithm: Trained the model using SVD.

3. Generate Predictions: Predicted user ratings for unseen movies.

7 Model Evaluation

7.1 Metrics Used

• RMSE (Root Mean Squared Error): Measures prediction accuracy.
• MAE (Mean Absolute Error): Evaluates average prediction error.

7.2 Performance

The RMSE of the model was 0.87, indicating good prediction accuracy.

8
8 Prediction and Results

8.1 Top Recommendations for User X:

• Inception (2010): 4.8
• The Dark Knight (2008): 4.7
• Interstellar (2014): 4.6

8.2 Strengths and Limitations

Strengths:

• Highly personalized.
• Efficient for large datasets.

Limitations:

• Struggles with new users or movies (cold start

problem).
• Biased toward popular items.

9
9 Conclusion and Future Scope

The Movie Recommendation System successfully provided

personalized suggestions.

9.1 Future Improvements

• Incorporate additional features like timestamps or user
demographics.
• Explore hybrid approaches combining collaborative and
content-based filtering.

10 References
• MovieLens Dataset:
https://grouplens.org/datasets/movielens/
• Surprise Library Documentation:
https://surprise.readthedocs.io/en/stable/
• Scikit-learn Official Guide: https://scikit-learn.org/

Book Recommendation System Proposal Report
100% (1)
Book Recommendation System Proposal Report
20 pages
Power BI - Final Project
No ratings yet
Power BI - Final Project
212 pages
Assignment 1 17bcs2733
No ratings yet
Assignment 1 17bcs2733
22 pages
Database Assignment
100% (6)
Database Assignment
7 pages
FRESH_FINDS___Documentation (1)
No ratings yet
FRESH_FINDS___Documentation (1)
32 pages
Report
No ratings yet
Report
59 pages
Share CapstoneFinal
No ratings yet
Share CapstoneFinal
69 pages
dsbda_mini_2
No ratings yet
dsbda_mini_2
23 pages
Project Report
No ratings yet
Project Report
49 pages
Deep Learning Based Recommendation Systems
No ratings yet
Deep Learning Based Recommendation Systems
47 pages
dsbda_mini_2__1_
No ratings yet
dsbda_mini_2__1_
23 pages
Anupam
No ratings yet
Anupam
41 pages
Image Stenography
No ratings yet
Image Stenography
22 pages
Analysis of Wi-Fi Performance Data
No ratings yet
Analysis of Wi-Fi Performance Data
50 pages
Format Report
No ratings yet
Format Report
41 pages
f1-2
No ratings yet
f1-2
53 pages
Performance Analysis and Design of An IoT-Friendly DAG-based Distributed Ledger System
No ratings yet
Performance Analysis and Design of An IoT-Friendly DAG-based Distributed Ledger System
119 pages
Report of Dimensions Measurement of An Object in 2D Image Using Image Processing in Python
No ratings yet
Report of Dimensions Measurement of An Object in 2D Image Using Image Processing in Python
70 pages
Proposal BE Project
No ratings yet
Proposal BE Project
22 pages
KECReport
No ratings yet
KECReport
23 pages
Morris 18 PH D
No ratings yet
Morris 18 PH D
181 pages
Cyberspace Monitoring Using AI and Graph Theoretic Tools
No ratings yet
Cyberspace Monitoring Using AI and Graph Theoretic Tools
38 pages
Swaraj Project 12march
No ratings yet
Swaraj Project 12march
31 pages
Software Defect
No ratings yet
Software Defect
46 pages
Thesis RajaKumar 19MCA005
No ratings yet
Thesis RajaKumar 19MCA005
49 pages
Visual Quality Assessment by Machine Learning: Long Xu Weisi Lin C.-C. Jay Kuo
No ratings yet
Visual Quality Assessment by Machine Learning: Long Xu Weisi Lin C.-C. Jay Kuo
142 pages
21ESKCA031 Baldeep Report (1)
No ratings yet
21ESKCA031 Baldeep Report (1)
34 pages
Sentimental Analysis of Movie Review
100% (1)
Sentimental Analysis of Movie Review
58 pages
D4.2.1 First Version of The Data Analytics Benchmark
No ratings yet
D4.2.1 First Version of The Data Analytics Benchmark
18 pages
Joseph_Stanton_2014
No ratings yet
Joseph_Stanton_2014
85 pages
Master Ahmed Hussnain 2014 PDF
No ratings yet
Master Ahmed Hussnain 2014 PDF
85 pages
Team6 Project Report
No ratings yet
Team6 Project Report
29 pages
Project Report Minor Project (1)
No ratings yet
Project Report Minor Project (1)
15 pages
Anjana Tiha Masters Project
No ratings yet
Anjana Tiha Masters Project
26 pages
Rushil Dave Thesis
No ratings yet
Rushil Dave Thesis
107 pages
Rpkdtech PDF
No ratings yet
Rpkdtech PDF
56 pages
Bachelor of Engineering (Computer Engineering)
No ratings yet
Bachelor of Engineering (Computer Engineering)
47 pages
Computer Network Analysis by Visualization
No ratings yet
Computer Network Analysis by Visualization
43 pages
Blockchain Personal Sem 1
0% (1)
Blockchain Personal Sem 1
41 pages
Report
No ratings yet
Report
18 pages
Design, Development and Performance Evaluation of Multiprocessor Systems On Fpga
No ratings yet
Design, Development and Performance Evaluation of Multiprocessor Systems On Fpga
161 pages
Thesis Karttunen Jarkko
No ratings yet
Thesis Karttunen Jarkko
62 pages
E Notice Report
No ratings yet
E Notice Report
82 pages
Data Analysis Project by Manika
No ratings yet
Data Analysis Project by Manika
59 pages
Bhagya Report Final
No ratings yet
Bhagya Report Final
73 pages
First Project
No ratings yet
First Project
34 pages
FINALLY (1)
No ratings yet
FINALLY (1)
51 pages
J.Janardhan M.SC - Thesis
100% (1)
J.Janardhan M.SC - Thesis
150 pages
aaaaa
No ratings yet
aaaaa
60 pages
Video Object Segmentation Tasks Dataset and Methods 3rd Edition Ning Xu instant download
100% (1)
Video Object Segmentation Tasks Dataset and Methods 3rd Edition Ning Xu instant download
64 pages
Mit PDF
No ratings yet
Mit PDF
45 pages
D4.5 Iterative Quality Enhancement Tools Initial Version
No ratings yet
D4.5 Iterative Quality Enhancement Tools Initial Version
45 pages
Mini Project
No ratings yet
Mini Project
21 pages
Video Classification Using Deep Learning For Video Providers Project Report
No ratings yet
Video Classification Using Deep Learning For Video Providers Project Report
36 pages
Report 83
No ratings yet
Report 83
50 pages
Dbprojreport
No ratings yet
Dbprojreport
35 pages
IoT Based Illness Prediction System Using Machine Learning
No ratings yet
IoT Based Illness Prediction System Using Machine Learning
25 pages
Parallel Processing of Images For Feature Extraction
No ratings yet
Parallel Processing of Images For Feature Extraction
40 pages
Dishank Jain 22eskca031 Itr Report 3CS Ai G1
No ratings yet
Dishank Jain 22eskca031 Itr Report 3CS Ai G1
21 pages
LAN Security Manager PDF
No ratings yet
LAN Security Manager PDF
47 pages
Mohak-RR
No ratings yet
Mohak-RR
57 pages
report12
No ratings yet
report12
40 pages
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. CLASSIFICATION PREDICTIVE TECHNIQUES: NAIVE BAYES, NEAREST NEIGHBORS and NEURAL NETWORKS: Examples with MATLAB
César Pérez López
No ratings yet
CO4703 Assignment Brief 2020-2021
No ratings yet
CO4703 Assignment Brief 2020-2021
6 pages
School Pictograph: School Number of Students
No ratings yet
School Pictograph: School Number of Students
2 pages
Ass DB
No ratings yet
Ass DB
8 pages
Bigdata-Bigdata (Set 1)
No ratings yet
Bigdata-Bigdata (Set 1)
11 pages
Database Denormalization Example-05032020-061547am
No ratings yet
Database Denormalization Example-05032020-061547am
4 pages
Foundations of Business Intelligence (BI) From Concept To Implementation
No ratings yet
Foundations of Business Intelligence (BI) From Concept To Implementation
75 pages
MySQL DBA TRAINING 2020
No ratings yet
MySQL DBA TRAINING 2020
8 pages
Virtual Storage Access Method (VSAM)
No ratings yet
Virtual Storage Access Method (VSAM)
49 pages
Business Benefits of SAP S4HANA
100% (1)
Business Benefits of SAP S4HANA
9 pages
Difference Between DBMS and RDBMS
No ratings yet
Difference Between DBMS and RDBMS
16 pages
Chapter 2 Complete
No ratings yet
Chapter 2 Complete
1 page
33 Mgg03 Ku1102 Computationalthinking
No ratings yet
33 Mgg03 Ku1102 Computationalthinking
32 pages
07 Handout 1 PDF
No ratings yet
07 Handout 1 PDF
6 pages
Assignment No 2 Advanced Database Programming
No ratings yet
Assignment No 2 Advanced Database Programming
2 pages
Certified List of Candidates: Cavite - City of General Trias Cavite - City of General Trias
No ratings yet
Certified List of Candidates: Cavite - City of General Trias Cavite - City of General Trias
2 pages
Arquitectura WhatsApp
100% (1)
Arquitectura WhatsApp
3 pages
SYNOPSIS
No ratings yet
SYNOPSIS
10 pages
Uxd Question Bank
No ratings yet
Uxd Question Bank
3 pages
Mining 2720209
No ratings yet
Mining 2720209
3 pages
Role of Computers in Library Automation: An Introduction
No ratings yet
Role of Computers in Library Automation: An Introduction
2 pages
Hoffer Mdm11e PP Ch11-JSF
No ratings yet
Hoffer Mdm11e PP Ch11-JSF
33 pages
CH 11
No ratings yet
CH 11
31 pages
Search Engines
No ratings yet
Search Engines
15 pages
Bibliography
No ratings yet
Bibliography
2 pages
Tuning Rac
No ratings yet
Tuning Rac
6 pages
Human Computer Interaction Presentation
No ratings yet
Human Computer Interaction Presentation
14 pages
Postgraduate Pg Master Computer Applications Mca Semester 3 2023 November Data Warehousing and Data Mining 2020 Pattern
No ratings yet
Postgraduate Pg Master Computer Applications Mca Semester 3 2023 November Data Warehousing and Data Mining 2020 Pattern
3 pages

Seminar Report

Uploaded by

Seminar Report

Uploaded by

A

SUBMITTED TO : SUBMITTED BY:

KRINA DAYANI Name: LAKSH

DEPARTMENT OF ELECTRONICS ENGINEERING

The core objective of this project is to implement recommendation algorithms, such as

By leveraging Python's flexibility and efficient computational tools, the project

2.1 YBI Foundation Overview . . . . . . . . . . . . . . . . . . . . . . . . . . 4

2.2 Role in the Internship . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

2.3 Skills Acquired . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4

3 Technologies and Tools Used 5

3.1 Python Programming . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

3.2 Libraries Used . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

4.1 Source of the Dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

4.2 Structure of the Dataset . . . . . . . . . . . . . . . . . . . . . . . . . . . 6

5.1 Data Import and Exploration . . . . . . . . . . . . . . . . . . . . . . . . 7

5.2 Data Cleaning and Preprocessing . . . . . . . . . . . . . . . . . . . . . . 7

5.3 Data Visualization . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7

6.1 Why Collaborative Filtering? . . . . . . . . . . . . . . . . . . . . . . . . 8

6.2 SVD Implementation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

7.1 Metrics Used . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8

8 Prediction and Results 9

8.1 Top Recommendations for User X: ........................................................................ 9

8.2 Strengths and Limitations ...................................................................................... 9

9 Conclusion and Future Scope 10

9.1 Future Improvements ........................................................................................... 10

The objective of this industrial training was to develop a project on

2.1 YBI Foundation Overview

2.2 Role in the Internship

During my internship, I was tasked with implementing a real-world recommendation

• Model Building: Applying collaborative filtering algorithms for recommenda-

• Visualization: Using data visualization tools to identify patterns and trends.

2.3 Skills Acquired

• Proficiency in Python libraries such as Pandas, NumPy, and Matplotlib.

• Improved problem-solving skills.

3.1 Python Programming

Python is a versatile programming language widely used in data science and

3.2 Libraries Used

4.1 Source of the Dataset

4.2 Structure of the Dataset

• Ratings.csv: Columns: User ID, Movie ID, Rating, Timestamp.

• Links.csv (Optional): Provides metadata such as IMDb links.

5.1 Data Import and Exploration

5.2 Data Cleaning and Preprocessing

5.3 Data Visualization

Visualization was performed to understand patterns.

• Rating Distribution: Plotted a histogram to show the frequency of ratings.

import matplotlib.pyplot as plt

6.1 Why Collaborative Filtering?

This technique predicts user preferences by identifying patterns in user behavior.

6.2 SVD Implementation

1. Data Preparation: Loaded data into the Surprise library format.

2. Apply Algorithm: Trained the model using SVD.

3. Generate Predictions: Predicted user ratings for unseen movies.

7.1 Metrics Used

8.1 Top Recommendations for User X:

8.2 Strengths and Limitations

• Struggles with new users or movies (cold start

The Movie Recommendation System successfully provided

9.1 Future Improvements

You might also like