Project Movielense Solution
Project Movielense Solution
Problem Objective:
Here, we ask you to perform the analysis using the Exploratory Data Analysis
technique. You need to find features affecting the ratings of any particular movie
and build a model to predict the movie ratings.
Domain: Entertainment
1. Find out all the unique genres (Hint: split the data in column genre making
a list and then process the data to find out only the unique categories of
genres)
2. Create a separate column for each genre category with a one-hot encoding
( 1 and 0) whether or not the movie belongs to that genre.
3. Determine the features affecting the ratings of any particular movie.
4. Develop an appropriate model to predict the movie ratings
Dataset Description:
Field Description
UserID Unique identification for each user
MovieID Unique identification for each movie
Rating User rating for each movie
Timestamp Timestamp generated while adding user review
Users.dat
Format - UserID::Gender::Age::Occupation::Zip-code
Field Description
UserID Unique identification for each user
Genere Category of each movie
Age User’s age
Occupation User’s Occupation
Zip-code Zip Code for the user’s location
Value Description
1 "Under 18"
18 "18-24"
25 "25-34"
35 "35-44"
45 "45-49"
50 "50-55"
56 "56+"
Occupation is chosen from the following choices:
Value Description
0 "other" or not specified
1 "academic/educator"
2 "artist”
3 "clerical/admin"
4 "college/grad student"
5 "customer service"
6 "doctor/health care"
7 "executive/managerial"
8 "farmer"
9 "homemaker"
10 "K-12 student"
11 "lawyer"
12 "programmer"
13 "retired"
14 "sales/marketing"
15 "scientist"
16 "self-employed"
17 "technician/engineer"
18 "tradesman/craftsman"
19 "unemployed"
20 "writer”
Movies.dat
Format - MovieID::Title::Genres
Field Description
Unique identification for
MovieID
each movie
Title A title for each movie
Genres Category of each movie
Genres are pipe-separated and are selected from the following genres:
1. Action
2. Adventure
3. Animation
4. Children's
5. Comedy
6. Crime
7. Documentary
8. Drama
9. Fantasy
10. Film-Noir
11. Horror
12. Musical
13. Mystery
14. Romance
15. Sci-Fi
16. Thriller
17. War
18. Western
Some MovieIDs do not correspond to a movie due to accidental duplicate
entries and/or test entries
Movies are mostly entered by hand, so errors and inconsistencies may
exist
Movielens Case
Study.ipynb