0% found this document useful (0 votes)

6 views

Lab manual on recommender system

The document is a manual on recommender systems, detailing their importance in managing information overload by providing personalized content and services. It covers various types of recommender systems, including content-based, collaborative, and hybrid filtering techniques, along with their advantages and challenges. The manual also discusses the phases of recommender systems, installation requirements, and basic programming concepts necessary for implementation.

Uploaded by

arsathahmed06

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

Lab manual on recommender system

Uploaded by

arsathahmed06

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 59

National Institute of Technical

Teachers Training and Research

(NITTTR, CHANDIGARH)

MANNUAL ON
RECOMMENDER SYSTEM
CONTENTS
1. INTRODUCTION ……………………………………………. 2
2. WHAT IS RECOMMENDER SYSTEM? …………………….4
3. WHY RECOMMENDARER SYSTEM? ……………………..6
4. TYPES OF RECOMMENDERSYSTEM? ……………………7
5. CONTENT BASED FILTERING …………………..................7
6. COLLABORATIVE BASED FILTERING ……………………9
7. HYBRI BASED FILTERING …………………………………12
8. PHASES OF RECOMMENDER SYSTEM …………………..14
9. REQUIREMENTS ……………………………………………16
10. INSTALLATION OF ANACONDA………………………….17
11. BASICS ON PYTHON ……………………………………….31
12. BASIC ON NUMPY ………………………………………….37
13. BASIC ON PANDAS ………………………………………....39
14. BASIC ON MATPLOTLIB …………………………………..40
15. DIFFERENT STEPS TO CONSTRUCT BASIC RECOMMENDER
SYSTEM ……………………………………………………….42
16.BENEFITS OF RECOMMENDER SYSSTEM………………..56
17. CONCLUSION…………………………………………………57
18. ASSIGNMENT…………………………………………………58

1
INTRODUCTION
An abundant amount of information is created and delivered over
electronic media. Users risk becoming overwhelmed by the flow of
information, and they lack adequate tools to help them manage the
situation. Information filtering (IF) is one of the methods that is rapidly
evolving to manage large information flows. The aim of IF is to expose
users to only information that is relevant to them. Many IF systems have
been developed in recent years for various application domains. Some
examples of filtering applications are: filters for search results on the
internet that are employed in the Internet software, personal e-mail filters
based on personal profiles, list servers or newsgroups filters for groups
or individuals, browser filters that block non-valuable information, filters
designed to give children access them only to suitable pages, filters for e-
commerce applications that address products and promotions to potential
customers only, and many more. It deals with the delivering the
information which the user is going to like or they feel useful. The
information filtering system assist the user and provide the relevant
information from the data source.
In the past, people used to shop in a physical store, in which the items
available are limited. For instance, the number of movies that can be
placed in a Blockbuster store depends on the size of that store. By
contrast, nowadays, the Internet allows people to access abundant
resources online. On the Internet, where the number of choices is
overwhelming, there is need to filter, prioritize and efficiently deliver
relevant information in order to alleviate the problem of information
overload, which has created a potential problem to many Internet users.
Recommender systems solve this problem by searching through large
volume of dynamically generated information to provide users with

2
personalized content and services .Netflix, for example, has an enormous
collection of movies. Although the amount of available information
increased, a new problem arose as people had a hard time selecting the
items they actually want to see. This is where the recommender system
comes in.
History
Before internet, there are already several methods of filtering
information; for instance, governments may control and restrict the flow
of information in a given country by means of formal or informal
censorship. Let’s talk about information filters if we refer to newspaper
editors and journalists when they provide a service that selects the most
valuable information for their clients, readers of books, magazines,
newspapers, radio listeners and television viewers. This filtering
operation is also present in schools and universities where there is a
selection of information to provide assistance based on academic criteria
to customers of this service, the students. With the advent of the Internet
it is possible that anyone can publish anything he wishes at a low-cost. In
this way, it increases considerably the less useful information and
consequently the quality information is disseminated. With this problem,
it began to devise new filtering with which we can get the information
required for each specific topic to easily and efficiently.

3
What are recommender systems?
The sudden explosion in the amount of digital information and the
number of user of Internet have created a potential challenge of
information overload which hinders timely access to items of interest.
Information retrieval systems, such as Google, DevilFinder and AltaVista
have partially solved this problem but prioritization and personalization
of information were absent. This has increased the demand for
recommender systems.
Recommender systems aim to predict users’ interests and recommend
product items that quite likely are interesting for them. Data required for
recommender systems stems from explicit user ratings after watching a
movie or listening to a song, from implicit search engine queries and
purchase histories, or from other knowledge about the users/items
themselves.
Recommender system is defined as a decision making strategy for users
under complex information environments. From the perspective of E-
commerce as a tool that helps users search through records of knowledge
which is related to users’ interest and preference. It can also be defined
as a means of assisting and augmenting the social process of using
recommendations of others to make choices when there is no sufficient
personal knowledge or experience of the alternatives. Handle the problem
of information overload that users normally encounter by providing them
with personalized, exclusive content and service recommendations.

4
AREAS WHERE RECOMMENDER SYSTEM
USED.

Online stores Product

discovery

Recommender Loyality
Searches system program
s

Bills and Email

emailers campaigns

Fig -1

5
Why do we need recommender
systems?
1. Companies using recommender systems focus on increasing sales as a
result of very personalized offers and an enhanced customer experience.
2. Recommendations typically speed up searches and make it easier for
users to access content they’re interested in.
3. The user starts to feel known and understood and is more likely to buy
additional products or consume more content. By knowing what a user
wants, the company gains competitive advantage and the threat of losing
a customer to a competitor decreases.
4. Recommender systems are information filtering systems that deal with
the problem of information overload.
5. It has the ability to predict whether a particular user would prefer an
item or not based on the user’s profile.
6. Recommender systems are beneficial to both service providers and
users. They reduce transaction costs of finding and selecting items in an
online shopping environment.
7. Recommendation systems have also proved to improve decision making
process and quality.
8. In e-commerce setting, recommender systems enhance revenues, for the
fact that they are effective means of selling more products.
9. In scientific libraries, recommender systems support users by allowing
them to move beyond catalog searches.

6
TYPES OF RECOMMENDER
SYSTEM

Fig-2

Content Based Filtering:-

These systems make recommendations using a user’s item and profile
features. They hypothesize that if a user was interested in an item in the
past, they will once again be interested in it in the future. Similar items
are usually grouped based on their features. User profiles are constructed
using historical interactions or by explicitly asking users about their

7
interests. There are other systems, not considered purely content-based,
which utilize user personal and social data.
Content-based technique is a domain-dependent algorithm and it
emphasizes more on the analysis of the attributes of items in order to
generate predictions. When documents such as web pages, publications
and news are to be recommended, content-based filtering technique is the
most successful. In content-based filtering technique, recommendation is
made based on the user profiles using features extracted from the content
of the items the user has evaluated in the past.
In order to generate meaningful recommendations we use Vector Space
Model such as Term Frequency Inverse Document Frequency (TF/IDF)
or Probabilistic models such as Naïve Bayes Classifier, Decision Trees
or Neural Networks to model the relationship between different
documents within a corpus. These techniques make recommendations by
learning the underlying model with either statistical analysis or machine
learning techniques.

Pros and Cons of Content-Based Filtering Techniques

A common problem is that new users lack a defined profile unless they
are explicitly asked for information. Nevertheless, it is relatively simple
to add new items to the system. We just need to ensure that we assign
them a group according to their features. They have the ability to
recommend new items even if there are no ratings provided by users. So
even if the database does not contain user preferences, recommendation
accuracy is not affected. Also, if the user preferences change, it has the
capacity to adjust its recommendations in a short span of time. They can
manage situations where different users do not share the same items, but
only identical items according to their intrinsic features. Users can get
recommendations without sharing their profile, and this ensures privacy.

8
Content based filtering techniques are dependent on items’ metadata.
That is, they require rich description of items and very well organized
user profile before recommendation can be made to users. This is called
limited content analysis. So, the effectiveness of CBF depends on the
availability of descriptive data.

Collaborative filtering:-
Collaborative filtering technique is the most mature and the most
commonly implemented. Collaborative filtering recommends items by
identifying other users with similar taste; it uses their opinion to
recommend items to the active user. Collaborative recommender systems
have been implemented in different application areas. GroupLens is a
news-based architecture which employed collaborative methods in
assisting users to locate articles from massive news database.
Collaborative filtering is currently one of the most frequently used
approaches and usually provides better results than content-based
recommendations. Some examples of this are found in the
recommendation systems of YouTube, Netflix, and Spotify.
Collaborative filtering is a domain-independent prediction technique for
content that cannot easily and adequately be described by metadata such
as movies and music. Collaborative filtering technique works by building
a database (user-item matrix) of preferences for items by users. It then
matches users with relevant interest and preferences by calculating
similarities between their profiles to make recommendations. Such users
build a group called neighborhood. A user gets recommendations to those
items that he has not rated before but that were already positively rated
by users in his neighborhood.
The system uses collaborative filtering method to overcome scalability
issue by generating a table of similar items offline through the use of
item-to-item matrix. The system then recommends other products which
are similar online according to the users’ purchase history.
9
There are two types of methods to achieve this goal: memory-based and
model-based.

Memory-based:-
There are two approaches: the first one identifies clusters of users and
utilizes the interactions of one specific user to predict the interactions of
other similar users. The second approach identifies clusters of items that
have been rated by user A and utilizes them to predict the interaction of
user A with a different but similar item B. These methods usually
encounter major problems with large sparse matrices, since the number
of user-item interactions can be too low for generating high quality
clusters.

Model-based:-
These methods are based on machine learning and data mining
techniques. The goal is to train models to be able to make predictions.
For example, we could use existing user-item interactions to train a model
to predict the top-5 items that a user might like the most. One advantage
of these methods is that they are able to recommend a larger number of
items to a larger number of users, compared to other methods like
memory-based.

Pros and Cons of Collaborative Filtering Techniques

Collaborative Filtering has some major advantages over CBF in that it
can perform in domains where there is not much content associated with
items and where content is difficult for a computer system to analyze
(such as opinions and ideal). Also, CF technique has the ability to provide
serendipitous recommendations, which means that it can recommend
items that are relevant to the user even without the content being in the
user’s profile.

10
Issues with collaborative filtering systems are as defined:-

Cold-start problem

This refers to a situation where a recommender does not have adequate

information about a user or an item in order to make relevant predictions
.This is one of the major problems that reduce the performance of
recommendation system. The profile of such new user or item will be
empty since he has not rated any item; hence, his taste is not known to
the system.
Data sparsity problem

This is the problem that occurs as a result of lack of enough information,

that is, when only a few of the total number of items available in a
database are rated by users. This always leads to a sparse user-item
matrix, inability to locate successful neighbors and finally, the generation
of weak recommendations.
Scalability

This is another problem associated with recommendation algorithms

because computation normally grows linearly with the number of users
and items .A recommendation technique that is efficient when the
number of dataset is limited may be unable to generate satisfactory
number of recommendations when the volume of dataset is increased.
Thus, it is crucial to apply recommendation techniques which are capable
of scaling up in a successful manner as the number of dataset in a
database increases.
Synonymy

Synonymy is the tendency of very similar items to have different names

or entries. Most recommender systems find it difficult to make distinction
between closely related items such as the difference between e.g. baby
11
wear and baby cloth. Collaborative Filtering systems usually find no
match between the two terms to be able to compute their similarity.

Hybrid Filtering:-
Hybrid filtering technique combines different recommendation
techniques in order to gain better system optimization to avoid some
limitations and problems of pure recommendation systems. The idea
behind hybrid techniques is that a combination of algorithms will provide
more accurate and effective recommendations than a single algorithm as
the disadvantages of one algorithm can be overcome by another
algorithm. The combination of approaches can be done in any of the
following ways: separate implementation of algorithms and combining
the result, utilizing some content-based filtering in collaborative
approach, utilizing some collaborative filtering in content-based
approach, creating a unified recommendation system that brings together
both approaches.
Different types hybrid filtering are
Weighted hybridization
Weighted hybridization combines the results of different recommenders
to generate a recommendation list or prediction by integrating the scores
from each of the techniques in use by a linear formula. They are given
equal weights at first, but weights are adjusted as predictions are
confirmed or otherwise. The benefit of a weighted hybrid is that all the
recommender system’s strengths are utilized during the recommendation
process in a straightforward way.
Switching hybridization
The system swaps to one of the recommendation techniques according to
a heuristic reflecting the recommender ability to produce a good rating.
12
The switching hybrid has the ability to avoid problems specific to one
method e.g. the new user problem of content-based recommender, by
switching to a collaborative recommendation system.
Cascade hybridization

The cascade hybridization technique applies an iterative refinement

process in constructing an order of preference among different items. The
recommendations of one technique are refined by another
recommendation technique. The first recommendation technique outputs
a coarse list of recommendations which is in turn refined by the next
recommendation technique. The hybridization technique is very efficient
and tolerant to noise due to the coarse-to-finer nature of the iteration.
Mixed hybridization
Mixed hybrids combine recommendation results of different
recommendation techniques at the same time instead of having just one
recommendation per item. Each item has multiple recommendations
associated with it from different recommendation techniques. In mixed
hybridization, the individual performances do not always affect the
general performance of a local region.
Feature-combination
The features produced by a specific recommendation technique are fed
into another recommendation technique. For example, the rating of
similar users which is a feature of collaborative filtering is used in a case-
based reasoning recommendation technique as one of the features to
determine the similarity between items.

Feature-augmentation
The technique makes use of the ratings and other information produced
by the previous recommender and it also requires additional functionality
from the recommender systems. Feature-augmentation hybrids are
13
superior to feature-combination methods in that they add a small number
of features to the primary recommender.

Meta-level
The internal model generated by one recommendation technique is used
as input for another. The model generated is always richer in
information when compared to a single rating. Meta-level hybrids are
able to solve the sparsity problem of collaborative filtering techniques
by using the entire model learned by the first technique as input for the
second technique.

Phases of Recommender System

Information Collection Phase:-

This collects relevant information of users to generate a user profile or
model for the prediction tasks including user’s attribute, behaviors or
content of the resources the user accesses. A recommendation agent
cannot function accurately until the user profile/model has been well
constructed. In E-learning platform, a user profile is a collection of
personal information associated with a specific user. This information

14
includes cognitive skills, intellectual abilities, learning styles, interest,
preferences and interaction with the system. The user profile is normally
used to retrieve the needed information to build up a model of the user.

Explicit Feedback:-
Explicit feedback requires more effort from user, it is still seen as
providing more reliable data, since it does not involve extracting
preferences from actions, and it also provides transparency into the
recommendation process that results in a slightly higher perceived
recommendation quality and more confidence in the recommendations.

Implicit feedback:-
Implicit feedback reduces the burden on users by inferring their user’s
preferences from their behavior with the system. The method though does
not require effort from the user, but it is less accurate.

Learning phase
It applies a learning algorithm to filter and exploit the user’s features from
the feedback gathered in information collection phase.

Prediction/recommendation phase
It recommends or predicts what kind of items the user may prefer. This
can be made either directly based on the dataset collected in information
collection phase which could be memory based or model based or
through the system’s observed activities of the user.

15
What Are The Prerequisites For
Building A Recommender
System?
Data is the single most important asset. Essentially, you need to know
some details about your users and items. If metadata is all you have
available, you can start with content-based approaches. If you have a
large number of user interactions, you can experiment with more
powerful collaborative filtering. The larger the data set in your
possession, the better your systems will work.

What is metadata?
Metadata is "data that provides information about other data". In other
words, it is "data about data." Many distinct types of metadata exist,
including descriptive metadata, structural metadata, administrative
metadata reference metadata and statistical metadata. Descriptive
metadata is descriptive information about a resource. It is used for
discovery and identification. It includes elements such as title, abstract,
author, and keywords.
 Structural metadata is metadata about containers of data and
indicates how compound objects are put together, for example,
how pages are ordered to form chapters. It describes the types,
versions, relationships and other characteristics of digital materials.
 Administrative metadata is information to help manage a resource,
like resource type, permissions, and when and how it was created.
 Reference metadata is information about the contents and quality
of statistical data.
 Statistical metadata, also called process data, may describe
processes that collect, process, or produce statistical data
16
STEP 1: - INSTALLATION OF
ANACONDA
1. To download the installer: -
https://www.anaconda.com/distribution/ CLICK
We get the following: -

Fig-3
a. We now need to download according to your operating system
windows 64/32bits or version 2 or 3 but recommended version 3.

17
Fig - 4
b. On clicking on the download we get an .exe file. We click on the
.exe file we get the following: -

Fig - 5

18
Click here (click on
the next)

Fig - 6

c. Now click on next and then on “I agree” followed by just me option

a next option. Choosing different position. Click on next.

click

Fig - 7
d. Clicking on next we get the following: -

19
Fig – 8
Note: - It may take some time.

Fig - 9
Then click on next, again next and the finish
e. The go to windows start menu. Click on the anaconda prompt.

20
click

Fig -10
f. Then at Anaconda prompt do the following-
g. Write the following commands: - > python Press
Enter

21
Fig - 11
h. Type >import this. It will import all the function required.

Fig -12

22
Then type “exit ()”
Step 2: - Installing Jupyter On Windows
1. Installing Jupyter on Windows using the
Anaconda Prompt
a. To install Jupyter on Windows, open the Anaconda
Prompt and type:
>conda install Jupyter
Type ‘y’ for yes when prompted. Once Jupyter is installed, type the
command below into the Anaconda Prompt to open the Jupyter
notebook file browser and start using Jupyter notebooks.

Fig – 13
Or you can directly type “jupyter notebook “ in the anaconda command
prompt.as shown below. (As some system may go directly)

23
b. Next we get the following: -

Fig -14

24
c. Now we get the following home page: -

Fig-15

25
d. Click on “NEW” we get the following: -

Fig – 16
e. After clicking on new click on python 3 as shown:

Click on it

Fig – 17

26
f. Then we get the following:-

Fig-18
g. As shown in the following: -

Fig 19

27
h. To run the program, click on the following: -

Fig -20
2. Steps to rename Jupyter: -
First go to file on Jupyter Notebook as shown: -
1) Click there and we get the following figure.

Click on
file

Fig-21

28
2) Click on the Rename.

Click on it

Fig-22

29
3) We get the following:-

Fig-23
Now we can rename as we want to save.

30
Basic On Python
Introduction on PYTHON: -
 Python is one of the most popular programming language created
by Guido van Rossum.
 It is a general-purpose interpreted, interactive, object-oriented, and
high-level programming language.
 Python language is being used by almost all tech-giant companies
like – Google, Amazon, Facebook, Instagram, Dropbox, Uber…
etc.
 Characteristics of Python
Following are important characteristics of Python
Programming −
 It supports functional and structured programming methods as well

as OOP.
 It can be used as a scripting language or can be compiled to byte-

code for building large applications.

 It provides very high-level dynamic data types and supports

dynamic type checking.

 It supports automatic garbage collection.

 It can be easily integrated with C, C++, COM, ActiveX, CORBA,

and Java.
Why Python?
 Python is one of the easiest language, which is readable and
understandable.
 Here the codes written is nearer to the English language.
 There is no such restriction in the language, so it is highly popular
among the developers.

31
Basic:
a. For initializing variables in python: -
i. <variables name> = <value>

ii. For taking input from the user: -

<variable name> = input (“enter the name:”)
Examples “Hello Jupyter Programmers!!!”

Fig – 24
The program for taking an input from the user and print “hello”

Fig - 25
32
b. Blok of Indentation: -
In python indentation is used to define the loop and the control
structure. Here the user has to pay the attention to the whitespaces.
Like other language to define the starting of the block in the function
they use curly braces “{}” here in python it uses colons “:”.
Examples:
def symbol (): #user defined function and operation
a=a+1
return a
print (a) Comment line

c. Tables of key words in Python: -

true False none and

Or Asser Brea continu
t k e
Class Def If elif
Else Del Try except
Raise For Whil pass
e
Import Finall Fro as
y m
Lambd Retur With in
a n
Yield As Is global
Nonloc
al

Table -1

33
d. Decision making
i. if conditional statement: -

Fig - 26
ii. nested if: -

Fig -27

34
iii. if elif else ladder

Fig – 28
e. Loops in Python
1. while loop:

35
Fig -29
2. for loop:
for iterator_var in sequence:
statements (s)
Fig -30
Functions in python: -
A function is an organized reusable code which performs and defines
certain form of task.
Syntax: -
def functionname(parameters):
“function expression task and operation”
return [expression]

36
Python NumPy
NumPy is a general-purpose array-processing package. It provides a
high-performance multidimensional array object, and tools for working
with these arrays. It is the fundamental package for scientific computing
with Python. Python we have lists that serve the purpose of arrays, but
they are slow to process. NumPy aims to provide an array object that is
up to 50x faster than traditional Python lists. NumPy arrays are stored at
one continuous place in memory unlike lists, so processes can access and
manipulate them very efficiently. This behavior is called locality of
reference in computer science. This is the main reason why NumPy is
much faster than lists. Also it is optimized to work with latest CPU
architectures. Some python distribution already have NumPy installed
like, Anaconda, and Spyder etc.

Arrays in NumPy
Array in Numpy is a table of elements (usually numbers), all of the same
type, indexed by a tuple of positive integers. In Numpy, number of
dimensions of the array is called rank of the array. A tuple of integers
giving the size of the array along each dimension is known as shape of
the array. An array class in Numpy is called as ndarray. Arrays in
Numpy can be created by multiple ways, with various number of Ranks,
defining the size of the Array. Arrays can also be created with the use of
various data types such as lists, tuples, etc.

37
Import NumPy

Fig 31
Simple Array program :-

Fig- 32

38
Pandas Tutorial
Pandas is an open-source library that is built on top of NumPy library. It
is a Python package that offers various data structures and operations for
manipulating numerical data and time series. It is mainly popular for
importing and analyzing data much easier. Pandas is an open-source
library that is made mainly for working with relational or labeled data
both easily and intuitively. It provides various data structures and
operations for manipulating numerical data and time series. It is a high-
level data manipulation tool developed by Wes McKinney. It is built on
the Numpy package and its key data structure is called the Data Frame.
Data Frames allow you to store and manipulate tabular data in rows of
observations and columns of variables.

Key Features of Pandas

1. Fast and efficient DataFrame object with default and customized
indexing.
2. Tools for loading data into in-memory data objects from different
file formats.
3. Data alignment and integrated handling of missing data.
4. Reshaping and pivoting of date sets.
5. Label-based slicing, indexing and sub-setting of large data sets.
6. Columns from a data structure can be deleted or inserted.
7. Group by data for aggregation and transformations.
8. High performance merging and joining of data.
9. Time Series functionality.
Note:-Anaconda Python package, Pandas will be installed by default.

39
We import pandas in anaconda as:-

Fig-33

Matplotlib
Matplotlib is a plotting library for the Python programming language
and its numerical mathematics extension NumPy. It provides an object-
oriented API for embedding plots into applications using general-purpose
GUI toolkits like Tkinter, wxPython, Qt, or GTK+. There is also a
procedural "pylab" interface based on a state machine (like OpenGL),
designed to closely resemble that of MATLAB, though its use is
discouraged
matplotlib.pyplot is a collection of functions that make matplotlib work
like MATLAB. Each pyplot function makes some change to a figure: e.g.,
creates a figure, creates a plotting area in a figure, plots some lines in a
plotting area, decorates the plot with labels, etc. Matplotlib was originally
written by John D. Hunter.

40
Example on matplotlib:-

Fig-33

41
BASIC CONSTRUCTION OF
RECOMMENDER SYSTEM
Here we create a hybrid based filtering basic recommender system using
python and its library. Here we collect the dataset and normalized the
data.

STEP 1:- Collection of The Dataset.

Download the data set from the following link as shown:-
https://www.kaggle.com/tmdb/tmdb-movie-metadata

Fig 34

42
Now we can see that the dataset has been downloaded as csv format.

Fig- 35

Step 2:- Now Open Anaconda Command Prompt For

Opening Jupyter Notebook.

43
Fig-36
Now write the following command to open jupyter notebook.
“jupyter notebook” and press enter.

Fig-37
Now we can see that notebook has started:-

Fig-38
44
Step 3:- start python program.
1. Click on “NEW”.

CLICK ON
IT

Fig-39
2. After clicking on new click on python 3 as shown:

CLICK ON IT

Fig- 40

45
3. Then we get the following:-

Fig-41

STEP 4: IMPORT PANDAS AND NUMPY.

As we import the NumPy and Pandas we also link the dataset.

Fig- 42

46
Step 5: Output of Movie Dataset and Credit Dataset.

Fig-43 (credit set)

47
Fig-44(movie dataset)

Step 6: Shape of The Dataset (Describe Nos of

Columns And Rows)

Fig- 45

Step 7: Merging of Both the Table.

As the both table contains same contents so we will merge them.

Fig-46(input)

48
Fig 47(output)

49
Step 8: We Will Remove Unwanted Data From The
Merged Table.

Fig- 48

Step 9: Check The Relevant Data Present or Not.

50
Fig -49

Step10: Weighted Hybrid Based Filtering Construction.

Fig-50 (formula)

Step 11: Calculate Mean of Voting Average And

Percentage Of Vote Count.

Fig -51
51
Step 12: Calculate Weight.

Fig -52

Step 13: Output of The Movie Dataset.

52
Fig -53

Step 13: Arrange in Ascending Order.

Fig- 54

Step 14: Use of Matplotlib to Get Graphical Data.

53
Fig -55

Graph -1

54
Step 15: Using Sklearn and MaxMinscaler We will
Normalized Data to Reduce the Gap Between Them.

Fig- 56

55
Benefits in Recommender System
1. Benefits of recommender systems are:
2. Revenue — past years, many researchers have studied and generate
many algorithms to learn increasing rate for an online customer like
Amazon site. Also, These algorithms study the difference between
shopping online sites with others using recommender systems for items
to increase revenue by increasing the number of sales.
3. Client Satisfaction — many times customers tend to expect to see near
similar product recommendation from their last browsing search on the
site. Mainly because they believe they will get more serious chances
for better products. When they leave the situation and get back
afterward; it would assist if their browsing data from the previous
shopping or viewing product list. This could further facilitate and guide
their e-Commerce activities, similar two experienced assistants. This
case of client satisfaction contributes to client retention.
4. Personalization — we often get recommendations from our friends.
They recognize what we like better than anyone else. This is the only
reason they are adept at recommending things and is what
recommendation systems try to model. You can utilize the data
collected indirectly to improve your website’s overall services and
assure that they are suitable according to a user’s preference.
5. Discovery — people need to be recommended items they would like or
prefer, and when they find a web page for shopping or movie, songs,
etc. meet their hopes they bound to visit this site again.
6. Provide Reports — is an integral piece of a personalization scheme.
Making the client accurate and up to the minute, reporting allows him
to make strong conclusions about his site and the management of a
movement. Founded on these reports clients can get offers for slow-
moving products in order to make a drive in sales.

56
CONCLUSION
A recommender system has been a hot topic for a long time. They are
simple algorithms which aim to provide the most relevant and accurate
items to the user by filtering useful stuff from of a huge pool of
information base. Recommender engines discovers data patterns in the
data set by learning consumer’s choices and produces the outcomes that
co-relates to their needs and interests.

57
Assignment Questions
1. What is data science?
2.What is data mining?
3. Define machine learning?
4. Define information filtering?
5. Types of information filtering?
6. Define recommender system.
7.Why recommender system?
8. Benefits of recommender system.
9. Difference between data mining and data filtering?
10. Give differences between data science and machine learning?
11. What are the problems faced in recommender system?
12. Different way of creating recommender system?
13. Different algorithm used in creating recommender system?
14. Give a summary on few algorithm?
15. What are the security problem faced by recommender system?
16. State different types of learning?
17. State different types of feedback?
18. Why matplotlib is used?
19. Create a graphical representation showing annual growth in
blockbuster movies using python.
20. Create a collaborative model using python using any data set.
21. Using any language of your choice create a basic recommender system.

Recommendation System Final
No ratings yet
Recommendation System Final
16 pages
Project Report On Recommendation System
100% (4)
Project Report On Recommendation System
26 pages
KEND Maintain Realignments
No ratings yet
KEND Maintain Realignments
32 pages
Repair ABS Unit Passat
100% (3)
Repair ABS Unit Passat
13 pages
Recommendation Systems: Department of Computer Science Engineering University School of Information and Technology
No ratings yet
Recommendation Systems: Department of Computer Science Engineering University School of Information and Technology
6 pages
Welcome 1
No ratings yet
Welcome 1
9 pages
Module 5
No ratings yet
Module 5
50 pages
Culbert
No ratings yet
Culbert
19 pages
Study of Recommender Systems Techniques: Shreya Gangan, Khyati Pawde, Niharika Purbey, Prof. Sindhu Nair
No ratings yet
Study of Recommender Systems Techniques: Shreya Gangan, Khyati Pawde, Niharika Purbey, Prof. Sindhu Nair
4 pages
Paper 3 RecommendationSystemsTechniquesChallengesApplicationsandEvaluations
No ratings yet
Paper 3 RecommendationSystemsTechniquesChallengesApplicationsandEvaluations
15 pages
UNIT 1
No ratings yet
UNIT 1
9 pages
Recommender Systems
No ratings yet
Recommender Systems
23 pages
Bda - M 5
No ratings yet
Bda - M 5
14 pages
Recommender Systems Asanov
No ratings yet
Recommender Systems Asanov
7 pages
Ai Document
No ratings yet
Ai Document
11 pages
What Is A Recommender System
No ratings yet
What Is A Recommender System
3 pages
Machine Learning Paradigms - Applications in Recommender Systems
No ratings yet
Machine Learning Paradigms - Applications in Recommender Systems
135 pages
SMA_CH-5
No ratings yet
SMA_CH-5
35 pages
Movie Recommendation System Using Simple Recommender-Based Approach
No ratings yet
Movie Recommendation System Using Simple Recommender-Based Approach
4 pages
Automated Online Course Recommendation System Using Collaborative Filtering
No ratings yet
Automated Online Course Recommendation System Using Collaborative Filtering
10 pages
Final Report 18.7.24
No ratings yet
Final Report 18.7.24
26 pages
A Seminar Report (Updated)
No ratings yet
A Seminar Report (Updated)
23 pages
Recommendation System
No ratings yet
Recommendation System
5 pages
1670-Article Text-6378-1-10-20220701
No ratings yet
1670-Article Text-6378-1-10-20220701
8 pages
Module 1
No ratings yet
Module 1
105 pages
Recommendation System
No ratings yet
Recommendation System
19 pages
Internship Report
No ratings yet
Internship Report
26 pages
Sinha-Dhanalakshmi2019 Article EvolutionOfRecommenderSystemOv PDF
No ratings yet
Sinha-Dhanalakshmi2019 Article EvolutionOfRecommenderSystemOv PDF
20 pages
Recommender System in Ai
No ratings yet
Recommender System in Ai
7 pages
RS Unit - I
No ratings yet
RS Unit - I
47 pages
Recommender System Using Collaborative Filtering and Demographic Characteristics of Users
No ratings yet
Recommender System Using Collaborative Filtering and Demographic Characteristics of Users
7 pages
Getting Information Off The Internet Is Like Taking A Drink From A Fire Hydrant!
No ratings yet
Getting Information Off The Internet Is Like Taking A Drink From A Fire Hydrant!
22 pages
Movie Recommender System: Shekhar 20BCS9911 Sanya Pawar 20BCS9879 Tushar Mishra 20BCS9962
No ratings yet
Movie Recommender System: Shekhar 20BCS9911 Sanya Pawar 20BCS9879 Tushar Mishra 20BCS9962
27 pages
Recommendation in Social Media: Recommender System
No ratings yet
Recommendation in Social Media: Recommender System
29 pages
Social Information Filtering_Unit V
No ratings yet
Social Information Filtering_Unit V
78 pages
Topic:-Product Recommendation System Using Machine Learning
No ratings yet
Topic:-Product Recommendation System Using Machine Learning
26 pages
Jannach Et Al. - 2016 - Recommender Systems - Beyond Matrix Completion
No ratings yet
Jannach Et Al. - 2016 - Recommender Systems - Beyond Matrix Completion
9 pages
1.2
No ratings yet
1.2
2 pages
GROUP 7 Recommendation System
No ratings yet
GROUP 7 Recommendation System
15 pages
Comparative_analysis_of_recommendation_system
No ratings yet
Comparative_analysis_of_recommendation_system
6 pages
Recommended Systems
No ratings yet
Recommended Systems
14 pages
Emotion News Recommendation System
No ratings yet
Emotion News Recommendation System
5 pages
Recommender Systems: A Project Report Submitted in Partial Fulfillment of Requirement For The Award in The Degree of
No ratings yet
Recommender Systems: A Project Report Submitted in Partial Fulfillment of Requirement For The Award in The Degree of
33 pages
1.+Overview_+Recommender+Systems
No ratings yet
1.+Overview_+Recommender+Systems
9 pages
HST-0621-577
No ratings yet
HST-0621-577
6 pages
Unit v Chapter II
No ratings yet
Unit v Chapter II
22 pages
Cloud Computing Report
No ratings yet
Cloud Computing Report
38 pages
RecommenderSystemsAnIntroduction PDF
100% (3)
RecommenderSystemsAnIntroduction PDF
353 pages
Final Year Project (Product Recommendation)
No ratings yet
Final Year Project (Product Recommendation)
33 pages
Review of Clustering-Based Recommender Systems
No ratings yet
Review of Clustering-Based Recommender Systems
22 pages
TECHNICAL+NOTE Recommender+Systems+v.27
No ratings yet
TECHNICAL+NOTE Recommender+Systems+v.27
16 pages
RS Module1
No ratings yet
RS Module1
24 pages
419623731-Project-Report-on-Recommendation-System (1)
No ratings yet
419623731-Project-Report-on-Recommendation-System (1)
26 pages
Application of Data
No ratings yet
Application of Data
1 page
Data Analytics. Fast Overview.
From Everand
Data Analytics. Fast Overview.
George Letton
2.5/5 (18)
M21DGS323 - 2610 - 02
No ratings yet
M21DGS323 - 2610 - 02
77 pages
Paper 23-An Automated Recommender System For Course Selection
No ratings yet
Paper 23-An Automated Recommender System For Course Selection
10 pages
Report Movie Recommendation
No ratings yet
Report Movie Recommendation
49 pages
A Survey on the Generation of Recommender
No ratings yet
A Survey on the Generation of Recommender
10 pages
Ijaret: International Journal of Advanced Research in Engineering and Technology (Ijaret)
No ratings yet
Ijaret: International Journal of Advanced Research in Engineering and Technology (Ijaret)
8 pages
Machine Learning for the Web
From Everand
Machine Learning for the Web
Andrea Isoni
No ratings yet
"Big Data Science" Basic Concepts and Applications
From Everand
"Big Data Science" Basic Concepts and Applications
Sukanta Bhattacharya
No ratings yet
On The Cover: June 2002, Volume 8 Number 6
No ratings yet
On The Cover: June 2002, Volume 8 Number 6
55 pages
CakePHP Aggressive Security
No ratings yet
CakePHP Aggressive Security
10 pages
Gym Management System
No ratings yet
Gym Management System
67 pages
MG2.1 U1 Revision T
No ratings yet
MG2.1 U1 Revision T
1 page
Algorithmic and Advanced Programming in Python - Syllabus in Computer Science, Decision Making & Data - Masterclass 2
No ratings yet
Algorithmic and Advanced Programming in Python - Syllabus in Computer Science, Decision Making & Data - Masterclass 2
43 pages
Thesis On Hostel Management System
100% (3)
Thesis On Hostel Management System
7 pages
SLA Document
No ratings yet
SLA Document
14 pages
MA400: Financial Mathematics: Introductory Course
100% (1)
MA400: Financial Mathematics: Introductory Course
24 pages
Business Requirements Document Template V1.0
No ratings yet
Business Requirements Document Template V1.0
5 pages
Capstone Proposal
No ratings yet
Capstone Proposal
16 pages
Instant Download Expert Scripting and Automation for SQL Server DBAs 1st Edition Peter A. Carter (Auth.) PDF All Chapters
100% (2)
Instant Download Expert Scripting and Automation for SQL Server DBAs 1st Edition Peter A. Carter (Auth.) PDF All Chapters
50 pages
Waste Water Discharge Permit (Wwdp) Emb Region 8
No ratings yet
Waste Water Discharge Permit (Wwdp) Emb Region 8
1 page
SWM0109 Secure Integration of SCADA Third Party Equipment With G500 V100 R0
No ratings yet
SWM0109 Secure Integration of SCADA Third Party Equipment With G500 V100 R0
41 pages
HTML Aastha File
No ratings yet
HTML Aastha File
54 pages
19CS050 Shegar Dipti Sunil DSA Journal
No ratings yet
19CS050 Shegar Dipti Sunil DSA Journal
131 pages
Altium Designer-Essentials Course 2014
No ratings yet
Altium Designer-Essentials Course 2014
4 pages
Solidity Programming Essentials: A guide to building smart contracts and tokens using the widely used Solidity language 2nd Edition Ritesh Modi instant download
No ratings yet
Solidity Programming Essentials: A guide to building smart contracts and tokens using the widely used Solidity language 2nd Edition Ritesh Modi instant download
49 pages
26 Binoculars
No ratings yet
26 Binoculars
18 pages
CTS Informatica Interview Question Answers
100% (2)
CTS Informatica Interview Question Answers
2 pages
ArubaOS 6.3.1.15 Release Notes
No ratings yet
ArubaOS 6.3.1.15 Release Notes
203 pages
FS - MM-En-023 - Service PO For Transporters - Contractors and Other Overheads Pricing - v0.1
No ratings yet
FS - MM-En-023 - Service PO For Transporters - Contractors and Other Overheads Pricing - v0.1
8 pages
9780137
No ratings yet
9780137
89 pages
Prelims in Assessment
No ratings yet
Prelims in Assessment
6 pages
CSRF
No ratings yet
CSRF
11 pages
Activity 2.1
No ratings yet
Activity 2.1
1 page
Auto TR TO Configuration (Warehouse 206) Quick Guide
No ratings yet
Auto TR TO Configuration (Warehouse 206) Quick Guide
20 pages
Integration of SCADA System With Existing PLC Setup To Control Induction Motor Speed
No ratings yet
Integration of SCADA System With Existing PLC Setup To Control Induction Motor Speed
27 pages

Lab manual on recommender system

Uploaded by

Lab manual on recommender system

Uploaded by

National Institute of Technical

Teachers Training and Research

Online stores Product

Bills and Email

Content Based Filtering:-

Pros and Cons of Content-Based Filtering Techniques

Pros and Cons of Collaborative Filtering Techniques

This refers to a situation where a recommender does not have adequate

This is the problem that occurs as a result of lack of enough information,

This is another problem associated with recommendation algorithms

Synonymy is the tendency of very similar items to have different names

The cascade hybridization technique applies an iterative refinement

Phases of Recommender System

Information Collection Phase:-

c. Now click on next and then on “I agree” followed by just me option

code for building large applications.

dynamic type checking.

 It can be easily integrated with C, C++, COM, ActiveX, CORBA,

ii. For taking input from the user: -

c. Tables of key words in Python: -

true False none and

Key Features of Pandas

STEP 1:- Collection of The Dataset.

Step 2:- Now Open Anaconda Command Prompt For

STEP 4: IMPORT PANDAS AND NUMPY.

Fig-43 (credit set)

Step 6: Shape of The Dataset (Describe Nos of

Step 7: Merging of Both the Table.

Step 9: Check The Relevant Data Present or Not.

Step10: Weighted Hybrid Based Filtering Construction.

Step 11: Calculate Mean of Voting Average And

Step 13: Output of The Movie Dataset.

Step 13: Arrange in Ascending Order.

Step 14: Use of Matplotlib to Get Graphical Data.

You might also like