0% found this document useful (0 votes)

8 views17 pages

1 s2.0 S0022000014001706 Main

Uploaded by

miratechos68

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views17 pages

1 s2.0 S0022000014001706 Main

Uploaded by

miratechos68

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Journal of Computer and System Sciences 81 (2015) 717–733

Contents lists available at ScienceDirect

Journal of Computer and System Sciences

www.elsevier.com/locate/jcss

SVD-based incremental approaches for recommender systems

Xun Zhou a,d , Jing He b , Guangyan Huang c , Yanchun Zhang b,d,∗
a
UCAS-VU Joint Lab for Social Computing and E-Health Research, University of Chinese Academy of Sciences, Beijing, China
b
Centre for Applied Informatics, College of Engineering and Science, Victoria University, Australia
c
School of Information Technology, Deakin University, Australia
d
School of Computer Science, Fudan University, Shanghai, China

a r t i c l e i n f o a b s t r a c t

Article history: Due to the serious information overload problem on the Internet, recommender systems
Received 16 July 2014 have emerged as an important tool for recommending more useful information to users
Received in revised form 30 October 2014 by providing personalized services for individual users. However, in the “big data”
Accepted 6 November 2014
era, recommender systems face signiﬁcant challenges, such as how to process massive
Available online 16 December 2014
data eﬃciently and accurately. In this paper we propose an incremental algorithm
Keywords: based on singular value decomposition (SVD) with good scalability, which combines the
Singular value decomposition Incremental SVD algorithm with the Approximating the Singular Value Decomposition
Incremental algorithm (ApproSVD) algorithm, called the Incremental ApproSVD. Furthermore, strict error analysis
Recommender system demonstrates the effectiveness of the performance of our Incremental ApproSVD algorithm.
Experimental evaluation We then present an empirical study to compare the prediction accuracy and running time
between our Incremental ApproSVD algorithm and the Incremental SVD algorithm on the
MovieLens dataset and Flixster dataset. The experimental results demonstrate that our
proposed method outperforms its counterparts.
© 2014 Elsevier Inc. All rights reserved.

1. Introduction

1.1. Background

With the popularity of the Internet and advances in information technology, information from websites tends to be
too general and people require more personalized information. In order to meet users’ demand for personalized services,
personalized recommender systems are a powerful tool to solve the information overload problem. Collaborative filtering is
one of the most important techniques used in recommender systems. Its principle is to recommend likely new information
to an active user by considering other similar users’ interests. It is based on the assumption that if two users have similar
interests then the two users will probably share the same information. The advantages of collaborative filtering are as
follows: first, it is independent of the contents of recommended items; second, it can be closely integrated with social
networks; third, it has good accuracy in terms of recommendations.
The common challenge of collaborative filtering and other types of recommender systems is how to deal with massive
data to make accurate recommendations. There are three difficulties [1]: (1) the huge amount of data, which requires the
algorithm to respond quickly; (2) the sparsity of data, the ratings provided by the users or information which can be used to
indicate interests are actually very sparse, compared with the large number of users and items in a recommender system;
(3) the dynamic nature of data, which requires the algorithm to update quickly and accurately. The recommender system

* Corresponding author at: Centre for Applied Informatics, College of Engineering and Science, Victoria University, Australia.
E-mail address: [email protected] (Y. Zhang).

http://dx.doi.org/10.1016/j.jcss.2014.11.016
0022-0000/© 2014 Elsevier Inc. All rights reserved.
718 X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733

constantly acquires new data, and the active user’s interests and concerns are constantly changing. In other words, when
the user is active, such as browsing, clicking, rating, the training data is growing significantly. Therefore, these changes must
be considered in the recommender system.
To solve the first two difficulties, many clustering or data dimensionality reduction methods have been proposed, in-
cluding the clustering-based model, such as K-means clustering [2], minimum spanning tree [3], partition around medoids
(PAM) [4], and so on; the matrix factorization-based model, such as singular value decomposition (SVD) [5], non-negative
matrix factorization (NMF) [6], and so on; and the co-clustering method [1,7–9], which can cluster multiple dimensions si-
multaneously. These methods can effectively reduce the sparsity of data and reduce the consumption of online computation
while improving accuracy, compared with the traditional method which calculates the similarity between users or items
directly. However, the above methods have a disadvantage that they are offline computations, which cannot handle online
and dynamic problems efficiently, so they are not adequate solutions to the aforementioned third difficulty.
To demonstrate the importance of online and dynamic computations for a recommender system, we give an example
to illustrate this. For example, when a user enters a movie rating recommender system, which includes some ratings of
movies rated by users, then users, movies and ratings are static in a recommender system. However, there are many new
movies released every day that are constantly put into the recommender system, so we would like to predict how a user
would rate a new movie in order to recommend the movies he/she might prefer. If a user can often obtain some good
movie recommendations from a recommender system, he/she will trust this recommender system and will be happy to
interact with the recommender system frequently. In this case, valid ratings from that active user in recommender system
are growing, so that recommender systems can give more accurate predictions to a user. The above is a positive circle.
In this paper, we apply an incremental approach for the following reasons. If we simply replace the offline process with
an online process, we should compute the singular value decomposition of some original unchanged matrix repeatedly each
time. With the incremental approach, we only need to compute the singular value decomposition of the incremental part
based on the singular value decomposition of the previous matrix, which can solve the problem of computational efficiency.
Actually, the incremental approach is a good solution to resolve the aforementioned first and third difficulty.

1.2. Contributions

In the face of constantly updated information and changing users’ interests, this paper proposes an incremental algorithm
based on SVD called the Incremental ApproSVD, which integrates the new Incremental SVD algorithm with our previous
ApproSVD algorithm [10].
How to conduct the Incremental ApproSVD algorithm for a recommender system is detailed below. The original static
user-movie-rating data can be converted into a user-movie rating matrix B 1 , and the newly entered user-movie-rating data
can be converted into an added user-movie rating matrix B 2 . Therefore, the practical problem is transformed into a matrix
updating problem. First, the B 1 , B 2 are considered as input in the Incremental ApproSVD algorithm. For B 1 , we extract a
constant number of columns from B 1 , and scale the columns appropriately to form a relatively smaller matrix C 1 . For B 2 ,
the process of forming a relatively smaller matrix C 2 is similar to that of C 1 . Second, we consider two matrices C 1 and C 2 as
input and implement the Incremental SVD algorithm, which computes the SVD of [C 1 , C 2 ] based on the SVD of C 1 . Finally,
we obtain the left singular vectors of [C 1 , C 2 ], which are good approximations to the left singular vectors of [ B 1 , B 2 ]. To
predict a user rating of a particular movie, we multiply the user preference vector by the movie feature vector, and obtain
a predicted rating to indicate how much the user likes that movie. If the predicted rating is in the upper half of the rating
range, the recommender system will recommend this particular movie to the user. If the predicted rating is in the lower half
of the rating range, the recommender system will not recommend this particular movie to the user. Moreover, Section 4.3
conducts the error analysis and gives the upper bound. It is crucial that an appropriate number of columns of B 1 and B 2
are selected for sampling so as to reduce the upper bound.
Fig. 1 shows an example of the recommendation procedure based on the Incremental ApproSVD algorithm. The movie
“Avatar” is a new movie entering a user-movie-rating recommender system, and we would like to predict whether Tom
likes “Avatar”. After implementing the Incremental ApproSVD algorithm, we can see that the predicted rating from Tom
on “Avatar” is 2.4, which is in the lower half of the rating range [1, 5]. Therefore, the recommender system would not
recommend “Avatar” to Tom.
The reason why the Incremental ApproSVD algorithm adopts column sampling to reduce the column number is that, after
sampling some columns of the original rating matrix based on an appropriate sampling probability, the most representative
columns are kept in the original rating matrix, which can effectively reduce the size of the original matrix. However, the
user-movie-rating recommender system is growing fast, and many new movies enter the recommender system every day,
so we should consider how to predict the new movies to a target user in order to keep the continuity of recommendation,
and not just predict existing movies.
The contributions of this paper include the following three aspects: (1) it proposes an incremental algorithm, Incremental
ApproSVD, which can predict unknown ratings when new items are dynamically entering a recommender system; (2) one
of the most important features of our algorithm is that it can be easily realized, because it is a suboptimal approximation
with lower running time compared with the Incremental SVD; (3) it gives the upper bound of error between the actual
ratings and the predicted ratings generated by Incremental ApproSVD. Experiments show the advantages of our algorithm
under different parameters on the MovieLens dataset and Flixster dataset.
X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733 719

Fig. 1. Instance chart of recommendation procedure based on Incremental ApproSVD algorithm.

1.3. Organization

The remainder of this paper is organized as follows. First, Section 2 provides an overview of work related to the develop-
ment of the incremental SVD and ApproSVD algorithms. Some preliminaries are introduced in Section 3. Section 4 presents
the construction of two incremental algorithms, and gives the error analysis for the Incremental ApproSVD algorithm. Ex-
perimental evaluation results are reported in Section 5. Section 6 concludes the paper.

2. Related work

SVD is a basic mathematical method in data mining. SVD is usually calculated by batch, and the time complexity is
O (m2 n + n3 ) [11] (m, n are the row size and column size of a matrix, respectively), meaning that all data must be processed
immediately. Therefore, it is not feasible for very large dataset. Lanczos proposed an SVD method whose time complexity
is O (mnr ) [11], where r is the rank of SVD. However, the Lanczos method requires the value of r in advance, and it is not
accurate for small singular values [12,13].
In the last three decades, many scholars undertook research on how to update SVD by adding rows or columns
[12,14–19]. SVD updating methods are mostly based on the Lanczos method, using Eq. (1) below. All uppercase letters
in (1) denote general matrices and I denote the identity matrix. Zha and Simon [16] also apply Eq. (1), but the update is
approximated and requires a dense SVD. Chandrasekaran et al. [20] use a similar method, however, their update is limited to
a single vector and is vulnerable to a loss of orthogonality. Levy and Lindenbaum [21] exploit the relationship between the
QR decomposition and the SVD to incrementally compute the left singular vectors in O (mnr 2 ) time complexity. However,
this is also vulnerable to a loss of orthogonality and results have only been reported for matrices having a few hundred
columns
T
diag(s) L V 0
[U , J ]
0 K 0 I
T
diag(s) U T C V 0
= U , I − U U T C /K
0 K 0 I
T

= U diag(s) V , C = [ M , C ]. (1)
720 X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733

In previous work [10], we put forward an approximating SVD algorithm called ApproSVD, which combines the dimension
reduction technique of SVD with the approximate method. The trick behind the ApproSVD algorithm is to sample some rows
of a user-item matrix, rescale each row by an appropriate factor to form a relatively smaller matrix, and then reduce the
dimensionality of the smaller matrix. However, ApproSVD cannot dynamically process growing massive data. Therefore, we
will propose an incremental algorithm in Section 4.2 to solve this problem.

3. Preliminaries

3.1. Singular value decomposition (SVD)

SVD is a matrix factorization technique commonly used for producing low-rank approximations. Given a matrix A ∈ Rm×n
with rank( A ) = r, the Singular Value Decomposition of A is deﬁned as the following:

A = U SV T , (2)
where U ∈ Rm×m , V ∈ Rn×n and S ∈ Rm×n . The matrices U , V are orthogonal, with their columns being the eigenvectors
of A A T and A T A, respectively. The middle matrix S is a diagonal matrix with r nonzero elements, which are the singular
values of A. Therefore, the effective dimensions of these three matrices U , S and V are m × r, r × r and n × r, respectively.
The initial diagonal r elements (σ1 , σ2 , . . . , σr ) of S have the property that σ1 ≥ σ2 ≥ . . . ≥ σr > 0.
An important property of SVD, which is particularly useful in recommender system, is that it can provide the optimal
approximation to the original matrix A using three smaller matrices multiplication. By keeping the ﬁrst k largest singular
values in S and the remaining smaller ones set to zero, we denote this reduced matrix by S k . Then by deleting the corre-
sponding columns of U and V , which are the last r − k columns of U and V , we denote these two reduced matrices by U k
and V k , respectively. The truncated SVD is represented as

Ak = U k S k V k T , (3)
which is the closest rank-k approximation to the original matrix A for any unitarily invariant norm [22,23].

3.2. The Eckart–Young theorem

Deﬁnition 1. Let A = (ai , j ) be an n × n square matrix. The trace of the matrix A is deﬁned to be the sum of the main
diagonal of A:

n
tr( A ) = aii . (4)
i =1

Note that the trace is a linear transformation from the space of square matrices to the real numbers. In other words, for
square matrices A and B, it is true that tr( A + B ) = tr( A ) + tr( B ).

Deﬁnition 2. The Frobenius norm · F of A ∈ Rm×n is deﬁned as the following:

A 2F = |ai j |2 = tr A A T = tr A T A . (5)
i, j

Note that the Frobenius norm satisﬁes the property A 2F = σ12 + . . . + σr2 , where σ1 , . . . , σr are the nonzero singular values
of A.

k
Theorem 1. (See Eckart and Young [24].) Let the SVD of A be given by Eq. (2). If k < r = rank( A ) and A k = i =1 σi u i v iT , then

min A − B 2F = A − A k 2F = σk2+1 + . . . + σr2 , (6)

rank( B )=k

where σi is the i-th singular value, and u i and v i are the i-th columns of U and V , respectively.

4. Incremental algorithms

According to the existing rating D of user U for item I , we suppose that:

D
R User ×Item : U × I → D . (7)
This represents a two-dimensional prediction mapping function which can be converted into a user-item matrix with rating
elements. Therefore, to calculate a prediction for any unknown rating, we can implement the matrix factorization and
dimensionality reduction techniques.
X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733 721

Fig. 2. Flow chart of oﬄine and online stage of updating SVD.

In general, the entire algorithm works in two independent steps in a recommender system. The first step is the offline
process and the second step is the online execution process. The user-user similarity and item-item similarity computation
can be seen as the offline process of a collaborative filtering-based recommender system. Moreover, the actual prediction
production can be seen as the online process. Usually, it is very time-consuming to do offline computations. Compared with
the online process, the offline process is computed relatively infrequently. For example, a movie recommender system may
calculate the user-user similarity or item-item similarity only once a day or even once a week. If the user-item ratings
database is not dynamic and the user’s taste does not change greatly over a short period of time, the similarity computation
method may work well. Some scholars have demonstrated that the SVD-based dimensionality reduction algorithms can
make the similarity formation process highly scalable while producing better results in most cases [25–27]. However, the
SVD decomposition in the offline step has expensive computational requirements, which requires a running time of O (m3 )
for an m × n user-item matrix [14,28].
In order to overcome the expensive computation drawback of SVD decomposition, we introduce some incremental algo-
rithms based on SVD with reasonable prediction quality, which can reduce the running time when making predictions. The
key point of the incremental algorithms is to ensure highly scalable overall performance, which is very significant for the
growing user-item matrix.
The incremental algorithm based on SVD shown in Fig. 2 is divided into an offline procedure and an online procedure.
The offline stage is computationally intensive, and is performed only once. Finally, it can obtain three matrices U 1 , S 1 and
V 1 after performing SVD algorithm on A 1 . The online stage is performed once the new matrix A 2 enters, and also produces
three matrices U 2 , S 2 and V 2 after performing the incremental algorithm on the updated matrix [ A 1 , A 2 ], using the results
of the offline part.

4.1. Incremental SVD algorithm

Consider a rating matrix A and the SVD decomposition A = U S V T , where U ∈ Rm×m , S ∈ Rm×n and V ∈ Rn×n . Let a
matrix A 1 ∈ Rm× p be some column vectors to add to the matrix A. Therefore, we can write

A := [ A , A 1 ], (8)

where A ∈ Rm×(n+ p ) is the updated matrix based on the original matrix A. The trick behind the incremental SVD algorithm
is to take advantage of our knowledge of the SVD of A to compute the SVD of A . Then Eq. (8) can be rewritten as follows:

A = [ A, A1] = U S V T , A1

VT 0
= U S , U T A1
0 I
T

V 0
=UF
0 I

VT 0
= U U F S F V FT
0 I
T
V 0
= (U U F ) S F VF , (9)
0 I

where F := [ S , U T A 1 ], and SVD( F ) = U F S F V FT . Finally, the SVD of A can be calculated as

A = U A S A V AT , (10)
722 X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733

Algorithm 1 Incremental SVD algorithm.

Input: matrices A 1 ∈ Rm×n1 , A 2 ∈ Rm×n2 ; parameter k ∈ Z + satisﬁes 1 ≤ k ≤ min{m, n1 + n2 }.
Output: matrices U k ∈ Rm×k , S k ∈ Rk×k , V k ∈ R(n1 +n2 )×k .
1: Using SVD on matrix A 1 , we get three matrices U 1 , S 1 , V 1 ;
2: F ← [ S 1 , U 1T A 2 ], using SVD to decompose F and keep the rank-k approximation, we get three matrices U F , S F , V F ;
3: b ← size ( S 1 , 2);
4: if b < n1 then
5: V k ← [ V 1 , zeros(n1 , n2 ); zeros(n2 , b), eye(n2 )] · V F ;
6: else
7: V k ← [ V 1 , zeros(n1 , n2 ); zeros(n2 , n1 ), eye(n2 )] · V F ;
8: end if
9: U k ← U 1 · U F ;
10: S k ← S F ;
√ √
11: Return user eigenvector of [ A 1 , A 2 ], i.e. U k S k (i ) and item eigenvector of [ A 1 , A 2 ], i.e. S k V kT ( j ).

V 0
where U A := U U F , S A := S F and V A := V F . The matrices U A , V A are orthogonal, which is a very important property
0 I
so that we can keep on updating the matrix A through a similar process.
Let a matrix A 2 ∈ Rm×q be some column vectors to add to the matrix A . Therefore, we can write

A := [ A , A 1 , A 2 ] = A , A 2 , (11)

where A ∈ Rm×(n+ p +q) is the updated matrix based on the matrix A . The trick behind the incremental SVD algorithm is
to take advantage of our knowledge of the SVD of A to compute the SVD of A . Then Eq. (11) can be rewritten as follows:

A = A , A 2 = U A S A V AT , A 2

V T 0
= U A S A , U TA A 2 A
0 I

V AT 0
= U A G
0 I

V T 0
= U A U G S G V GT A
0 I
T
V A 0
= (U A U G ) S G VG , (12)
0 I

where G := [ S A , U TA A 2 ], and SVD(G ) = U G S G V GT . Finally, the SVD of A can be calculated as

A = U A S A V AT , (13)
V 0
where U A := U A U G , S A := S G and V A := A V G . Notice that, the matrices U A , V A are orthogonal too.
0 I
Berry et al. [29] detail the process of the SVD-updating method and give the SVD-updating example for a term-document
matrix. Similarly, we can view a term-document matrix as a user-item matrix. SVD-updating incorporates new user or item
information into an existing recommender system using three factor matrices. That is, SVD-updating exploits the previous
singular values and singular vectors of the original user-item matrix A as an alternative to recomputing the SVD of the
updated matrix A . The update procedure of the Incremental SVD algorithm is shown in Algorithm 1.

4.2. Incremental ApproSVD algorithm

Based on the ApproSVD algorithm proposed in our previous paper [10], we combine it with the Incremental SVD and
propose the Incremental ApproSVD algorithm, which could solve the scalability problem in a recommender system. In the
Incremental ApproSVD algorithm, the most important aspect is how to choose the column sampling probabilities { p i }i =1 1
n

n2
and { p j } j =1 . In Section 5.2, when undertaking performance comparisons between two algorithms, for B 1 , we choose the
column sampling probabilities as p i = nnz( B 1 )/nnz( B 1 ), for i = 1, . . . , n1 , where nnz( B 1 ) denotes the number of nonzero
(i ) (i )

elements in the i-th column of matrix B 1 and nnz( B 1 ) denotes the number of nonzero elements in matrix B 1 ; for B 2 , we
choose the column sampling probabilities as p j = nnz( B 2 )/nnz( B 2 ), for j = 1, . . . , n2 , where nnz( B 2 ) denotes the number
( j) ( j)

of nonzero elements in the j-th column of matrix B 2 and nnz( B 2 ) denotes the number of nonzero elements in matrix B 2 .
The process of Incremental ApproSVD algorithm is described in Algorithm 2.
The ﬂow chart of the Incremental ApproSVD algorithm is shown in Fig. 3, where the marks ①, . . . , ⑩ correspond to the
step numbers in Algorithm 2.
X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733 723

Algorithm 2 Incremental ApproSVD algorithm.

Input: matrices B 1 ∈ Rm×n1 , B 2 ∈ Rm×n2 ; parameters c 1 , c 2 ∈ Z + satisfy c 1 ≤ n1 , c 2 ≤ n2 ; parameter k ∈ Z + satisﬁes 1 ≤ k ≤ min{m, c 1 + c 2 }; for B 1 ,
column sampling probabilities { p i }i =1 1 satisfy p i ≥ 0 and p = 1; for B 2 , column sampling probabilities { p j } j = satisfy p j ≥ 0 and p = 1.
n n1 n2 n2
i =1 i 1 j =1 j
Output: H k ∈ Rm×k .
1: for t = 1 → c 1 do
2: Pick it ∈ 1, . . . , n1 under sampling probabilities p α , α = 1, . . . , n1 (it denotes the column index of B 1 );
C 1 ← B 1 t / c 1 p i (a column vector B 1 denotes the i-th column of B 1 );
(t ) (i ) (i )
3:
t

4: end for
5: for t = 1 → c 2 do
6: Pick jt ∈ 1, . . . , n2 under sampling probabilities p α , α = 1, . . . , n2 ( jt denotes the column index of B 2 );
C 2 ← B 2 t / c 2 p j (a column vector B 2 denotes the j-th column of B 2 );
(t ) (j ) ( j)
7:
t

8: end for
9: C 1 and C 2 can replace A 1 and A 2 as the Input of Algorithm 1. Run Algorithm 1 and skip Steps 3–8, we can get U k ;
10: Hk ← U k ;
11: Return H k ∈ Rm×k .

Fig. 3. Flow chart of Incremental ApproSVD.

4.3. Error analysis

As stated above, approximating [ B 1 , B 2 ] by [ B 1 , B 2 ]k generates an error equal to [ B 1 , B 2 ] − [ B 1 , B 2 ]k 2F =

r
t =k+1σt2 ([ B 1 , B 2 ]), where r = rank([ B 1 , B 2 ]) and σt ([ B 1 , B 2 ]) are the singular values of [ B 1 , B 2 ]. Therefore, [ B 1 , B 2 ]k
is the optimal rank-k approximation to [ B 1 , B 2 ]. We will show that, except for this error, the matrix H k H kT [ B 1 , B 2 ] has an
additional error that mainly depends on [ B 1 , B 2 ][ B 1 , B 2 ] T − [C 1 , C 2 ][C 1 , C 2 ] T F .

Theorem 2. Suppose B 1 ∈ Rm×n1 , B 2 ∈ Rm×n2 and let C 1 , C 2 , H k be constructed from the Incremental ApproSVD algorithm. Then

2 2 √
[ B 1 , B 2 ] − H k H kT [ B 1 , B 2 ] F
≤ [ B 1 , B 2 ] − [ B 1 , B 2 ]k F
+ 2 k [ B 1 , B 2 ][ B 1 , B 2 ]T − [C 1 , C 2 ][C 1 , C 2 ]T F
. (14)

Proof. Before starting the proof, let us recall some basic properties of a matrix in the following: X 2F = tr( X X T ) = tr( X T X ),
for any matrix X ; tr( X + Y ) = tr( X ) + tr(Y ), for square matrices X and Y . From the Incremental ApproSVD algorithm, we
also have H kT H k = I k .
724 X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733

Therefore, [ B 1 , B 2 ] − H k H kT [ B 1 , B 2 ]2F can be expressed as

2
[ B 1 , B 2 ] − H k H kT [ B 1 , B 2 ] F
T
= tr [ B 1 , B 2 ] − H k H kT [ B 1 , B 2 ] [ B 1 , B 2 ] − H k H kT [ B 1 , B 2 ]

= tr [ B 1 , B 2 ]T [ B 1 , B 2 ] − 2[ B 1 , B 2 ]T H k H kT [ B 1 , B 2 ] + [ B 1 , B 2 ]T H k H kT H k H kT [ B 1 , B 2 ]

= tr [ B 1 , B 2 ]T [ B 1 , B 2 ] − tr [ B 1 , B 2 ]T H k H kT [ B 1 , B 2 ]
2 2
= [B1, B2] F
− [ B 1 , B 2 ]T H k F
. (15)
k
The error between [ B 1 , B 2 ] T H k 2F and t =1 σt2 ([C 1 , C 2 ]) can be produced by the following:

k

2
[ B 1 , B 2 ]T H k − 2
σt [C 1 , C 2 ]
F
t =1
12
√
k

≤ k [ B 1 , B 2 ]T ht 2 − σ 2 [C 1 , C 2 ] 2
t
t =1
12
√
k

= k [ B 1 , B 2 ]T ht 2 − [C 1 , C 2 ]T ht 2 2
t =1
12
√
k
t 2
tT T T
= k h [ B 1 , B 2 ][ B 1 , B 2 ] − [C 1 , C 2 ][C 1 , C 2 ] h
t =1
√
≤ k [ B 1 , B 2 ][ B 1 , B 2 ] T − [C 1 , C 2 ][C 1 , C 2 ] T F
. (16)
k k
t ([C 1 , C 2 ]) t ([ B 1 , B 2 ])
2 2
The error between σ and
t =1 t =1 σ can also be produced by the following:
k
2
k

2
σt [C 1 , C 2 ] − σt [ B 1 , B 2 ]

t =1 t =1
12
√
k
2
≤ k σ [C 1 , C 2 ] − σt2 [ B 1 , B 2 ]
t
2

t =1
12
√
k
2
T T
= k σt [C 1 , C 2 ][C 1 , C 2 ] − σt [ B 1 , B 2 ][ B 1 , B 2 ]
t =1
12
√
m

T T 2
≤ k σt [C 1 , C 2 ][C 1 , C 2 ] − σt [ B 1 , B 2 ][ B 1 , B 2 ]
t =1
√
≤ k [C 1 , C 2 ][C 1 , C 2 ] T − [ B 1 , B 2 ][ B 1 , B 2 ] T F
. (17)
k
Combining the results of inequality (16) with (17), the error between [ B 1 , B 2 ] T H k 2F and t =1 σt2 ([ B 1 , B 2 ]) can be pro-
duced by the following:

k
√
2
[ B 1 , B 2 ]T H k − σ [ B 1 , B 2 ] ≤ 2 k [ B 1 , B 2 ][ B 1 , B 2 ]T − [C 1 , C 2 ][C 1 , C 2 ]T
2
. (18)
F t
F
t =1
According to (18), formula (15) can be rewritten and yields formula (14) by the following:
2
[ B 1 , B 2 ] − H k H kT [ B 1 , B 2 ] F
2 2
= [ B 1 , B 2 ] F − [ B 1 , B 2 ]T H k F
k
k

2 2 2
≤ [B1, B2] F − σt [ B 1 , B 2 ] + σt2 [ B 1 , B 2 ] − [ B 1 , B 2 ]T H k F

t =1 t =1
2 √
≤ [ B 1 , B 2 ] − [ B 1 , B 2 ]k F + 2 k [ B 1 , B 2 ][ B 1 , B 2 ]T − [C 1 , C 2 ][C 1 , C 2 ]T F
. (19)
Now, we complete the proof. 2
X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733 725

Fig. 4. Five-fold cross validation.

From Theorem 2, the error analysis shows that when parameter k grows bigger, the ﬁrst part [ B 1 , B 2 ] − [ B 1 , B 2 ]k 2F
√
will reduce and the second part 2 k[ B 1 , B 2 ][ B 1 , B 2 ] T − [C 1 , C 2 ][C 1 , C 2 ] T F will increase which are on the right side
of inequality (14). As mentioned above, [ B 1 , B 2 ]k is the optimal rank-k approximation to [ B 1 , B 2 ]. So, the error between
[ B 1 , B 2 ] and H k H kT [ B 1 , B 2 ] is mainly determined by the second part of the right side of (14).
In the Incremental ApproSVD algorithm, the most important aspect is how to choose the column sampling probabilities
{ p i }ni =1 1 and { p j }nj =
2
1
, which is described in previous Section 4.2. The column sampling probabilities can be used to choose
columns to be sampled in one pass and O (c 1 + c 2 ) additional space and time.

5. Experimental evaluations

This section describes the experimental validation of our personalized recommendation algorithm – the Incremental Ap-
proSVD algorithm. We ﬁrst propose our experimental platform – the datasets, the evaluation metric, and the computational
environment. Then, we introduce our experimental process, followed by the results and discussions.

5.1. Experimental platform

Datasets. We use two real datasets to conduct the experiment: MovieLens and Flixster.
The first dataset we used is provided by the MovieLens group [30]. MovieLens is a web-based research recommender
system which commenced in autumn 1997. Each week, hundreds of users visit MovieLens to rate and receive recommen-
dations for movies. The dataset can be converted into a user-item matrix that has 943 rows (users) and 1682 columns
(movies), in which approximately 6.3% of entries are filled. The matrix has 100,000 ratings and all unrated items have a
value of zero. The ratings range from 1 to 5, where 1 represents dislike and 5 represents a strong preference.
The second dataset we used is now publicly available at http://www.sfu.ca/~sja25/datasets/. Flixster is a dataset of movie
ratings from the Flixster commercial website [31], which contains ratings expressed by users during the period from Novem-
ber 2005 to November 2009. The dataset has 786,936 users, 48,794 items and 8,196,077 ratings. We select a small portion
of original Flixster dataset which satisfies the following two conditions at the same time: (1) keep a user who rated at
least 250 items; (2) keep an item that has at least 30 ratings. The dataset could be converted into a user-item matrix that
has 8465 rows (users), 9602 columns (movies) and 5,326,788 nonzero elements (ratings), in which approximately 6.55% of
entries are filled. Possible rating values in Flixster are 10 discrete numbers in the range [0.5, 5] with step size 0.5, where
the minimum number represents dislike and the maximum number represents strong preference.
Available rating entries in each dataset are randomly divided into five partitions for five-fold cross validation, which is
shown in Fig. 4. In other words, we randomly divide all ratings evenly into five disjoint folds and apply four folds together
to train our algorithm, and use the remaining fold as a test set to evaluate the performance. We repeat this process five
times for each dataset so that each fold is used as a test set once. Algorithms are then performed on training cases to make
predictions for test cases.
Evaluation metric. The primary purpose of recommender systems is to predict users’ underlying preferences and inter-
ests and recommend the right items to users to find their likes from a growing number of items. There are a lot of metrics
to measure various aspects of recommendation performance. Two popular metrics, Mean Absolute Error (MAE) and Root
Mean Squared Error (RMSE) are used to measure the closeness of predicted ratings to the true ratings. If r ui denotes the
true rating on item i by user u, and r̂ ui denotes the predicted rating on item i by user u, MAE and RMSE of N corresponding
rating-prediction pairs are defined as:
N
i =1 |r ui − r̂ui |
MAE = , (20)
N

N
i =1 (r ui − r̂ui )2
RMSE = . (21)
N
The smaller MAE and RMSE values correspond to the higher accuracy of the recommender system. As the error is squared
before being summed in the calculation expression of RMSE, it tends to penalize large errors more heavily.
726 X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733

Table 1
Performance comparisons for the MovieLens dataset under the same k.

Incremental ApproSVD Incremental SVD

n1 n2 c1 c2 k RMSE MAE Time (sec.) n1 n2 k RMSE MAE Time (sec.)
900 100 500 50 10 0.9673 0.7686 748.37 500 50 10 0.9511 0.7530 810.88
900 100 600 50 10 0.9643 0.7658 750.51 600 50 10 0.9502 0.7522 1016.88
900 100 700 50 10 0.9612 0.7630 752.11 700 50 10 0.9500 0.7530 1038.56
900 100 800 50 10 0.9580 0.7604 756.47 800 50 10 0.9507 0.7537 1043.09

Table 2
Performance comparisons for the MovieLens dataset under different k.

Incremental ApproSVD Incremental SVD

n1 n2 c1 c2 k RMSE MAE Time (sec.) n1 n2 k RMSE MAE Time (sec.)
900 100 800 50 10 0.9580 0.7604 756.47 800 50 10 0.9507 0.7537 1043.09
900 100 800 50 100 0.9815 0.7799 765.95 800 50 100 0.9779 0.7759 1057.94
900 100 800 50 400 1.0013 0.7991 768.37 800 50 400 0.9963 0.7949 1310.70
900 100 800 50 600 1.0024 0.8003 772.22 800 50 600 0.9971 0.7956 1333.10

Environment. All codes were written in MATLAB R2013b. All our experiments were executed on a computer with a
2.90 GHz Intel(R) Core(TM) i5-4570S CPU processor and 8 GB of RAM, running on 64-bit Microsoft Windows 7 Enterprise
Edition.

5.2. Evaluation results and discussions

As described above, the most important aspect of the Incremental ApproSVD algorithm is the column sampling proce-
dure. From Theorem 1, we find that it holds no matter how the column sampling probabilities { p i }i =1 1 and { p j } j =
n n2
1
are
chosen.
In order to obtain a lower value of MAE and RMSE with less running time, we choose the column sampling probabilities
as p i = nnz( B 1 (i ) )/nnz( B 1 ), i = 1, . . . , n1 and p j = nnz( B 2 ( j ) )/nnz( B 2 ), j = 1, . . . , n2 . Then, we scale the columns prior to
including them in C 1 and C 2 . Furthermore, we compute the SVD of [C 1 , C 2 ] based on the SVD of C 1 by implementing the
Incremental SVD algorithm. Finally, we obtain the matrix H k whose columns are the left singular vectors of [C 1 , C 2 ]. So
we can predict the rating of user u on movie i by H k H kT [ B 1 , B 2 ](u , i ). We substitute 1 for the elements less than 1 and 5
for the elements more than 5 in matrix H k H kT [ B 1 , B 2 ] when conducting experiments on the MovieLens dataset, since the
ratings range from 1 to 5 in the MovieLens dataset. Moreover, we substitute 0.5 for the elements less than 0.5 and 5 for
the elements more than 5 in matrix H k H kT [ B 1 , B 2 ] when conducting experiments on the Flixster dataset, since the ratings
range from 0.5 to 5 in the Flixster dataset.
Following are the RMSE, MAE and running time for the MovieLens 100 K dataset, using five-fold cross validation, chang-
ing parameters c 1 , k for our Incremental ApproSVD and parameters n1 , k for Incremental SVD. The performance comparisons
between the two algorithms are shown in Tables 1–2.
In Table 1, the first column is the column size of original matrix B 1 , and we select the same value n1 = 900. The second
column is the column size of added matrix B 2 , and we select the same value n2 = 100. The third column is the number
of columns picked for matrix C 1 from B 1 , and we select four values c 1 = 500, 600, 700 and 800. The fourth column is the
number of columns picked for matrix C 2 from B 2 , and we select the same value c 2 = 50. The fifth column is the dimension
of eigenvector, and we select the same value k = 10. The right side of three adjacent columns are the prediction accuracy
values and running time of the Incremental ApproSVD algorithm. The ninth column is the column size of original matrix A 1 ,
and we select four values n1 = 500, 600, 700 and 800, which is the same size as that of matrix C 1 in the Incremental
ApproSVD algorithm. The tenth column is the column size of added matrix A 2 , and we select the same value n2 = 50,
which is the same size as that of matrix C 2 in the Incremental ApproSVD algorithm. The eleventh column is the dimension
of eigenvector, and we select the same value k = 10. The rightmost three columns are the prediction accuracy values and
running time of the Incremental SVD algorithm. From Table 1, by observing the performance of Incremental ApproSVD, we
can see that RMSE and MAE values decrease with the increase of c 1 . Moreover, the running time is increasingly modest. By
observing the performance of Incremental SVD, we can see that the trends of RMSE and MAE values are not significant with
the increase of n1 and the running time is increasingly sharp.
The two curves shown in Fig. 5 demonstrate the different trend of RMSE values according to constant value k = 10,
gradual increasing in c 1 for Incremental ApproSVD and n1 for Incremental SVD for the MovieLens dataset. Two parameters
c 1 and n1 denote the column size of the original matrix before performing the incremental algorithms. Therefore, in order
to make Fig. 5 clearer, the horizontal axis is labeled as “original column size” which represents c 1 for Incremental ApproSVD
and n1 for Incremental SVD.
The two curves shown in Fig. 6 demonstrate the different trend of MAE values according to constant value k = 10,
gradual increasing in c 1 for Incremental ApproSVD and n1 for Incremental SVD for the MovieLens dataset. Two parameters
X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733 727

Fig. 5. RMSE values for MovieLens dataset with the change of original column size.

Fig. 6. MAE values for the MovieLens dataset with the change of original column size.

c 1 and n1 denote the column size of the original matrix before performing the incremental algorithms. Therefore, in order
to make Fig. 6 clearer, the horizontal axis is labeled as “original column size” which represents c 1 for Incremental ApproSVD
and n1 for Incremental SVD.
The two curves shown in Fig. 7 demonstrate the different trend of running time (seconds) according to constant value
k = 10, gradual increasing in c 1 for Incremental ApproSVD and n1 for Incremental SVD for the MovieLens dataset. Two
parameters c 1 and n1 denote the column size of the original matrix before performing the incremental algorithms. Therefore,
in order to make Fig. 7 clearer, the horizontal axis is labeled as “original column size” which represents c 1 for Incremental
ApproSVD and n1 for Incremental SVD.
In Table 2, the ﬁrst, the second, the fourth and the tenth columns are the same as that in the above Table 1. The
third column is the number of columns picked for matrix C 1 from B 1 , and we select the same value c 1 = 800. The ﬁfth
column is the dimension of eigenvector, and we select four values k = 10, 100, 400 and 600. The right side of the three
adjacent columns are the prediction accuracy values and running time of the Incremental ApproSVD algorithm. The ninth
column is the column size of original matrix A 1 , and we select the same value n1 = 800, which is the same size as that of
matrix C 1 in the Incremental ApproSVD algorithm. The eleventh column is the dimension of eigenvector, and we select four
values k = 10, 100, 400 and 600. The rightmost three columns are the prediction accuracy values and running time of the
Incremental SVD algorithm. From Table 2, we can see that RMSE and MAE values increase with the increase of k for both
Incremental ApproSVD and Incremental SVD. Moreover, the running time of Incremental ApproSVD is increasingly modest.
However, the running time of Incremental SVD is increasingly sharp.
728 X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733

Fig. 7. Running time for the MovieLens dataset with the change of original column size.

Fig. 8. RMSE values for the MovieLens dataset with the change of k.

The two curves shown in Fig. 8 demonstrate the different trend of RMSE values according to a gradual increase in k for
the MovieLens dataset.
The two curves shown in Fig. 9 demonstrate the different trend of MAE values according to a gradual increase in k for
the MovieLens dataset.
The two curves shown in Fig. 10 demonstrate the different trend of running time according to a gradual increase in k
for the MovieLens dataset.
In short, for the same sized matrices and the same reduced dimension, although RMSE and MAE values of Incremental
ApproSVD are a little higher than that of Incremental SVD, the running time of Incremental ApproSVD is much shorter than
that of Incremental SVD. Therefore, the overall performance of Incremental ApproSVD outperforms that of Incremental SVD
for the MovieLens dataset.
Following are the RMSE, MAE and running time for the Flixster dataset, using ﬁve-fold cross validation, changing param-
eters c 1 , k for our Incremental ApproSVD and parameters n1 , k for Incremental SVD. The performance comparisons between
the two algorithms are shown in Tables 3–4.
In Table 3, all columns represent the same meanings as described by Table 1. From Table 3, by observing the performance
of Incremental ApproSVD, we can see that RMSE and MAE values decrease with the increase of c 1 . Furthermore, little change
has occurred for the running time. By observing the performance of Incremental SVD, we can see that the trends of RMSE
and MAE values are not signiﬁcant with the increase of n1 and the running time is increasingly sharp.
The two curves shown in Fig. 11 demonstrate the different trend of RMSE values according to constant value k = 10,
gradual increasing in c 1 for Incremental ApproSVD and n1 for Incremental SVD for the Flixster dataset. Two parameters c 1
X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733 729

Fig. 9. MAE values for the MovieLens dataset with the change of k.

Fig. 10. Running time for the MovieLens dataset with the change of k.

Table 3
Performance comparisons for the Flixster dataset under the same k.

Incremental ApproSVD Incremental SVD

n1 n2 c1 c2 k RMSE MAE Time (sec.) n1 n2 k RMSE MAE Time (sec.)
900 100 500 50 10 0.9321 0.7212 8.03 500 50 10 0.9209 0.7091 39.52
900 100 600 50 10 0.9281 0.7173 8.18 600 50 10 0.9140 0.7038 55.82
900 100 700 50 10 0.9213 0.7111 8.30 700 50 10 0.9144 0.7041 88.36
900 100 800 50 10 0.9183 0.7084 8.54 800 50 10 0.9143 0.7040 111.89

Table 4
Performance comparisons for the Flixster dataset under different k.

Incremental ApproSVD Incremental SVD

n1 n2 c1 c2 k RMSE MAE Time (sec.) n1 n2 k RMSE MAE Time (sec.)
900 100 800 50 10 0.9183 0.7084 8.54 800 50 10 0.9143 0.7040 111.89
900 100 800 50 100 0.9651 0.7467 8.66 800 50 100 0.9799 0.7589 112.16
900 100 800 50 400 1.0055 0.7830 8.81 800 50 400 1.0306 0.8066 115.62
900 100 800 50 600 1.0089 0.7864 8.96 800 50 600 1.0333 0.8095 124.76
730 X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733

Fig. 11. RMSE values for the Flixster dataset with the change of original column size.

Fig. 12. MAE values for the Flixster dataset with the change of original column size.

and n1 denote the column size of the original matrix before performing the incremental algorithms. Therefore, in order to
make Fig. 11 clearer, the horizontal axis is labeled as “original column size” which represents c 1 for Incremental ApproSVD
and n1 for Incremental SVD.
The two curves shown in Fig. 12 demonstrate the different trend of MAE values according to constant value k = 10,
gradual increasing in c 1 for Incremental ApproSVD and n1 for Incremental SVD for the Flixster dataset. Two parameters c 1
and n1 denote the column size of the original matrix before performing the incremental algorithms. Therefore, in order to
make Fig. 12 clearer, the horizontal axis is labeled as “original column size” which represents c 1 for Incremental ApproSVD
and n1 for Incremental SVD.
The two curves shown in Fig. 13 demonstrate the different trend of running time (seconds) according to constant value
k = 10, gradual increasing in c 1 for Incremental ApproSVD and n1 for Incremental SVD for the Flixster dataset. Two param-
eters c 1 and n1 denote the column size of the original matrix before performing the incremental algorithms. Therefore, in
order to make Fig. 13 clearer, the horizontal axis is labeled as “original column size” which represents c 1 for Incremental
ApproSVD and n1 for Incremental SVD.
In Table 4, all columns represent the same meanings as described by Table 2. From Table 4, we can see that RMSE and
MAE values increase with the increase of k for both Incremental ApproSVD and Incremental SVD. Moreover, the running
time of Incremental ApproSVD has almost no change. However, the running time of Incremental SVD is increasingly sharp.
The two curves shown in Fig. 14 demonstrate the different trend of RMSE values according to a gradual increase in k for
the Flixster dataset.
X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733 731

Fig. 13. Running time for the Flixster dataset with the change of original column size.

Fig. 14. RMSE values for the Flixster dataset with the change of original column size.

The two curves shown in Fig. 15 demonstrate the different trend of MAE values according to a gradual increase in k for
the Flixster dataset.
The two curves shown in Fig. 16 demonstrate the different trend of running time according to a gradual increase in k
for the Flixster dataset.
In short, for the same sized matrices and the same reduced dimension, although RMSE and MAE values of Incremental
ApproSVD are a little bit higher than that of Incremental SVD in Table 3, and a little lower than that of Incremental SVD
except for the case k = 10 in Table 4, the running time of Incremental ApproSVD is much shorter than that of Incremental
SVD. Therefore, the overall performance of Incremental ApproSVD outperforms that of Incremental SVD for the Flixster
dataset. When the sizes of matrices are growing bigger, the superiority of Incremental ApproSVD is much more obvious.

6. Conclusions

In this paper, we ﬁrstly describe an Incremental SVD algorithm which can exploit the previous singular values and
singular vectors of the original matrix as an alternative to recomputing the SVD of the updated matrix. Secondly, we propose
an incremental algorithm called Incremental ApproSVD, which is produced by combining the ApproSVD algorithm with the
Incremental SVD algorithm. Thirdly, we give the mathematical analysis of the error between the actual ratings and the
predicted ratings produced by the Incremental ApproSVD algorithm. Lastly, the evaluation results on the MovieLens 100 K
dataset and Flixster dataset have demonstrated that the Incremental ApproSVD algorithm outperforms the Incremental SVD
algorithm in terms of integrating the prediction accuracy and running time.
732 X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733

Fig. 15. MAE values for the Flixster dataset with the change of original column size.

Fig. 16. Running time for the Flixster dataset with the change of original column size.

Acknowledgment

This work is partially supported by the National Natural Science Foundation of China (Grant Nos. 61272480 and
71072172).

References

[1] B. Marlin, Collaborative filtering: a machine learning perspective, Ph.D. thesis, University of Toronto, 2004.
[2] G.-R. Xue, C. Lin, Q. Yang, W. Xi, H.-J. Zeng, Y. Yu, Z. Chen, Scalable collaborative filtering using cluster-based smoothing, in: Proceedings of the 28th
Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, ACM, 2005, pp. 114–121.
[3] G.N. Demir, A.S. Uyar, S.G. Ögüdücü, Graph-based sequence clustering through multiobjective evolutionary algorithms for web recommender systems,
in: Proceedings of the 9th Annual Conference on Genetic and Evolutionary Computation, ACM, 2007, pp. 1943–1950.
[4] P.S. Chakraborty, A scalable collaborative filtering based recommender system using incremental clustering, in: Advance Computing Conference, IACC
2009, IEEE International, IEEE, 2009, pp. 1526–1529.
[5] S. Funk, Netflix update: try this at home, http://sifter.org/~simon/journal/20061211.html, 2006.
[6] S. Zhang, W. Wang, J. Ford, F. Makedon, Learning from incomplete ratings using non-negative matrix factorization, in: SDM, SIAM, 2006, pp. 549–553.
[7] Y. Cheng, G.M. Church, Biclustering of expression data, in: ISMB, vol. 8, 2000, pp. 93–103.
[8] G. Chen, F. Wang, C. Zhang, Collaborative filtering using orthogonal nonnegative matrix tri-factorization, Inf. Process. Manag. 45 (3) (2009) 368–379.
[9] H. Shan, A. Banerjee, Bayesian co-clustering, in: Eighth IEEE International Conference on Data Mining, ICDM’08, IEEE, 2008, pp. 530–539.
[10] X. Zhou, J. He, G. Huang, Y. Zhang, A personalized recommendation algorithm based on approximating the singular value decomposition (ApproSVD),
in: Proceedings of the 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology, vol. 02, IEEE Computer
Society, 2012, pp. 458–464.
X. Zhou et al. / Journal of Computer and System Sciences 81 (2015) 717–733 733

[11] G.H. Golub, C.F. Van Loan, Matrix Computations, vol. 3, JHU Press, 2012.
[12] M.W. Berry, Large-scale sparse singular value computations, Int. J. Supercomput. Appl. 6 (1) (1992) 13–49.
[13] H. Nakayama, A. Hattori, Incremental learning and forgetting in RBF networks and SVMs with applications to financial problems, in: Knowledge-Based
Intelligent Information and Engineering Systems, Springer, 2003, pp. 1109–1115.
[14] B. Sarwar, G. Karypis, J. Konstan, J. Riedl, Incremental singular value decomposition algorithms for highly scalable recommender systems, in: Fifth
International Conference on Computer and Information Technology, Citeseer, 2002, pp. 27–28.
[15] C.G. Baker, K.A. Gallivan, P. Van Dooren, Low-rank incremental methods for computing dominant singular subspaces, Linear Algebra Appl. 436 (8)
(2012) 2866–2888.
[16] H. Zha, H.D. Simon, On updating problems in latent semantic indexing, SIAM J. Sci. Comput. 21 (2) (1999) 782–791.
[17] C.-X. Ren, D.-Q. Dai, Incremental learning of bidirectional principal components for face recognition, Pattern Recognit. 43 (1) (2010) 318–330.
[18] Q. Zheng, X. Wang, W. Deng, J. Liu, X. Wu, Incremental projection vector machine: a one-stage learning algorithm for high-dimension large-sample
dataset, in: AI 2010: Advances in Artificial Intelligence, Springer, 2011, pp. 132–141.
[19] M. Brand, Incremental singular value decomposition of uncertain data with missing values, in: Computer Vision ECCV 2002, Springer, 2002,
pp. 707–720.
[20] S. Chandrasekaran, B. Manjunath, Y.-F. Wang, J. Winkeler, H. Zhang, An eigenspace update algorithm for image analysis, Graph. Models Image Process.
59 (5) (1997) 321–332.
[21] A. Levey, M. Lindenbaum, Sequential Karhunen–Loeve basis extraction and its application to images, IEEE Trans. Image Process. 9 (8) (2000) 1371–1374.
[22] G.I. Allen, L. Grosenick, J. Taylor, A generalized least-square matrix decomposition, J. Am. Stat. Assoc. 109 (505) (2014) 145–159.
[23] A. Dax, From eigenvalues to singular values: a review, Adv. Pure Math. 3 (2013) 8.
[24] C. Eckart, G. Young, The approximation of one matrix by another of lower rank, Psychometrika 1 (3) (1936) 211–218.
[25] B. Sarwar, G. Karypis, J. Konstan, J. Riedl, Application of dimensionality reduction in recommender system-a case study, in: Proceedings of the ACM
Web KDD workshop on Web Mining for E-Commerce, ACM Press, New York, 2000, pp. 82–90.
[26] H. Polat, W. Du, SVD-based collaborative filtering with privacy, in: Proceedings of the 2005 ACM Symposium on Applied Computing, ACM, 2005,
pp. 791–795.
[27] B. Sarwar, G. Karypis, J. Konstan, J. Riedl, Analysis of recommendation algorithms for e-commerce, in: Proceedings of the 2nd ACM Conference on
Electronic Commerce, ACM, 2000, pp. 158–167.
[28] M.A. Ghazanfar, A. Prügel-Bennett, The advantage of careful imputation sources in sparse data-environment of recommender systems: generating
improved SVD-based recommendations, Informatica (Slov.) 37 (1) (2013) 61–92.
[29] M.W. Berry, S.T. Dumais, G.W. O’Brien, Using linear algebra for intelligent information retrieval, SIAM Rev. 37 (4) (1995) 573–595.
[30] MovieLens, http://www.grouplens.org/node/73.
[31] Flixster dataset, http://www.cs.sfu.ca/~sja25/personal/datasets/.

Big Data Analytics Digital Notes
No ratings yet
Big Data Analytics Digital Notes
119 pages
Artificial Intelligence Interview Questions
100% (1)
Artificial Intelligence Interview Questions
28 pages
R Data Analysis Projects PDF
No ratings yet
R Data Analysis Projects PDF
354 pages
Virtual Fitness Trainer Using Artificial Intelligence
No ratings yet
Virtual Fitness Trainer Using Artificial Intelligence
11 pages
Music Recommendation
100% (1)
Music Recommendation
113 pages
Web Information Systems Engineering - WISE 2018: 19th International Conference, Dubai, United Arab Emirates, November 12-15, 2018, Proceedings, Part II Hakim Hacid Download
100% (1)
Web Information Systems Engineering - WISE 2018: 19th International Conference, Dubai, United Arab Emirates, November 12-15, 2018, Proceedings, Part II Hakim Hacid Download
56 pages
MOD 4 Notes
No ratings yet
MOD 4 Notes
19 pages
Hackathon Problem Statements Technology Track With Link
100% (1)
Hackathon Problem Statements Technology Track With Link
16 pages
Recommendation System Using Collaborative Filtering
No ratings yet
Recommendation System Using Collaborative Filtering
49 pages
Petstorefinalreport 1
No ratings yet
Petstorefinalreport 1
22 pages
Unit..1 Rs
No ratings yet
Unit..1 Rs
16 pages
Literature Review Uwi
100% (1)
Literature Review Uwi
5 pages
Netflix 35%
No ratings yet
Netflix 35%
6 pages
Situating Search: Chirag Shah Emily M. Bender
No ratings yet
Situating Search: Chirag Shah Emily M. Bender
12 pages
13jay Chotaliya
No ratings yet
13jay Chotaliya
119 pages
A Research of Job Recommendation System Based On Collaborative Filtering
No ratings yet
A Research of Job Recommendation System Based On Collaborative Filtering
6 pages
RecSys Updated
No ratings yet
RecSys Updated
37 pages
1symeonidis Panagiotis Zioupos Andreas Matrix and Tensor Fact
No ratings yet
1symeonidis Panagiotis Zioupos Andreas Matrix and Tensor Fact
101 pages
Course Pack - V Sem Machine Learning by DR SantoshKumar5
No ratings yet
Course Pack - V Sem Machine Learning by DR SantoshKumar5
27 pages
Group 15 Report
No ratings yet
Group 15 Report
23 pages
Module 2
No ratings yet
Module 2
53 pages
RecSys PyData2016
No ratings yet
RecSys PyData2016
32 pages
Recommendation System in Python
No ratings yet
Recommendation System in Python
13 pages
Comparison of Collaborative Filtering Algorithms: Limitations of Current Techniques and Proposals For Scalable, High-Performance Recommender Systems
No ratings yet
Comparison of Collaborative Filtering Algorithms: Limitations of Current Techniques and Proposals For Scalable, High-Performance Recommender Systems
33 pages
Big Data Analytics: Challenges and Applications For Text, Audio, Video, and Social Media Data
No ratings yet
Big Data Analytics: Challenges and Applications For Text, Audio, Video, and Social Media Data
11 pages
FULLTEXT01
No ratings yet
FULLTEXT01
44 pages
Personalized E-Learning Recommender System Based On Autoencoders
No ratings yet
Personalized E-Learning Recommender System Based On Autoencoders
20 pages
Chapter 3 AI New
No ratings yet
Chapter 3 AI New
35 pages
Survey On Users Ranking Pattern Based Trust Model Regularization in Product Recommendation
No ratings yet
Survey On Users Ranking Pattern Based Trust Model Regularization in Product Recommendation
13 pages
Matrix Factorization Techniques For Recommender Systems: Collaborative Filtering
No ratings yet
Matrix Factorization Techniques For Recommender Systems: Collaborative Filtering
20 pages
A Recommender System-Using Novel Deep Network Collaborative Filtering
No ratings yet
A Recommender System-Using Novel Deep Network Collaborative Filtering
12 pages
.Trashed-1724941095-Recommender Systems
No ratings yet
.Trashed-1724941095-Recommender Systems
30 pages
Recommendation System
No ratings yet
Recommendation System
17 pages
Application of Dimensionality Reduction in Recommender System - A Case Study
No ratings yet
Application of Dimensionality Reduction in Recommender System - A Case Study
12 pages
Enhancing Collaborative Filtering by User Interest Expansion Via Personalized Ranking
No ratings yet
Enhancing Collaborative Filtering by User Interest Expansion Via Personalized Ranking
16 pages
A Social Network-Based Recommender System SNRS
No ratings yet
A Social Network-Based Recommender System SNRS
32 pages
GCD Detailed Syllabus
No ratings yet
GCD Detailed Syllabus
24 pages
Research Paper
No ratings yet
Research Paper
7 pages
U5 - SVD - 5th Sem - DS
No ratings yet
U5 - SVD - 5th Sem - DS
17 pages
Bayesian Probabilistic Matrix Factorization With Social Relations and Item Contents
No ratings yet
Bayesian Probabilistic Matrix Factorization With Social Relations and Item Contents
14 pages
Good Good Good - Modeling Behavior Sequence For Personalized Fund Recommendation With Graphical Deep Collaborative Filtering
No ratings yet
Good Good Good - Modeling Behavior Sequence For Personalized Fund Recommendation With Graphical Deep Collaborative Filtering
14 pages
IJE - Volume 29 - Issue 6 - Pages 788-796
No ratings yet
IJE - Volume 29 - Issue 6 - Pages 788-796
9 pages
Matrix Factiorization
No ratings yet
Matrix Factiorization
12 pages
Algorithmic Exploration in Reading Behavior Analysis and Recommendations Using Machine Learning
No ratings yet
Algorithmic Exploration in Reading Behavior Analysis and Recommendations Using Machine Learning
23 pages
N2VSCDNNR: A Local Recommender System
No ratings yet
N2VSCDNNR: A Local Recommender System
11 pages
Big Data Cloud-Based Recommendation System Using NLP Techniques With Machine and Deep Learning
No ratings yet
Big Data Cloud-Based Recommendation System Using NLP Techniques With Machine and Deep Learning
8 pages
A Guide To Singular Value Decomposition For Collaborative Filtering
No ratings yet
A Guide To Singular Value Decomposition For Collaborative Filtering
14 pages
1 s2.0 S095070512201334X Main
No ratings yet
1 s2.0 S095070512201334X Main
13 pages
2020-Enhanced Collaborative Filtering-Based Approach For Recommender Systems
No ratings yet
2020-Enhanced Collaborative Filtering-Based Approach For Recommender Systems
8 pages
W. Inglês Paulo
No ratings yet
W. Inglês Paulo
12 pages
A Cosmetic Product Recommendation System Based On Skin Type Using AI:ML
No ratings yet
A Cosmetic Product Recommendation System Based On Skin Type Using AI:ML
7 pages
An Improved Personalized Filtering Recommendation Algorithm:, Sun Mingyang, Liu Xidong
No ratings yet
An Improved Personalized Filtering Recommendation Algorithm:, Sun Mingyang, Liu Xidong
10 pages
Movie Recommendations With LightGCN Exploring Graph Machine Learning
No ratings yet
Movie Recommendations With LightGCN Exploring Graph Machine Learning
12 pages
Ijhs 13222+4527 4537
No ratings yet
Ijhs 13222+4527 4537
11 pages
A Novel Collaborative Filtering Model Based On Combination of Correlation Method With Matrix Completion Technique
No ratings yet
A Novel Collaborative Filtering Model Based On Combination of Correlation Method With Matrix Completion Technique
8 pages
Comparative Study of Recommender System Approaches and Movie Recommendation Using Collaborative Filtering
No ratings yet
Comparative Study of Recommender System Approaches and Movie Recommendation Using Collaborative Filtering
11 pages
12 Recsys 1
No ratings yet
12 Recsys 1
11 pages
Movie Recommendentation Systemedited-B
No ratings yet
Movie Recommendentation Systemedited-B
9 pages
Movie Recommendation System Using SVD Letterboxd
No ratings yet
Movie Recommendation System Using SVD Letterboxd
9 pages
Movie Recommendation KNN
No ratings yet
Movie Recommendation KNN
5 pages
The Ultimate Mini Projects Guide For Developers
No ratings yet
The Ultimate Mini Projects Guide For Developers
7 pages
Movie Recommendation System Using Cosine Similarity and KNN: II. Related Work
No ratings yet
Movie Recommendation System Using Cosine Similarity and KNN: II. Related Work
4 pages
Machine Learning Model For Movie Recomme
No ratings yet
Machine Learning Model For Movie Recomme
6 pages
A Survey For Personalized Item Based Recommendation System
No ratings yet
A Survey For Personalized Item Based Recommendation System
3 pages
Movie Recommendation System Based On SVD Collaborative Filtering
No ratings yet
Movie Recommendation System Based On SVD Collaborative Filtering
7 pages
Research Proposal for Masters in China 攻读硕
No ratings yet
Research Proposal for Masters in China 攻读硕
7 pages
Friendbook: A Semantic-Based Friend Recommendation System For Social Networks
No ratings yet
Friendbook: A Semantic-Based Friend Recommendation System For Social Networks
6 pages
Research Proposal
No ratings yet
Research Proposal
7 pages
E96660695201532
No ratings yet
E96660695201532
5 pages
Peng 2013
No ratings yet
Peng 2013
4 pages
A Review On Reinforcement Learning Based News Recommendation Systems and Its Challenges
No ratings yet
A Review On Reinforcement Learning Based News Recommendation Systems and Its Challenges
6 pages
Paper Joke Recommendation Systems
No ratings yet
Paper Joke Recommendation Systems
6 pages
Recommendation System
No ratings yet
Recommendation System
6 pages
Bha Lse 2021
No ratings yet
Bha Lse 2021
6 pages
Haimlc801 Twsma Syllabus
No ratings yet
Haimlc801 Twsma Syllabus
3 pages
Applied Computational Intelligence and Soft Computing - 2023 - Ahmed - Book Recommendation
No ratings yet
Applied Computational Intelligence and Soft Computing - 2023 - Ahmed - Book Recommendation
12 pages
Mds 1
No ratings yet
Mds 1
4 pages
Building A Movie Recommendation System Using SVD A
No ratings yet
Building A Movie Recommendation System Using SVD A
4 pages
2024-Widyaningtyas T. Et Al.-mf-NCG - Recommendation Algorithm Using Matrix Factorization-Based Normalized Cumulative Genre
No ratings yet
2024-Widyaningtyas T. Et Al.-mf-NCG - Recommendation Algorithm Using Matrix Factorization-Based Normalized Cumulative Genre
10 pages
Improving Regularized Singular Value Decomposition For Collaborative Filtering
No ratings yet
Improving Regularized Singular Value Decomposition For Collaborative Filtering
4 pages
Songs Recommender System Using Machine
No ratings yet
Songs Recommender System Using Machine
3 pages
RS Assignment 1
No ratings yet
RS Assignment 1
3 pages
E-Commerce Intelligent Recommendation System Based On Deep Learning
No ratings yet
E-Commerce Intelligent Recommendation System Based On Deep Learning
4 pages
Practical Data Analysis
From Everand
Practical Data Analysis
Hector Cuesta
4.5/5 (14)
Cloud-Based Multi-Modal Information Analytics
From Everand
Cloud-Based Multi-Modal Information Analytics
Tanushri Kaniyar
No ratings yet
Data and Analytics in Action: Project Ideas and Basic Code Skeleton in Python
From Everand
Data and Analytics in Action: Project Ideas and Basic Code Skeleton in Python
Zemelak Goraga
No ratings yet
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
From Everand
IT Specialist: Artificial Intelligence Exam Prep - 500 Questions for Certification Success (0225)
Satou Takahiro
No ratings yet
Building and Operating Data Hubs: Using a practical Framework as Toolset
From Everand
Building and Operating Data Hubs: Using a practical Framework as Toolset
Georg Graner
No ratings yet

1 s2.0 S0022000014001706 Main

Uploaded by

1 s2.0 S0022000014001706 Main

Uploaded by

Journal of Computer and System Sciences 81 (2015) 717–733

Contents lists available at ScienceDirect

Journal of Computer and System Sciences

SVD-based incremental approaches for recommender systems

Fig. 1. Instance chart of recommendation procedure based on Incremental ApproSVD algorithm.

3.1. Singular value decomposition (SVD)

3.2. The Eckart–Young theorem

Deﬁnition 2. The Frobenius norm  ·  F of A ∈ Rm×n is deﬁned as the following:

min  A − B 2F =  A − A k 2F = σk2+1 + . . . + σr2 , (6)

According to the existing rating D of user U for item I , we suppose that:

Fig. 2. Flow chart of oﬄine and online stage of updating SVD.

4.1. Incremental SVD algorithm

where F := [ S , U T A 1 ], and SVD( F ) = U F S F V FT . Finally, the SVD of A  can be calculated as

Algorithm 1 Incremental SVD algorithm.

where G := [ S A  , U TA  A 2 ], and SVD(G ) = U G S G V GT . Finally, the SVD of A  can be calculated as

4.2. Incremental ApproSVD algorithm

Algorithm 2 Incremental ApproSVD algorithm.

Fig. 3. Flow chart of Incremental ApproSVD.

4.3. Error analysis

As stated above, approximating [ B 1 , B 2 ] by [ B 1 , B 2 ]k generates an error equal to [ B 1 , B 2 ] − [ B 1 , B 2 ]k 2F =

Therefore, [ B 1 , B 2 ] − H k H kT [ B 1 , B 2 ]2F can be expressed as

Fig. 4. Five-fold cross validation.

5.1. Experimental platform

Incremental ApproSVD Incremental SVD

Incremental ApproSVD Incremental SVD

5.2. Evaluation results and discussions

Incremental ApproSVD Incremental SVD

Incremental ApproSVD Incremental SVD

You might also like

Deﬁnition 2. The Frobenius norm · F of A ∈ Rm×n is deﬁned as the following:

min A − B 2F = A − A k 2F = σk2+1 + . . . + σr2 , (6)

where F := [ S , U T A 1 ], and SVD( F ) = U F S F V FT . Finally, the SVD of A can be calculated as

where G := [ S A , U TA A 2 ], and SVD(G ) = U G S G V GT . Finally, the SVD of A can be calculated as

As stated above, approximating [ B 1 , B 2 ] by [ B 1 , B 2 ]k generates an error equal to [ B 1 , B 2 ] − [ B 1 , B 2 ]k 2F =

Therefore, [ B 1 , B 2 ] − H k H kT [ B 1 , B 2 ]2F can be expressed as