0% found this document useful (0 votes)

32 views

Zero-Shot Recommendation As Language Modeling

Uploaded by

mmmhasy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views

Zero-Shot Recommendation As Language Modeling

Uploaded by

mmmhasy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/359722889

Zero-Shot Recommendation as Language Modeling

Chapter · April 2022

DOI: 10.1007/978-3-030-99739-7_26

CITATIONS READS
24 33

3 authors, including:

Damien Sileo
National Institute for Research in Computer Science and Control
31 PUBLICATIONS 180 CITATIONS

SEE PROFILE

All content following this page was uploaded by Damien Sileo on 24 December 2022.

The user has requested enhancement of the downloaded file.

Zero-Shot Recommendation as Language Modeling

Damien Sileo[0000−0002−3274−291X] , Wout Vossen, and Robbe Raymaekers

KU Leuven, Belgium
[email protected]
arXiv:2112.04184v1 [cs.CL] 8 Dec 2021

Abstract. Recommendation is the task of ranking items (e.g. movies or prod-

ucts) according to individual user needs. Current systems rely on collaborative fil-
tering and content-based techniques, which both require structured training data.
We propose a framework for recommendation with off-the-shelf pretrained lan-
guage models (LM) that only used unstructured text corpora as training data. If
a user u liked Matrix and Inception, we construct a textual prompt, e.g. ”Movies
like Matrix, Inception, <m>” to estimate the affinity between u and m with LM
likelihood. We motivate our idea with a corpus analysis, evaluate several prompt
structures, and we compare LM-based recommendation with standard matrix fac-
torization trained on different data regimes. The code for our experiments is pub-
licly available1 .

1 Introduction
Recommender systems predict an affinity score between users and items. Current rec-
ommender systems are based on content-based filtering (CB), collaborative filtering
techniques (CF), or a combination of both. CF recommender systems rely on (U SER ,
I TEM , I NTERACTION) triplets. CB relies on (I TEM , F EATURES) pairs. Both system
types require a costly structured data collection step. Meanwhile, web users express
themselves about various items in an unstructured way. They share lists of their favorite
items and ask for recommendations on web forums, as in (1)2 which hints at a similarity
between the enumerated movies.
(1) Films like Beyond the Black Rainbow, Lost River, Suspiria, and The Neon Demon.
The web also contains a lot of information about the items themselves, like synopsis or
reviews for movies. Language models such as GPT-2 [14] are trained on large web cor-
pora to generate plausible text. We hypothesize that they can make use of this unstruc-
tured knowledge to make recommendations by estimating the plausibility of items be-
ing grouped together in a prompt. LM can estimate the probability of a word sequence,
P (w1 , ...wn ). Neural language models are trained over a large corpus of documents: to
train a neural network, its parameters Θ are optimized for next word prediction likeli-
hood maximization over k-length sequences sampled from a corpus. The loss writes as
follows: X
LLM = −log P (wi |wi−k ....wi−1 ; Θ) (1)
i
1
https://colab.research.google.com/drive/...?usp=sharing
2
https://www.reddit.com/r/MovieSuggestions/...lost river/
2 D. Sileo et al.

We rely on existing pretrained language models. To make a relevance prediction ,

we build a prompt for each user:
pu,i = Movies like <m1 >, ...<mn >, <mi > (2)

where <mi > is the name of the movie mi and <m1 ...mn > are those of randomly
ordered movies liked by u. We then directly use Rbu,i = PΘ (pu,i ) as a relevance score
to sort items for user u.
Our contributions are as follow (i) we propose a model for recommendation with
standard LM; (ii) we derive prompt structures from a corpus analysis and compare
their impact on recommendation accuracy; (iii) we compare LM-based recommenda-
tion with next sentence prediction (NSP) [12] and a standard supervised matrix factor-
ization method [9,15].

2 Related work

Language models and recommendation Previous work leveraged language modeling

techniques to perform recommendations. However, they do not rely on natural lan-
guage: they use sequences of user/item interactions, and treat these sequences as sen-
tences to leverage the architectures inspired by NLP, such as Word2Vec [7,1,4,11] or
BERT [19].

Zero-shot prediction with language models Neural language models have been used
for zero-shot inference on many NLP tasks [14,2]. For example, they manually con-
struct a prompt structure to translate text, e.g. Translate english to french : ”cheese”
=>, and use the language model completions to find the best translations. Petroni et
al. [13] show that masked language models can act as a knowledge base when we use
part of a triplet as input, e.g. Paris in <mask>. Here, we apply LM-based prompts to
recommendation.

Hybrid and zero-shot recommendation The cold start problem [17], i.e. dealing with
new users or items is a long-standing problem in recommender systems, usually ad-
dressed with hybridization of CF-based and CB-based systems. Previous work [20,10,5,6]
introduced models for zero-shot recommendation, but they use zero-shot prediction
with a different sense than ours. They train on a set of (U SER , I TEM , I NTERACTION )
triplets, and perform zero-shot predictions on new users or items with known attributes.
These methods still require (U SER , I TEM , I NTERACTION ) or (I TEM , F EATURES ) tu-
ples for training. To our knowledge, the only attempt to perform recommendations with-
out such data at all is from Penha et al. [12] who showed that BERT [3] next sentence
prediction (NSP) can be used to predict the most plausible movie after a prompt. NSP
is not available in all language models and requires a specific pretraining. Their work is
designed as a probing of BERT knowledge about common items, and lacks comparison
with a standard recommendation model, which we here address.
Zero-Shot Recommendation as Language Modeling 3

3 Experiments

3.1 Setup
Dataset We use the standard the MovieLens 1M dataset [8] with 1M ratings from
0.5 to 5, 6040 users, and 3090 movies in our experiments. We address the relevance
prediction task3 , so we consider a rating r as positive if r ≥ 4.0, as negative if ≤ 2.5
and we discard the other ratings. We select users with at least 21 positive ratings and
4 negative ratings and thus obtain 2716 users. We randomly select 20% of them as test
users4 . 1 positive and 4 negative ratings are reserved for evaluation for each user, and
the goal is to give the highest relevance score to the positively rated item. We use 5
positive ratings per user unless mentioned otherwise. We remove the years from the
movie titles and reorder the articles (a, the) in the movie titles provided in the dataset
(e.g. Matrix, The (1999) → The Matrix).
Evaluation metric We use the mean average precision at rank 1 (MAP@1) [18] which
is the rate of correct first ranked prediction averaged over test users, because of its
interpretability.
Pretrained language models In our experiments we use the GPT-2 [14] language mod-
els, which are publicly available in several sizes. GPT-2 is trained with LM pretraining
(equation 1) on the WebText corpus [14], which contains 8 million pages covering var-
ious domains. Unless mentioned otherwise, we use the GPT-base model, with 117M
parameters.

3.2 Mining prompts for recommendation

<m >,...,<m >

3-6 gram #Count
<m> and <m> 387 <m >,...,<m >
<m>, <m>, <m> 232
<m >,...,<m >
Movies like <m> 196
<m>, <m>, <m>, <m> 85 <m >,...,<m >
Movies similar to <m> 25
0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35
MAP@1

Table 1: Occurrence counts of 3-6 grams Fig. 1: Comparison of LM recommen-

that contain movie names in the Reddit dations MAP@1 with different prompt
corpus. <m> denotes a movie name. structures.

We analyze the Reddit comments from May 20155 to find out how web users mention
lists of movies in web text. This analysis will provide prompt candidates for LM-based
3
Item relevance could be mapped to ratings but we do not address rating prediction here.
4
Training users are only used for the matrix factorization baseline.
5
https://www.kaggle.com/reddit/reddit-comments-may-2015
4 D. Sileo et al.

recommendations. We select comments where a movie name of the MovieLens dataset

is present and replace movies with a <m> tag. This filtered dataset of comments has a
size of > 900k words. We then select the most frequent pattern with at least three words,
as shown in table 1. Movie names are frequently used in enumerations. The patterns
Movies like <m> and Movies similar to confirm that users focus on the similarity of
movies.
Figure 1 shows that prompt design is important but not critical for high accuracy.
Our corpus-derived prompts significantly outperform if you like <m1 ...mn >, you will
like <mi > used in [12]. We will use <m1 ...mn >, <mi > in the remaining of the paper
due to its superior results and its simplicity.

3.3 Effect of the number of ratings per test user

We investigate the effect of the number of mentioned movies in prompts. We expect

the accuracy of the models in making recommendations to increase when they get more
info about movies a user likes. We compare the recommendation accuracy on the same
users 0,1,2,3,5,10,15 or 20 movies per prompt.

0.40

0.35
MAP@1

0.30

0.25

0 2 5 7 10 12 15 17 20
#Movies per user
Fig. 2: MAP@1 of LM models with a varying number of movies per user sampled in
the input prompt.

Figure 2 shows that increasing the number of ratings per user has diminishing re-
turns and lead to increasing instability, so specifying n ≈ 5 seems to lead to the best re-
sults with the least user input. After 5 items, adding more items might make the prompt
Zero-Shot Recommendation as Language Modeling 5

less natural, even though the LM seems to adapt when the number of items keeps in-
creasing. It is also interesting to note that when we use an empty prompt, accuracy is
above chance level because the LM captures some information about movie popularity.

3.4 Comparison with matrix factorization and NSP

We now use a matrix factorization as a baseline, with the Bayesian Personalised Rank-
ing algorithm (BPR) [15]. Users and items are mapped to d randomly initialized latent
factors, and their dot product is used as a relevance score trained with ranking loss. We
use [16] implementation with default hyperparameters6 d = 10 and a learning rate of
0.001.
We also compare GPT-2 LM to BERT next sentence prediction [12] which models
affinity scores with R bu,i = BERTNSP (pu , <mi >), where pu is a prompt containing
movies liked by u. BERT was pretrained with contiguous sentence prediction task [3]
and Penha et al. [12] proposed to use it as a way to probe BERT for recommendation
capabilities.

0.50

0.45

0.40
MAP@1

0.35
BPR [15]
0.30
BERT-base NSP [12]
BERT-large NSP [12]
0.25
GPT-2-base
GPT-2-medium
0.20
0 100 200 300 400 500
#Users in training set
Fig. 3: MAP@1 for BPR models with increasing numbers of users compared the zero-
shot language models (with 0 training user). BERT-base and BERT-large respectively
have 110M and 340M parameters. GPT-2-base and GPT-2-medium have 117M and
345M parameters.

6
https://cornac.readthedocs.io/en/latest/models.html#
bayesian-personalized-ranking-bpr, we experimented with other hyperpa-
rameter configurations but did not observe significant changes.
6 D. Sileo et al.

Figure 3 shows that the proposed LM-based recommendation significantly outper-

forms BERTNSP . We explain the difference by the fact that LM are generative and ac-
tually model text likelihood, while next sentence prediction is discriminative and can
be based on simple discursive coherence features. It is also interesting to note that LM-
based models outperform matrix factorization when there are few users, i.e < 50 and
< 100 for BASE and MEDIUM GPT-2, which demonstrates that LM-based recommenda-
tion is viable for cold start regimes. Using models larger than the BASE versions lead to
better results, however when we evaluated with larger versions (we did not perform the
full experiments due to memory limitations), we did not see additional improvement,
which could be explained by overfitting.

3.5 Qualitative analysis of generations

Up until there, we have used LM to score the likelihood of sequences. LM can also be
used directly for text generation, unlike BERT. We here show LM-generated prompt
completions randomly sampled in our dataset, using greedy decoding.

Prompt (P1): Forrest Gump, Blade Runner, Modern Times, Amelie, Lord of the Rings
The Return of the King, Shaun of the Dead, Alexander, Pan’s Labyrinth, Cashback,
Avatar:
Completion (C1): 3, The Hunger Games: Mockingjay Part 2, King Arthur, A Feast for
Crows, The Hunger Games: Catching Fire, Jackass, Jackass 2, King Arthur

Prompt (P2): Independence Day, Winnie the Pooh and the Blustery Day, Raiders of
the Lost Ark, Star Wars Episode VI - Return of the Jedi, Quiet Man, Game, Labyrinth,
Return to Oz, Song of the South, Matrix:
Completion (C2): and many more. The list can be read by clicking on the relevant
section at the left of the image. To access the list of releases

Some prompts, i.e. (P1) generate valid movie names, but others, like (P2), do not.
LM-based recommender do need a post-processing to match movie names in the possi-
ble sampled generations.

4 Conclusion

We showed that standard language models can be used to perform item recommen-
dations without any adaptation and that they are competitive with supervised matrix
factorization when the number of users is very low (less than 100 users). LM can there-
fore be used to kickstart recommender systems if items are frequently discussed in the
training corpora. Further research could explore ways to adjust LM for recommendation
purposes or to combine LM with matrix factorization into hybrid systems. Another way
to use of our findings would be to generate movie recommendation datasets by mining
web data which could feed standard supervised recommendation techniques.
Zero-Shot Recommendation as Language Modeling 7

5 Acknowledgements
This work is part of the CALCULUS project, which is funded by the ERC Advanced
Grant H2020-ERC-2017 ADG 7885067 .

References
1. Barkan, O., Koenigstein, N.: Item2vec: Neural item embedding for collaborative filtering.
In: 2016 IEEE 26th International Workshop on Machine Learning for Signal Processing
(MLSP). pp. 1–6 (2016). https://doi.org/10.1109/MLSP.2016.7738886
2. Brown, T.B., Mann, B., Ryder, N., Subbiah, M., Kaplan, J., Dhariwal, P., Neelakantan, A.,
Shyam, P., Sastry, G., Askell, A., Agarwal, S., Herbert-Voss, A., Krueger, G., Henighan,
T., Child, R., Ramesh, A., Ziegler, D.M., Wu, J., Winter, C., Hesse, C., Chen, M., Sigler, E.,
Litwin, M., Gray, S., Chess, B., Clark, J., Berner, C., McCandlish, S., Radford, A., Sutskever,
I., Amodei, D.: Language models are few-shot learners (2020)
3. Devlin, J., Chang, M.W., Lee, K., Toutanova, K.: BERT: Pre-training of deep bidi-
rectional transformers for language understanding. In: Proceedings of the 2019 Con-
ference of the North American Chapter of the Association for Computational Linguis-
tics: Human Language Technologies, Volume 1 (Long and Short Papers). pp. 4171–
4186. Association for Computational Linguistics, Minneapolis, Minnesota (Jun 2019).
https://doi.org/10.18653/v1/N19-1423, https://aclanthology.org/N19-1423
4. Devooght, R., Bersini, H.: Long and short-term recommendations with recurrent neu-
ral networks. p. 13–21. UMAP ’17, Association for Computing Machinery, New York,
NY, USA (2017). https://doi.org/10.1145/3079628.3079670, https://doi.org/10.
1145/3079628.3079670
5. Ding, H., Ma, Y., Deoras, A., Wang, Y., Wang, H.: Zero-shot recommender systems (2021)
6. Feng, P.J., Pan, P., Zhou, T., Chen, H., Luo, C.: Zero shot on the cold-start prob-
lem: Model-agnostic interest learning for recommender systems. In: Proceedings of
the 30th ACM International Conference on Information & Knowledge Management. p.
474–483. CIKM ’21, Association for Computing Machinery, New York, NY, USA (2021).
https://doi.org/10.1145/3459637.3482312, https://doi.org/10.1145/3459637.
3482312
7. Guàrdia-Sebaoun, E., Guigue, V., Gallinari, P.: Latent trajectory modeling: A light and ef-
ficient way to introduce time in recommender systems. In: Proceedings of the 9th ACM
Conference on Recommender Systems. pp. 281–284 (2015)
8. Harper, F.M., Konstan, J.A.: The movielens datasets: History and context. ACM Trans. Inter-
act. Intell. Syst. 5(4) (Dec 2015). https://doi.org/10.1145/2827872, https://doi.org/
10.1145/2827872
9. Koren, Y., Bell, R., Volinsky, C.: Matrix factorization techniques for recommender systems.
Computer 42(8), 30–37 (2009)
10. Li, J., Jing, M., Lu, K., Zhu, L., Yang, Y., Huang, Z.: From zero-shot learning to cold-
start recommendation. Proceedings of the AAAI Conference on Artificial Intelligence
33(01), 4189–4196 (Jul 2019). https://doi.org/10.1609/aaai.v33i01.33014189, https://
ojs.aaai.org/index.php/AAAI/article/view/4324
11. Li, Z., Zhao, H., Liu, Q., Huang, Z., Mei, T., Chen, E.: Learning from history and present:
Next-item recommendation via discriminatively exploiting user behaviors. In: Proceedings
of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Min-
ing. pp. 1734–1743 (2018)
7
https://calculus-project.eu/
8 D. Sileo et al.

12. Penha, G., Hauff, C.: What does bert know about books, movies and music? probing bert
for conversational recommendation. In: Fourteenth ACM Conference on Recommender
Systems. p. 388–397. RecSys ’20, Association for Computing Machinery, New York,
NY, USA (2020). https://doi.org/10.1145/3383313.3412249, https://doi.org/10.
1145/3383313.3412249
13. Petroni, F., Rockaschel, T., Riedel, S., Lewis, P., Bakhtin, A., Wu, Y., Miller, A.: Language
models as knowledge bases? In: Proceedings of the 2019 Conference on Empirical Meth-
ods in Natural Language Processing and the 9th International Joint Conference on Nat-
ural Language Processing (EMNLP-IJCNLP). pp. 2463–2473. Association for Computa-
tional Linguistics, Hong Kong, China (Nov 2019). https://doi.org/10.18653/v1/D19-1250,
https://aclanthology.org/D19-1250
14. Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., Sutskever, I.: Language
models are unsupervised multitask learners (2019), https://openai.com/blog/
better-language-models/
15. Rendle, S., Freudenthaler, C., Gantner, Z., Schmidt-Thieme, L.: Bpr: Bayesian personalized
ranking from implicit feedback. p. 452–461. UAI ’09, AUAI Press, Arlington, Virginia, USA
(2009)
16. Salah, A., Truong, Q.T., Lauw, H.W.: Cornac: A comparative framework for multimodal
recommender systems. Journal of Machine Learning Research 21(95), 1–5 (2020)
17. Schein, A.I., Popescul, A., Ungar, L.H., Pennock, D.M.: Methods and metrics
for cold-start recommendations. In: Proceedings of the 25th Annual International
ACM SIGIR Conference on Research and Development in Information Retrieval.
p. 253–260. SIGIR ’02, Association for Computing Machinery, New York, NY,
USA (2002). https://doi.org/10.1145/564376.564421, https://doi.org/10.1145/
564376.564421
18. Schröder, G., Thiele, M., Lehner, W.: Setting goals and choosing metrics for recommender
system evaluations. In: UCERSTI2 workshop at the 5th ACM conference on recommender
systems, Chicago, USA. vol. 23, p. 53 (2011)
19. Sun, F., Liu, J., Wu, J., Pei, C., Lin, X., Ou, W., Jiang, P.: Bert4rec: Sequential rec-
ommendation with bidirectional encoder representations from transformer. In: Proceed-
ings of the 28th ACM International Conference on Information and Knowledge Man-
agement. p. 1441–1450. CIKM ’19, Association for Computing Machinery, New York,
NY, USA (2019). https://doi.org/10.1145/3357384.3357895, https://doi.org/10.
1145/3357384.3357895
20. Volkovs, M., Yu, G., Poutanen, T.: Dropoutnet: Addressing cold start in recommender sys-
tems. In: Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S.,
Garnett, R. (eds.) Advances in Neural Information Processing Systems. vol. 30. Curran As-
sociates, Inc. (2017), https://proceedings.neurips.cc/paper/2017/file/
dbd22ba3bd0df8f385bdac3e9f8be207-Paper.pdf

View publication stats

LaBella Et Al v. Apple
No ratings yet
LaBella Et Al v. Apple
51 pages
Chapter 3 - Fundamental PLC Programming Chapter 3 Review Questions and Problems
No ratings yet
Chapter 3 - Fundamental PLC Programming Chapter 3 Review Questions and Problems
7 pages
LLM For Recommandation
No ratings yet
LLM For Recommandation
101 pages
rs-llm
No ratings yet
rs-llm
12 pages
Batch D17
No ratings yet
Batch D17
17 pages
Recommender Systems in the Era of Large Language M
No ratings yet
Recommender Systems in the Era of Large Language M
16 pages
Towards Llm-Recsys Alignment With Textual Id Learning
No ratings yet
Towards Llm-Recsys Alignment With Textual Id Learning
10 pages
Alibaba 202304 Is ChatGPT A Good Recommender A Preliminary Study
No ratings yet
Alibaba 202304 Is ChatGPT A Good Recommender A Preliminary Study
10 pages
2304.10149v3
No ratings yet
2304.10149v3
10 pages
Prompting LLM for RecSys
No ratings yet
Prompting LLM for RecSys
46 pages
Personalized Prompt Learning For Explainable Recommendation: Lei Li Yongfeng Zhang Li Chen
No ratings yet
Personalized Prompt Learning For Explainable Recommendation: Lei Li Yongfeng Zhang Li Chen
26 pages
Text is All You Need
No ratings yet
Text is All You Need
10 pages
llmrecommpaper
No ratings yet
llmrecommpaper
12 pages
International Journal of Research Publication and Reviews: Yeole Madhavi B., Rokade Monika D. Khatal Sunil S
No ratings yet
International Journal of Research Publication and Reviews: Yeole Madhavi B., Rokade Monika D. Khatal Sunil S
15 pages
Movie at
No ratings yet
Movie at
19 pages
2402.18590v3 (2)
No ratings yet
2402.18590v3 (2)
10 pages
Movie Recommendation System Using Machine Learning
No ratings yet
Movie Recommendation System Using Machine Learning
8 pages
(COMP4332) (2021) (S) Final P6a03t 90367
No ratings yet
(COMP4332) (2021) (S) Final P6a03t 90367
14 pages
Movie Recommendation System Report
No ratings yet
Movie Recommendation System Report
5 pages
New Machine Learning Model To Movie Recommender and Sentiment Analysis
No ratings yet
New Machine Learning Model To Movie Recommender and Sentiment Analysis
4 pages
Lilianweng Github Io Posts 2023-03-15 Prompt Engineering
No ratings yet
Lilianweng Github Io Posts 2023-03-15 Prompt Engineering
9 pages
Aligning Large Language Models With Recommendation Knowledge
No ratings yet
Aligning Large Language Models With Recommendation Knowledge
16 pages
Movie_Recommendation_Report
No ratings yet
Movie_Recommendation_Report
27 pages
Large Language Models Are Zero Shot Text Classifiers
No ratings yet
Large Language Models Are Zero Shot Text Classifiers
9 pages
Research Paper
No ratings yet
Research Paper
12 pages
3170724_ML_210490131009_OEP
No ratings yet
3170724_ML_210490131009_OEP
8 pages
MovieRecomendation
No ratings yet
MovieRecomendation
6 pages
Recommendation As Language Processing (RLP) : A Unified Pretrain, Personalized Prompt & Predict Paradigm (P5)
No ratings yet
Recommendation As Language Processing (RLP) : A Unified Pretrain, Personalized Prompt & Predict Paradigm (P5)
19 pages
Exploring The Impact of Large Language Models On Recomender Systems
No ratings yet
Exploring The Impact of Large Language Models On Recomender Systems
10 pages
A First Look at LLM-Powered Generative News Recommendation: Qijiong Liu Nuo Chen
No ratings yet
A First Look at LLM-Powered Generative News Recommendation: Qijiong Liu Nuo Chen
12 pages
Mov Rev Phase II JJJJ
No ratings yet
Mov Rev Phase II JJJJ
20 pages
Movie Recommendation KNN
No ratings yet
Movie Recommendation KNN
5 pages
E96660695201532
No ratings yet
E96660695201532
5 pages
s12559-025-10432-2 (1)
No ratings yet
s12559-025-10432-2 (1)
26 pages
LLM_introduction 2024
No ratings yet
LLM_introduction 2024
77 pages
Appendix
No ratings yet
Appendix
9 pages
Do Llms Understand User Preferences? Evaluating Llms On User Rating Prediction
No ratings yet
Do Llms Understand User Preferences? Evaluating Llms On User Rating Prediction
11 pages
Frad Detection Finfinacial Transaction
No ratings yet
Frad Detection Finfinacial Transaction
8 pages
PPT
No ratings yet
PPT
15 pages
A Sequence-Oblivious Generation Method for Context
No ratings yet
A Sequence-Oblivious Generation Method for Context
11 pages
Google LLM Conversational Recs
No ratings yet
Google LLM Conversational Recs
24 pages
Semantic Textual Similarity
No ratings yet
Semantic Textual Similarity
39 pages
MOvie Recommendation System Project Report
No ratings yet
MOvie Recommendation System Project Report
30 pages
Zero_Shot_Recommendations_with_Pre_Train
No ratings yet
Zero_Shot_Recommendations_with_Pre_Train
11 pages
News Recommendation Systems - Accomplishments Challenges Amp Future Directions
No ratings yet
News Recommendation Systems - Accomplishments Challenges Amp Future Directions
24 pages
2310.04878v1
No ratings yet
2310.04878v1
8 pages
2404.00579
No ratings yet
2404.00579
12 pages
Foundational Models and Architectures S1: Generative AI, #1
From Everand
Foundational Models and Architectures S1: Generative AI, #1
Leaster Startx
No ratings yet
Personalize Movie Recommendation System CS 229 Project Final Writeup
0% (1)
Personalize Movie Recommendation System CS 229 Project Final Writeup
6 pages
fin_irjmets1731397431
No ratings yet
fin_irjmets1731397431
7 pages
Quality Signals in Generated Stories
No ratings yet
Quality Signals in Generated Stories
11 pages
text classification reseach paper
No ratings yet
text classification reseach paper
4 pages
CL Assignments
No ratings yet
CL Assignments
22 pages
Seq2Seq Attention Mechanism
No ratings yet
Seq2Seq Attention Mechanism
19 pages
2302.03269v3
No ratings yet
2302.03269v3
25 pages
Low-Resource Adaptation of Open-Domain Generative Chatbots
No ratings yet
Low-Resource Adaptation of Open-Domain Generative Chatbots
8 pages
Panchbhai 2021
No ratings yet
Panchbhai 2021
6 pages
A Survey On Llms For Recommendation
No ratings yet
A Survey On Llms For Recommendation
10 pages
Building A Movie Recommendation System Using Collaborative Filtering With TF-IDF-IJRASET
No ratings yet
Building A Movie Recommendation System Using Collaborative Filtering With TF-IDF-IJRASET
13 pages
NM (2)_merged
No ratings yet
NM (2)_merged
16 pages
PLACES: Prompting Language Models For Social Conversation Synthesis
No ratings yet
PLACES: Prompting Language Models For Social Conversation Synthesis
25 pages
Statistics with Rust: 50+ Statistical Techniques Put into Action
From Everand
Statistics with Rust: 50+ Statistical Techniques Put into Action
Keiko Nakamura
No ratings yet
Deep Learning For IoT Big Data and Streaming Analytics
No ratings yet
Deep Learning For IoT Big Data and Streaming Analytics
34 pages
Kuka Sunriseos 117 Si en
No ratings yet
Kuka Sunriseos 117 Si en
667 pages
"Versatile Event Management: A Project Report On
No ratings yet
"Versatile Event Management: A Project Report On
60 pages
Case Study CTTS - Milestone 07 Object Analysis Solution
No ratings yet
Case Study CTTS - Milestone 07 Object Analysis Solution
6 pages
ML Mid Question Solve
No ratings yet
ML Mid Question Solve
19 pages
Inverse of A Matrix
No ratings yet
Inverse of A Matrix
10 pages
Web Surfing and Cyber Security-1
No ratings yet
Web Surfing and Cyber Security-1
4 pages
Approaches and Game Elements Used To Tailor Digital Gamification For Learning A Systematic Literature Review
No ratings yet
Approaches and Game Elements Used To Tailor Digital Gamification For Learning A Systematic Literature Review
21 pages
SPA100 200 Provisioning
No ratings yet
SPA100 200 Provisioning
231 pages
Course Title: Project Work-II 15CS67T Study & Student Activity Core
No ratings yet
Course Title: Project Work-II 15CS67T Study & Student Activity Core
13 pages
2020 Stormersyle Mock AMC 10 Solution
No ratings yet
2020 Stormersyle Mock AMC 10 Solution
6 pages
5.-Revised-Tle-As-Css-10-Q3-Installing Antivirus
No ratings yet
5.-Revised-Tle-As-Css-10-Q3-Installing Antivirus
4 pages
CCNA 4 Module 3
No ratings yet
CCNA 4 Module 3
32 pages
Agilent Infinitylab LC Series: Preventive Maintenance Checklist
No ratings yet
Agilent Infinitylab LC Series: Preventive Maintenance Checklist
12 pages
NETGEAR Interactive Catalog
No ratings yet
NETGEAR Interactive Catalog
53 pages
AVR Serial Port Programmer
No ratings yet
AVR Serial Port Programmer
34 pages
TOX TB 10010 en
No ratings yet
TOX TB 10010 en
5 pages
Enlighten Your Polynomial Confusion: Different Strategies in Factoring Polynomials
No ratings yet
Enlighten Your Polynomial Confusion: Different Strategies in Factoring Polynomials
1 page
BK - Achieving Effective Fraud Management - EN - 07 - 23
No ratings yet
BK - Achieving Effective Fraud Management - EN - 07 - 23
22 pages
Film Poster Proposal 1
No ratings yet
Film Poster Proposal 1
3 pages
Unit 4
No ratings yet
Unit 4
20 pages
Test Report
No ratings yet
Test Report
44 pages
Chapter 4 AJP
No ratings yet
Chapter 4 AJP
69 pages
Digi Pay: CSC E-Governance Services India Limited
No ratings yet
Digi Pay: CSC E-Governance Services India Limited
7 pages
CV Template
No ratings yet
CV Template
1 page
Dicom Conformance Statement dicomPACS DX-R 2018-03-01
No ratings yet
Dicom Conformance Statement dicomPACS DX-R 2018-03-01
10 pages
DH-IPC-HFW4431T-S-S4: 4MP WDR IR Mini Bullet Network Camera
No ratings yet
DH-IPC-HFW4431T-S-S4: 4MP WDR IR Mini Bullet Network Camera
3 pages
70r2000D - ABB Zenith MX150 O&M Manual PDF
No ratings yet
70r2000D - ABB Zenith MX150 O&M Manual PDF
28 pages

Zero-Shot Recommendation As Language Modeling

Uploaded by

Zero-Shot Recommendation As Language Modeling

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Zero-Shot Recommendation as Language Modeling

Chapter · April 2022

The user has requested enhancement of the downloaded file.

Damien Sileo[0000−0002−3274−291X] , Wout Vossen, and Robbe Raymaekers

Abstract. Recommendation is the task of ranking items (e.g. movies or prod-

We rely on existing pretrained language models. To make a relevance prediction ,

Language models and recommendation Previous work leveraged language modeling

3.2 Mining prompts for recommendation

<m >,...,<m >

Table 1: Occurrence counts of 3-6 grams Fig. 1: Comparison of LM recommen-

recommendations. We select comments where a movie name of the MovieLens dataset

3.3 Effect of the number of ratings per test user

We investigate the effect of the number of mentioned movies in prompts. We expect

3.4 Comparison with matrix factorization and NSP

Figure 3 shows that the proposed LM-based recommendation significantly outper-

3.5 Qualitative analysis of generations

View publication stats

You might also like