Replika: Building An Emotional Conversation With Deep Learning
Replika: Building An Emotional Conversation With Deep Learning
Fuzzy matching
Classifiers
Parser
Retrieval-based
model
Generative model
Retrieval-based dialog model:
Basic architecture
Retrieval-based dialog model:
Basic architecture
Retrieval-based dialog model:
Basic architecture
Word embeddings — word2vec 300-dimensional
pre-initialisation
Similarity loss
Fuzzy matching model
• We use pre-trained context encoder part of
retrieval-based model as body of a siamese
network
Basic seq2seq
(+ persona-based)
John
HRED seq2seq
Generative seq2seq dialog model:
Improvements
• HRED (context history) — +20% user’s quality!
• Training from scratch takes ~1 week (both for seq2seq and ranking models)
Inference
• Because we have around 30M service requests per day and 100 RPS per
each model at a peak
Online
Thanks! Android