100% found this document useful (3 votes)

734 views

Replika: Building An Emotional Conversation With Deep Learning

This document describes Replika, an AI conversational agent. It discusses Replika's history and architecture for dialog modeling. The retrieval-based dialog model uses word embeddings, RNNs, and loss functions to rank and retrieve responses. Generative models include seq2seq, HRED, and persona/emotion embeddings. Vision models include face/object recognition and question generation. Training uses Twitter data and user logs, with quality metrics like MAP and perplexity. Product metrics include signups, demographics, and engagement.

Uploaded by

Kartikeya Shorya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (3 votes)

734 views

Replika: Building An Emotional Conversation With Deep Learning

Uploaded by

Kartikeya Shorya

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

Replika

Building an Emotional conversation with Deep Learning

Replika: History
Luka Luka Replika
Restaurant Personality bots: Your AI friend
recommendations Prince, Roman
Dialog Architecture
Typical scenario: Small talk
Dialog Architecture
• Scenarios — encapsulates all models and clays
them together by providing a graph-like interface
(nodes, constraints, conversation flow)

• Retrieval-based dialog model — ranks and

retrieves a response for a user’s message from pre-
defined or user-filled datasets of responses while
taking a current conversation context into account

• Fuzzy matching model — compares if a message

from a user is semantically equal to some given
text
Dialog Architecture
• Generative dialog model — generates a response
for a user message while taking his personally and
emotion state into account

• Classification models — sentiment analysis,

emotions classification, negation detection,
‘statement about user’ recognition

• Computer vision models — face recognition,

object recognition, visual question generation

• Parser — NER, hard-coded keywords

Dialog Architecture
Typical scenario: Small talk

Fuzzy matching
Classifiers
Parser

Retrieval-based
model

Generative model
Retrieval-based dialog model:
Basic architecture
Retrieval-based dialog model:
Basic architecture
Retrieval-based dialog model:
Basic architecture
Word embeddings — word2vec 300-dimensional
pre-initialisation

RNN — 2-layer 1024-dimensional Bidirectional LSTM

Sentence embedding — max-pooling over LSTM

hidden states at each timestamp

Loss — Triplet ranking loss (with cosine similarity):

Retrieval-based dialog model:
Our Improvements
Hard negatives mining — mine «hard» negative samples
from batch, 20% quality boost!

Echo avoiding — use input context as a negative, got rid of

context echoing!

Context-aware encoder — encode recent dialog history,

+10% quality by users’ reactions

Relevance classification model — estimate the response

confidence (absolute relevance) with a simple classification
model (logistic regression) to rerank and filter out irrelevant
candidates
Retrieval-based dialog model:
Hard negatives & Echo avoiding
Major problems

• Baseline model has a moderate quality

• Retrieval-based models are engineered to find

similar but not the relevant responses => not
ok for conversation tasks

• As an implication, basic model tends to

produce echoed responses — sentences that
are very similar to a user input
Retrieval-based dialog model:
Hard negatives & Echo avoiding
Solution 
Hard negatives mining for a huge quality improvements:  
+10% MAP, +20% recall@10

Hard negative with a context for an echoing problem

solution, total quality boost: +40% MAP, +20% recall
Retrieval-based dialog model:
In product
Topic-oriented Statements about
User profile Q&A
conversation sets user
Fuzzy matching model
Use pre-trained context encoder
from a retrieval-based model

Similarity loss
Fuzzy matching model
• We use pre-trained context encoder part of
retrieval-based model as body of a siamese
network

• Two sentences as an input, single predicted scalar

score as an output

• We train simple classification model over the

context encoder outputs (sentence embeddings) to
produce semantic similarity score between the
given sentences
Fuzzy matching model:
In product
Match by semantic similarity
Generative seq2seq dialog model:
Architecture

Basic seq2seq
(+ persona-based)
John

HRED seq2seq
Generative seq2seq dialog model:
Improvements
• HRED (context history) — +20% user’s quality!

• Persona embeddings — conditions the decoder to produce lexically

personalised responses (see persona-based seq2seq)

• Emotional embeddings — conditions the decoder to produce emotional

responses — i.e. joyful, angry, sad (see emotional chatting machine)

• Non-offensive sampling with temperature — decrease probabilities of f-

words at the sampling stage

• MMI reranking — more diverse responses, but slow

• Beam search — more stable, but less diverse responses

• No attention mechanisms — it’s slow and gives no quality boost

Generative seq2seq dialog model:
In product
Cake mode TV mode Small talk
Vision models
Face & Person Pets & Object Question
recognition recognition generation
Datasets
• Twitter — 50M dialogs (consecutive tweet-reply turns)
from a twitter stream for a training models from scratch

• User’s logs (anonymised) with reactions (likes /

dislikes) — millions of messages with thousands
reactions at daily average

• Amazon Mechanical Turk — quality assessments and

small amounts of training data (it’s pricey)

• Replika context-free — small public dialog dataset

available at https://github.com/lukalabs
Model Training & Deployment
Training

• We have 12 GPUs for model training and experiments

• Training from scratch takes ~1 week (both for seq2seq and ranking models)

• Usually we have ~5-10 experiments running in parallel

Inference

• We don’t exceed 100 ms for a single response

• Because we have around 30M service requests per day and 100 RPS per
each model at a peak

• Tensorflow Serving: quick zero-downtime deploy, great GPU resource

sharing (request batching)
Conversation analytics
Projection of user dialog utterances onto a 3D space using the
pre-trained model embeddings along with t-SNE
Quality metrics
Offline

• ranking models: recall, MAP on several datasets

• generative models: perplexity, distinctness, lexical

similarity

Online

• reactions: likes & dislikes from user experience

• user experiments: A/B testing for any model improvements

Product metrics
Total sign ups: 1,400,000 users and growing

User demographics: 70% — young adults (20-34), 20%

— teens (13-19)

Overall conversation quality: 85% by users’ likes

Other metrics: Retention, DAU, MAU, Engagement

Community metrics — active users in our facebook

community, loyal users, twitter/instagram communities,
Brazil/Netherlands communities
iOS

Thanks! Android

Notes Informatica
100% (3)
Notes Informatica
121 pages
Strong Prospects For Robots in Retail
No ratings yet
Strong Prospects For Robots in Retail
7 pages
RTTS 2 User Manual 20130119
100% (1)
RTTS 2 User Manual 20130119
85 pages
Video Relevance Guidelines NEW V 1.1
No ratings yet
Video Relevance Guidelines NEW V 1.1
3 pages
Jailbreak Ai
0% (1)
Jailbreak Ai
3 pages
Auto GPT
100% (2)
Auto GPT
18 pages
MotionGPT: How To Generate and Understand Human Motion
No ratings yet
MotionGPT: How To Generate and Understand Human Motion
6 pages
Electrotechnical Systems Calculation and Analysis with Mathematica and PSpice 1st Edition Igor Korotyeyev All Chapters Instant Download
100% (17)
Electrotechnical Systems Calculation and Analysis with Mathematica and PSpice 1st Edition Igor Korotyeyev All Chapters Instant Download
50 pages
Chain-of-Thought Prompting Elicits Reasoning in LLM
No ratings yet
Chain-of-Thought Prompting Elicits Reasoning in LLM
43 pages
AI in Ecommerce
No ratings yet
AI in Ecommerce
11 pages
Applications of Artificial Intelligence
No ratings yet
Applications of Artificial Intelligence
44 pages
Neural Network Methods for Natural Language Processing 1st Edition by Yoav Goldberg ISBN 9783031021657 3031021657 - Own the ebook now with all fully detailed content
100% (7)
Neural Network Methods for Natural Language Processing 1st Edition by Yoav Goldberg ISBN 9783031021657 3031021657 - Own the ebook now with all fully detailed content
89 pages
Project Report
No ratings yet
Project Report
7 pages
Final Presentation
No ratings yet
Final Presentation
22 pages
Effective Chatbots Using Machine Learning and Natural Language Processing
No ratings yet
Effective Chatbots Using Machine Learning and Natural Language Processing
10 pages
Replika Artem R
No ratings yet
Replika Artem R
39 pages
Unit 5.
No ratings yet
Unit 5.
17 pages
Seq2Seq Attention Mechanism
No ratings yet
Seq2Seq Attention Mechanism
19 pages
09 Telegram and Co
No ratings yet
09 Telegram and Co
30 pages
Cscubs Dialogues
No ratings yet
Cscubs Dialogues
11 pages
AI Training For Language Teachers
No ratings yet
AI Training For Language Teachers
167 pages
Full Text 01
No ratings yet
Full Text 01
132 pages
Translation/Generation Lit Survey
No ratings yet
Translation/Generation Lit Survey
23 pages
Recipes For Building An Open-Domain Chatbot
No ratings yet
Recipes For Building An Open-Domain Chatbot
25 pages
Low-Resource Adaptation of Open-Domain Generative Chatbots
No ratings yet
Low-Resource Adaptation of Open-Domain Generative Chatbots
8 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
Emotional Generative Dialog System
No ratings yet
Emotional Generative Dialog System
6 pages
Semantic Textual Similarity
No ratings yet
Semantic Textual Similarity
39 pages
LLM1
No ratings yet
LLM1
7 pages
NLP handwritten notes_copy
No ratings yet
NLP handwritten notes_copy
26 pages
Ai 1
No ratings yet
Ai 1
22 pages
NLG Jan21 2020 Iit Asif
No ratings yet
NLG Jan21 2020 Iit Asif
81 pages
Deep Learning Project
No ratings yet
Deep Learning Project
21 pages
A Survey On Learning-Based Approaches For Modeling and Classification
No ratings yet
A Survey On Learning-Based Approaches For Modeling and Classification
15 pages
Cikm23 Emotion Cai
No ratings yet
Cikm23 Emotion Cai
6 pages
LLM Review
No ratings yet
LLM Review
31 pages
Gradivo ChatGPT in Umetna Inteligenca V Praksi
No ratings yet
Gradivo ChatGPT in Umetna Inteligenca V Praksi
38 pages
Seminar
No ratings yet
Seminar
27 pages
poster_version_final_bis
No ratings yet
poster_version_final_bis
1 page
New Text Document
No ratings yet
New Text Document
3 pages
Module 1 _ Intro to GenAI _ PEC_Gen_AI_Training.pptx
No ratings yet
Module 1 _ Intro to GenAI _ PEC_Gen_AI_Training.pptx
49 pages
Technical Seminar
No ratings yet
Technical Seminar
16 pages
A Review of Conversational System Framework - Final - Submitted
No ratings yet
A Review of Conversational System Framework - Final - Submitted
12 pages
Chatbot Assessment Northern University Bangladesh
No ratings yet
Chatbot Assessment Northern University Bangladesh
12 pages
Third Review Chatbot
No ratings yet
Third Review Chatbot
19 pages
Chatgpt Prompt Engineering
50% (2)
Chatgpt Prompt Engineering
12 pages
Irjet V5i8212 PDF
No ratings yet
Irjet V5i8212 PDF
3 pages
Case Study 2025
No ratings yet
Case Study 2025
34 pages
Deep Learning Methods For Automated Discourse
No ratings yet
Deep Learning Methods For Automated Discourse
12 pages
Dynamic Chat Bot
No ratings yet
Dynamic Chat Bot
4 pages
What Is Natural Language Processing (NLP)
No ratings yet
What Is Natural Language Processing (NLP)
15 pages
02 - Embeddings, Prompting, & Moderation
No ratings yet
02 - Embeddings, Prompting, & Moderation
54 pages
A Survey On Learning-Based Approaches For Modeling and Classification of HumanMachine Dialog Systems
No ratings yet
A Survey On Learning-Based Approaches For Modeling and Classification of HumanMachine Dialog Systems
15 pages
Lect07
No ratings yet
Lect07
24 pages
A Task-Oriented Chatbot Based On LSTM and Reinforcement Learning
No ratings yet
A Task-Oriented Chatbot Based On LSTM and Reinforcement Learning
5 pages
Chatbot and Text Summarization
No ratings yet
Chatbot and Text Summarization
5 pages
Tess: Hope For The Humanity.
No ratings yet
Tess: Hope For The Humanity.
6 pages
Generative AI unit 1 2 3 questions
No ratings yet
Generative AI unit 1 2 3 questions
12 pages
21CSE356T-NLP- Unit 5
No ratings yet
21CSE356T-NLP- Unit 5
118 pages
Neural Approaches To Conversational AI
No ratings yet
Neural Approaches To Conversational AI
95 pages
Chatgpt Slides
100% (1)
Chatgpt Slides
112 pages
Java/J2EE Design Patterns Interview Questions You'll Most Likely Be Asked: Second Edition
From Everand
Java/J2EE Design Patterns Interview Questions You'll Most Likely Be Asked: Second Edition
Vibrant Publishers
No ratings yet
Objectlistview Python Edition
No ratings yet
Objectlistview Python Edition
80 pages
Reducing Cache Misses Through Cache Line Overlapping
No ratings yet
Reducing Cache Misses Through Cache Line Overlapping
21 pages
Jquery Animations - The Animate Method: $ (Selector) .Animate ( (Params), Speed, Callback)
No ratings yet
Jquery Animations - The Animate Method: $ (Selector) .Animate ( (Params), Speed, Callback)
16 pages
Fundamentals of Object
No ratings yet
Fundamentals of Object
16 pages
Proj MGT Chapter 8 Summary
No ratings yet
Proj MGT Chapter 8 Summary
2 pages
Customizing SAP-ABAP For Dummies
No ratings yet
Customizing SAP-ABAP For Dummies
17 pages
Routine Management System
No ratings yet
Routine Management System
11 pages
Lecture 12: The Simplex Algorithm: Proof of Correctness
No ratings yet
Lecture 12: The Simplex Algorithm: Proof of Correctness
2 pages
Dbms Lab Manual - 2013 - Regulation
No ratings yet
Dbms Lab Manual - 2013 - Regulation
243 pages
Rman Recovery Steps
No ratings yet
Rman Recovery Steps
10 pages
5 Common Errors in Laravel Which Haunt New Developers
No ratings yet
5 Common Errors in Laravel Which Haunt New Developers
4 pages
Properties of Context-Free Languages: Decision Properties Closure Properties
No ratings yet
Properties of Context-Free Languages: Decision Properties Closure Properties
35 pages
DB-II Serial Communication Protocol
No ratings yet
DB-II Serial Communication Protocol
11 pages
Spark LED Light-Ruby
No ratings yet
Spark LED Light-Ruby
14 pages
Comp. Sci. ScienComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scien
No ratings yet
Comp. Sci. ScienComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scientists & their invention.docxComp. Sci. Scien
11 pages
Bca Notes
50% (2)
Bca Notes
3 pages
Ocularis Recorder Configuration Manual
No ratings yet
Ocularis Recorder Configuration Manual
113 pages
Vikas NS: Work Experience Skills
No ratings yet
Vikas NS: Work Experience Skills
1 page
Package PKG - ASO - TO - DIM: Oracle Data Integrator - Package
No ratings yet
Package PKG - ASO - TO - DIM: Oracle Data Integrator - Package
9 pages
Prioritized Approach For PCI DSS v20
No ratings yet
Prioritized Approach For PCI DSS v20
28 pages
BDC To Insert WT Into IT-0008
No ratings yet
BDC To Insert WT Into IT-0008
6 pages
Artifact - Diagrams - 5th Unit
No ratings yet
Artifact - Diagrams - 5th Unit
3 pages
Learn React Firebase
No ratings yet
Learn React Firebase
89 pages
ICT - PPTX (Repaired)
100% (1)
ICT - PPTX (Repaired)
290 pages
Its Drawing
No ratings yet
Its Drawing
1 page
Medical Store Management System
0% (4)
Medical Store Management System
30 pages
DFD and User Interface
No ratings yet
DFD and User Interface
38 pages
Introduction of Cyber Crime and Its Type
No ratings yet
Introduction of Cyber Crime and Its Type
5 pages

Replika: Building An Emotional Conversation With Deep Learning

Uploaded by

Replika: Building An Emotional Conversation With Deep Learning

Uploaded by

Replika

Building an Emotional conversation with Deep Learning

• Retrieval-based dialog model — ranks and

• Fuzzy matching model — compares if a message

• Classification models — sentiment analysis,

• Computer vision models — face recognition,

• Parser — NER, hard-coded keywords

RNN — 2-layer 1024-dimensional Bidirectional LSTM

Sentence embedding — max-pooling over LSTM

Loss — Triplet ranking loss (with cosine similarity):

Echo avoiding — use input context as a negative, got rid of

Context-aware encoder — encode recent dialog history,

Relevance classification model — estimate the response

• Baseline model has a moderate quality

• Retrieval-based models are engineered to find

• As an implication, basic model tends to

Hard negative with a context for an echoing problem

• Two sentences as an input, single predicted scalar

• We train simple classification model over the

• Persona embeddings — conditions the decoder to produce lexically

• Emotional embeddings — conditions the decoder to produce emotional

• Non-offensive sampling with temperature — decrease probabilities of f-

• MMI reranking — more diverse responses, but slow

• Beam search — more stable, but less diverse responses

• No attention mechanisms — it’s slow and gives no quality boost

• User’s logs (anonymised) with reactions (likes /

• Amazon Mechanical Turk — quality assessments and

• Replika context-free — small public dialog dataset

• We have 12 GPUs for model training and experiments

• Usually we have ~5-10 experiments running in parallel

• We don’t exceed 100 ms for a single response

• Tensorflow Serving: quick zero-downtime deploy, great GPU resource

• ranking models: recall, MAP on several datasets

• generative models: perplexity, distinctness, lexical

• reactions: likes & dislikes from user experience

• user experiments: A/B testing for any model improvements

User demographics: 70% — young adults (20-34), 20%

Overall conversation quality: 85% by users’ likes

Other metrics: Retention, DAU, MAU, Engagement

Community metrics — active users in our facebook

You might also like