SlideShare a Scribd company logo
Sequence-to-Sequence
Generation for Spoken Dialogue
via Deep Syntax Trees and Strings
Ondrˇej Dusˇek and Filip Jurcˇ ́ıcˇek
ACL, 2016
発表者 B4 尾形朋哉
1
概要
• seq2seqを用いて構文木だけではなく自然言語文を出力する手
法の提案
• 2stepによる生成と1stepによる手法の比較
• どちらの手法もとても少ないトレーニングデータで動作した
2
タスク
3
入力のDialogue actに対して自然言語文を出力
提案手法
• 入力:Dialogue Acts(DA)
• 2つの手法
1. 構文木を経由して自然言語文を出力
2. 直接自然言語文を出力
1はDAから構文木を出力し、構文木をTreex NLP toolkitの
surface realizerにより自然言語文に変換
2はDAから直接自然言語文を出力
4
Seq2seq Generator
Reranker
surface
realizer
提案手法
• 入力:Dialogue Acts(DA)
• 2つの手法
1. 構文木を経由して自然言語文を出力
2. 直接自然言語文を出力
1は複雑な表層構文などを人手で取り除き生成を簡単化できる
文法的正しさを常に保証できる
2のモードは構造を明示的にモデル化する必要がない
累積誤差を避けることができる
5
Seq2seq Generator
Reranker
surface
realizer
DA, deep syntax trees, 文の表現
• DAとdeep syntax treesと文をトークン列として表現する
• それぞれのトークンはembeddingとして表現される
• DAの表現列を形成するためにDAのそれぞれのスロットに対し
て”DA type, slot, value”の3つ組として表現
6
Seq2seq Generator
• Encoder
• Decoder
beam searchを行い、top-nの出力の対数尤度を保持する
7
Reranker
• n-best beam searchの出力に対してrerankし、情報抜けや不要
な情報を付加しているものにペナルティをかける
• 出力と入力のDAのベクトルのハミング距離でペナルティをかけ
る
• ペナルティがn-bestリストの対数尤度から引かれrerankされる
• Reranker
8
実験
• BAGEL data set
• restaurant information domainの202のDA
• それぞれ二つの言い換えがあり、それぞれを個々のものとして
扱った
• 181のトレーニングと21のテストに対して10分割交差検定を
行った
• Seq2seq GeneratorとRerankerはそれぞれAdam optimizerによ
り交差エントロピーを最小化するようにトレーニング
9
実験詳細
• レストランの名前や電話番号などは抽象化する
• 人手で付与されたスロットと値のアライメントは使用しなかっ
た
• データは小文字にし、文字列生成の際複数形のsは別のトーク
ンとして扱った
• 構造木を出力するモデルの学習のためにTreex NLP toolkitを用
いて構文木を獲得した
10
結果
• rerankerを用いない場合は2stepの手法の方が意味的誤りが少
なく、BLEU/NISTも高かった
• rerankerを用いるとjointモデルは2stepの手法と意味誤りの数
は同じで、BLEU/NISTは高くなった
11
出力
12
結論
• 構文木を介した2ステップの手法と直接文字列を生成する手法
を比較した。
• それぞれ一定の性能を見せたが、それらの出力はかなり違うも
のとなった。
• 直接文字列を生成する手法の方がより好ましく、nグラムベー
スのものよりもかなり高いスコアと同程度の意味的誤りになっ
た
• RNNベースの手法で用いられるものよりもかなり小さいトレー
ニングデータで意味のある発話を出力するよう学習できた。
13
Ad

More Related Content

Featured (20)

Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
Artificial Intelligence, Data and Competition – SCHREPEL – June 2024 OECD dis...
OECD Directorate for Financial and Enterprise Affairs
 
How to Leverage AI to Boost Employee Wellness - Lydia Di Francesco - SocialHR...
How to Leverage AI to Boost Employee Wellness - Lydia Di Francesco - SocialHR...How to Leverage AI to Boost Employee Wellness - Lydia Di Francesco - SocialHR...
How to Leverage AI to Boost Employee Wellness - Lydia Di Francesco - SocialHR...
SocialHRCamp
 
2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
Marius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
Expeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
Pixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
marketingartwork
 
Skeleton Culture Code
Skeleton Culture CodeSkeleton Culture Code
Skeleton Culture Code
Skeleton Technologies
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
SpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Lily Ray
 
How to have difficult conversations
How to have difficult conversations How to have difficult conversations
How to have difficult conversations
Rajiv Jayarajah, MAppComm, ACC
 
Introduction to Data Science
Introduction to Data ScienceIntroduction to Data Science
Introduction to Data Science
Christy Abraham Joy
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
Vit Horky
 
How to Leverage AI to Boost Employee Wellness - Lydia Di Francesco - SocialHR...
How to Leverage AI to Boost Employee Wellness - Lydia Di Francesco - SocialHR...How to Leverage AI to Boost Employee Wellness - Lydia Di Francesco - SocialHR...
How to Leverage AI to Boost Employee Wellness - Lydia Di Francesco - SocialHR...
SocialHRCamp
 
2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot2024 State of Marketing Report – by Hubspot
2024 State of Marketing Report – by Hubspot
Marius Sescu
 
Everything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPTEverything You Need To Know About ChatGPT
Everything You Need To Know About ChatGPT
Expeed Software
 
Product Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage EngineeringsProduct Design Trends in 2024 | Teenage Engineerings
Product Design Trends in 2024 | Teenage Engineerings
Pixeldarts
 
How Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental HealthHow Race, Age and Gender Shape Attitudes Towards Mental Health
How Race, Age and Gender Shape Attitudes Towards Mental Health
ThinkNow
 
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdfAI Trends in Creative Operations 2024 by Artwork Flow.pdf
AI Trends in Creative Operations 2024 by Artwork Flow.pdf
marketingartwork
 
PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024PEPSICO Presentation to CAGNY Conference Feb 2024
PEPSICO Presentation to CAGNY Conference Feb 2024
Neil Kimberley
 
Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)Content Methodology: A Best Practices Report (Webinar)
Content Methodology: A Best Practices Report (Webinar)
contently
 
How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024How to Prepare For a Successful Job Search for 2024
How to Prepare For a Successful Job Search for 2024
Albert Qian
 
Social Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie InsightsSocial Media Marketing Trends 2024 // The Global Indie Insights
Social Media Marketing Trends 2024 // The Global Indie Insights
Kurio // The Social Media Age(ncy)
 
Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024Trends In Paid Search: Navigating The Digital Landscape In 2024
Trends In Paid Search: Navigating The Digital Landscape In 2024
Search Engine Journal
 
5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary5 Public speaking tips from TED - Visualized summary
5 Public speaking tips from TED - Visualized summary
SpeakerHub
 
ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd ChatGPT and the Future of Work - Clark Boyd
ChatGPT and the Future of Work - Clark Boyd
Clark Boyd
 
Getting into the tech field. what next
Getting into the tech field. what next Getting into the tech field. what next
Getting into the tech field. what next
Tessa Mero
 
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search IntentGoogle's Just Not That Into You: Understanding Core Updates & Search Intent
Google's Just Not That Into You: Understanding Core Updates & Search Intent
Lily Ray
 
Time Management & Productivity - Best Practices
Time Management & Productivity -  Best PracticesTime Management & Productivity -  Best Practices
Time Management & Productivity - Best Practices
Vit Horky
 

[ACL2016]Sequence-to-Sequence Generation for Spoken Dialogue via Deep Syntax Trees and Strings

Editor's Notes

  • #7: seq2seqのgeneratorから出力された構造木は角括弧記法で表される
  • #9: Rerankerはdialogue actとslot valueの組があるかどうかを2値で出力する
  • #12: 加えてjointモデルは外部のsurface realizerを必要としない。
  • #13: 意味的に閉じたモノの混同が多く見られる。たとえばFrenchとすべきところをItalianとしてしまう with treesの方は構文的流暢性が低い、繰り返しが起きる