0% found this document useful (0 votes)

7 views

Lec15 Qa

Uploaded by

krishna chaitanya

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Lec15 Qa

Uploaded by

krishna chaitanya

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Lecture 14:

Ques-on Answering

Wei Xu
(many slides from Greg Durrett)
QA is very broad
‣ Factoid QA: what states border Mississippi?, when was Barack Obama
born?
‣ Lots of this could be handled by QA from a knowledge base, if we had a
big enough knowledge base
‣ “Ques<on answering” as a term is so broad as to be meaningless
‣ Is P=NP?
‣ What is 4+5?
‣ What is the transla=on of [sentence] into French? [McCann et al.,
2018]
2
Classical Ques-on Answering
‣ Form seman-c representa-on from seman-c parsing, execute against
structured knowledge base
Q: “where was Barack Obama born”

λx. type(x, Location) ∧ born_in(Barack_Obama, x)

(other representa-ons like SQL possible too…)

‣ How to deal with open-domain data/rela-ons? Need data to learn how

to ground every predicate or need to be able to produce predicates in a
zero-shot way
Reading Comprehension
‣ “AI challenge problem”:
answer ques-on given
context
‣ Recognizing Textual
Entailment (2006)
‣ MCTest (2013): 500
passages, 4 ques-ons
per passage
‣ Two ques-ons per
passage explicitly require
cross-sentence reasoning Richardson (2013)
Dataset Explosion
‣ 10+ QA datasets released since 2015
‣ Children’s Book Test, CNN/Daily Mail, SQuAD, TriviaQA are most well-
known (others: SearchQA, MS Marco, RACE, WikiHop, …)
‣ Ques-on answering: ques-ons are in natural language
‣ Answers: mul-ple choice or require picking from the passage
‣ Require human annota-on
‣ “Cloze” task: word (o`en an en-ty) is removed from a sentence
‣ Answers: mul-ple choice, pick from passage, or pick from vocabulary
‣ Can be created automa-cally from things that aren’t ques-ons
Children’s Book Test

????

‣ Children’s Book Test: take a sec-on of a children’s story, block out an

en-ty and predict it (one-doc mul--sentence cloze task) Hill et al. (2015)
bAbI
‣ Evalua-on on 20 tasks proposed as building blocks for building “AI-
complete” systems
‣ Various levels of diﬃculty, exhibit diﬀerent linguis-c phenomena
‣ Small vocabulary, language isn’t truly “natural”

Weston et al. (2014)

Dataset Proper-es
‣ Axis 1: QA vs. cloze (Children’s Book Test)

‣ Axis 2: single-sentence vs. passage

‣ O`en shallow methods work well because most answers are in a
single sentence (SQuAD, MCTest)
‣ Some explicitly require linking between mul-ple sentences (MCTest)
‣ Axis 3: single-document (datasets in this lecture) vs. mul--document
(TriviaQA, WikiHop, HotPotQA, …)
Memory Networks
Memory Networks
‣ Memory networks let you reference input with agen-on
‣ Encode input items into two vectors: a key and a value
‣ Keys compute agen-on weights given a query, weighted sum of values
gives the output

Sukhbaatar et al. (2015)

Memory Networks
‣ Three layers of memory network where the
query representa-on is updated addi-vely
based on the memories at each step

‣ How to encode the sentences?

‣ Bag of words (average embeddings)
‣ Posi-onal encoding: mul-ply each word by a
vector capturing posi-on in sentence

Sukhbaatar et al. (2015)

Evalua-on: bAbI

‣ Useful for cloze tasks where far-back context is necessary

‣ What can we do with more basic agen-on?

CNN/Daily Mail: Agen-ve Reader
CNN/Daily Mail
‣ Single-document, (usually) single-
sentence cloze task

‣ Formed based on ar-cle

summaries — informa-on should
mostly be present, makes it
easier than Children’s Book Test

‣ Need to process the ques-on,

can’t just use LSTM LMs

Hermann et al. (2015), Chen et al. (2016)

CNN/Daily Mail
‣ LSTM reader: encode ques-on, encode passage, predict en-ty

Mary

X visited England ||| Mary visited England

‣ Can also use textual entailment-like models
Mul-class classiﬁca-on
problem over en--es
X visited England Mary in the document

Mary visited England Hermann et al. (2015), Chen et al. (2016)

CNN/Daily Mail
‣ Agen-ve reader:
u = encode query
s = encode sentence
r = agen-on(u -> s)
predic-on = f(candidate, u, r)

‣ Uses fixed-size
representa-ons for the
final predic-on, mul-class
classifica-on

Hermann et al. (2015)

CNN/Daily Mail
‣ Chen et al (2016): small
changes to the agen-ve
reader
‣ Addi-onal analysis of the
task found that many of
the remaining ques-ons
were unanswerable or
extremely diﬃcult
Stanford Agen-ve Reader 76.2 76.5 79.5 78.7

Hermann et al. (2015), Chen et al. (2016)

SQuAD: Bidirec-onal Agen-on Flow
SQuAD
‣ Single-document, single-sentence ques-on-answering task where the
answer is always a substring of the passage
‣ Predict start and end indices of the answer in the passage

Rajpurkar et al. (2016)

SQuAD
What was Marie Curie the ﬁrst female recipient of?

START END

ﬁrst female recipient of the Nobel Prize .

‣ Like a tagging problem over the sentence (not mul-class classiﬁca-on),

but we need some way of agending to the query

Rajpurkar et al. (2016)

Bidirec-onal Agen-on Flow
‣ Passage (context) and query are both encoded with BiLSTMs
‣ Context-to-query agen-on: compute so`max over columns of S, take
weighted sum of u based on agen-on weights for each passage word
X
ũi = ↵ij uj ‣ query “specialized”
j to the ith word
↵ij = softmaxj (Sij ) ‣ dist over query

query U
Sij = hi · uj

passage H
Seo et al. (2016)
Bidirec-onal Agen-on Flow

Each passage
word now “knows
about” the query

Seo et al. (2016)

QA with BERT

What was Marie Curie the ﬁrst female recipient of ? [SEP] One of the most famous people born in Warsaw was Marie …

‣ Predict start and end posi<ons in passage

25 ‣ No need for cross-a8en<on mechanisms! Devlin et al. (2019)
SQuAD SOTA: 2018
‣ BiDAF: 73 EM / 81 F1

‣ nlnet, QANet, r-net —

dueling super complex
systems (much more than
BiDAF…)

‣ BERT: transformer-based
approach with pretraining
on 3B tokens
SQuAD 2.0 SOTA: Spring 2019
SQuAD SOTA: Spring 19
‣ SQuAD 2.0: harder dataset
because some ques<ons
are unanswerable

‣ Industry contest

27
SQuAD 2.0 SOTA: Fall
SQuAD SOTA: Today 2019
‣ Performance is very
saturated

‣ Harder QA sezngs are

needed!

28
SQuAD 2.0 SOTA: Today
SQuAD SOTA: Today
‣ Performance is very
saturated

‣ Harder QA sezngs are

needed!

29
TriviaQA
‣ Totally ﬁguring this
out is very challenging
‣ Coref:
the failed campaign
movie of the same name

‣ Lots of surface clues:

1961, campaign, etc.

‣ Systems can do well

without really
understanding the text
30 Joshi et al. (2017)
What are these models learning?
‣ “Who…”: knows to look for people

‣ “Which ﬁlm…”: can iden<fy movies and then spot keywords that
are related to the ques<on

‣ Unless ques<ons are made super tricky (target closely-related

en<<es who are easily confused), they’re usually not so hard to
answer

31
Takeaways
‣ Many ﬂavors of reading comprehension tasks: cloze or actual ques-ons,
single or mul--sentence

‣ Memory networks let you reference input in an agen-on-like way, useful

for generalizing language models to long-range reasoning

‣ Complex agen-on schemes can match queries against input texts and
iden-fy answers

SC1003 Assignment: Typedef Struct Int Char Char
No ratings yet
SC1003 Assignment: Typedef Struct Int Char Char
12 pages
Lesson 9: Global Demography
100% (1)
Lesson 9: Global Demography
6 pages
Aci 350.4R-04 PDF
No ratings yet
Aci 350.4R-04 PDF
18 pages
QALD-4 Open Challenge: Question Answering Over Linked Data
No ratings yet
QALD-4 Open Challenge: Question Answering Over Linked Data
17 pages
1704.01792v3
No ratings yet
1704.01792v3
6 pages
Open Domain QA
No ratings yet
Open Domain QA
28 pages
Text Understanding With The Attention Sum Reader Network: Rudolf Kadlec, Martin Schmid, Ondrej Bajgar & Jan Kleindienst
No ratings yet
Text Understanding With The Attention Sum Reader Network: Rudolf Kadlec, Martin Schmid, Ondrej Bajgar & Jan Kleindienst
11 pages
Question Answering On Squad 2.0: Stanford Cs224N Default Project, Option 3
No ratings yet
Question Answering On Squad 2.0: Stanford Cs224N Default Project, Option 3
11 pages
Joint Question/Answering
No ratings yet
Joint Question/Answering
7 pages
A Question-Focused Multi-Factor Attention Network For Question Answering
No ratings yet
A Question-Focused Multi-Factor Attention Network For Question Answering
8 pages
Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks
No ratings yet
Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks
11 pages
6440-Article Text-9665-1-10-20200517
No ratings yet
6440-Article Text-9665-1-10-20200517
8 pages
Question Answering System: 296: Natural Language Processing
No ratings yet
Question Answering System: 296: Natural Language Processing
30 pages
Chapter 14-NLP
No ratings yet
Chapter 14-NLP
24 pages
Applying Deep Learning To Answer Selection - A Study and An Open Task
No ratings yet
Applying Deep Learning To Answer Selection - A Study and An Open Task
8 pages
11316-Article Text-14844-1-2-20201228
No ratings yet
11316-Article Text-14844-1-2-20201228
8 pages
tacl_a_00023
No ratings yet
tacl_a_00023
12 pages
Chapter Transformers
No ratings yet
Chapter Transformers
8 pages
2406.13188v1
No ratings yet
2406.13188v1
9 pages
CS11-711 Advanced NLP: Retrieval and Retrieval-Augmented Generation
No ratings yet
CS11-711 Advanced NLP: Retrieval and Retrieval-Augmented Generation
37 pages
Icml2016 Memnn Tutorial
No ratings yet
Icml2016 Memnn Tutorial
89 pages
8 Quiz Maker Automatic Quiz Generation From Text Using NLP
No ratings yet
8 Quiz Maker Automatic Quiz Generation From Text Using NLP
11 pages
Better Distractions: Transformer-Based Distractor Generation and Multiple Choice Question Filtering
No ratings yet
Better Distractions: Transformer-Based Distractor Generation and Multiple Choice Question Filtering
10 pages
Farsnewsqa: A Deep Learning Based Question Answering System For The Persian News Articles
No ratings yet
Farsnewsqa: A Deep Learning Based Question Answering System For The Persian News Articles
17 pages
Lost in The Middle How Language Models Use Long Contexts
No ratings yet
Lost in The Middle How Language Models Use Long Contexts
15 pages
Recent Trends in Deep Learning Based Open-Domain Textual Question Answering Systems
No ratings yet
Recent Trends in Deep Learning Based Open-Domain Textual Question Answering Systems
16 pages
Cs224u Intro 2023 Handout
No ratings yet
Cs224u Intro 2023 Handout
98 pages
Cs224n 2025 Lecture03 Neuralnets
No ratings yet
Cs224n 2025 Lecture03 Neuralnets
96 pages
2021.acl-long.502
No ratings yet
2021.acl-long.502
16 pages
1719309701044
No ratings yet
1719309701044
27 pages
Lost in The Middle: How Language Models Use Long Contexts
No ratings yet
Lost in The Middle: How Language Models Use Long Contexts
19 pages
Dealing With Textual Data
No ratings yet
Dealing With Textual Data
67 pages
1909.05017v2
No ratings yet
1909.05017v2
7 pages
report24
No ratings yet
report24
7 pages
Aqua
No ratings yet
Aqua
25 pages
very good for transformer
No ratings yet
very good for transformer
34 pages
Question Answering, Information Retrieval, and Retrieval Augmented Generation
No ratings yet
Question Answering, Information Retrieval, and Retrieval Augmented Generation
22 pages
Natural Language Processing-Course Handout September 2022
No ratings yet
Natural Language Processing-Course Handout September 2022
8 pages
Research Paper Neuro Symbolic
No ratings yet
Research Paper Neuro Symbolic
12 pages
QA With Deep Learning
No ratings yet
QA With Deep Learning
10 pages
A Comprehensive Evaluation of Neural SPARQL Query Generation From Natural Language Questions
No ratings yet
A Comprehensive Evaluation of Neural SPARQL Query Generation From Natural Language Questions
22 pages
RAPTOR
No ratings yet
RAPTOR
23 pages
Knowledge Will Propel Machine Understanding of Content: Extrapolating From Current Examples
No ratings yet
Knowledge Will Propel Machine Understanding of Content: Extrapolating From Current Examples
17 pages
cs224n spr2024 Lecture01 Wordvecs1
No ratings yet
cs224n spr2024 Lecture01 Wordvecs1
40 pages
3. Graph Representation Learning
No ratings yet
3. Graph Representation Learning
32 pages
TextFeatureEnginerring-NLP lec2
No ratings yet
TextFeatureEnginerring-NLP lec2
60 pages
Get Natural language processing with TensorFlow Teach language to machines using Python s deep learning library 1st Edition Thushan Ganegedara PDF ebook with Full Chapters Now
100% (2)
Get Natural language processing with TensorFlow Teach language to machines using Python s deep learning library 1st Edition Thushan Ganegedara PDF ebook with Full Chapters Now
65 pages
QA Using QNLP
No ratings yet
QA Using QNLP
19 pages
LUKE: Deep Contextualized Entity Representations With Entity-Aware Self-Attention
No ratings yet
LUKE: Deep Contextualized Entity Representations With Entity-Aware Self-Attention
13 pages
0632
No ratings yet
0632
7 pages
A Review On Question Generation From Natural Language Text
No ratings yet
A Review On Question Generation From Natural Language Text
43 pages
Mark
No ratings yet
Mark
3 pages
21 Conclusion
No ratings yet
21 Conclusion
78 pages
NLP PBL
No ratings yet
NLP PBL
21 pages
Cs 224N Default Final Project: Question Answering On Squad 2.0
No ratings yet
Cs 224N Default Final Project: Question Answering On Squad 2.0
24 pages
Default 116625923
No ratings yet
Default 116625923
11 pages
Stepwise Reasoning For Multi-Relation Question Answering Over Knowledge Graph With Weak Supervision
No ratings yet
Stepwise Reasoning For Multi-Relation Question Answering Over Knowledge Graph With Weak Supervision
9 pages
Retrieving and Reading - A Comprehensive Survey On Open-Domain Question Answering
No ratings yet
Retrieving and Reading - A Comprehensive Survey On Open-Domain Question Answering
21 pages
Learning To Answer by Learning To Ask - Getting The Best of GPT-2 and BERT Worlds PDF
No ratings yet
Learning To Answer by Learning To Ask - Getting The Best of GPT-2 and BERT Worlds PDF
10 pages
Thesis LLMsForDocVQA
No ratings yet
Thesis LLMsForDocVQA
29 pages
Exam
No ratings yet
Exam
10 pages
The Definitive JavaScript Handbook: From Fundamentals to Cutting‑Edge Best Practices
From Everand
The Definitive JavaScript Handbook: From Fundamentals to Cutting‑Edge Best Practices
Aarav Joshi
No ratings yet
AP Computer Science A Premium, 12th Edition: Prep Book with 6 Practice Tests + Comprehensive Review + Online Practice
From Everand
AP Computer Science A Premium, 12th Edition: Prep Book with 6 Practice Tests + Comprehensive Review + Online Practice
Barron's Educational Series
No ratings yet
Shaver - 2006 - Attachment Theory Individual Psychodynamics and Relationship Functioning
No ratings yet
Shaver - 2006 - Attachment Theory Individual Psychodynamics and Relationship Functioning
38 pages
L6 Tuple Container
No ratings yet
L6 Tuple Container
18 pages
L7 Set Container
No ratings yet
L7 Set Container
16 pages
L8 Dictionaries
No ratings yet
L8 Dictionaries
17 pages
Service Manual Safety Camera SDC313B
No ratings yet
Service Manual Safety Camera SDC313B
20 pages
The Coins of The Moghul Emperors of Hindustan in The British Museum / by Stanley Lane-Poole Ed. by Reginald Stuart Poole
No ratings yet
The Coins of The Moghul Emperors of Hindustan in The British Museum / by Stanley Lane-Poole Ed. by Reginald Stuart Poole
621 pages
Motilal Oswal Diwali Picks 2021
No ratings yet
Motilal Oswal Diwali Picks 2021
8 pages
Infrared and Hot-Air Drying of Onions: D.G. Praveen Kumar, H. Umesh Hebbar, D. Sukumar and M.N. Ramesh
No ratings yet
Infrared and Hot-Air Drying of Onions: D.G. Praveen Kumar, H. Umesh Hebbar, D. Sukumar and M.N. Ramesh
19 pages
The Art of Conversation Questions
100% (1)
The Art of Conversation Questions
2 pages
Wa0022
No ratings yet
Wa0022
147 pages
Links Uteis
No ratings yet
Links Uteis
6 pages
Media Arts 7
No ratings yet
Media Arts 7
7 pages
VBU 8 -Windows Database Programming
No ratings yet
VBU 8 -Windows Database Programming
100 pages
Download ebooks file Mathematics in the Visual Arts Ruth Scheps all chapters
100% (2)
Download ebooks file Mathematics in the Visual Arts Ruth Scheps all chapters
51 pages
Peranso - Light Curve and Period Analysis Software
No ratings yet
Peranso - Light Curve and Period Analysis Software
7 pages
Copper, Table 'X'. Flow of Water in Pipes Table: - Pipe Sizing - Page
No ratings yet
Copper, Table 'X'. Flow of Water in Pipes Table: - Pipe Sizing - Page
4 pages
Conclusion
No ratings yet
Conclusion
6 pages
5s Training Materials
No ratings yet
5s Training Materials
60 pages
Pananabangan
100% (1)
Pananabangan
63 pages
Teachers Wellbeing Survey Report 2024
No ratings yet
Teachers Wellbeing Survey Report 2024
5 pages
Tycs Sem6 Atkt Timetable
No ratings yet
Tycs Sem6 Atkt Timetable
1 page
Owl Pellet Lab
No ratings yet
Owl Pellet Lab
4 pages
Reference List
No ratings yet
Reference List
7 pages
Upper-Intermediate & Advanced Grammar Course For TOEFL, IELTS, GRE, SAT, and GMAT
No ratings yet
Upper-Intermediate & Advanced Grammar Course For TOEFL, IELTS, GRE, SAT, and GMAT
331 pages
PRD - Syngistix Atomic Spec Software Family - 011968 - 01
No ratings yet
PRD - Syngistix Atomic Spec Software Family - 011968 - 01
4 pages
BOQ For Batching Plant Area1
100% (1)
BOQ For Batching Plant Area1
10 pages
Marketing Management Project - Surf Excel
0% (1)
Marketing Management Project - Surf Excel
67 pages
ProCash 2250xe - Eng PDF
No ratings yet
ProCash 2250xe - Eng PDF
2 pages
Fugue-NEW GROVE
100% (1)
Fugue-NEW GROVE
4 pages
Service Operations Management
100% (1)
Service Operations Management
108 pages
Notes Pas 265
No ratings yet
Notes Pas 265
4 pages

Lec15 Qa

Uploaded by

Lec15 Qa

Uploaded by

Lecture 14:

λx. type(x, Location) ∧ born_in(Barack_Obama, x)

‣ How to deal with open-domain data/rela-ons? Need data to learn how

‣ Children’s Book Test: take a sec-on of a children’s story, block out an

Weston et al. (2014)

‣ Axis 2: single-sentence vs. passage

Sukhbaatar et al. (2015)

‣ How to encode the sentences?

Sukhbaatar et al. (2015)

‣ 3-hop memory network

‣ Useful for cloze tasks where far-back context is necessary

‣ What can we do with more basic agen-on?

‣ Formed based on ar-cle

‣ Need to process the ques-on,

Hermann et al. (2015), Chen et al. (2016)

X visited England ||| Mary visited England

Mary visited England Hermann et al. (2015), Chen et al. (2016)

Hermann et al. (2015)

Hermann et al. (2015), Chen et al. (2016)

Rajpurkar et al. (2016)

ﬁrst female recipient of the Nobel Prize .

‣ Like a tagging problem over the sentence (not mul-class classiﬁca-on),

Rajpurkar et al. (2016)

Seo et al. (2016)

‣ Predict start and end posi<ons in passage

‣ nlnet, QANet, r-net —

‣ Harder QA sezngs are

‣ Harder QA sezngs are

‣ Lots of surface clues:

‣ Systems can do well

‣ Unless ques<ons are made super tricky (target closely-related

‣ Memory networks let you reference input in an agen-on-like way, useful

You might also like