SlideShare a Scribd company logo
Reference Scope Identification
in Citing Sentences
         Authors:
                 Amjad Abu-Jbara, Dragomir Radev
                           (University of Michigan)
            Conference:
                                      NAACL 2012
            Expositor:
                                  Akihiro Kameda
              (Aizawa Lab. The University of Tokyo)
Abstract
●   Problem:
    ●   Multiple citation in one sentence
    ●   There are many POS taggers developed using
        different techniques for many major languages such
        as transformation-based error-driven learning (Brill,
        1995), decision trees (Black et al., 1992), Markov
        model (Cutting et al., 1992), maximum entropy
        methods (Ratnaparkhi, 1996) etc for English.
●   Approach:Prepossessing
         and 2+1+2*3+1=10 methods
Preprocessing & Methods
Reference Preprocessing
    (tagging, grouping, non-syntactical element removal)
●   These constraints can be lexicalized (REF.1; REF.2),
    unlexicalized (REF.3; TREF.4) or automatically learned
    (REF.5; REF.6).

●   These constraints can be lexicalized (GREF.1), unlexicalized
    (GTREF.2) or automatically learned (GREF.3).

●   (GTREF.1) apply fuzzy techniques for integrating source
    syntax into hierarchical phrase-based systems (REF.2).
Approach 1(SVM,LR)
●   Word classification
    ●   with SVM, a logistic regression classifier
●   Feature: Distance, Position(Before/After), in Segment(,.;
    and, but, for, nor, or, so, yet), POS tag, Dependency
    Distance, Dependency Relations, Common Ancestor Node,
    Syntactic Distance
●   Problem Example:
    ●   There are many POS taggers developed using different
        techniques for many major languages such as transformation-
        based error-driven learning (Brill, 1995), decision trees (Black et
        al., 1992), Markov model (Cutting et al., 1992), maximum entropy
        methods (Ratnaparkhi, 1996) etc for English.
Approach 2(CRF)
●   Sequence Labeling with CRF
    ●   feature is same as Approach 1
Approach 3-S1-* (CRF/segment)
●   segmentation (1)
    ●   punctuation marks
    ●   coordination conjunctions
        –   and, but, for, nor, or, so, yet
    ●   a set of special expressions
        –   "for example", "for instance", "including", "includes",
            "such as", "like", etc.
●   [Rerankers have been successfully applied to numerous
    NLP tasks such as] [parse selection (GTREF)], [parse
    reranking (GREF)], [question-answering (REF)].
Approach 3-S2-* (CRF/segment)
●   segmentation (2)
    ●   chunking tool
        –   noun groups
        –   verb groups
        –   preposition groups
        –   adjective groups
        –   adverb groups
        –   other parts form segment by themselves
●   [To] [score] [the output] [of] [the coreference models], [we]
    [employ] [the commonly-used MUC scoring program (REF)]
    [and] [the recently-developed CEAF scoring program (TREF)].
Approach 3-*-R1,2,3
                 (CRF/segment)
●   R1: majority label of the words it contains
●   R2: inside if any word is inside
●   R3: outside if any word is outside
    ●   [I O O O O] [I I I] [O O]
AR2011




the link grammar parser
(Sleator and Temperley,1991)
Experiment
Data
●   ACL Anthology Network Corpus
●   3300 sentences, citations in each ≧ 2


             Annotation agreement
●   500 of 3300,
    ●   Preprocessing is perfect
    ●   Kappa coefficient of scope is
              P ( A)−P ( E )
           K=                =2P ( A)−1=0.61
              1−P ( E )
Tools
●   Edinburgh Language Technology Text
    Tokenization Toolkit (LT-TTT)
    ●   text tokenization, part-of-speech tagging, chunking,
        and noun phrase head identification.
●   Stanford parser
    ●   syntactic and dependency parsing
●   LibSVM with linear kernel
●   Weka
    ●   logistic regression classification
Tools
●   Machine Learning for Language Toolkit
    (MALLET)
    ●   CRF

                    Validation
●   10-fold cross validation
Experiment (Preprocessing)
    These constraints can be lexicalized (REF.1; REF.2), ll
                                                   r ec a
●

    unlexicalized (REF.3; TREF.4) or and 93  .1%learned
    (REF.5; REF.6). 3% preci
                               s ion automatically
           ng: 9 8 .
    Taggi
●   These constraints can be lexicalized (GREF.1), unlexicalized
    (GTREF.2) or Perfect!
                 automatically learned (GREF.3).
    Grouping:
    (GTREF.1) apply fuzzy techniques for integrating source
                                      a l:
●


 syntax into hierarchicalence
                              removsystems (REF.2).
Non-syn    tactic refer phrase-based ecall
                            9 0. 1% r
                cision and
9 0.08% pre
Experiment (Main)
               ● CRF
               ● Chunking

               ● Majority
Feature Analysis
●   Feature: Distance, Position(Before/After), Same
    segment(,.; and, but, for, nor, or, so, yet), POS
    tag, Dependency Distance, Dependency
    Relations, Common Ancestor Node, Syntactic
    Distance
Summary
●   Identified reference scope in a sentence which
    has multiple citation
● CRF
● Chunking

● Majority
Reference Scope Identification in Citing Sentences
Ad

More Related Content

What's hot (20)

A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
iyo
 
Labelled Variables in Logic Programming: A First Prototipe in tuProlog
Labelled Variables in Logic Programming: A First Prototipe in tuPrologLabelled Variables in Logic Programming: A First Prototipe in tuProlog
Labelled Variables in Logic Programming: A First Prototipe in tuProlog
Roberta Calegari
 
Cross-Lingual Sentiment Analysis using modified BRAE
Cross-Lingual Sentiment Analysis using modified BRAECross-Lingual Sentiment Analysis using modified BRAE
Cross-Lingual Sentiment Analysis using modified BRAE
marujirou
 
Method in oop
Method in oopMethod in oop
Method in oop
Nadimozzaman Pappo
 
FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...
FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...
FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...
Dierk König
 
Type checking compiler construction Chapter #6
Type checking compiler construction Chapter #6Type checking compiler construction Chapter #6
Type checking compiler construction Chapter #6
Daniyal Mughal
 
NLP State of the Art | BERT
NLP State of the Art | BERTNLP State of the Art | BERT
NLP State of the Art | BERT
shaurya uppal
 
Implementation of lexical analyser
Implementation of lexical analyserImplementation of lexical analyser
Implementation of lexical analyser
Archana Gopinath
 
Sp imp gtu
Sp imp gtuSp imp gtu
Sp imp gtu
Paras Patel
 
Csr2011 june17 15_15_kaminski
Csr2011 june17 15_15_kaminskiCsr2011 june17 15_15_kaminski
Csr2011 june17 15_15_kaminski
CSR2011
 
Fafl notes [2010] (sjbit)
Fafl notes [2010] (sjbit)Fafl notes [2010] (sjbit)
Fafl notes [2010] (sjbit)
Siddharaj Junnarkar
 
Type Checking(Compiler Design) #ShareThisIfYouLike
Type Checking(Compiler Design) #ShareThisIfYouLikeType Checking(Compiler Design) #ShareThisIfYouLike
Type Checking(Compiler Design) #ShareThisIfYouLike
United International University
 
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
Association for Computational Linguistics
 
Language Interaction and Quality Issues: An Exploratory Study
Language Interaction and Quality Issues: An Exploratory StudyLanguage Interaction and Quality Issues: An Exploratory Study
Language Interaction and Quality Issues: An Exploratory Study
Marco Torchiano
 
Scheme Programming Language
Scheme Programming LanguageScheme Programming Language
Scheme Programming Language
Reham AlBlehid
 
Notes on attention mechanism
Notes on attention mechanismNotes on attention mechanism
Notes on attention mechanism
Khang Pham
 
A simple approach of lexical analyzers
A simple approach of lexical analyzersA simple approach of lexical analyzers
A simple approach of lexical analyzers
Archana Gopinath
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Young Seok Kim
 
Chapter 13 - Recursion
Chapter 13 - RecursionChapter 13 - Recursion
Chapter 13 - Recursion
Adan Hubahib
 
short_talk
short_talkshort_talk
short_talk
卓 陈
 
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
A Study Of Statistical Models For Query Translation :Finding A Good Unit Of T...
iyo
 
Labelled Variables in Logic Programming: A First Prototipe in tuProlog
Labelled Variables in Logic Programming: A First Prototipe in tuPrologLabelled Variables in Logic Programming: A First Prototipe in tuProlog
Labelled Variables in Logic Programming: A First Prototipe in tuProlog
Roberta Calegari
 
Cross-Lingual Sentiment Analysis using modified BRAE
Cross-Lingual Sentiment Analysis using modified BRAECross-Lingual Sentiment Analysis using modified BRAE
Cross-Lingual Sentiment Analysis using modified BRAE
marujirou
 
FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...
FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...
FregeDay: Roadmap for resolving differences between Haskell and Frege (Ingo W...
Dierk König
 
Type checking compiler construction Chapter #6
Type checking compiler construction Chapter #6Type checking compiler construction Chapter #6
Type checking compiler construction Chapter #6
Daniyal Mughal
 
NLP State of the Art | BERT
NLP State of the Art | BERTNLP State of the Art | BERT
NLP State of the Art | BERT
shaurya uppal
 
Implementation of lexical analyser
Implementation of lexical analyserImplementation of lexical analyser
Implementation of lexical analyser
Archana Gopinath
 
Csr2011 june17 15_15_kaminski
Csr2011 june17 15_15_kaminskiCsr2011 june17 15_15_kaminski
Csr2011 june17 15_15_kaminski
CSR2011
 
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...Tiancheng Zhao - 2017 -  Learning Discourse-level Diversity for Neural Dialog...
Tiancheng Zhao - 2017 - Learning Discourse-level Diversity for Neural Dialog...
Association for Computational Linguistics
 
Language Interaction and Quality Issues: An Exploratory Study
Language Interaction and Quality Issues: An Exploratory StudyLanguage Interaction and Quality Issues: An Exploratory Study
Language Interaction and Quality Issues: An Exploratory Study
Marco Torchiano
 
Scheme Programming Language
Scheme Programming LanguageScheme Programming Language
Scheme Programming Language
Reham AlBlehid
 
Notes on attention mechanism
Notes on attention mechanismNotes on attention mechanism
Notes on attention mechanism
Khang Pham
 
A simple approach of lexical analyzers
A simple approach of lexical analyzersA simple approach of lexical analyzers
A simple approach of lexical analyzers
Archana Gopinath
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language UnderstandingBERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Young Seok Kim
 
Chapter 13 - Recursion
Chapter 13 - RecursionChapter 13 - Recursion
Chapter 13 - Recursion
Adan Hubahib
 
short_talk
short_talkshort_talk
short_talk
卓 陈
 

Viewers also liked (20)

Chemrev4
Chemrev4Chemrev4
Chemrev4
kitcoffeen
 
Mlearn 2011 conference keynote
Mlearn 2011 conference keynoteMlearn 2011 conference keynote
Mlearn 2011 conference keynote
Mike Sharples
 
B2B marketing analytics-report
B2B marketing analytics-reportB2B marketing analytics-report
B2B marketing analytics-report
Progress
 
3 Ways to Simplify your Mobile Apps
3 Ways to Simplify your Mobile Apps3 Ways to Simplify your Mobile Apps
3 Ways to Simplify your Mobile Apps
Progress
 
Chembond
ChembondChembond
Chembond
kitcoffeen
 
Chembond
ChembondChembond
Chembond
kitcoffeen
 
UCS Senior Market Mobile 2010
UCS Senior Market Mobile 2010UCS Senior Market Mobile 2010
UCS Senior Market Mobile 2010
Ian Hay
 
Binayak
BinayakBinayak
Binayak
Vidyadhar Gadgil
 
Amakusa Rick
Amakusa RickAmakusa Rick
Amakusa Rick
guest49db35
 
Effective Pedagogy at Scale – Social Learning and Citizen Inquiry
Effective Pedagogy at Scale –  Social Learning and Citizen InquiryEffective Pedagogy at Scale –  Social Learning and Citizen Inquiry
Effective Pedagogy at Scale – Social Learning and Citizen Inquiry
Mike Sharples
 
Shift Happens
Shift HappensShift Happens
Shift Happens
rspro007
 
Ingl Sfacil
Ingl SfacilIngl Sfacil
Ingl Sfacil
ireperez
 
Econfig Pres
Econfig PresEconfig Pres
Econfig Pres
kitcoffeen
 
Sense-it - mLearn 2015 presentation
Sense-it - mLearn 2015 presentationSense-it - mLearn 2015 presentation
Sense-it - mLearn 2015 presentation
Mike Sharples
 
M learn 2014 slideshare
M learn 2014   slideshareM learn 2014   slideshare
M learn 2014 slideshare
Mike Sharples
 
NEW MEDIA LECTURE - Swinburne University Radio Students
NEW MEDIA LECTURE -  Swinburne University Radio StudentsNEW MEDIA LECTURE -  Swinburne University Radio Students
NEW MEDIA LECTURE - Swinburne University Radio Students
bryceives
 
期末專題
期末專題期末專題
期末專題
GRX
 
Webstock 2011
Webstock 2011Webstock 2011
Webstock 2011
Simon Gianoutsos
 
Why Should You Join The Mobile Revolution?
Why Should You Join The Mobile Revolution?Why Should You Join The Mobile Revolution?
Why Should You Join The Mobile Revolution?
Progress
 
Mlearn 2011 conference keynote
Mlearn 2011 conference keynoteMlearn 2011 conference keynote
Mlearn 2011 conference keynote
Mike Sharples
 
B2B marketing analytics-report
B2B marketing analytics-reportB2B marketing analytics-report
B2B marketing analytics-report
Progress
 
3 Ways to Simplify your Mobile Apps
3 Ways to Simplify your Mobile Apps3 Ways to Simplify your Mobile Apps
3 Ways to Simplify your Mobile Apps
Progress
 
UCS Senior Market Mobile 2010
UCS Senior Market Mobile 2010UCS Senior Market Mobile 2010
UCS Senior Market Mobile 2010
Ian Hay
 
Effective Pedagogy at Scale – Social Learning and Citizen Inquiry
Effective Pedagogy at Scale –  Social Learning and Citizen InquiryEffective Pedagogy at Scale –  Social Learning and Citizen Inquiry
Effective Pedagogy at Scale – Social Learning and Citizen Inquiry
Mike Sharples
 
Shift Happens
Shift HappensShift Happens
Shift Happens
rspro007
 
Ingl Sfacil
Ingl SfacilIngl Sfacil
Ingl Sfacil
ireperez
 
Sense-it - mLearn 2015 presentation
Sense-it - mLearn 2015 presentationSense-it - mLearn 2015 presentation
Sense-it - mLearn 2015 presentation
Mike Sharples
 
M learn 2014 slideshare
M learn 2014   slideshareM learn 2014   slideshare
M learn 2014 slideshare
Mike Sharples
 
NEW MEDIA LECTURE - Swinburne University Radio Students
NEW MEDIA LECTURE -  Swinburne University Radio StudentsNEW MEDIA LECTURE -  Swinburne University Radio Students
NEW MEDIA LECTURE - Swinburne University Radio Students
bryceives
 
期末專題
期末專題期末專題
期末專題
GRX
 
Why Should You Join The Mobile Revolution?
Why Should You Join The Mobile Revolution?Why Should You Join The Mobile Revolution?
Why Should You Join The Mobile Revolution?
Progress
 
Ad

Similar to Reference Scope Identification in Citing Sentences (20)

CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
Lifeng (Aaron) Han
 
Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)
kevig
 
Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)
kevig
 
Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)
kevig
 
Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...
Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...
Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...
Johann Petrak
 
PL Lecture 01 - preliminaries
PL Lecture 01 - preliminariesPL Lecture 01 - preliminaries
PL Lecture 01 - preliminaries
Schwannden Kuo
 
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRFEnd-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
Jayavardhan Reddy Peddamail
 
LiCord: Language Independent Content Word Finder
LiCord: Language Independent Content Word FinderLiCord: Language Independent Content Word Finder
LiCord: Language Independent Content Word Finder
National Inistitute of Informatics (NII), Tokyo, Japann
 
GENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGING
GENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGINGGENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGING
GENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGING
ijnlc
 
Genetic Approach For Arabic Part Of Speech Tagging
Genetic Approach For Arabic Part Of Speech TaggingGenetic Approach For Arabic Part Of Speech Tagging
Genetic Approach For Arabic Part Of Speech Tagging
kevig
 
Genetic Approach For Arabic Part Of Speech Tagging
Genetic Approach For Arabic Part Of Speech TaggingGenetic Approach For Arabic Part Of Speech Tagging
Genetic Approach For Arabic Part Of Speech Tagging
kevig
 
Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...
Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...
Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...
AIST
 
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmUnit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
DhruvKushwaha12
 
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
Lifeng (Aaron) Han
 
ICSE20_Tao_slides.pptx
ICSE20_Tao_slides.pptxICSE20_Tao_slides.pptx
ICSE20_Tao_slides.pptx
DeepaGupta205807
 
neural based_context_representation_learning_for_dialog_act_classification
neural based_context_representation_learning_for_dialog_act_classificationneural based_context_representation_learning_for_dialog_act_classification
neural based_context_representation_learning_for_dialog_act_classification
JEE HYUN PARK
 
Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...
Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...
Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...
semanticsconference
 
LogMap: Large-scale, Logic-based and Interactive Ontology Matching
LogMap: Large-scale, Logic-based and Interactive Ontology MatchingLogMap: Large-scale, Logic-based and Interactive Ontology Matching
LogMap: Large-scale, Logic-based and Interactive Ontology Matching
Ernesto Jimenez Ruiz
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...
Lifeng (Aaron) Han
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
kevig
 
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
Lifeng (Aaron) Han
 
Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)
kevig
 
Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)
kevig
 
Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)Named Entity Recognition using Hidden Markov Model (HMM)
Named Entity Recognition using Hidden Markov Model (HMM)
kevig
 
Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...
Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...
Semantics2018 Zhang,Petrak,Maynard: Adapted TextRank for Term Extraction: A G...
Johann Petrak
 
PL Lecture 01 - preliminaries
PL Lecture 01 - preliminariesPL Lecture 01 - preliminaries
PL Lecture 01 - preliminaries
Schwannden Kuo
 
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRFEnd-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF
Jayavardhan Reddy Peddamail
 
GENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGING
GENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGINGGENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGING
GENETIC APPROACH FOR ARABIC PART OF SPEECH TAGGING
ijnlc
 
Genetic Approach For Arabic Part Of Speech Tagging
Genetic Approach For Arabic Part Of Speech TaggingGenetic Approach For Arabic Part Of Speech Tagging
Genetic Approach For Arabic Part Of Speech Tagging
kevig
 
Genetic Approach For Arabic Part Of Speech Tagging
Genetic Approach For Arabic Part Of Speech TaggingGenetic Approach For Arabic Part Of Speech Tagging
Genetic Approach For Arabic Part Of Speech Tagging
kevig
 
Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...
Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...
Elena Bolshakova and Natalia Efremova - A Heuristic Strategy for Extracting T...
AIST
 
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmUnit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
DhruvKushwaha12
 
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
Lifeng (Aaron) Han
 
neural based_context_representation_learning_for_dialog_act_classification
neural based_context_representation_learning_for_dialog_act_classificationneural based_context_representation_learning_for_dialog_act_classification
neural based_context_representation_learning_for_dialog_act_classification
JEE HYUN PARK
 
Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...
Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...
Philippe Martin and Jérémy Bénard | Importing, Translating and Exporting Know...
semanticsconference
 
LogMap: Large-scale, Logic-based and Interactive Ontology Matching
LogMap: Large-scale, Logic-based and Interactive Ontology MatchingLogMap: Large-scale, Logic-based and Interactive Ontology Matching
LogMap: Large-scale, Logic-based and Interactive Ontology Matching
Ernesto Jimenez Ruiz
 
Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...Pptphrase tagset mapping for french and english treebanks and its application...
Pptphrase tagset mapping for french and english treebanks and its application...
Lifeng (Aaron) Han
 
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
BIDIRECTIONAL LONG SHORT-TERM MEMORY (BILSTM)WITH CONDITIONAL RANDOM FIELDS (...
kevig
 
Ad

More from Akihiro Kameda (7)

iPRES Day 3 instruction
iPRES Day 3 instructioniPRES Day 3 instruction
iPRES Day 3 instruction
Akihiro Kameda
 
Evaluating Visual Representations for Topic Understanding and Their Effect...
Evaluating Visual Representations for Topic Understanding and Their Effect...Evaluating Visual Representations for Topic Understanding and Their Effect...
Evaluating Visual Representations for Topic Understanding and Their Effect...
Akihiro Kameda
 
ISWC2016 1-slide-survey
ISWC2016 1-slide-surveyISWC2016 1-slide-survey
ISWC2016 1-slide-survey
Akihiro Kameda
 
いかにしてデータを手に入れるか
いかにしてデータを手に入れるかいかにしてデータを手に入れるか
いかにしてデータを手に入れるか
Akihiro Kameda
 
"Joint Extraction of Events and Entities within a Document Context"の解説
"Joint Extraction of Events and Entities within a Document Context"の解説"Joint Extraction of Events and Entities within a Document Context"の解説
"Joint Extraction of Events and Entities within a Document Context"の解説
Akihiro Kameda
 
Iodd2015osaka kameda-slideshare
Iodd2015osaka kameda-slideshareIodd2015osaka kameda-slideshare
Iodd2015osaka kameda-slideshare
Akihiro Kameda
 
PATTY: A Taxonomy of Relational Patterns with Semantic Types
PATTY: A Taxonomy of Relational Patterns with Semantic TypesPATTY: A Taxonomy of Relational Patterns with Semantic Types
PATTY: A Taxonomy of Relational Patterns with Semantic Types
Akihiro Kameda
 
iPRES Day 3 instruction
iPRES Day 3 instructioniPRES Day 3 instruction
iPRES Day 3 instruction
Akihiro Kameda
 
Evaluating Visual Representations for Topic Understanding and Their Effect...
Evaluating Visual Representations for Topic Understanding and Their Effect...Evaluating Visual Representations for Topic Understanding and Their Effect...
Evaluating Visual Representations for Topic Understanding and Their Effect...
Akihiro Kameda
 
ISWC2016 1-slide-survey
ISWC2016 1-slide-surveyISWC2016 1-slide-survey
ISWC2016 1-slide-survey
Akihiro Kameda
 
いかにしてデータを手に入れるか
いかにしてデータを手に入れるかいかにしてデータを手に入れるか
いかにしてデータを手に入れるか
Akihiro Kameda
 
"Joint Extraction of Events and Entities within a Document Context"の解説
"Joint Extraction of Events and Entities within a Document Context"の解説"Joint Extraction of Events and Entities within a Document Context"の解説
"Joint Extraction of Events and Entities within a Document Context"の解説
Akihiro Kameda
 
Iodd2015osaka kameda-slideshare
Iodd2015osaka kameda-slideshareIodd2015osaka kameda-slideshare
Iodd2015osaka kameda-slideshare
Akihiro Kameda
 
PATTY: A Taxonomy of Relational Patterns with Semantic Types
PATTY: A Taxonomy of Relational Patterns with Semantic TypesPATTY: A Taxonomy of Relational Patterns with Semantic Types
PATTY: A Taxonomy of Relational Patterns with Semantic Types
Akihiro Kameda
 

Recently uploaded (20)

AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
SOFTTECHHUB
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell
 
2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx
Samuele Fogagnolo
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
 
Cybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure ADCybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure AD
VICTOR MAESTRE RAMIREZ
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded DevelopersLinux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Toradex
 
AI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global TrendsAI and Data Privacy in 2025: Global Trends
AI and Data Privacy in 2025: Global Trends
InData Labs
 
How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?How Can I use the AI Hype in my Business Context?
How Can I use the AI Hype in my Business Context?
Daniel Lehner
 
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdfSAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
SAP Modernization: Maximizing the Value of Your SAP S/4HANA Migration.pdf
Precisely
 
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
AI EngineHost Review: Revolutionary USA Datacenter-Based Hosting with NVIDIA ...
SOFTTECHHUB
 
Technology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data AnalyticsTechnology Trends in 2025: AI and Big Data Analytics
Technology Trends in 2025: AI and Big Data Analytics
InData Labs
 
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven InsightsAndrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell: Transforming Business Strategy Through Data-Driven Insights
Andrew Marnell
 
2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx2025-05-Q4-2024-Investor-Presentation.pptx
2025-05-Q4-2024-Investor-Presentation.pptx
Samuele Fogagnolo
 
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In FranceManifest Pre-Seed Update | A Humanoid OEM Deeptech In France
Manifest Pre-Seed Update | A Humanoid OEM Deeptech In France
chb3
 
Role of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered ManufacturingRole of Data Annotation Services in AI-Powered Manufacturing
Role of Data Annotation Services in AI-Powered Manufacturing
Andrew Leo
 
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdfThe Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
The Evolution of Meme Coins A New Era for Digital Currency ppt.pdf
Abi john
 
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes Partner Innovation Updates for May 2025
ThousandEyes
 
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul Shares 5 Steps to Implement AI Agents for Maximum Business Efficien...
Noah Loul
 
tecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdftecnologias de las primeras civilizaciones.pdf
tecnologias de las primeras civilizaciones.pdf
fjgm517
 
Mobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi ArabiaMobile App Development Company in Saudi Arabia
Mobile App Development Company in Saudi Arabia
Steve Jonas
 
Cybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure ADCybersecurity Identity and Access Solutions using Azure AD
Cybersecurity Identity and Access Solutions using Azure AD
VICTOR MAESTRE RAMIREZ
 
Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025Splunk Security Update | Public Sector Summit Germany 2025
Splunk Security Update | Public Sector Summit Germany 2025
Splunk
 
TrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business ConsultingTrsLabs - Fintech Product & Business Consulting
TrsLabs - Fintech Product & Business Consulting
Trs Labs
 
Procurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptxProcurement Insights Cost To Value Guide.pptx
Procurement Insights Cost To Value Guide.pptx
Jon Hansen
 
Electronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploitElectronic_Mail_Attacks-1-35.pdf by xploit
Electronic_Mail_Attacks-1-35.pdf by xploit
niftliyevhuseyn
 
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded DevelopersLinux Support for SMARC: How Toradex Empowers Embedded Developers
Linux Support for SMARC: How Toradex Empowers Embedded Developers
Toradex
 

Reference Scope Identification in Citing Sentences

  • 1. Reference Scope Identification in Citing Sentences          Authors: Amjad Abu-Jbara, Dragomir Radev (University of Michigan)             Conference: NAACL 2012             Expositor: Akihiro Kameda (Aizawa Lab. The University of Tokyo)
  • 2. Abstract ● Problem: ● Multiple citation in one sentence ● There are many POS taggers developed using different techniques for many major languages such as transformation-based error-driven learning (Brill, 1995), decision trees (Black et al., 1992), Markov model (Cutting et al., 1992), maximum entropy methods (Ratnaparkhi, 1996) etc for English. ● Approach:Prepossessing      and 2+1+2*3+1=10 methods
  • 4. Reference Preprocessing (tagging, grouping, non-syntactical element removal) ● These constraints can be lexicalized (REF.1; REF.2), unlexicalized (REF.3; TREF.4) or automatically learned (REF.5; REF.6). ● These constraints can be lexicalized (GREF.1), unlexicalized (GTREF.2) or automatically learned (GREF.3). ● (GTREF.1) apply fuzzy techniques for integrating source syntax into hierarchical phrase-based systems (REF.2).
  • 5. Approach 1(SVM,LR) ● Word classification ● with SVM, a logistic regression classifier ● Feature: Distance, Position(Before/After), in Segment(,.; and, but, for, nor, or, so, yet), POS tag, Dependency Distance, Dependency Relations, Common Ancestor Node, Syntactic Distance ● Problem Example: ● There are many POS taggers developed using different techniques for many major languages such as transformation- based error-driven learning (Brill, 1995), decision trees (Black et al., 1992), Markov model (Cutting et al., 1992), maximum entropy methods (Ratnaparkhi, 1996) etc for English.
  • 6. Approach 2(CRF) ● Sequence Labeling with CRF ● feature is same as Approach 1
  • 7. Approach 3-S1-* (CRF/segment) ● segmentation (1) ● punctuation marks ● coordination conjunctions – and, but, for, nor, or, so, yet ● a set of special expressions – "for example", "for instance", "including", "includes", "such as", "like", etc. ● [Rerankers have been successfully applied to numerous NLP tasks such as] [parse selection (GTREF)], [parse reranking (GREF)], [question-answering (REF)].
  • 8. Approach 3-S2-* (CRF/segment) ● segmentation (2) ● chunking tool – noun groups – verb groups – preposition groups – adjective groups – adverb groups – other parts form segment by themselves ● [To] [score] [the output] [of] [the coreference models], [we] [employ] [the commonly-used MUC scoring program (REF)] [and] [the recently-developed CEAF scoring program (TREF)].
  • 9. Approach 3-*-R1,2,3 (CRF/segment) ● R1: majority label of the words it contains ● R2: inside if any word is inside ● R3: outside if any word is outside ● [I O O O O] [I I I] [O O]
  • 10. AR2011 the link grammar parser (Sleator and Temperley,1991)
  • 12. Data ● ACL Anthology Network Corpus ● 3300 sentences, citations in each ≧ 2 Annotation agreement ● 500 of 3300, ● Preprocessing is perfect ● Kappa coefficient of scope is P ( A)−P ( E ) K= =2P ( A)−1=0.61 1−P ( E )
  • 13. Tools ● Edinburgh Language Technology Text Tokenization Toolkit (LT-TTT) ● text tokenization, part-of-speech tagging, chunking, and noun phrase head identification. ● Stanford parser ● syntactic and dependency parsing ● LibSVM with linear kernel ● Weka ● logistic regression classification
  • 14. Tools ● Machine Learning for Language Toolkit (MALLET) ● CRF Validation ● 10-fold cross validation
  • 15. Experiment (Preprocessing) These constraints can be lexicalized (REF.1; REF.2), ll r ec a ● unlexicalized (REF.3; TREF.4) or and 93 .1%learned (REF.5; REF.6). 3% preci s ion automatically ng: 9 8 . Taggi ● These constraints can be lexicalized (GREF.1), unlexicalized (GTREF.2) or Perfect! automatically learned (GREF.3). Grouping: (GTREF.1) apply fuzzy techniques for integrating source a l: ● syntax into hierarchicalence removsystems (REF.2). Non-syn tactic refer phrase-based ecall 9 0. 1% r cision and 9 0.08% pre
  • 16. Experiment (Main) ● CRF ● Chunking ● Majority
  • 17. Feature Analysis ● Feature: Distance, Position(Before/After), Same segment(,.; and, but, for, nor, or, so, yet), POS tag, Dependency Distance, Dependency Relations, Common Ancestor Node, Syntactic Distance
  • 18. Summary ● Identified reference scope in a sentence which has multiple citation ● CRF ● Chunking ● Majority

Editor's Notes

  • #3: 難波先生や自身らがある引用が文をまたがって説明されている場合のスコープの同定を扱っていることが関連研究に示されている。 応用は要約など。
  • #13: Annotator 2人なのでたまたま被る確率P(E)は1/2 P(A)は8割ちょい