SlideShare a Scribd company logo
A GRAPH-BASED CROSS-LINGUAL
      PROJECTION APPROACH FOR
WEAKLY SUPERVISED RELATION EXTRACTION
    The 50th Annual Meeting of the Association for Computational Linguistics
                                  (ACL 2012)
                             July 11th, 2012, Jeju

       Seokhwan Kim (Institute for Infocomm Research)
                 Gary Geunbae Lee (POSTECH)
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Evaluation
• Conclusions




                                                                   2
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Evaluation
• Conclusions




                                                                   3
Problem Definition
• Relation Extraction
    To identify semantic relations between a pair of entities

                         Birthplace

       Barack Obama was born in Honolulu           ,   Hawaii    .
              PER                          LOC           LOC



    Considered as a classification problem




                                                                     4
Related Work (1)
• Supervised Learning
    Many supervised machine learning approaches have been
     successfully applied
      • (Kambhatla, 2004; Zhou et al., 2005; Zelenko et al., 2003; Culotta and
        Sorensen, 2004; Bunescu and Mooney, 2005; Zhang et al., 2006)

• Semi-supervised Learning
    To obtain the annotations of unlabeled instances from the seed
     information
      • (Brin, 1999; Riloff and Jones, 1999; Agichtein and Gravano, 2000;
        Sudo et al, 2003; Yangarber, 2003; Stevenson and Greenwood, 2006;
        Zhang, 2004; Chen el al., 2006; Zhou et al., 2009)



                                                                             5
Motivation
• Resources for Relation Extraction
    Supervised/Semi-supervised Approaches
      • Labeled corpora for supervised learning
      • Seed instances for semi-supervised learning
      • Available for only a few languages
           ACE 2003 Multilingual Training Dataset
              • English (252 articles)
              • Chinese (221 articles)
              • Arabic (206 articles)
      • No resources for other languages
           Korean




                                                      6
Related Work (2)
• Self-supervised Learning
    To obtain the annotated dataset without any human effort
    Using the information obtained from external resources
       • Heuristic-based Method (Banko et al., 2007; Banko et al., 2008)
       • Wikipedia-based Methods (Wu and Weld, 2010)

• Cross-lingual Annotation Projection
    To leverage parallel corpora to project the relation annotations on
     the resource-rich source language to the resource-poor target
     language (Kim et al., 2010, Kim et al., 2011)




                                                                           7
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Implementation
• Evaluation
• Conclusions




                                                               8
Overall Architecture
Annotation                Parallel
                                                     Projection
                          Corpus


         Sentences in                 Sentences in
                 Ls                        Lt



        Preprocessing                Preprocessing
        (POS Tagging,                (POS Tagging,
           Parsing)                     Parsing)




               NER                   Word Alignment




              Relation
                                       Projection
             Extraction



          Annotated                    Annotated
         Sentences in                 Sentences in
                 Ls                        Lt                     9
Direct Projection
                                                                                (Kim et al., 2010)
• Annotation


• Projection




                        fE (<Barack Obama, Honolulu>) = 1
       Barack Obama              was born in        Honolulu          ,    Hawaii         .


    버락 오바마               는       하와이         의      호놀룰루              에서          태어났다
    (beo-rak-o-ba-ma)   (neun)   (ha-wa-i)   (ui)   (ho-nol-rul-ru)   (e-seo)     (tae-eo-nat-da)


                          fK (<버락 오바마, 호놀룰루>) = 1
                                                                                                    10
Limitations of Direct Projection
• Direct projection approach is still vulnerable to the
  erroneous inputs generated by preprocessors
• Main causes of this limitation
    Considering alignment between entity candidates only, not any
     contextual information
    Performed by just a single pass process




                                                                     11
Graph-based Learning
• Semi-supervised learning algorithm
• Defining a graph
    The nodes represent labeled and unlabeled examples in a dataset
    The edges reflect the similarity of examples
• Learning a labeling function in an iterative manner
    It should be close to the given labels on the similar labeled nodes
    It should be smooth on the whole graph
• Related Work
    Graph-based Learning for Relation Extraction (Chen et al, 2006)
    Bilingual projection of POS tagging (Das and Petrov, 2011)


                                                                           12
Graph Construction
• Graph Nodes
   Instance Nodes
      • Defined for all pairs of entity candidates in both languages
      • Each instance node has a soft label vector Y = [y+ y-]
   Context Nodes
      • For identifying the relation descriptors of the positive instances
      • Defined for each trigram which is located between a given entity pair
        which is semantically related
      • Each context node has a soft label vector Y = [y+ y-]


                       <ARG1> was born in <ARG2>



   <ARG1> was born                was born in                born in <ARG2>     13
Graph Construction
• Edge Weights
   Between instance node and context node in the same language
        𝑤 𝑣 𝑖,𝑗 , 𝑢 𝑘
                             1             𝑖𝑓 𝑣 𝑖𝑗 ℎ𝑎𝑠 𝑢 𝑘 𝑎𝑠 𝑎 𝑐𝑜𝑛𝑡𝑒𝑥𝑡𝑢𝑎𝑙 𝑠𝑢𝑏𝑠𝑒𝑞𝑢𝑒𝑛𝑐𝑒,
                           = 0                                𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒.
   Between context nodes in a language
             𝑘,
                                                   |𝑢 𝑘 ∩ 𝑢 𝑙 |
       𝑤(𝑢        𝑢 𝑙)     = 𝐽(𝑢      𝑘,
                                           𝑢 𝑙)   = 𝑘           .
                                                   |𝑢 ∪ 𝑢 𝑙 |

   Between context nodes in source and target languages
                                      𝑐𝑜𝑢𝑛𝑡 𝑢 𝑠𝑘 , 𝑢 𝑙𝑡
       𝑤(𝑢 𝑠𝑘 ,   𝑢 𝑙𝑡 )   =                      𝑘       𝑚
                                                               ,
                                 𝑢𝑡   𝑚 ‍ 𝑐𝑜𝑢𝑛𝑡 𝑢 𝑠 , 𝑢 𝑡




                                                                                          14
Graph Construction
• Example




                                 15
Label Propagation
                                       Initialize T
• Algorithm
    Input
      • A transition matrix T
      • An initial label matrix Y0    Normalize T
    Output
      • The updated label matrix Yt

                                       Initialize Y




                                        Update Y




                                                 16
Label Propagation
• Executed in three phases




               1st phase




                             2nd phase




                                         3rd phase




                                                     17
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Evaluation
• Conclusions




                                                                   18
Implementation
• Dataset
    English-Korean parallel corpus
       • 266,982 bi-sentence pairs in English and Korean
       • Aligned by GIZA++
• Annotation
    ReVerb (Fader et al., 2011)
       • English Open IE system
• Label Propagation
    Junto Label Propagation Toolkit
• Learning
    Tree kernel-based SVM classifier
       • Shortest path dependency kernel (Bunescu and Mooney, 2005)
       • SVM-Light (Joachims, 1998)

                                                                      19
Evaluation
• Dataset
    Manually annotated Korean dataset
      • Obtained from the Web following Bunescu and Mooney(2007)’s work
      • 500 sentences with manual annotations for four relation types
             Acquisition
             Birthplace
             Inventor Of
             Won Prize

• Evaluation Metrics
    Precision/Recall/F-measure




                                                                          20
Experimental Results
• Direct Projection vs. Graph-based Projection


                   Direct Projection     Graph-based Projection
      Type
                  P       R        F       P       R       F
   Acquisition   51.6    87.7     64.9    55.3    91.2    68.9
   Birthplace    69.8    84.5     76.4    73.8    87.3    80.0
   Inventor of   62.4    85.3     72.1    66.3    89.7    76.3
   Won Prize     73.3    80.5     76.7    76.4    82.9    79.5
      Total      63.9    84.2     72.7    67.7    87.4    76.3




                                                                 21
Experimental Results
• Comparisons to other self-supervised approaches
    Heuristic-based Approach (Banko et al., 2007; Banko et al., 2008)
       • Korean Treebank and Syntactic Heuristics
    Wikipedia-based Approach (Wu and Weld, 2010)
       • Korean Wikipedia articles and Infoboxes


          Approach                    P              R        F

       Heuristic-based              92.31           17.27   29.09

      Wikipedia-based               66.67           66.91   66.79

      Projection-based             67.69            87.41   76.30

                                                                         22
Contents
• Introduction
• Methods
    Cross-lingual Annotation Projection for Relation Extraction
    Graph-based Projection Approach
• Evaluation
• Conclusions




                                                                   23
Conclusion
• Summary
    A graph-based projection approach for relation extraction
       • Label propagation algorithm
       • On a graph that represents the instance and context features of both
         the source and target languages
    Experimental results show that our approach helps to improve the
     performances of relation extraction compared to other approaches
• Future work
    To relieve the high complexity problem of the approach
    To deal with more expanded graph structure to improve the
     extraction performances


                                                                                24
Q&A
Ad

Recommended

A Cross-Lingual Annotation Projection Approach for Relation Detection
A Cross-Lingual Annotation Projection Approach for Relation Detection
Seokhwan Kim
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Minh Pham
 
Bert pre_training_of_deep_bidirectional_transformers_for_language_understanding
Bert pre_training_of_deep_bidirectional_transformers_for_language_understanding
ThyrixYang1
 
1909 paclic
1909 paclic
WarNik Chow
 
Pre trained language model
Pre trained language model
JiWenKim
 
2010 PACLIC - pay attention to categories
2010 PACLIC - pay attention to categories
WarNik Chow
 
1909 BERT: why-and-how (CODE SEMINAR)
1909 BERT: why-and-how (CODE SEMINAR)
WarNik Chow
 
Bert
Bert
Abdallah Bashir
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Christoph Lange
 
NLP from scratch
NLP from scratch
Bryan Gummibearehausen
 
A System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá Characters
Guy De Pauw
 
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Roelof Pieters
 
NLP State of the Art | BERT
NLP State of the Art | BERT
shaurya uppal
 
BERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from Transformers
Liangqun Lu
 
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
virtual-campus
 
BERT introduction
BERT introduction
Hanwha System / ICT
 
[Paper review] BERT
[Paper review] BERT
JEE HYUN PARK
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Christoph Lange
 
SPIDER: a System for Paraphrasing - Applicability in Machine Translation Pre-...
SPIDER: a System for Paraphrasing - Applicability in Machine Translation Pre-...
INESC-ID (Spoken Language Systems Laboratory - L2F)
 
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly
 
Multitier holistic Approach for urdu Nastaliq Recognition
Multitier holistic Approach for urdu Nastaliq Recognition
Dr. Syed Hassan Amin
 
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deren Lei
 
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
kevig
 
Diversified Social Media Retrieval for News Stories
Diversified Social Media Retrieval for News Stories
Bryan Gummibearehausen
 
VOC real world enterprise needs
VOC real world enterprise needs
Ivan Berlocher
 
Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10
Jason Yang
 
Representation Learning of Vectors of Words and Phrases
Representation Learning of Vectors of Words and Phrases
Felipe Moraes
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
gohyunwoong
 
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
Seokhwan Kim
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
Seokhwan Kim
 

More Related Content

What's hot (20)

Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Christoph Lange
 
NLP from scratch
NLP from scratch
Bryan Gummibearehausen
 
A System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá Characters
Guy De Pauw
 
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Roelof Pieters
 
NLP State of the Art | BERT
NLP State of the Art | BERT
shaurya uppal
 
BERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from Transformers
Liangqun Lu
 
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
virtual-campus
 
BERT introduction
BERT introduction
Hanwha System / ICT
 
[Paper review] BERT
[Paper review] BERT
JEE HYUN PARK
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Christoph Lange
 
SPIDER: a System for Paraphrasing - Applicability in Machine Translation Pre-...
SPIDER: a System for Paraphrasing - Applicability in Machine Translation Pre-...
INESC-ID (Spoken Language Systems Laboratory - L2F)
 
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly
 
Multitier holistic Approach for urdu Nastaliq Recognition
Multitier holistic Approach for urdu Nastaliq Recognition
Dr. Syed Hassan Amin
 
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deren Lei
 
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
kevig
 
Diversified Social Media Retrieval for News Stories
Diversified Social Media Retrieval for News Stories
Bryan Gummibearehausen
 
VOC real world enterprise needs
VOC real world enterprise needs
Ivan Berlocher
 
Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10
Jason Yang
 
Representation Learning of Vectors of Words and Phrases
Representation Learning of Vectors of Words and Phrases
Felipe Moraes
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
gohyunwoong
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Christoph Lange
 
A System for the Recognition of Handwritten Yorùbá Characters
A System for the Recognition of Handwritten Yorùbá Characters
Guy De Pauw
 
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Deep Learning for NLP: An Introduction to Neural Word Embeddings
Roelof Pieters
 
NLP State of the Art | BERT
NLP State of the Art | BERT
shaurya uppal
 
BERT: Bidirectional Encoder Representations from Transformers
BERT: Bidirectional Encoder Representations from Transformers
Liangqun Lu
 
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
S-CUBE LP: Executing the HOCL: Concept of a Chemical Interpreter
virtual-campus
 
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Ontology Integration and Interoperability (OntoIOp) – Part 1: The Distributed...
Christoph Lange
 
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly AI-NLP Club #6 - Sequence Tagging using Neural Networks - Artem Che...
Grammarly
 
Multitier holistic Approach for urdu Nastaliq Recognition
Multitier holistic Approach for urdu Nastaliq Recognition
Dr. Syed Hassan Amin
 
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deep Reinforcement Learning with Distributional Semantic Rewards for Abstract...
Deren Lei
 
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
A NOVEL APPROACH FOR NAMED ENTITY RECOGNITION ON HINDI LANGUAGE USING RESIDUA...
kevig
 
Diversified Social Media Retrieval for News Stories
Diversified Social Media Retrieval for News Stories
Bryan Gummibearehausen
 
VOC real world enterprise needs
VOC real world enterprise needs
Ivan Berlocher
 
Pattern Mining To Unknown Word Extraction (10
Pattern Mining To Unknown Word Extraction (10
Jason Yang
 
Representation Learning of Vectors of Words and Phrases
Representation Learning of Vectors of Words and Phrases
Felipe Moraes
 
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
gohyunwoong
 

Similar to A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction (20)

A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
Seokhwan Kim
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
Seokhwan Kim
 
Improving Machine Learning Approaches to Coreference Resolution
Improving Machine Learning Approaches to Coreference Resolution
butest
 
Reference Scope Identification in Citing Sentences
Reference Scope Identification in Citing Sentences
Akihiro Kameda
 
Meaning Extraction - IJCTE 2(1)
Meaning Extraction - IJCTE 2(1)
IT Industry
 
Learning analytics to identify exploratory dialogue in online discussions
Learning analytics to identify exploratory dialogue in online discussions
Rebecca Ferguson
 
From Linked Data to Semantic Applications
From Linked Data to Semantic Applications
Andre Freitas
 
Knowledge Extraction
Knowledge Extraction
Pierre de Lacaze
 
mlss
mlss
MaiAGE-INRA, Paris Sud, LIMSI-CNRS
 
A Bridge Not too Far
A Bridge Not too Far
Valeria de Paiva
 
REPORT.doc
REPORT.doc
IswaryaPurushothaman1
 
Optimization of NLP Components for Robustness and Scalability
Optimization of NLP Components for Robustness and Scalability
Jinho Choi
 
introduction to natural language processing lecture.pptx
introduction to natural language processing lecture.pptx
aljeboorymuhammed
 
Unsupervised Learning of a Social Network from a Multiple-Source News Corpus
Unsupervised Learning of a Social Network from a Multiple-Source News Corpus
htanev
 
A Pilot Study On Computer-Aided Coreference Annotation
A Pilot Study On Computer-Aided Coreference Annotation
Darian Pruitt
 
2015 07-tuto2-clus type
2015 07-tuto2-clus type
jins0618
 
Networks and Natural Language Processing
Networks and Natural Language Processing
Ahmed Magdy Ezzeldin, MSc.
 
ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...
ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...
University of Bari (Italy)
 
Coreference Resolution using Hybrid Approach
Coreference Resolution using Hybrid Approach
butest
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on Language
Roelof Pieters
 
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
A Cross-lingual Annotation Projection-based Self-supervision Approach for Ope...
Seokhwan Kim
 
A semi-supervised method for efficient construction of statistical spoken lan...
A semi-supervised method for efficient construction of statistical spoken lan...
Seokhwan Kim
 
Improving Machine Learning Approaches to Coreference Resolution
Improving Machine Learning Approaches to Coreference Resolution
butest
 
Reference Scope Identification in Citing Sentences
Reference Scope Identification in Citing Sentences
Akihiro Kameda
 
Meaning Extraction - IJCTE 2(1)
Meaning Extraction - IJCTE 2(1)
IT Industry
 
Learning analytics to identify exploratory dialogue in online discussions
Learning analytics to identify exploratory dialogue in online discussions
Rebecca Ferguson
 
From Linked Data to Semantic Applications
From Linked Data to Semantic Applications
Andre Freitas
 
Optimization of NLP Components for Robustness and Scalability
Optimization of NLP Components for Robustness and Scalability
Jinho Choi
 
introduction to natural language processing lecture.pptx
introduction to natural language processing lecture.pptx
aljeboorymuhammed
 
Unsupervised Learning of a Social Network from a Multiple-Source News Corpus
Unsupervised Learning of a Social Network from a Multiple-Source News Corpus
htanev
 
A Pilot Study On Computer-Aided Coreference Annotation
A Pilot Study On Computer-Aided Coreference Annotation
Darian Pruitt
 
2015 07-tuto2-clus type
2015 07-tuto2-clus type
jins0618
 
ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...
ConNeKTion: A Tool for Exploiting Conceptual Graphs Automatically Learned fro...
University of Bari (Italy)
 
Coreference Resolution using Hybrid Approach
Coreference Resolution using Hybrid Approach
butest
 
Visual-Semantic Embeddings: some thoughts on Language
Visual-Semantic Embeddings: some thoughts on Language
Roelof Pieters
 
Ad

More from Seokhwan Kim (18)

The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)
Seokhwan Kim
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Seokhwan Kim
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic Tracking
Seokhwan Kim
 
The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)
Seokhwan Kim
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot Interaction
Seokhwan Kim
 
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Seokhwan Kim
 
The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)
Seokhwan Kim
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Seokhwan Kim
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Seokhwan Kim
 
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
Seokhwan Kim
 
Sequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog States
Seokhwan Kim
 
Wikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic Tracking
Seokhwan Kim
 
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
Seokhwan Kim
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognition
Seokhwan Kim
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information access
Seokhwan Kim
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...
Seokhwan Kim
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information Extraction
Seokhwan Kim
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
Seokhwan Kim
 
The Eighth Dialog System Technology Challenge (DSTC8)
The Eighth Dialog System Technology Challenge (DSTC8)
Seokhwan Kim
 
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Deep Recurrent Neural Networks with Layer-wise Multi-head Attentions for Punc...
Seokhwan Kim
 
Dynamic Memory Networks for Dialogue Topic Tracking
Dynamic Memory Networks for Dialogue Topic Tracking
Seokhwan Kim
 
The Fifth Dialog State Tracking Challenge (DSTC5)
The Fifth Dialog State Tracking Challenge (DSTC5)
Seokhwan Kim
 
Natural Language in Human-Robot Interaction
Natural Language in Human-Robot Interaction
Seokhwan Kim
 
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Exploring Convolutional and Recurrent Neural Networks in Sequential Labelling...
Seokhwan Kim
 
The Fourth Dialog State Tracking Challenge (DSTC4)
The Fourth Dialog State Tracking Challenge (DSTC4)
Seokhwan Kim
 
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Wikification of Concept Mentions within Spoken Dialogues Using Domain Constra...
Seokhwan Kim
 
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Towards Improving Dialogue Topic Tracking Performances with Wikification of C...
Seokhwan Kim
 
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
A Composite Kernel Approach for Dialog Topic Tracking with Structured Domain ...
Seokhwan Kim
 
Sequential Labeling for Tracking Dynamic Dialog States
Sequential Labeling for Tracking Dynamic Dialog States
Seokhwan Kim
 
Wikipedia-based Kernels for Dialogue Topic Tracking
Wikipedia-based Kernels for Dialogue Topic Tracking
Seokhwan Kim
 
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
A Graph-based Cross-lingual Projection Approach for Spoken Language Understan...
Seokhwan Kim
 
MMR-based active machine learning for Bio named entity recognition
MMR-based active machine learning for Bio named entity recognition
Seokhwan Kim
 
A spoken dialog system for electronic program guide information access
A spoken dialog system for electronic program guide information access
Seokhwan Kim
 
An alignment-based approach to semi-supervised relation extraction including ...
An alignment-based approach to semi-supervised relation extraction including ...
Seokhwan Kim
 
An Alignment-based Pattern Representation Model for Information Extraction
An Alignment-based Pattern Representation Model for Information Extraction
Seokhwan Kim
 
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
EPG 정보 검색을 위한 예제 기반 자연어 대화 시스템
Seokhwan Kim
 
Ad

Recently uploaded (20)

PyCon SG 25 - Firecracker Made Easy with Python.pdf
PyCon SG 25 - Firecracker Made Easy with Python.pdf
Muhammad Yuga Nugraha
 
Security Tips for Enterprise Azure Solutions
Security Tips for Enterprise Azure Solutions
Michele Leroux Bustamante
 
2025_06_18 - OpenMetadata Community Meeting.pdf
2025_06_18 - OpenMetadata Community Meeting.pdf
OpenMetadata
 
OpenPOWER Foundation & Open-Source Core Innovations
OpenPOWER Foundation & Open-Source Core Innovations
IBM
 
The Future of Technology: 2025-2125 by Saikat Basu.pdf
The Future of Technology: 2025-2125 by Saikat Basu.pdf
Saikat Basu
 
Smarter Aviation Data Management: Lessons from Swedavia Airports and Sweco
Smarter Aviation Data Management: Lessons from Swedavia Airports and Sweco
Safe Software
 
Securing AI - There Is No Try, Only Do!.pdf
Securing AI - There Is No Try, Only Do!.pdf
Priyanka Aash
 
Techniques for Automatic Device Identification and Network Assignment.pdf
Techniques for Automatic Device Identification and Network Assignment.pdf
Priyanka Aash
 
WebdriverIO & JavaScript: The Perfect Duo for Web Automation
WebdriverIO & JavaScript: The Perfect Duo for Web Automation
digitaljignect
 
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
yosra Saidani
 
Lessons Learned from Developing Secure AI Workflows.pdf
Lessons Learned from Developing Secure AI Workflows.pdf
Priyanka Aash
 
OWASP Barcelona 2025 Threat Model Library
OWASP Barcelona 2025 Threat Model Library
PetraVukmirovic
 
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
Priyanka Aash
 
A Constitutional Quagmire - Ethical Minefields of AI, Cyber, and Privacy.pdf
A Constitutional Quagmire - Ethical Minefields of AI, Cyber, and Privacy.pdf
Priyanka Aash
 
Mastering AI Workflows with FME by Mark Döring
Mastering AI Workflows with FME by Mark Döring
Safe Software
 
The Future of Product Management in AI ERA.pdf
The Future of Product Management in AI ERA.pdf
Alyona Owens
 
From Manual to Auto Searching- FME in the Driver's Seat
From Manual to Auto Searching- FME in the Driver's Seat
Safe Software
 
Curietech AI in action - Accelerate MuleSoft development
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
 
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Safe Software
 
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
revolcs10
 
PyCon SG 25 - Firecracker Made Easy with Python.pdf
PyCon SG 25 - Firecracker Made Easy with Python.pdf
Muhammad Yuga Nugraha
 
Security Tips for Enterprise Azure Solutions
Security Tips for Enterprise Azure Solutions
Michele Leroux Bustamante
 
2025_06_18 - OpenMetadata Community Meeting.pdf
2025_06_18 - OpenMetadata Community Meeting.pdf
OpenMetadata
 
OpenPOWER Foundation & Open-Source Core Innovations
OpenPOWER Foundation & Open-Source Core Innovations
IBM
 
The Future of Technology: 2025-2125 by Saikat Basu.pdf
The Future of Technology: 2025-2125 by Saikat Basu.pdf
Saikat Basu
 
Smarter Aviation Data Management: Lessons from Swedavia Airports and Sweco
Smarter Aviation Data Management: Lessons from Swedavia Airports and Sweco
Safe Software
 
Securing AI - There Is No Try, Only Do!.pdf
Securing AI - There Is No Try, Only Do!.pdf
Priyanka Aash
 
Techniques for Automatic Device Identification and Network Assignment.pdf
Techniques for Automatic Device Identification and Network Assignment.pdf
Priyanka Aash
 
WebdriverIO & JavaScript: The Perfect Duo for Web Automation
WebdriverIO & JavaScript: The Perfect Duo for Web Automation
digitaljignect
 
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
Salesforce Summer '25 Release Frenchgathering.pptx.pdf
yosra Saidani
 
Lessons Learned from Developing Secure AI Workflows.pdf
Lessons Learned from Developing Secure AI Workflows.pdf
Priyanka Aash
 
OWASP Barcelona 2025 Threat Model Library
OWASP Barcelona 2025 Threat Model Library
PetraVukmirovic
 
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
GenAI Opportunities and Challenges - Where 370 Enterprises Are Focusing Now.pdf
Priyanka Aash
 
A Constitutional Quagmire - Ethical Minefields of AI, Cyber, and Privacy.pdf
A Constitutional Quagmire - Ethical Minefields of AI, Cyber, and Privacy.pdf
Priyanka Aash
 
Mastering AI Workflows with FME by Mark Döring
Mastering AI Workflows with FME by Mark Döring
Safe Software
 
The Future of Product Management in AI ERA.pdf
The Future of Product Management in AI ERA.pdf
Alyona Owens
 
From Manual to Auto Searching- FME in the Driver's Seat
From Manual to Auto Searching- FME in the Driver's Seat
Safe Software
 
Curietech AI in action - Accelerate MuleSoft development
Curietech AI in action - Accelerate MuleSoft development
shyamraj55
 
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Using the SQLExecutor for Data Quality Management: aka One man's love for the...
Safe Software
 
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
ReSTIR [DI]: Spatiotemporal reservoir resampling for real-time ray tracing ...
revolcs10
 

A Graph-based Cross-lingual Projection Approach for Weakly Supervised Relation Extraction

  • 1. A GRAPH-BASED CROSS-LINGUAL PROJECTION APPROACH FOR WEAKLY SUPERVISED RELATION EXTRACTION The 50th Annual Meeting of the Association for Computational Linguistics (ACL 2012) July 11th, 2012, Jeju Seokhwan Kim (Institute for Infocomm Research) Gary Geunbae Lee (POSTECH)
  • 2. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Evaluation • Conclusions 2
  • 3. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Evaluation • Conclusions 3
  • 4. Problem Definition • Relation Extraction  To identify semantic relations between a pair of entities Birthplace Barack Obama was born in Honolulu , Hawaii . PER LOC LOC  Considered as a classification problem 4
  • 5. Related Work (1) • Supervised Learning  Many supervised machine learning approaches have been successfully applied • (Kambhatla, 2004; Zhou et al., 2005; Zelenko et al., 2003; Culotta and Sorensen, 2004; Bunescu and Mooney, 2005; Zhang et al., 2006) • Semi-supervised Learning  To obtain the annotations of unlabeled instances from the seed information • (Brin, 1999; Riloff and Jones, 1999; Agichtein and Gravano, 2000; Sudo et al, 2003; Yangarber, 2003; Stevenson and Greenwood, 2006; Zhang, 2004; Chen el al., 2006; Zhou et al., 2009) 5
  • 6. Motivation • Resources for Relation Extraction  Supervised/Semi-supervised Approaches • Labeled corpora for supervised learning • Seed instances for semi-supervised learning • Available for only a few languages  ACE 2003 Multilingual Training Dataset • English (252 articles) • Chinese (221 articles) • Arabic (206 articles) • No resources for other languages  Korean 6
  • 7. Related Work (2) • Self-supervised Learning  To obtain the annotated dataset without any human effort  Using the information obtained from external resources • Heuristic-based Method (Banko et al., 2007; Banko et al., 2008) • Wikipedia-based Methods (Wu and Weld, 2010) • Cross-lingual Annotation Projection  To leverage parallel corpora to project the relation annotations on the resource-rich source language to the resource-poor target language (Kim et al., 2010, Kim et al., 2011) 7
  • 8. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Implementation • Evaluation • Conclusions 8
  • 9. Overall Architecture Annotation Parallel Projection Corpus Sentences in Sentences in Ls Lt Preprocessing Preprocessing (POS Tagging, (POS Tagging, Parsing) Parsing) NER Word Alignment Relation Projection Extraction Annotated Annotated Sentences in Sentences in Ls Lt 9
  • 10. Direct Projection (Kim et al., 2010) • Annotation • Projection fE (<Barack Obama, Honolulu>) = 1 Barack Obama was born in Honolulu , Hawaii . 버락 오바마 는 하와이 의 호놀룰루 에서 태어났다 (beo-rak-o-ba-ma) (neun) (ha-wa-i) (ui) (ho-nol-rul-ru) (e-seo) (tae-eo-nat-da) fK (<버락 오바마, 호놀룰루>) = 1 10
  • 11. Limitations of Direct Projection • Direct projection approach is still vulnerable to the erroneous inputs generated by preprocessors • Main causes of this limitation  Considering alignment between entity candidates only, not any contextual information  Performed by just a single pass process 11
  • 12. Graph-based Learning • Semi-supervised learning algorithm • Defining a graph  The nodes represent labeled and unlabeled examples in a dataset  The edges reflect the similarity of examples • Learning a labeling function in an iterative manner  It should be close to the given labels on the similar labeled nodes  It should be smooth on the whole graph • Related Work  Graph-based Learning for Relation Extraction (Chen et al, 2006)  Bilingual projection of POS tagging (Das and Petrov, 2011) 12
  • 13. Graph Construction • Graph Nodes  Instance Nodes • Defined for all pairs of entity candidates in both languages • Each instance node has a soft label vector Y = [y+ y-]  Context Nodes • For identifying the relation descriptors of the positive instances • Defined for each trigram which is located between a given entity pair which is semantically related • Each context node has a soft label vector Y = [y+ y-] <ARG1> was born in <ARG2> <ARG1> was born was born in born in <ARG2> 13
  • 14. Graph Construction • Edge Weights  Between instance node and context node in the same language 𝑤 𝑣 𝑖,𝑗 , 𝑢 𝑘 1 𝑖𝑓 𝑣 𝑖𝑗 ℎ𝑎𝑠 𝑢 𝑘 𝑎𝑠 𝑎 𝑐𝑜𝑛𝑡𝑒𝑥𝑡𝑢𝑎𝑙 𝑠𝑢𝑏𝑠𝑒𝑞𝑢𝑒𝑛𝑐𝑒, = 0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒.  Between context nodes in a language 𝑘, |𝑢 𝑘 ∩ 𝑢 𝑙 | 𝑤(𝑢 𝑢 𝑙) = 𝐽(𝑢 𝑘, 𝑢 𝑙) = 𝑘 . |𝑢 ∪ 𝑢 𝑙 |  Between context nodes in source and target languages 𝑐𝑜𝑢𝑛𝑡 𝑢 𝑠𝑘 , 𝑢 𝑙𝑡 𝑤(𝑢 𝑠𝑘 , 𝑢 𝑙𝑡 ) = 𝑘 𝑚 , 𝑢𝑡 𝑚 ‍ 𝑐𝑜𝑢𝑛𝑡 𝑢 𝑠 , 𝑢 𝑡 14
  • 16. Label Propagation Initialize T • Algorithm  Input • A transition matrix T • An initial label matrix Y0 Normalize T  Output • The updated label matrix Yt Initialize Y Update Y 16
  • 17. Label Propagation • Executed in three phases 1st phase 2nd phase 3rd phase 17
  • 18. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Evaluation • Conclusions 18
  • 19. Implementation • Dataset  English-Korean parallel corpus • 266,982 bi-sentence pairs in English and Korean • Aligned by GIZA++ • Annotation  ReVerb (Fader et al., 2011) • English Open IE system • Label Propagation  Junto Label Propagation Toolkit • Learning  Tree kernel-based SVM classifier • Shortest path dependency kernel (Bunescu and Mooney, 2005) • SVM-Light (Joachims, 1998) 19
  • 20. Evaluation • Dataset  Manually annotated Korean dataset • Obtained from the Web following Bunescu and Mooney(2007)’s work • 500 sentences with manual annotations for four relation types  Acquisition  Birthplace  Inventor Of  Won Prize • Evaluation Metrics  Precision/Recall/F-measure 20
  • 21. Experimental Results • Direct Projection vs. Graph-based Projection Direct Projection Graph-based Projection Type P R F P R F Acquisition 51.6 87.7 64.9 55.3 91.2 68.9 Birthplace 69.8 84.5 76.4 73.8 87.3 80.0 Inventor of 62.4 85.3 72.1 66.3 89.7 76.3 Won Prize 73.3 80.5 76.7 76.4 82.9 79.5 Total 63.9 84.2 72.7 67.7 87.4 76.3 21
  • 22. Experimental Results • Comparisons to other self-supervised approaches  Heuristic-based Approach (Banko et al., 2007; Banko et al., 2008) • Korean Treebank and Syntactic Heuristics  Wikipedia-based Approach (Wu and Weld, 2010) • Korean Wikipedia articles and Infoboxes Approach P R F Heuristic-based 92.31 17.27 29.09 Wikipedia-based 66.67 66.91 66.79 Projection-based 67.69 87.41 76.30 22
  • 23. Contents • Introduction • Methods  Cross-lingual Annotation Projection for Relation Extraction  Graph-based Projection Approach • Evaluation • Conclusions 23
  • 24. Conclusion • Summary  A graph-based projection approach for relation extraction • Label propagation algorithm • On a graph that represents the instance and context features of both the source and target languages  Experimental results show that our approach helps to improve the performances of relation extraction compared to other approaches • Future work  To relieve the high complexity problem of the approach  To deal with more expanded graph structure to improve the extraction performances 24
  • 25. Q&A