Dissertation > Excellent graduate degree dissertation topics show

Research on Chinese Event Extraction and Filling of Missing Event Argument

Author: HouLiBin
Tutor: QianPeiDe; LiPeiFeng
School: Suzhou University
Course: Applied Computer Technology
Keywords: Event Extraction Filling of Event Argument Cross-eventInference Conditional Random Fields (CRF) Semantic Role Labeling (SRL)
CLC: TP391.1
Type: Master's thesis
Year: 2012
Downloads: 49
Quote: 0
Read: Download Dissertation

Abstract


Event extraction, a research field of Information Extraction, focuses on how to extract event mention of specific type and its arguments. Nowadays, most of the researches are based on English corpus while Chinese event extraction is still at an elementary stage.In this dissertation, we propose a new method and more effective features for Chinese event extraction based on the existing Chinese event extraction system. Otherwise, we find the full information of an event is often distributed in various parts of the document through the analysis of event extrction results. However, the sentence-level event extraction approaches always ignore these arguments which are out of the sentence, so that a large number of event arguments were missing in our experiment. Therefore, we also propose a theory of filling the missing event arguments based on cross-event inference. The study can be concluded as follow.1. According to the nature of Chinese, this dissertation adopts CRF(Conditional Random Fields) model in trigger detection to solve the problem of the inconsistency between Chinese word segmentation and trigger word boundary. Otherwise, it frist uses cross-event inference in the stage of event type recognition, which expands the feature set form sentence-level information to discourse-level one. Experimental results on ACE2005Chinese Corpus show that our two methods can promote both the accuracy of trigger detection and performance of event type recognition. Compared with the state-of-the-art system, the Fl-measure of our approach can be improved by5.5%and2.5%respectively.2. This dissertation explores the CRF-based event argument extraction approach and summarizes all features into five categories:lexical, semantic, dependency, syntactic and relative-position features. By exploring various features and their combination, we find out that semantic role feature play a important role in our features. Experimental results also show that CRF model has better performance and semantic role is a good indicator for event argument extraction. Compared with the state-of-the-art system, Fl-measure of our event argument extraction approach can be improved by5.1%. 3. To evaluate our argument filling approach, we annotated these missing arguments in ACE2005Chinese corpus firlstly. And then this dissertation proposes a machine learning-base method to fill the missing argument based on the statistic and analysis on the annotated corpus. Our method contains two parts:missing argument identification and classification. The first stage decides whether a missing event argument can be filled while second one decides which argument in other event mention in this document can be used to fill the missing event argument. The experimental results show that the Fl-measure of our method reaches72.97and74.68respectively.

Related Dissertations

  1. Research on Domain Entity Attribute and Event Extraction Technology,TP391.1
  2. Research on Key Technologies of Automatic Domain Ontology Construction,TP391.1
  3. Study of Chinese Event Information Extraction Based on Hownet Semantic Relation,TP391.1
  4. Research on Extraction and Tracking of People’s Opinion,TP391.1
  5. Research on Key Techniques for Subjectivity Detection of Microblogs,TP391.1
  6. Research and Application on Chinese Topic Event Extraction,TP391.1
  7. Research on Related Technologies of Domain Information Extraction,TP391.1
  8. Research on Key Technology of Event Extraction Based on Frame,TP391.1
  9. Web event information extraction,TP393.092
  10. Research on Typical Event Extraction Technology in the Field of Music,TP391.1
  11. Research on Sentence Level Chinese Event Extraction,TP391.1
  12. Technique Research of Web Chinese Event Automatic Detection,TP393.09
  13. The Reasreach and Implementation of Semantic-Based Event Extraction Method for Chinese Text,TP391.1
  14. Research for Event Extraction Method in Specific Domain Based on Tree Conditional Random Field,TP391.1
  15. Research on Chinese Event Extraction,TP391.1
  16. Research on Chinese Event Extraction Technology,TP391.1
  17. Causal Relation Recognition Between Sentence-Based Events,TP391.1
  18. Conditional Random Fields Based Location Name Recognition in Ancient Chinese,TP391.3
  19. Research on Basic Algorithms of Digital Image Processing and Implementation with FPGA,TP391.41
  20. Research on Facial Feature Extraction and Matching Algorithms for Image Retrieval,TP391.41
  21. Research of High Speed Image Pre-processing System Based on FPGA,TP391.41
  22. Research on Algorithms of 2D Face Template Protection,TP391.41

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer applications > Information processing (information processing) > Text Processing
© 2012 www.DissertationTopic.Net  Mobile