Dissertation > Excellent graduate degree dissertation topics show

Research on Key Technology and Algorithms in Unstructured Information Extraction Based on Cognition

Author: MuYiFu
Tutor: QianXu
School: Beijing
Course: Applied Computer Technology
Keywords: information extraction conditional random fields named entity recognition entityrelationship recognition cognitive science
CLC: TP391.1
Type: PhD thesis
Year: 2013
Downloads: 148
Quote: 0
Read: Download Dissertation

Abstract


With the rapid development of the computer technology, the information extraction technologyhas become one of the hot topics of natural language processing field in recent years. Furthermore,machine learning, text mining and graph algorithms have already been applied to informationextraction. However, the performance of information extraction algorithms can not be satisfied andthere are many challenging problems for further research. In this paper, by analyzing the drawbacksof the existed document representation model, we apply graph model, conditional random fields’theory, machine learning-relative knowledge to implement information extraction algorithms. Inorder to improve information extraction’s performance, some information extraction algorithms areproposed in this paper, such as named entity recognition algorithm based on rules, an improvedperson’s name recognition algorithm based on rules, named entity recognition algorithm based onrules and conditional random fields, Chinese organization’ abbreviation name generation andrecognition algorithm based on rules, person’s relationship recognition algorithm based on textclassifiction. Furthermore, the effectiveness and efficiency of these algorithms are all validated byexperiments. These proposed algorithms in this thesis broad the prospect for the development ofinformation extraction technology.

Related Dissertations

  1. Research on Domain Entity Attribute and Event Extraction Technology,TP391.1
  2. Research on Temporal Information Recognition and Normalization,TP391.1
  3. Study on Growth Monitoring Technique Based on Pixel Un-Mixing Method and HJ Remote Sensing Images in Paddy Rice,S511
  4. Land Desertification in Qinghai Lake Landscape Pattern Change,X171
  5. Active faults based radar image information extraction method applied research and demonstration,P542.3
  6. Based on high-resolution remote sensing data mining houses information extraction,TP751
  7. Scholar Resume Automatic Generation Based on Text Mining,TP391.1
  8. Research on Opinion Target Extraction,TP391.1
  9. Chinese study nested entity recognition method named,TP391.1
  10. Applications of Bibliometrics and Text Mining in the Life Science,TP391.1
  11. Research and Implement of Chinese Word Segment Techniques Based on the Conditional Random Field,TP391.1
  12. Integration of Spatial Information Bag of Feature in Image Annotation,TP391.41
  13. Chinese Automatic identification function block,TP391.1
  14. Detecting Hedges and Their Linguistic Scope in Biomedical Literatures,TP391.1
  15. Research of Information Extraction and Declaration Analysis in Program Comprehension,TP311.11
  16. Study on Chinese Name Entity Recognition and Some Related Issues,TP391.41
  17. Commercial Social Network Creation Based on Information Extraction Technology,TP391.1
  18. Semi-supervised BLOG Information Extraction Techniques Based on Document Structure,TP393.092
  19. The Research of Conditional Random Fields Based Chinese Named Entity Recognition,TP391.4
  20. Research of Chinese Phrase Identification Based on Conditional Random Fields,TP391.1
  21. The Automatic Recognition Research on Chinese Modality’s Usage Based on Rules and Statistical,TP391.1

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer applications > Information processing (information processing) > Text Processing
© 2012 www.DissertationTopic.Net  Mobile