Dissertation > Excellent graduate degree dissertation topics show

Research on Chinese Named Entity Recognition for Information Extraction

Author: CuiXiangYang
Tutor: WangXiaoYu
School: Harbin University of Science and Technology
Course: Applied Computer Technology
Keywords: information extraction chinese named entity recognition hiarchicaltemperal memory model hidden markov model
CLC: TP391.1
Type: Master's thesis
Year: 2012
Downloads: 4
Quote: 0
Read: Download Dissertation

Abstract


Information Extraction’s purpose is to extract information in texts according tospecial needs. Named Entity Recognition is a foundamental and importanttechnology in both information extraction and natural language processing fields.Named Entity is the basic unit in natural language texts. It’s impossible to understandtexts properly without correct recognition of named entity. It plays an important rolein kinds of fields such as Information Extraction, Text Classification, InformationRetrieval, Question Answering Track and so on. In Chinese the process of NamedEntity Recognition is usually divided into two parts: Chinese segmentation andnamed entity tagging. In recent years, this technology has been researched andapplied widely and has shown more and more potential value.In this paper, recognition on personal name, place name and institude name hasbeen researched. The main contents include:1.Chinese Named Entity Recognition has been introduced. The difficulties inpersonal, place and institude name recognition have been exactly analyzed.2.Through Hidden Markov Model and Hiarchical Temperal Memory Model,take experiments on Named Entity Recognition, especially on the recognition ofpersonal name, place name and institude name. Hidden Markov Model has been amature model after years of research and application, and has shown brilliantperformance in efficiency and effects. Hiarchical Temperal Memory Model is anmachine learning model come up recently which imitated the structure and functionsof cortex.3. Based on the results of each experiment, compare and analyze theperformance of Hidden Markov Model and Hiarchical Temperal Memory Model.Traditional Hidden Markov Model doesn’t take full advantage of contecxtinformation in the text, compared with Hiarchical Temperal Memory Model. TheHiarchical Temperal Memory Model this paper introduced has shown proper performance and efficiency. Experimental results show that the Hiarchical TemperalMemory Model can be applied to Chinese Named Entity Recognition field suitably.

Related Dissertations

  1. Packet Loss Recovering Technology for Speech Transmission over Network,TN912.3
  2. Research on Domain Entity Attribute and Event Extraction Technology,TP391.1
  3. Study on Growth Monitoring Technique Based on Pixel Un-Mixing Method and HJ Remote Sensing Images in Paddy Rice,S511
  4. Land Desertification in Qinghai Lake Landscape Pattern Change,X171
  5. Based on high-resolution remote sensing data mining houses information extraction,TP751
  6. Scholar Resume Automatic Generation Based on Text Mining,TP391.1
  7. Research and Application of Information Extraction Based on Query Expansion,TP391.1
  8. Object-Based Automatic Extraction of Change Information Based on High-Resolution Remote Sensing Image Research,P237
  9. Applications of Bibliometrics and Text Mining in the Life Science,TP391.1
  10. Research on Application to Video Scene Analysis of Hidden Markov Model,TP391.41
  11. Voice recognition in the access control application,TN912.34
  12. Rule-based Web text information extraction technology research,TP391.1
  13. The Research for Named Entity Recognition and Relation Extraction in Text,TP391.1
  14. Research and Application of Semantic-Based Information Extraction,TP393.09
  15. Home Academic Information Extraction System,TP393.092
  16. Dynamic learning framework based on structured automatic web data extraction method,TP393.092
  17. Multi-threaded fusion soccer video semantic analysis and event detection,TP391.41
  18. Research and Implement of Function-level Program Understanding and Visualization Based on C/C++,TP311.11
  19. Research on Information Extraction and Visualization in Program Comprehension,TP311.1
  20. Research of Information Extraction and Declaration Analysis in Program Comprehension,TP311.11
  21. Research on Domain Ontology-Based Web Information Extraction,TP393.09

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer applications > Information processing (information processing) > Text Processing
© 2012 www.DissertationTopic.Net  Mobile