Dissertation > Excellent graduate degree dissertation topics show

Named Entity Recognition Based on Machine Learning Approach

Author: RenDengJun
Tutor: ZhangLi
School: Northeastern University
Course: Computer Software and Theory
Keywords: named entity recognition machine learning method Maximum entropy Boosting
CLC: TP391.43
Type: Master's thesis
Year: 2005
Downloads: 319
Quote: 3
Read: Download Dissertation

Abstract


Named Entity Recognition techniques have been the focus of recent Natural Language Processing research. NER is a form of information extraction in which we seek to classify every word in a document as being a person-name, location, organization, date, time, number or none of above. NER as a subtask of Information Extraction has been applied on many compute linguistics tasks, such as machine translation.In this thesis, two machine learning methods are applied on named entity recognition. One is maximum entropy method and another is boosting algorithm. The machine learning methods are both robust and portable compared with rule based approach. The system based on machine learning approach can be ported a new domain or language with minimal expense. At first a character-based model and a word-based model are constructed which only some basic features are employed. We compare the performance of two models on the task. In order to use the advantages of the two models, we decode the word segmentation information into the character-based model. Moreover, some complex linguistics knowledge features are employed in the model. The result of experiment shows that the performance of the model is better than the others mentioned above. Meanwhile, we compare the performance of the classifiers under the same conditions.Finite State Machine is employed to recognize date, time and number and extract the candidates of foreign person-name in a document. As a result, we concentrate on three types in machine learning framework, which can decrease the complexity of these algorithms. Finally global information is utilized to increase the performance of NER system.

Related Dissertations

  1. The Research for Named Entity Recognition and Relation Extraction in Text,TP391.1
  2. Shanghai Study on the Effects about Government Boost in the Development of Agricultural Insurance,F842.6
  3. Ontology-based medicine named entity recognition technology research,TP391.1
  4. CRF -based named joint extraction of entities and relationships,TP391.4
  5. Multi-target tracking algorithm,TN953
  6. Click data and search results based on fragments excavated named entities,TP391.3
  7. Hash based on structured sparse spectrum image indexing algorithm,TP391.41
  8. Mobile robot based on 3D laser rangefinder object detection,TP242
  9. Chinese named entity recognition and disambiguation of,TP391.1
  10. Effects of Qi-Boosting Toxin-Resolving Formula on the Structure of Cellular Adhesion System in the Implanted Tumors with Human NPC Cells Among Nude Mice,R739.63
  11. The Clinical Study on the Effect of Boosting Qi and Nourishing Yin, Transforming Stasis and Freeing the Collaterals to DML of Patients with DPN,R259
  12. Study on Chinese Name Entity Recognition and Some Related Issues,TP391.41
  13. Design of Analog-to-Digital Converter for the RFID Location System Based on AOA,TN792
  14. Examples of classification based on neighbor selection algorithm,TP181
  15. The Research of Conditional Random Fields Based Chinese Named Entity Recognition,TP391.4
  16. Chinese Named Entity Recognition Based on Conditional Random Fields,TP391.43
  17. The Study of POI Abbreviations Dictionary in the Filed of Location Search,TP391.3
  18. Research on Ensemble Technique for Multiple Classifiers,TP311.13
  19. Synonym Recognition Based on User Behaviors in E-commerce,TP391.1
  20. Research on Product Named Entity Recognition and Normalization,TP391.1
  21. English two-way time numbers and quantifiers identification and translation technology,TP391.2

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer applications > Information processing (information processing) > Pattern Recognition and devices > Character recognition devices
© 2012 www.DissertationTopic.Net  Mobile