Dissertation > Excellent graduate degree dissertation topics show

A Study of Chinese Word Sense Disambiguation Based on Hownet

Author: SunJiMing
Tutor: LiZhouJun
School: National University of Defense Science and Technology
Course: Computer technology
Keywords: word sense disambiguation How-Net Natural language process Dependency grammar analysis
CLC: TP391.1
Type: Master's thesis
Year: 2007
Downloads: 122
Quote: 1
Read: Download Dissertation

Abstract


Word sense disambiguation (WSD) is all along an important and difficult problem in nature language processing. It is widely used in many natural language processing application systems, such as information retrieval, machine translation, text classification, text summarization and so on. At present, only some representative ambiguous words are selected as disambiguated objects in many WSD researches, which have great limitations in real application. This thesis address this problem in real text application.In this thesis, we firstly introduce the definition of WSD, the history of evolution and the trend of development. Secondly, we analyze the varieties of ambiguity problems in WSD and the corresponding disambiguation algorithms. Finally, based on experience and understanding of WSD, We propose a new method of WSD and analyze several aspects of it, including why to bring forward this strategy, its benefits, and how to take it into application, etc.The method can be summarized as below: After keywords are extracted from the preprocessed text. The ambiguous keywords are disambiguated according to their parts of speech and context. In the disambiguation process, the concepts of keywords are firstly divided into sememes according to their definitions in How-Net. Then, in order to find which words restrict the word sense, the fully dependency grammar analysis is adopted to find dominant and dominated relation among words from inner structure of a sentence. Finally, based on entity relationship of How-Net system and the atomic term of correlative words, the weight of atomic term in ambiguous words is computed, and then the word sense of the ambiguous word can be determined according to the weight.Experiments show that the proposed method has higher accuracy and on average corpuses and requires less calculating time.In a word, although plenty of efforts have been taken in research of this field, the accuracy of WSD still stays in a relative low level because of the fuzzy characteristics of word sense itself. Thus the question how to improve the effect of WSD will be the motivation and objective of our further research in this field.

Related Dissertations

  1. Word Sense Disambiguation Corpus Automatic Acquisition,TP391.1
  2. Research on Multi-Robot Cooperative Pursuit Problem,TP242
  3. Design and Implementation of the teaching file management system,TP311.52
  4. Force Online Examination System Design and Implementation,TP311.52
  5. Study on the Development of Folk Sports Non-Profit Organization,D632.9
  6. The Design of Embedded Image Transmission Terminal Based on the TCP/IP Protocol,TP368.1
  7. Grass-roots forces the day-to-day management of information systems design and implementation,TP311.52
  8. Tibet border combat training information management system design and realization of research,TP311.52
  9. Research Onlandscape Ecological Net Rack System Construction of Nanjing,TU986
  10. Design and Development of the Early-Warning System for Soil Pollution Based on .NET and ArcGIS Engine,X833
  11. Research on Biological Productivity and Characteristics of Soil Nutrients of Main Tree Species for Shelterbelt in Haitan Island,S727.2
  12. Murine Peritoneal Macrophages Transcriptional Responses Following in Vivo Infection with Streptococcus Suis Type 2,S858.91
  13. Some Pharmacology and Toxicity of Three Prescriptions from Chinese Veterinary Pharmacopoeia,S859.5
  14. The Problems and Solutions of University’s Ideological and Political Education under Network Environment,G641
  15. Website design and construction of grass-roots units,TP311.52
  16. Forces housing management system development and implementation of,TP311.52
  17. The Design and Implementation of Student Information Management System Based on Workflow,TP311.52
  18. Effect of Chlorophyll Deficient and High Nitrogen on Soybean Photosynthesis Capacity,S565.1
  19. Ecological Function of Cassava-Peanut Intercropping in Small Red Soil Watershed,S533
  20. Based on Modbus Protocol pressure medical gas distribution monitoring system development,R197.39
  21. Risk Assessment of FPSO in Off-loading Operation,U698

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer applications > Information processing (information processing) > Text Processing
© 2012 www.DissertationTopic.Net  Mobile