Dissertation > Excellent graduate degree dissertation topics show

Researches of Sentences Similarity Computation Method Based on Hownet

Author: ZhangYuJuan
Tutor: DuanHongMei
School: Chinese Geology University (Beijing)
Course: Applied Computer Technology
Keywords: Chinese information processing sentences similarity computation Hownet question-answering
CLC: TP311.13
Type: Master's thesis
Year: 2006
Downloads: 798
Quote: 14
Read: Download Dissertation

Abstract


Text similarity is a systematic research subject, the similarity computation of different levels of text is related each other closely. In this thesis, similarity computing on various levels was studied and put the emphasis on sentences similarity computation. Firstly, different methods of word similarity computation were analyzed and realized the Hownet based word similarity computation method which is one of methods giving good performance at present. Secondly, similar word pare cooccurrence based and semantic expression model based two new sentence similarity computation methods were presented. Finally the new sentence similarity computation methods were used in Bank-Domain Automatic Chinese Question-Answering System to test the feasibility and validity of them. To be more specific, the main work and result in this thesis were as below:(1)Analyzed the methods for Chinese word similarity computing, and realized the Hownet-based word similarity computation method.To the present , the mainstream of Chinese word similarity computation is based on semantic dictionary, especially based on Hownet. This method is better than the same character based method for it compute the similarity by the concepts represented by the words and it also avoids the effect of data noise and data sparseness in statistic based methods. This method was realized so we can use it in the computation of sentences similarity.(2)A new sentences similarity computation method based on similar word pare cooccurrence was present. Because of the difficulty of sentence structure analyses, using the similar word between the sentences to compute the similarity is the main method of sentence similarity computation. The words in the sentence are related each other by syntax and semantic. The cooccurrence of similar words between sentence has the mutual inspire contribution to the similarity. A formula for computing the mutual inspire contribution was given, and based on which computing the sentence similarity.(3) The application example of sentences similarity computation in the Question Answering System. Question answering system provides the human-machine interface by means of natural language. Comparing to the traditional search engine which is based on keyword, question answering system is more accurate, simply and efficiency. The similarity computation was used in the similar questions search in FAQ base of QA, which give the example how the similarity computation was realized in practice. And through similar question search experiment the feasibility and validity of the new methods were testified.

Related Dissertations

  1. Research of Question Answering System Based on the Analysis of Lexical and Semantic Meanings,TP391.1
  2. Research of Text Clustering on Food Complaint Documents Based on Ontology,TP391.1
  3. Research on Web Content Filtering Based on Concepts of Collection,TP393.092
  4. Design and Implementation of Network Based Computer Assisted Instruction System for Liaoning Forestry Vocational Technical College,TP311.52
  5. Study on Syntactic and Semantic Relation in "V+N" Structure for Chinese Information Processing,H146
  6. The Design and Implementation of Hownet-Based Semantic Retrieval Model,TP391.3
  7. Text on the extraction of the same event in the Chinese-English bilingual web resources,H08
  8. Research on Chinese Phrase Structure Ambiguities Based on Semantic Analysis and Its Implementation,TP391.1
  9. Study of Modern Distance Education System Based on Web Service and Multi-Agent,TP393.09
  10. Research on Open-Domain Question Answering System,TP18
  11. Automatic identification of the longest noun phrase containing,H146.3
  12. Based on the research and development of a remote web - Answering System,G434
  13. The Semantic Relation Pattern of "V[Double Syllables]+V[Double Syllables]" & Automatic Recognition,H13
  14. Substituting Zuheci for Liheci,H13
  15. The Research on Conducting Chemical Domain Text Classifier Based on Hownet,TP391.1
  16. Studies on and Implementation of Selected Topics in Chinese Information Processing,TP391.1
  17. A Study of Constructing Rules of Phrases in Contemporary Chinese for Chinese Information Processing,H146
  18. A Research on the Construction of Chinese FrameNet,H13
  19. Answer Exaction of Question Answering Based on Web,TP311.52
  20. Study of Chinese Event Information Extraction Based on Hownet Semantic Relation,TP391.1
  21. Research on Domain Ontology and the Application in Mobile Question Answering,TP391.6

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer software > Program design,software engineering > Programming > Database theory and systems
© 2012 www.DissertationTopic.Net