Dissertation > Excellent graduate degree dissertation topics show

The Research on English-Chinese Name Entity Translation

Author: ZhaoMingMing
Tutor: YaoJianMin
School: Suzhou University
Course: Applied Computer Technology
Keywords: Machine Transliteration Model Machine Learning Web Mining Statistical Machine Translation Word Alignment
CLC: TP391.2
Type: Master's thesis
Year: 2011
Downloads: 81
Quote: 0
Read: Download Dissertation

Abstract


Named Entity (NE) translation is an important sub-task in multilingual language processing, such as Machine Translation and Cross-lingual Information Extraction. Especially in a Statistical Machine Translation system, NE translation is an important factor reinforcing the system performance. Different types of NE have different translation characteristics. Person Name and Location Name Translation is mainly implemented by transliteration, The Combination of translation and transliteration is employed to translate Organization Names.This thesis concentrates on English-Chinese Person Name Transliteration modeling methods and Web-based Name Entity Translation, The contributions of this works is summarized as follows:Statistical Machine Translation-based and Machine Learning-basedEnglish-Chinese Person Name Transliteration modeling methods Name transliteration problem is transformed into a general sentence translation problem by The Statistical Machine Translation-based Transliteration model. Two machine translation approaches: the phrase-based model and the N-Gram model are applied to transliteration modeling problem. In the Machine Learning-based transliteration model, transliteration problem is transformed into a sequence-labeling problem. We test two Machine Learning methods: Maximum Entropy model and Conditional Random Fields. We compared the performance of five modeling methods. Machine Learning-based model proves to give a better performance and the Conditional Random Fields get the best accuracy.Transliteration and Web-based Name Entity Translation MethodWe propose a Person Name translation mining method, which makes use of transliteration model results as heuristic query expansion to improve the quality of the snippets. High quality snippets enhance the Name Entity translation inclusion rate. We compare the performance of Model-based transliteration method and Transliteration and Web-based method. Experiment show that the second method gives a better performance. The second transliteration method fixes incorrect Chinese character in transliteration results of model-based method.Web-based Organization Name translation mining method We propose a method which can extract the Chinese translation for a English Organization Name from bilingual webpage. The words of Organization Name is aligned by a method which named alignment-anchor expansion-based .and then a greedy algorithm is used to extract phrase and word translation pairs to build bilingual dictionary from aligned ON pairs. ON translation is extracted from webpage using the extracted bilingual phrase and word dictionary. We compare the performance of machine translation-based method and Web-based method. Experiment show that the second method gives a better performance.

Related Dissertations

  1. Research and Implementation of Mining Implicit User Interest,TP311.13
  2. The Research of Decoding Algorithm for Statistical Machine Tranlation,TP391.2
  3. Based on Data Distribution Characteristics of Text Classification,TP391.1
  4. Prediction of Binding Affinity of Human Transporter Associated with Antigen Processing,R392.1
  5. Distortion effects on image quality evaluation and classification,TP391.41
  6. Home Academic Information Extraction System,TP393.092
  7. Based on self-learning social relation extraction research,TP391.1
  8. Based on rough sets and SVM national defense Comprehensive Quality Assessment Methods,E075
  9. Machine learning based on sparse coding and image content recognition algorithm,TP391.41
  10. Template independent web information extraction,TP393.092
  11. The Dynamic Distributed network intrusion patterns,TP393.08
  12. Research on Personalized Recommender System Based on Collaborative Filtering Algorithm,TP393.09
  13. Study on Collaborative Filtering Recommendation Based on Users’ Interest Clustering,TP393.09
  14. Related Studied on Information Extraction and Information Recommendation Based on Web Data Mining,TP393.09
  15. Sort learning based automatic evaluation method of translation,TP391.2
  16. A Improvement on Method for Session Identification in Web Log Mining,TP393.09
  17. Based on support vector machine algorithm to optimize the RBF neural network and applied research,TP18
  18. Research on the Machine Learning Theroy and Its Application in the Vehicle Navigation System,TN966
  19. Research on Cache Coherence Protocol Simulation Verification Method of CC-NUMA System,TP306
  20. Research on Web Mining Based on Social Network Analysis Methods,TP311.13
  21. Comparative Study on Three Preprocessing Methods in Statistical Machine Translation,H085

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer applications > Information processing (information processing) > Translator
© 2012 www.DissertationTopic.Net  Mobile