Dissertation > Excellent graduate degree dissertation topics show

Word Segmentation and Pos Tagging in Chinese

Author: LiuDongXu
Tutor: YangGuoWei
School: University of Electronic Science and Technology
Course: Computer Software and Theory
Keywords: Natural Chinese processing Segmentation Forward maximum matching Reverse maximum matching Intersection Field Binding degree Chinese Name Recognition Part of Speech Tagging
CLC: TP391.1
Type: Master's thesis
Year: 2003
Downloads: 377
Quote: 10
Read: Download Dissertation

Abstract


Segmentation and POS tagging natural Chinese language processing (NLP) , the previous senior has done a lot of research in this regard , the topic that I have done is to sum ??up this part of the contents of them on the basis of , improve , improve , provide better support for the follow-up study . Segmentation in previous studies mainly uses the MM method ( forward maximum matching ) , combined reverse maximum matching the RMM law ( ) method , and compare their combined degree maximum intersection field to select the segmentation However, this method can only deal with part of the largest intersection field . The subject on the basis of statistics on the the largest intersection field in large real text , the maximum intersection field divided into three categories , and with respect to their treatment greatly improve the handling capacity of the largest intersection field . Chinese Name Recognition is an important element of the segmentation , the issues in large-scale real text characters for the surname, first name , the names of the characters commonly used before and after visits . To name judge using segmentation surname as the trigger point , start Name judgment its recall and precision rates of more than 90% . POS tagging is a difficult natural Chinese language processing . In English , when a word transform part of speech is often accompanied by changes on the word type in Chinese word type on the changes , which increase the difficulty of the Chinese part-of-speech tagging . I addition determines the parts of speech by a conventional method , but also to build a POS determination rule table , every word in the POS determination rule table has a corresponding object , Speech judgment removed from the part of speech in the determination rule table when the corresponding word object for POS judgment . The subject and there is a task that is the previous brothers made ??the subject from the VC ported to JAVA up, in order to be published online .

Related Dissertations

  1. Design and Development of Application Programme for Fault Analyzer Based on Wince Platform,TP311.52
  2. Elastic-viscoplastic Analysis of the Asymptotic Stress Field Near the Tip of a Quasi-static Propagating Crack under Plane Stress Condition,O346.1
  3. Research on Magnetic Field Sensor Based on GMI Effect and Geomagnetic Matching Algorithm,P318
  4. Study on Contact Fatigue Life Based on Grease Variation with Temperature,TH117.22
  5. Simulation Analysis on Temperature Stress of RCC Arch Dam and Its Construction Joints Design Research,TV642.2
  6. Analysis and Study of Abutment Stability in Concrete High Arch Dam by Three-Dimensional Nonlinear Finite Element Method,TV642.4
  7. A Study of the Relevance of Social Behavior and Building Space,TU-024
  8. The Influence of Magnetic Field Topology on Electron Motion in Hall Thrusters,V439.2
  9. Research of Effect of Secondary Air Injector on Aerodynamic Field in 1025t/h CFB Furnace,TK229.66
  10. Experimental Study and Numerical Simulation on Aerodynamic Field of Tangential Bias Swirl Burner,TK223.23
  11. Research of the Performances of Ferroelectric Film and Compositional-Graded Ferroelectric Film,TM221
  12. The Effect of Current Density and Electric Field on Ni and Its Alloy Electrodeposition for Copper Crystallizer,TQ153.2
  13. Research on Image Recognition Algorithm in the Forest Fire Prevention System,TP391.41
  14. Research on Temporal Information Recognition and Normalization,TP391.1
  15. The Research of the Text Extraction Method Based on Spectral Cut,TP391.41
  16. Application Research of Digital Image Processing on Container Inspection,TP274.4
  17. Research on Key Technologies of Automatic Steel Wire Galvanizing Production Line Based on Gas Reduction Method,TQ153.15
  18. Analysis of Sports Elements of Non-sports Domain Commodity Advertising,G80-05
  19. Study on the Design and Biomechanical Behaviors of Individualized Artificial Knee Joint,R318.1
  20. Three-Dimensional Numerical Modelling for the Head and the Electric Field Analysis of DBS,R742.5
  21. Based on statistical methods of magnetic resonance imaging of the human brain image segmentation and three-dimensional analysis of the data,R445.2

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer applications > Information processing (information processing) > Text Processing
© 2012 www.DissertationTopic.Net  Mobile