Dissertation > Excellent graduate degree dissertation topics show

Research on Feedback Learning in Chinese Text Categorization

Author: ZhangZhiGuo
Tutor: LiuHuaiLiang
School: Xi'an University of Electronic Science and Technology
Course: Information Science
Keywords: Support Vector Machine K nearest neighbor Text Classification Feedback learning
CLC: TP391.1
Type: Master's thesis
Year: 2009
Downloads: 174
Quote: 7
Read: Download Dissertation

Abstract


With the increasing expansion of the Internet information, the information resources on the network is growing at the exponential rate of growth, and how people have to face in the discovery and excavation of the information resources they need extensive information. This requires us to explore the computer automatic text classification effective way to improve the efficiency and accuracy of the classification. However, due to the limited number of training corpus is difficult to cover all of the category and the original classifier outdated with the passage of time the category has added many new features, while still using the original classifier being classified text classification, may cause the problems of classification errors and classification omissions. Feedback learning for information changes dynamically adjust and improve the methods of effective classification model. Therefore, based on user feedback dynamically improve classification model to the current problems to be solved. This article text classification status quo on the basis of wide-ranging study places for text classification key technology carried out a summary of the inductive including Text Word, the text said, feature selection and feature weight re-calculation, classification algorithms (in particular, support vector machine classifier and K nearest neighbor classifier) ??and classification performance assessment. Text set based on a different scale, a comparative analysis of the information gain, mutual information, expect the cross-entropy of x to 2 - the statistic and text weight of evidence five feature selection methods for classification performance; experimental analysis of the text feature selection algorithm categorization performance of kernel function selection, support vector machine classifier classification performance, the eigenvectors dimension affect the value of K and K-nearest neighbor classifier for text classification performance size classification performance. Introduction of Chinese text classification on the basis of in-depth study of the Chinese text classification, in turn related feedback and detailed analysis of the the feedback learning basic idea of ??text classification, in-depth discussions on the classification of feedback learning process and feedback learning algorithm, built based on feedback the learning Chinese text classification model, elaborated the structure of the Chinese text categorization feedback learning system framework and functional modules. Finally, experimental studies show that the training set and non-training set: feedback learning to improve the classification performance of the significant role and the quality of the training samples for learning the importance of classification performance and user feedback classification bring uncertainty. Chinese text classification training - Classification - feedback \The classification model has a perfect role the classifier gradually from training does not fully stage tends to be fully trained stage, the classification performance will gradually stabilize. Therefore, the Chinese text classification the feedback learning research with strong theoretical and practical significance.

Related Dissertations

  1. Research on Automatic Detection Algorithm for Substructure Distress of Highway Pavement Based on SVM,U418.6
  2. Research on Autamatic Music Structrue Analysis,TN912.3
  3. Research on Transductive Support Vector Machine and Its Application in Image Retrieval,TP391.41
  4. Research on Text Classification Based on Biomimetic Pattern Recongnition,TP391.1
  5. Tourism Comments on the Internet’s Semantic Analysis and Usefulness Research,TP391.1
  6. Fault Diagnosis Method Based on Support Vector Machine,TP18
  7. Process Support Vector Machine and Its Application to Satellite Thermal Equilibrium Temperature Prediction,TP183
  8. Research for Infrared Image Target Identification and Tracking Technology,TP391.41
  9. Study on the Road Condition Monitoring Based on Vehicular 3D Acceleration Sensor,TP274
  10. Research of Diagnosing Cucumber Diseases Based on Hyperspectral Imaging,S436.421
  11. Based on Data Distribution Characteristics of Text Classification,TP391.1
  12. The Research on Intrusion Detection System Based on Machine Learning,TP393.08
  13. Research on Improved K Neighbor Support Vector Machine Algorithm Faced Text Classification,TP391.1
  14. Research on Face Recognition Based on AdaBoost Algorithm,TP391.41
  15. Research on Feature Extraction, Selection and Classification Algorithms for Pulmonary CAD,TP391.41
  16. Research on Subimage Selection and Mathching Method for Synthetic Aperture Radar(SAR) Target Recognition,TN957.52
  17. Research of Facial Expression Recognition Algorithm,TP391.41
  18. Fundus Image Segmentation Based on SVM and Template Matching,TP391.41
  19. Research and Realization of License Plate Character Recognition Algorithm Based on SVM,TP391.41
  20. Modulation Classification Algorithms of Digital Communication Signals,TN914.3
  21. One kind of empirical data on the workload of a software bug fixes Prediction Model,TP311.53

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer applications > Information processing (information processing) > Text Processing
© 2012 www.DissertationTopic.Net  Mobile