Dissertation > Excellent graduate degree dissertation topics show

The SVM Algorithm and Its Application Based Data Preprocessing in the Kernel Space for Unbalanced Data

Author: HaoSiZuo
Tutor: TaoXinMin
School: Harbin Engineering University
Course: Communication and Information System
Keywords: The problem of unbalanced classification Kernel cluster under-sample AdaBoost Sample properties Fault diagnosis
CLC: TP18
Type: Master's thesis
Year: 2013
Downloads: 16
Quote: 0
Read: Download Dissertation

Abstract


The problem of unbalanced data exists in various fields, such as medical field, the fieldof fault diagnosis and the field of fraud detection. Therefore studying an effective algorithm tosolve the unbalanced classification is great scientific significance and application value. Butwhen the classical classification algorithm applied to the unbalanced data, the classificationperformance of the algorithm is far from ideal. In addition, in the field of fault diagnosis,because support vector machine classification algorithm has faster convergence rate, strongstability and generalization ability, it has replaced the neural network algorithm widely used.The paper bases on support vector machine classification algorithm and focuses on how makethe SVM classification interface bias toward the majority instances appropriately.First, research the knowledge of unbalanced algorithms and machine fault diagnosis,then analyze the basis of knowledge and research. In order to select the majority instanceswith information and representative information of the spatial structure of the majority class,we present a novel under-sampling algorithm based on kernel cluster. Majority instances areclustered using fuzzy C-Means clustering algorithm (KFCM) in kernel space for randomlysampling representative samples with cluster information. The selected majority instances aswell as minority instances are used to learn classifier. Substantially the AdaBoost ensemble isused to integrate the proposed unbalanced classification component based on KernelCluster-based under-sampling, so the SVM classification performance under unbalanceddataset is improved. In the experiments, the proposed approach is compared with otherdata-preprocess methods for unbalanced dataset classification, the experimental resultsdemonstrate that the proposed method can not only improve classification performance ofSVM but also algorithm complexity.Secondly, we present the other support vector machine algorithm for unbalanced databased on sample properties under-sampling. We use Euclidean distance in the kernel space toselect the majority instances, then according to the sample’s density some representativemajority instances located near the classification interface are selected. In the experiments, theproposed approach is compared with other data-preprocessing methods for unbalanced datasetclassification, the experimental results demonstrate that the proposed method can improveclassification performance of SVM in the minority class data, the overall classification performance and robustness.Finally, applying the SVM classifier for unbalanced data based on kernel cluster-basedunder-sampling ensemble approaches to the field of fault diagnosis, it can achieve goodresults by experimental the method.

Related Dissertations

  1. Research on the Classification Based on the Reconstruction of Solder Joint,TP391.41
  2. Tongue Feature Extraction and Research of Fusion Classification,TP391.41
  3. The Fatigue State Recognition of the Driver Based on Eye Detection,TP391.41
  4. Feature Extraction, Selection and Combination in Lipreading,TP391.41
  5. The Research on Paper Currency Classification Method Based on Harr-Like Feature and Minimal Ball Including Samples,TP391.41
  6. Research on the Key Technology of Waterborne Transport Security System,U698
  7. Research on Predicting Intrinsic Disorder Protein Structure Based on Supervision Manifold Learning Algorithm,Q51
  8. The overall design and system development RTAD-CMDMDES of,U279.3
  9. Research of Support Vector Machine Based Fault Diagnosis System,TH165.3
  10. Fault Diagnosis Method Study of Hydraulic System of Concrete Pump,TU646
  11. Steam Turbine Thermal Analysis and Modeling Principles and Operation Fault Diagnosis,U664.113
  12. Research of Fault Diagnosis Method of Analog Circuit Based on Improved Support Vector Machines,TN710
  13. The Method of Transient Modeling and Parameter Identification and Its Application in Rotating Machine Fault Diagnosis,TH165.3
  14. GPRS-based transformer fault diagnosis system,TM407
  15. Study on Decentralized Fault Diagnosis Based on Multiblock Kernel Methods,TH165.3
  16. Study on Intelligent Diagnosis of Measuring Radar,TN957
  17. Research of Fault Diagnosis Algorithm Based on Multivariate Statistics Analysis,TP277
  18. Research on Intelligent Fault Diagnosis and Fault-tolerant Control for Affine Nonlinear Systems,TP13
  19. Development of Fault Diagnosis System for Turbine Shaft Based on Neural Network,TK267
  20. Rail vehicle bearing fault diagnosis and research,U279.3
  21. Life greased ball bearing noise test technique,TH133.3

CLC: > Industrial Technology > Automation technology,computer technology > Automated basic theory > Artificial intelligence theory
© 2012 www.DissertationTopic.Net  Mobile