Dissertation > Excellent graduate degree dissertation topics show

Research and Improvement on K-Means Clustering Algorithm

Author: OuChenWei
Tutor: ChenZuo
School: Changsha University of Science and Technology
Course: Applied Computer Technology
Keywords: Clustering algorithms K-means algorithm Differential evolution algorithm
CLC: TP311.13
Type: Master's thesis
Year: 2011
Downloads: 124
Quote: 0
Read: Download Dissertation

Abstract


With the rapid development of computer technology, people face all kinds of data, such as text data, image data, audio data, video data and so on. The quantity of these kinds of data is very large. How to quickly and effectively gain implicit and valuable information from these mass data has been a problem that has got much attention and should been solved urgently. Data mining (DM) has appeared in this situation. It has provided lots of efficient methods and tools on solving that problem for people. The Clustering analysis is one important method of them. It is an important part of data mining. With the gradually intensive research on clustering analysis these years, its importance has been recognized by people more and more. Clustering analysis technology has gained plentiful and substantial achievements in both theory and practice during recent years. At present, clustering analysis has been widely applied in machine learning, pattern recognition, image processing, text classification, marketing, statistical science and lots of others fields.According to the difference of data type, clustering purpose and application, we can divide existing clustering algorithms into partition algorithm, hierarchical algorithm, grid-based algorithm, density-based algorithm and model-based algorithm. One of the most mature and classical clustering algorithms is k-means clustering algorithm. It is a partition algorithm. This paper presents deeply research and analysis on merits and defects of k-means clustering algorithm. This paper has provided a improvement on k-means clustering algorithm according to the feature that the results of k-means clustering algorithm liable to be effected by initial centers. Following are the main works have been done:1. According to the defect that K-means clustering algorithm is dependent on the initial clustering centers selection, this paper put forward a new initial clustering centers selection method of k-means algorithm. The experiments showed that this method has effectively solved the problem that the clustering result is always unstable due to the initial clustering centers overly close to each other and has improved effectiveness and stability of the clustering result.2. Aiming to the disadvantages of k-means clustering algorithm that it is sensitive to the initial centers selection and easily falls into local optimal solution, differential evolution algorithm whose global optimization ability is strong was introduced into clustering in this paper. This paper put forward an improved differential evolution algorithm and made it combined with k-means clustering algorithm at the same time. This method has solved initial centers optimization problem of k-means clustering algorithm well. The experiments showed that the method has effectively improved clustering quality and convergence speed.

Related Dissertations

  1. Research on Scheduling of Whole-set Orders in JSP Based on Differential Evolution Algorithm,F273
  2. Research on K-means Optimization Clustering Algorithm,TP311.13
  3. Research on Fuzzy C-Mean Clustering Algorithm Based on Particle Swarm Optimization and Shuffled Frog Leaping Algorithm,TP18
  4. Evolutionary Clustering Algorithm and Its Application,TP311.13
  5. Web Usage Mining and the Research of Personalized Recommendation,TP311.13
  6. The Modified Harmony Search Algorithm with Control Parameters Co-evolution and Its Application,TP391.3
  7. Library management system of personalized service Design and Implementation,TP311.52
  8. Model-based rapid test method equipment,TJ06
  9. Subway construction project risk evaluation methods and criteria for research,U231.3
  10. Intelligent mobile robot map description and navigation methods,TP242.6
  11. Research and Implementation based the WebService execution management system,TP311.52
  12. Research on an Improved Clustering Algorithm of k_means,TP311.13
  13. Markov random field DS evidence theory of the human brain image segmentation,TP391.41
  14. Multi-Agent Differential Evolution Algorithm and Its Applications in Optimization of Fermentation,TP18
  15. Research and Development of Customer-Oriented Quick Quote System for Low-Voltage Products,F426.63
  16. Research on Dynamic Decoupling for Multi-Axis Sensor,TP212
  17. Study of Spatial and Temporal Dimensions Dynamical System Modeling Based on Multi-polymerization Process Neural Networks,TP391.9
  18. Research of Parametric Method on Electrical Impedance Endotomography,R318.0
  19. Study on Attribute Reduction Method Based on Evolutionary Algorithm,TP18
  20. Application in Campus Network of Intrusion on Detection System Based on Data Mining,TP393.08

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer software > Program design,software engineering > Programming > Database theory and systems
© 2012 www.DissertationTopic.Net  Mobile