Dissertation > Excellent graduate degree dissertation topics show

Data Stream Clustering Algorithm Based on Density and Fractal Dimension

Author: JinJianYe
Tutor: NiZhiWei
School: Hefei University of Technology
Course: Management Science and Engineering
Keywords: Data Stream cluster fractal dimension grid
CLC: TP311.13
Type: Master's thesis
Year: 2012
Downloads: 18
Quote: 0
Read: Download Dissertation


In recent years, with the rapid development and the wide application ofinformation technology, various applications generate a large number of streamingdata. Such data is a kind of continuous, ordered, changing fast and massive data.Clustering is an important data mining method. However, the traditional clusteringalgorithms cannot be applied to data stream directly. The scholar has done a lot ofresearch work on data stream clustering; however, there are many problems need tobe researched and resolved.Fractal Geometry is developing fast in recent years, it also has been widelyused in some areas, such as geography, transportation, meteorology, and so on.Fractal data mining uses the fractal characteristic to mining the data set, fractalcharacteristic refers to the similarity of structure or feature between the part andwhole. Fractal dimension is an important indicator of the fractal characteristic ofdata set, it can describe the data set effectively. It indicates some characteristics ofdata set have changed when the fractal dimension changed, such as trend,distribution, and so on.In the thesis, some classical algorithms for clustering data stream and FractalTheory have been systematically studied and comprehensively summarized, at thesame time, considering deficiencies of some popular data stream clusteringalgorithms. On the basic of previous research´╝îa data stream clustering algorithmbased on density and fractal dimension is presented. It consists of two phases ofonline and offline processing, combined with the advantages of density clusteringand fractal clustering, thus the deficiency of the traditional clustering algorithm isovercome. In the algorithm, a density decaying strategy to reflect the timelines ofdata stream is adopted. Experiments show that the algorithm improves theefficiency and accuracy of data stream clustering, and can find arbitrary shapes andnon-neighboring clusters.

Related Dissertations

  1. Design and Study on Movable Packing Machine in Acrylic Fiber Production Lines,TH248
  2. Study on the Heat Transfer Characteristics of Particle Cluster in Circulating Fluidized Bed,TK124
  3. Grid-Side Converter Control and Wind Turbine Emulator in Direct Drive Wind Power System,TM46
  4. BioLab a Bioinformatics Oriented Grid Portal,TP399-C8
  5. Multiple Pairwise Keys Management Protocol of Function Node-Based for Wireless Sensor Networks,TP212.9
  6. Design and Realize of Family Cleaning Robot Path-Coverage System,TP242
  7. Development of EST-SSR Primers and Application in Analysis of Genetic Realtionships in Tree Peony,S685.11
  8. Pre-hypertension syndrome characteristics,R259
  9. Comprehensive Quality Assessment of College Students,G645.5
  10. Weaving in the children's clothing industry cluster development,F426.86
  11. Studieson Effects of Soybean Species on Yuba and Initial Establishment of Quality Evalution System for Yuba,TS214.2
  12. Micro- grid with distributed power control strategy research,TM61
  13. The Grid-Connected Wind-solar Hybrid Generation System and Maximum Power Point Tracking,TM61
  14. ISSR Analysis of Genetic Diversity on 21 Lotus(Nelumbo Nucifera) Cultivars,S682.32
  15. Research of Scheduling Algorithm Based on Hybrid Adaptive Genetic Algorithm in Computing Grid,TP393.09
  16. Characterization of an Atrazine-Degrading Strain, Cloning of Key Degrading Related Genes and Construction of a Gene Cluster,X172
  17. Research on the Soil Environmental Function Zoning,X321
  18. Research on Intelligent Aerial Bomb for Forest Fire Fighting,S762
  19. Comparison of Gene Expression Data Cluster Methods and Gene Network Construction for Phytophthora Sojae Genes,S435.651
  20. Study on Heterosis and Genetic Basis of Soybean,S565.1
  21. Evaluation on Forage Quality and Biomass Energy Characters of Inbred Vegetative Lines of Napier Grass,S543.9

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer software > Program design,software engineering > Programming > Database theory and systems
© 2012 www.DissertationTopic.Net  Mobile