Dissertation > Excellent graduate degree dissertation topics show
An Algorithm on Clustering and Anomaly Detection for Multiple Data Streams
Author: JiangZuo
Tutor: YangJing
School: Harbin Engineering University
Course: Computer System Architecture
Keywords: Multiple Data Streams Clustering Discrete Wavelet Transform Anomaly Detection Local Outlier Factor
CLC: TP311.13
Type: Master's thesis
Year: 2011
Downloads: 27
Quote: 0
Read: Download Dissertation
Abstract
As a new data model,data stream plays an important role in many applications,such as network traffic management,financial montoring,traffic control as well as e-business and so on.The processing and mining technologies over multiple data streams have been widely studied.The infinite and high speed characters of multiple data streams and the requirement of fast on;ine response for these applicationgs break many assumptions in traditional databases. On the other hand, multiple data streams processing technology requires not only focus on change of one data stream, also on relevance analysis between a lot of data streams.Though the reserach of multiple data streams clustering and anomaly detection has been widly studied,but there are still many questions need to be solved.In this thsis,we study the problem of based on clustering multiple data streams anomaly detection,and realize an improved algorithm on clustering and anomaly detection for multiple data streams which gethers clustering algorithm and outlier mining algorithm.It can gain arbitrary clusters,as well as find local outlier,and detect anomaly.First,we analyses and study the related theory of data stream mining.Combined the characteristics of multiple data streams,we reviewed for multiple data streams’research direction and the existing method of anomaly detection,and existing rub and challenges of multiple data streams. On the basis of discrete wavelet transform for multiple data streams compress,we propose an improved algorithm on clustering and anomaly detection for multiple data streams.This algorithm fisrt, preprocesses multiple data streams which gain the compressed data streams,according to multiple data streams’correlation and discrete wavelet transform.lt can reduce the requirements of system’s memory storage,quickening computer’s deal time.Then we improve similarity matrix which provides completed data and improves the accuration of clustering results.Then we compute the local reach density of each data point and mark core point,and cluster data streams which can find arbitrary clustering shapes.At last,we compute local outlier factor of the nosie,output the set of local outlier factor,detect anomaly by setting the LOF value.In conclusion, experiment shows that the algorithm can cluster and find anomaly of LOF, when computing multiple data streams,and the time of clustering has less then DBSCAN.
|
Related Dissertations
- Research and Implementation of Mining Implicit User Interest,TP311.13
- Establishment and Update of Similar Users’ Cluster in Personalized Information Retrieval,TP391.3
- Research on Removal Algorithm of Shadows in Image Segmentation,TP391.41
- The Research of the Text Extraction Method Based on Spectral Cut,TP391.41
- Gao Zhong-ying academic thought and experience and use of Bufei Decoction treatment of common diseases of the respiratory system drug law,R249.2
- Research and Improvement on K-Means Clustering Algorithm,TP311.13
- Research on Peer-to-Peer Traffic Identification Algorithm Based on Cluster Analysis,TP393.02
- Research of Scheduling Algorithm Based on Hybrid Adaptive Genetic Algorithm in Computing Grid,TP393.09
- Evaluation of Photosynthetic Efficiancy of Seedlings of the Hybrid Progenies (F1) in Peach,S662.1
- The Load Research and Comprehensive Evaluation on the Agricultural Non-Point Source Pollution in Nantong,X592
- BF-FCM Clustering Algorithm and Its Application in the Image Segmentation,TP391.41
- The Application of Ant Colony Algorithm in Meteorological Satellite Cloud Pictures Segmentation,TP391.41
- Research on Clustering Algorithm Based on Mutation Particle Swarm Optimization,TP18
- Research on K-means Optimization Clustering Algorithm,TP311.13
- Research on Fuzzy C-Mean Clustering Algorithm Based on Particle Swarm Optimization and Shuffled Frog Leaping Algorithm,TP18
- Research on Clustering Algorithm Based on Genetic Algorithm and Rough Set Theory,TP18
- Study on Photosynthetic Characteristics of Peach Based on Heterosis of Assimilation Capacity,S662.1
- The Research on Routing Protocol of Agricultural Environmental Monitoring System Based on Wir Eless Sensor Networks,TN915.04
- Multilayer structure based WSN routing protocol for heterogeneous clusters,TP212.9
- Evolutionary Clustering Algorithm and Its Application,TP311.13
- Vehicle detection based on machine vision and vehicle distance measuring method,TP274
CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer software > Program design,software engineering > Programming > Database theory and systems
© 2012 www.DissertationTopic.Net Mobile
|