Dissertation > Excellent graduate degree dissertation topics show

Research on Micro-blogging Privacy Detection Based on Bayesian

Author: JiangZhiShuang
Tutor: LiuJie
School: Harbin Engineering University
Course: Applied Computer Technology
Keywords: micro-blog privacy detection word segmentation Na ve Bayesian classifier
CLC: TP393.092
Type: Master's thesis
Year: 2013
Downloads: 22
Quote: 0
Read: Download Dissertation

Abstract


In recent years, micro-blogging is becoming more and more popular around the worldwith its integrated, opening, easy to operate, rapid spread and wide coverage features, whilethe following micro-blogging privacy disclosure problem is also gaining more concern.Researches for micro-blogging privacy detection are still in the initial stage, and are gainingmore and more attention.In this thesis the following researches were made after observing the current researchstatus around the globe and related technologies.A micro-blogging privacy detection system was proposed in this thesis to detectmicro-blogs involving privacy disclosure. The system mainly contains modules forpre-processing, Chinese word segmentation and results optimization process, stop wordremoval and a double level Na ve Bayesian classifier. Firstly, as the traditional RMM+TSDsegmentation method has too much invalid terms lookup and can not handle ambiguitysegmentation and new word recognition, an I-RMM+I-SD segmentation method wasproposed to solve those problems. The method can effectively improve the segmentationspeed without bringing too much additional dictionary storage expenses, and can handle thecommon two word overlapping ambiguity and new word recognition problem, thus caneffectively improve the efficiency and accuracy of the segmentation. Secondly, a doublelevel Na ve Bayesian classifier was proposed to classify the micro-blogs after thesegmentation process. By such means both micro-blog and privacy classification can beobtained with only one step of marking the micro-blog and privacy category. Combine boththe I-RMM+I-SD segmentation and double level Bayesian classifier’s performance themicro-blogging privacy detection system obtained good privacy detection results, and canmeet the requirements for efficiency and accuracy in micro-blogging privacy detection.Finally, this thesis verified the proposed algorithm through experiments, and theexperimental results were compared for analysis, the results showed that the superiority ofthe algorithm, and the direction of further improvement was also discussed.

Related Dissertations

  1. Micro-blog: An Illusion of the Discourse of Grass Roots,G206
  2. Blog Development and the profit model of,F49
  3. Study on Real Time Information Shareing Platform of Travel Based on 3G and Web2.0,F592
  4. Micro Blog Marketing Strategy of SZ,F274
  5. Examining Influencing Factors of Users’ Continuance Intention Toword Microbloging,F224
  6. Chinese word segmentation based on understanding of system design and implementation,TP391.1
  7. The Research on Full-Text Search and Related Technologies,TP391.3
  8. Communicative Action Research in the Chinese We Media Age,G206
  9. The Study of Dealer Training of T Company,F426.72
  10. The Application of Web Data Mining Technology on Network Education BBS,G434
  11. The Research of the Infromation Design in the User Interface of Micro-Bloc,TP393.092
  12. A Research on Jouralists’ Serical Roles in Social Media,G214
  13. The Application Research of Chinese Word Segmentation Algorithm in GIS,TP391.3
  14. Chinese mobile Internet-based health information service system design and implementation,TP393.09
  15. The Study of Ontology-Based Query Expansion,TP391.1
  16. Research on the Marketing Strategy of China’s Movie Industry in Context of New Media,J943.1
  17. A Study on Intertextuality of the Chinese Microblog Discourse,H052
  18. Emergent Event Detectiona and Information Diffusion Modeling on Microblog,TP393.092
  19. Content-based sites Clustering Algorithm,TP393.092
  20. The Design and Implement of Web Page Automatic Categorization and Storage Management System,TP393.092
  21. The Research of Word Index Method Based on Inter-Relevant Successive Trees Model,TP391.3

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer applications > Computer network > General issues > The application of computer network > Web browser
© 2012 www.DissertationTopic.Net  Mobile