Dissertation > Excellent graduate degree dissertation topics show

Research on Drawing Characteristic Bunch Technology for Copying Plagiarism in Program Code

Author: HouMin
Tutor: LiuDongSheng
School: Inner Mongolia Normal
Course: Applied Computer Technology
Keywords: Code copy detection Similarity Eigenvalues Tree structure string
CLC: TP311.11
Type: Master's thesis
Year: 2009
Downloads: 118
Quote: 2
Read: Download Dissertation

Abstract


Copy detection technology in the information age , a very wide range of applications, particularly in computer programming . Copy Detection into two categories : one is the formal language text (such as: computer program code , etc. ) copy detection , and the other is a natural language text copy detection . Copy the code detection is to determine whether a program code plagiarism or copy one or more programs , the core code similarity . Similarity calculation program , you first need to extract the eigenvalues ??of the block , that is able to represent the basic language units of the content and structure of the program . And then compares the extracted program eigenvalue to be compared , the similarity is calculated according to the degree of similarity between the result of the comparison judgment procedure , i.e. . In this process , the feature value extraction is essential, characteristic values ??directly affects the accuracy of the comparison result . In this paper, the characteristic value extraction technology research . This paper first introduces the program code similarity detection technology , including the code similarity definition and measurement technology classification, the status quo of domestic and foreign research and development as well as existing program code similarity discrimination system introduced . Then the program code similarity detection process feature value extraction techniques are introduced . In the existing program code similarity discrimination system , using methods based on string comparison , comparison procedures to be sub-word conversion feature string , then the string similarity comparison . This string contains the program structure information , it will affect the result of the comparison accuracy . This paper studies how the program is converted to string structure contains more structural information , to provide a better basis for comparison for the next step . The design of the study the formation and structure of string is done in three steps : the first step in the development of language lexical rules and grammar rules , provide the basis for follow-up analysis of the source code conversion ; generate lexer and analyzer ; the third step, the lexical analyzer and parser source code analysis, to generate the corresponding tree structure string . Finally, by way of example test , this study achieved a string of the program source code into a tree structure , to achieve the desired purpose .

Related Dissertations

  1. Syntactic Features Based Pronoun Resolution,TP391.1
  2. Research of Multiple Emails Automatic Summarization,TP391.1
  3. Research of IRC Botnet Detection Based on Behavior,TP393.08
  4. Research on Auto-Evaluation Method of Programming Based on Similarity,TP312.1
  5. Comprehensive Evaluation of Flue-cured Tobacco Quality in Pingdingshan and Comparative Analysis with American Tobacco,S572
  6. The Impact of Tourism on Typical Vegetation in Luya Mountain Nature Reserve, Shanxi Province,S759.9
  7. Ontology -based Semantic Web service matching and composition method,TP393.09
  8. WordNet and the \,G254
  9. Research of Text Clustering on Food Complaint Documents Based on Ontology,TP391.1
  10. Yuan Zhen and Bai Juyi’s Similar Research,I207.22
  11. Research on Relationship Extraction Based on Semantic Pattern Matching in Web Environment,TP391.1
  12. 16th Men of the World Basketball Championships China Basketball Man’s Team Point Guard Attack or Defense Ability to Analyze,G841
  13. Sentence Similarity Computing Research and Application of Intelligent Question Answering System,TP391.1
  14. Design and Implementation of the Character Classification System Used in Search Engine,TP391.3
  15. Research on Image Super-resolution Reconstruction Based on Non-local Similarity,TP391.41
  16. Research on Streaming Media Detection Methods Against DoS\DDoS Attack Based on Analysis of Self-similarity,TP393.08
  17. Study on Data Mining in Water Dispatching Decision Support System,TV697.11
  18. Synthesis and Design of Microwave Filters,TN713
  19. Finding Web Services Based on Clustering Probabilistic Semantic Approach,TP393.09
  20. The Research of Image Spam Detecting Based on Similarity Assessment,TP393.098
  21. CBR-based discrete simulation model reusability study,TP301.6

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer software > Program design,software engineering > Programming > Programming method
© 2012 www.DissertationTopic.Net  Mobile