Dissertation > Excellent graduate degree dissertation topics show

Crawler and Incremental Update Strategy Research in Deep Web

Author: GuoMei
Tutor: LiHui
School: Beijing University of Chemical Technology
Course: Applied Computer Technology
Keywords: Deep Web Vertical search Text Categorization Information extraction Incremental update
CLC: TP391.3
Type: Master's thesis
Year: 2010
Downloads: 188
Quote: 1
Read: Download Dissertation


The 21st century based on a network -based , high-tech as the core of the knowledge-based economy , the network is increasingly important to our lives , more and more people start your research online , and now the user is becoming increasingly dependent on search engine , the search results \Deep Web compared with ordinary web pages , more informative , more specific theme , the data structured information better , higher quality of the information , and can effectively search the Deep Web resources , to provide users with more valuable information . Deep Web search needs to break through the limitations of traditional search technology , automatic identification searchable database from the network , submit a search request and return the query results were analyzed by the search interface , remove the required data processing and then returned in some form to the user. The spirit of the search for deeper , more professional information purposes , the text discusses the depth gateway key research to achieve the depth of the net vertical search system based on cloud computing . Internet mass disorder information structured to provide users with the monograph , specific , in-depth information retrieval services , the use of a simple model , text vector characteristics classification page accurately classified . The experimental results show that the method has the efficiency of web crawling and page classification accuracy . In addition, according to the different categories of URLs with different update algorithm to achieve incremental updates of the data . The experimental data show that this incremental updating algorithm is feasible, and dynamically updated as the Web , the system will be automatically updated each time automatically adjust the update frequency , the updated range automatically adjust .

Related Dissertations

  1. Research on Domain Entity Attribute and Event Extraction Technology,TP391.1
  2. Research on Temporal Information Recognition and Normalization,TP391.1
  3. Study on Growth Monitoring Technique Based on Pixel Un-Mixing Method and HJ Remote Sensing Images in Paddy Rice,S511
  4. Land Desertification in Qinghai Lake Landscape Pattern Change,X171
  5. Active faults based radar image information extraction method applied research and demonstration,P542.3
  6. Based on high-resolution remote sensing data mining houses information extraction,TP751
  7. Web Page Attribute Extraction Method Research,TP391.1
  8. The Research for Named Entity Recognition and Relation Extraction in Text,TP391.1
  9. The key component vertical search engine technology research,TP391.3
  10. Reptiles theme for Education News Design and Implementation,TP391.3
  11. GPU-based image search Chinese Research on key technologies of the retrieval,TP391.1
  12. Home Academic Information Extraction System,TP393.092
  13. Engineering News reported information extraction and applied research,G212
  14. Topic search engine key technology research,TP391.3
  15. Hull section robotic welding path planning and offline programming,TP242
  16. Based on semi- structured text transporter protein substrate information extraction system,Q811.4
  17. Dynamic learning framework based on structured automatic web data extraction method,TP393.092
  18. Web-oriented Chinese automatic summarization research generated,TP391.1
  19. Printers based on natural language HCI Research and implementation,TP11
  20. Multi-language support program comprehension understanding and information extraction technology research,TP311.52
  21. Network public opinion analysis to key technology research and,TP393.09

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer applications > Information processing (information processing) > Retrieval machine
© 2012 www.DissertationTopic.Net  Mobile