Dissertation > Excellent graduate degree dissertation topics show

The key component vertical search engine technology research

Author: SuXiaoHui
Tutor: XuLiPing
School: Huazhong University of Science and Technology
Course: Computer Software and Theory
Keywords: Component Description Vertical search Theme crawling Information Extraction Improved indexing
CLC: TP391.3
Type: Master's thesis
Year: 2011
Downloads: 57
Quote: 0
Read: Download Dissertation

Abstract


Component-based software development methods (Component-Based Software Development) are considered to solve the \However, the size of a single component library can not meet the needs of software developers , many heterogeneous interoperability between component library inaccessible , making the \With the rapid development of Internet , Internet there were many available commercial components and open source component component library , in addition to the Internet is also littered with large component library collection could not be a member . The emergence of vertical search engines on the Internet and development of components for the realization of search resources provide a solution ideas and technical assurance . Vertical search engine for a theme, for specific populations , to ensure the right information in a particular area fully included and timely updates, data collected through the depth of the excavation analysis, can provide users with personalized search service. The key component vertical search engine technology, including crawling algorithms , components and structural information indicates that extraction, indexing and retrieval component . Crawling algorithm based on Shark Search algorithm based on the expanded , combined with Nutch of OPIC algorithm approach , the design of a subject-oriented crawling algorithm L-Shark Search, experiments show that L-Shark Search algorithm than the original Shark Search crawling algorithm has better effect . Under the existing component library website can provide information about the design of a component description component description model iUCDL, presents a structured template-based information extraction method for accurate extraction of the component information . For information on the index component in maintaining Lucene full-text search without changing the style by adding XML structure information to support member faceted search and matching model combined with engraved face member , a second sort the search results , improve member information search quality , and finally through the experiment to verify the component vertical search key technology research.

Related Dissertations

  1. Research on Domain Entity Attribute and Event Extraction Technology,TP391.1
  2. Research on Temporal Information Recognition and Normalization,TP391.1
  3. Study on Growth Monitoring Technique Based on Pixel Un-Mixing Method and HJ Remote Sensing Images in Paddy Rice,S511
  4. Land Desertification in Qinghai Lake Landscape Pattern Change,X171
  5. Active faults based radar image information extraction method applied research and demonstration,P542.3
  6. Based on high-resolution remote sensing data mining houses information extraction,TP751
  7. Web Page Attribute Extraction Method Research,TP391.1
  8. The Research for Named Entity Recognition and Relation Extraction in Text,TP391.1
  9. Home Academic Information Extraction System,TP393.092
  10. Engineering News reported information extraction and applied research,G212
  11. Topic search engine key technology research,TP391.3
  12. Hull section robotic welding path planning and offline programming,TP242
  13. Based on semi- structured text transporter protein substrate information extraction system,Q811.4
  14. Dynamic learning framework based on structured automatic web data extraction method,TP393.092
  15. Web-oriented Chinese automatic summarization research generated,TP391.1
  16. Printers based on natural language HCI Research and implementation,TP11
  17. Multi-language support program comprehension understanding and information extraction technology research,TP311.52
  18. Template independent web information extraction,TP393.092
  19. Internet-facing access to diverse information technology research,TP393.09
  20. Study on Extraction of Coniferous Forest Information in Southern China,TP79
  21. Study on Information Extraction and the Dynamic Monitoring of Grassland Coverage in Three River Source Area,S812

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer applications > Information processing (information processing) > Retrieval machine
© 2012 www.DissertationTopic.Net  Mobile