Dissertation > Excellent graduate degree dissertation topics show

The Design and Implementation of a Distributed Storage and Retrieval System

Author: CaoZuoFen
Tutor: WangDong;LiYongHang
School: Hunan University
Course: Computer technology
Keywords: Parallel Computing Distributed File System (DFS) Distributed Information Retrieval (DIR) MASS DATA mapping protocol
CLC: TP333
Type: Master's thesis
Year: 2009
Downloads: 102
Quote: 0
Read: Download Dissertation


In the digital era of information explosion, the storage and retrieval of information will become the basic means and ends. In the information age,“Data rich, information poor”is the most significant feature. Therefore, information retrieval technology is constantly updated and improved. Surge in volume of digital information, storage prices are low, the rapid develop- ment of the network, access to useful information in the context of several background above, traditional file system limited to a single device is already difficult to meet the requirements of storage management. The distributed storage and retrieval system has strong advantages of high efficiency, stability and scalability, is the best way to comply a efficient storage and re- trieval.Distributed parallel programming model for a lot of different characteristics, we compare the classic OpenMP, MPI and recently more popular MapReduce programming model and found that poor OpenMP scalable MPI programming model is complex. MapReduce is pre- sented as a Google group for large-scale mass data processing distributed programming model. The advantages: scalability is good, readable, and has better auto-parallelism and fault to- lerance.This thesis analyzes the distributed storage and retrieval system’s strong advantages of high efficiency, stability and scalability, and introduces one kind of simplified distributed programming model—MapReduce.This thesis introduces how to establish a MapReduce-based distributed file storage sys- tem (DFS), and how to implement a distributed information retrieval (DIR) platform on this storage system to achieve full-text search.Through experimental comparison, we found that the efficiency of the distributed file system is far ahead of stand-alone treatment when data processing increased. In addition, the key of effectively improving the efficiency of parallel computing systems is to enhance its concurrency when under the permit of the system hardware conditions.

Related Dissertations

  1. Research and Design of a High-Performance Scalable Public Key Cryptographic Coprocessor,TN918.1
  2. Research on Video Compression Algorithm Based on Multi-core Computing Platform,TN919.81
  3. Research of Finite Element Method on GPU,O241.82
  4. Numerical Simulation of Radiofrequency Waves in Magnetized Plasma,TL612
  5. The Algorithm Researches of Novel Wide Area Backup Protection for Power Grid,TM774
  6. The Research on Online Adaptive Settings,TM77
  7. Based on logical hierarchical storage system design and implementation,TP333
  8. Fault Tolerance for MapReduce in the Cloud Environment,TP302.8
  9. High dynamic SINS navigation solution algorithm and parallelization of,TN966
  10. Image retrieval method and system for parallel computing,TP391.3
  11. GPU-accelerated particle filter PET image reconstruction algorithm,TP391.41
  12. GPU-based parallel search algorithm for time series,TP391.41
  13. CPU-based inverse algorithm source strength,TP18
  14. Parallel computing for data-intensive reconfigurable linear array processor architecture design,TP332
  15. Large-scale approximation paragraph fingerprint - based page detection algorithm research,TP393.092
  16. Parallel and Dual-systems Cooperative Co-evolutionary Differential Evolution Algorithms and Their Application,TP18
  17. Research on Fault-Tolerant Parallel Skyline Query Technology in Cloud Computing Environment,TP311.13
  18. A Study on Diagonal Computing Model for GPGPU Platform,TP391.41
  19. Algorithm Study on Accelerate CV Image Segmentation and Exterior Industrial Image Reconstruction by CUDA,TP391.41
  20. Massive Data Storage and Full-text Search,TP333
  21. The Study on the UAV Digital Remote Sensing & Survey System Integration and Images Data Processing,P237

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Electronic digital computer (not a continuous role in computer ) > Memory
© 2012 www.DissertationTopic.Net  Mobile