Dissertation > Excellent graduate degree dissertation topics show

The Study and Design of High Availability Monitoring Subsystem for Fault Tolerant Computing Systems

Author: LuoLuMing
Tutor: YangXiaoZong
School: Harbin Institute of Technology
Course: Computer Science and Technology
Keywords: High availability Monitor scheme TMR
CLC: TP311.52
Type: Master's thesis
Year: 2008
Downloads: 112
Quote: 0
Read: Download Dissertation

Abstract


Fault-Tolerant computing systems are very important in the field of information technology. On one hand, the systems have strong ability to deal with key tasks. On the other hand, they have high availability, and can provide high-speed and reliable of information processing services. The information losing and destroying or the exceptional shutting down of Fault-Tolerant computing systems would exert a great influence on those key tasks, so the ability of continuously operating is put forward for these systems, the ability is high availability.This paper is based on blade server systems. The design of high availability monitoring subsystem is presented. The monitoring subsystem can choose any two blades from blade server systems as the Leader layer of high availability. The monitoring subsystem use TMR technology and it make Leader layer become the core of the high availability system.Whether or not the arbitration process succeeds is a main bottleneck influencing the availability of Fault-Tolerant computing systems. When both two leader blades are good, the network services they provide are almost the same as single module system. Only when one sever crushes down, and the arbitration and reconfiguration succeed, the advantage is manifested. If any failure happens during the arbitration process, Leader layer system has nothing advantages compare with single module system.During the analysis of the whole process of arbitration process, a Marcov model is proposed to study the influence of some parameters on the availability of the whole system. Integrated active-standby systems and dual active systems, we can conclude that fault detection and fault diagnose are critical to system availability.This paper presents some research and design as follows: some normal arbitration techniques studied. The conflict between normal techniques and practical requirements is analyzed. A high-availability arbitration scheme is proposed to provide hardware support for blades server systems. The hardware designs for high availability monitoring subsystem of fault tolerant computing systems are presented. Some concrete works are implemented, including TMR, CPLD, USB switching, HotSwap, etc.

Related Dissertations

  1. Mass storage system availability Service Management Design and Implementation,TP333
  2. Feed Mixer Auger Parameter Optimization Study,S817.124
  3. The Load Balance Design and Application of Monternet Business,TN929.5
  4. Onboard High-speed Data Processing Technology,V446.9
  5. Small ranch TMR effects and economic benefits of technology applications,S823
  6. Research and Development the Key Issues of High Availability Database Middleware,TP311.13
  7. Research and Implementation of the Novel Heartbeat Inspecting Technique,TP274.4
  8. Design and Implementation based CBDF filtering high-availability anti-spam system,TP393.098
  9. Analysis and Design of Mswitch Network Management System,TN915.07
  10. The Design and Implementation of the High Availability Technology on KYLIN Operating System,TP316
  11. Research of Distributed Database Resource’s High Availability,TP311.138
  12. The Research and Implementation of Fault-tolerant Metadata Cluster Management for PB-scale Storage System,TP333
  13. The Research on a CompactPCI Platform’s High Availability Application Based on VxWorks Real-Time System,TP311.52
  14. Research and Implementation on Disaster-recovery Oriented Failure Detection Algorithm,TP309
  15. Research and Implementation of High Availability Technology for Distributed Services,TP311.52
  16. The Design and Implementation of the CA Server’s Hot-standby System in the CAPF of Hainan Province,TP393.05
  17. A Design and Implementation of Media Gateway High Availability,TN915.05
  18. Sichuan Unicom remote disaster recovery system planning and construction,TP309.3
  19. Research on Adaptive Routing Algorithm of Hybrid FSO/RF Network,TN929.1
  20. Optimizing on Roughage Formulation of Fermented TMR,S816

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Computer software > Program design,software engineering > Software Engineering > Software Development
© 2012 www.DissertationTopic.Net  Mobile