Dissertation > Excellent graduate degree dissertation topics show

Optimizations of Memory Subsystem for Chip Multiprocessor Systems

Author: LiJianHua
Tutor: XuZuoLong; XueChun
School: University of Science and Technology of China
Course: Computer Software and Theory
Keywords: Chip Multiprocess On-Chip Networks Cache Coherence STT-RAM Multicast Routing Cache Memory Network Partition Hybrid Cache
CLC: TP332
Type: PhD thesis
Year: 2013
Downloads: 35
Quote: 0
Read: Download Dissertation


Modern chip multiprocessors (CMPs) employ large cache memories to reduce the performance gap between processors and off-chip memory. This thesis states that the particular characteristics of CMP system can be exploited to improve energy and performance in the memory hierarchy. The research presented in this thesis investigates several mechanisms to optimize the performance of CMP memory system. Specifically, we target three problems as our research topic:1) design efficient multicasting algorithm to improve the performance of on-chip network,2) exploit emerging non-volatile memories to design low power cache memory for CMP systems,3) exploit thread progress information to design high performance cache coherence protocols.For the first research topic, we propose an efficient multicast routing mech-anism for on-chip network. For CMP system with increasing core count, on-chip network provides an efficient and scalable interconnection paradigm, wherein one-to-many (multicast) communication is universal for such platforms. Without ef-ficient multicasting support, traditional unicasting on-chip networks will be low efficiency in tackling such multicast communication. In this thesis, we propose dual partitioning multicasting (DPM) which significantly reduces packet laten-cy and on-chip network power dissipation. Specifically, DPM scheme adaptively makes routing decision based on the network load-balance level as well as the link sharing patterns characterized by the distribution of the multicasting destinations.For our second research topic, we propose to exploit emerging non-volatile memory, such as spin-torque transfer RAM (STT-RAM), to design low power cache memories. STT-RAM has fast read access, high storage density and negli-gible leakage power. However, the wide adoption of STT-RAM as cache memories is impeded by its long write latency and high write power. The write performance of STT-RAM can be improved through relaxing the retention time of its cell, magnetic tunnel junction (MTJ). The resultant volatile STT-RAM needs to be periodically refreshed to prevent data loss. When applied as the large last-level cache in CMP systems, the frequent refresh operations could dissipate significant extra energy. In addition, the refreshes could severely conflict with the normal read/write operations to degrade the overall system performance. In this thesis, we propose cache coherence enabled adaptive refresh (CCear) to minimize the number of refresh operations for volatile STT-RAM. CCear can effectively mini-mize the number of refresh operations on volatile STT-RAM through interacting with cache coherence protocols and cache management policy.Finally, we propose an efficient coherence adaption mechanism to improve the performance of cache coherence protocol in CMP systems. One primary ob-jective of CMP system is to boost application execution by exploiting thread-level parallelism. In such systems, threads typically exhibit unbalanced progress stem-ming from unequal cache misses or task assignment. Load imbalance is one of the biggest roadblocks for parallel application performance. Because of the inherent synchronization primitives, such as barriers and locks, cores running fast thread have to waste pervious cycles waiting for slow cores. In this thesis, we propose thread progress aware coherence adaption (TEACA) which utilizes the thread progress information as the hints to adapt hybrid coherence protocols. Specifical-1y, TEACA fuses the memory system statistics to estimate the progress of threads. Based on the estimated thread progress information, TEACA dynamically catego-rizes threads into leader threads and laggard threads. The thread categorization decisions are then leveraged for efficient coherence adaption in hybrid coherence protocols.

Related Dissertations

  1. The Design and Application of Data Acquisition System Based on Industrial Wireless Network,TP274.2
  2. Research on Multicast QoS Routing Algorithms Based on GA in Ad Hoc Networks,TN929.5
  3. Research on Mine Ventilation Simulation and Optimization Transformation of Wangzhuang Mine,TD724
  4. DTN model based on multi- node multicast routing algorithm,TN929.5
  5. Algorithms of Network Partitions,O157.5
  6. Design and implementation of PIM-SM offset Algorithm,TP393.04
  7. The Improvements and Applications of the Glowworm Swarm Algortihm,TP301.6
  8. QoS Multicast Routing Based on Nastural Computation,TP393.09
  9. Based on improved genetic algorithm for QoS multicast routing algorithm,TP393.09
  10. Research on QoS Multicast Routingalgorithm of Network Sensory System,TP393.02
  11. Design and Implementation of Wireless Network Partition Management System of Guangzhou,TP311.52
  12. Multicast Routing Algorithm Based on Wireless Sensor Network Applied in Intelligent Monitoring System,TP212.9;TP277
  13. Research on QoS Based Multicast Routing Algorithms,TP393.09
  14. Research on Algorithms in QoS Multicast Routing,TP393.09
  15. QoS Constrained Multicast Routing Algorithms,TP393.02
  16. Research on QoS-Based Multicast Routing Algorithm,TP393.02
  17. Research of Multi-Constrained QoS Multicast Algrotihm Based on Hybrid Cenetic Algorithm,TP393.02
  18. Research on Multicast Routing Protocols with QoS Guaranteed in Ad Hoc Networks,TN929.5
  19. Research on Some Technologies of Multiple Constrained QoS Multicast Routing Based on Theory of Complex Networks,O157.5
  20. The Application of Genetic Algotithm in QoS Multicast Routing,TP393.02
  21. Ant Colony Optimization and Its Research on QoS Multicast Routing Problem,TP301.6

CLC: > Industrial Technology > Automation technology,computer technology > Computing technology,computer technology > Electronic digital computer (not a continuous role in computer ) > Arithmetic unit and the controller (CPU)
© 2012 www.DissertationTopic.Net  Mobile