Central vs. distributed dynamic thermal management for multi-core processors: Which one is better?
Date of Original Version
In this paper we investigate and contrast two techniques to maximize the performance of multi-core processors under thermal constraints. The first technique is a distributed dynamic thermal management system that maximizes the total performance without exceeding given thermal constraints. In our scheme, each core adjusts its operating parameters, i.e., frequency and voltage, according to its temperature which is measured using integrated thermal sensors. We propose a novel controller that dynamically adapts the system to simultaneously avoid timing errors and thermal violations. For comparison purposes, we implement a second technique based on a runtime centralized, optimal system that uses combinatorial optimization techniques to calculate the optimal frequencies and voltages for the different cores to maximize the total throughput under thermal constraints. To empirically validate our techniques, we put together an extensive tool chain that incorporates thermal and power consumption simulators to characterize the performance of multi-core processors for a number of configurations ranging from 2 cores at 90 nm to 16 cores at 32 nm. Our results show that both investigated techniques are capable of delivering significant improvements (about 40% for 16 cores) over standard frequency and voltage planning techniques. From the results, we outline the main advantages and disadvantages of both techniques. © 2009 ACM.
Proceedings of the ACM Great Lakes Symposium on VLSI, GLSVLSI
Kadin, Michael, Sherief Reda, and Augustus Uht. "Central vs. distributed dynamic thermal management for multi-core processors: Which one is better?." Proceedings of the ACM Great Lakes Symposium on VLSI, GLSVLSI , (2009): 137-140. doi:10.1145/1531542.1531577.