With the continuing scaling of semiconductor technologies, many core systems have become the de facto design for modern high performance computer architectures. It is expected that more and more applications with diverse requirements will run simultaneously on the many core platform. However, this will exert contention on shared resources such as the last level cache, network-on-chip bandwidth and off-chip memory bandwidth, thus affecting the performance and quality of service (QoS) significantly. In this environment, efficient resource sharing and a guarantee of a certain level of performance is highly desirable.
Researchers have proposed different techniques to support QoS, but most existing works focus on only one individual resource. Besides, most of these frameworks allocate shared resources statically at the beginning of application runtime, and do not dynamically track, manage and share shared resources across applications. Joint QoS support and coordinated management of those shared resources remains an open problem. It our work, we address this limitation by proposing architectures for joint resource management for QoS support. We then propose dynamic QoS management policies that monitor the resource usage of applications at runtime, then steals resources from the high priority applications for lower priority ones. The goal is to maintain the targeted level of performance for high priority applications while improving the performance of lower priority applications. To the best of our knowledge, this is the first work that considers all the three critical shared resources (cache, NoC and memory) simultaneously for joint QoS support and management. Our evaluation results show that our dynamic resource management policy can improve performance for lower priority applications significantly while maintaining the performance for high priority application, thus demonstrating the effectiveness of our dynamic QoS resource management policy.
ORION 2.0: A Fast and Accurate NoC Power and Area Model for Early-Stage Design Space Exploration
Polaris: A System-Level Roadmap for On-Chip Interconnection Networks