Paper Example Doctorate 878 words

Managing Contention for Shared Resources on Multicore Processors

Last reviewed: November 30, 2013 ~5 min read

Abstract

The LLC miss rate was proven to be a good indicator to evaluate how shared resources affect the overall performance. It can be utilize to schedule algorithms in ways that optimize system performance and decrease system degradation. High LLC miss rate applications should be scheduled with low LLC miss rate applications for proper optimization.

Managing Contention

"Contention for shared resources significantly impedes the efficient operation of multicore processors" (Fedorova, 2009). The authors of "Managing Contention for Shared Resources on Multicore Processors" (Fedorova, 2009) found that shared cache contention as well as prefetching hardware and memory interconnects were all responsible for performance degradation. After implementing a pain, sensitivity and intensity, model to test applications, the authors discovered that high miss rate applications must be kept apart and not co-scheduled on the same domain (memory). Therefore, the management of how the applications were scheduled by the scheduler would mitigate the performance degradation of the cache lines and the applications on the processors.

The authors built a prototype scheduler, called Distributed Intensity Online (DIO) that distributes intensive (high latest level cache (LLC) miss rates) after measuring online miss rates of the application. With the execution of eight different workloads for testing, the DIO improved workload performance by 11% (Fedorova, 2009) with some applications showing 60-80% improvement with the worst case schedules. The prefetching hardware was the application that showed the most improvement under DIO. It also shows potential for ensuring QoS (quality of service) for critical applications with a means of ensuring the worst scheduling assignments are never used.

Another schedule that was used was the Power DI (Distributed Intensity) to test the power consumption. Power DI clusters incoming applications on as few machines as possible, except machines with memory intensive applications. The concept is the same on a single machine only it clusters the applications on as few memory domains as possible. The effectiveness differed with the number of memory intensive applications. The greater the number of memory intensive applications, the greater number of domains, or machines, that would end up getting used. So, the greater the number of memory intensive applications, the greater amount of power that was used. Power DI was able to adjust to the properties of the workload to minimize the power used.

The authors found the dominant cause of performance degradation is contention of shared resources of front-side bus, prefetching resources, and memory controller. Applications that issue many cache misses will occupy the memory controller and the front-line bus, which hurts other applications that use that hardware and the applications themselves. Cache contention stems from two or more threads running on the same domain (memory). The cache consists of lines allocated to hold thread memory as the threads issue cache requests. Threads share the last level cache (LLC), so when a thread requests a line not in the cache, cache miss, a new line gets allocated. When the cache is full, data must be evicted to free space for the new data, which hurts performance in the eviction process.

According to (Zhoa, 2011), the primary function of the cache line size and application behavior is the amount of data sharing. Invalidations and misses occur from cores competing to access the same data or different items in the same cache line. Frequent occurrence causes sharing-induced slow down. High miss rates exhibit false sharing, therefore, performance degradation.

According to (Arteaga, n.d.), the nature of resource sharing causes contention. The resource distribution on cores and threads determines the core/thread performance. Scheduling on multiple machines can cause the cores and threads to compete for the same resources. Applications with similar behavior may compete for the same resources leaving other resources relatively idle. As a result, workload performance degrades where distributed resources are not utilized.

To manage contention and optimize performance, accurate modeling of impact of inter-process cache contention on performance and power consumption is required (Xu, 2010). This would require a cache contention aware assignment algorithm. High miss rate applications must be kept apart and not co-scheduled on the same domain. A high miss rate shows performance degradation where a lower miss rate improves performance.

You’re 81% through this paper. Sign up to read the full paper.

130,000+ paper examples AI writing assistant Citation generator Cancel anytime

References

4 sources cited in this paper

Arteaga, D. e. (n.d.). Cooperative Virtual Machine Scheduling on Multi-core Multi-threading Systems -- A Feasibility Study. Retrieved from Florida International University: http://visa.cs.fiu.edu/...i/tiki-download_file.php?field=25 ↗
Fedorova, A. B. (2009). Managing Contention for Shared Resources on Multicore Processors. Vancouver, Canada: Simon Frazier University.
Xu, C. C. (2010, Mar). Cache Contention and Application Performance Prediction for Multi-core Systems. Retrieved from University of Michigan: http://web.eecs.umich.edu/!zmao/Papers/xu10mar.pdf ↗
Zhoa, Q. e. (2011, Mar). Dynamic Cache Contention Detection in Multi-threaded Applications. Retrieved from Massachusettes Institute of Technology: http://groups.csail.mit.edu/commit/papers/2011/zhao-vee11-cache-contention.pdf ↗

Arteaga, D. e. (n.d.). Cooperative Virtual Machine Scheduling on Multi-core Multi-threading Systems -- A Feasibility Study. Retrieved from Florida International University: http://visa.cs.fiu.edu/...i/tiki-download_file.php?field=25 ↗
Fedorova, A. B. (2009). Managing Contention for Shared Resources on Multicore Processors. Vancouver, Canada: Simon Frazier University.
Xu, C. C. (2010, Mar). Cache Contention and Application Performance Prediction for Multi-core Systems. Retrieved from University of Michigan: http://web.eecs.umich.edu/!zmao/Papers/xu10mar.pdf ↗
Zhoa, Q. e. (2011, Mar). Dynamic Cache Contention Detection in Multi-threaded Applications. Retrieved from Massachusettes Institute of Technology: http://groups.csail.mit.edu/commit/papers/2011/zhao-vee11-cache-contention.pdf ↗

Arteaga, D. e. (n.d.). Cooperative Virtual Machine Scheduling on Multi-core Multi-threading Systems -- A Feasibility Study. Retrieved from Florida International University: http://visa.cs.fiu.edu/...i/tiki-download_file.php?field=25 ↗
Fedorova, A. B. (2009). Managing Contention for Shared Resources on Multicore Processors. Vancouver, Canada: Simon Frazier University.
Xu, C. C. (2010, Mar). Cache Contention and Application Performance Prediction for Multi-core Systems. Retrieved from University of Michigan: http://web.eecs.umich.edu/!zmao/Papers/xu10mar.pdf ↗
Zhoa, Q. e. (2011, Mar). Dynamic Cache Contention Detection in Multi-threaded Applications. Retrieved from Massachusettes Institute of Technology: http://groups.csail.mit.edu/commit/papers/2011/zhao-vee11-cache-contention.pdf ↗