Dissecting On-node Memory Access Performance: A Semantic ApproachShow others and affiliations
2014 (English)In: SC14: International Conference for High Performance Computing, Networking, Storage and Analysis, Piscataway, NJ, USA: IEEE Press, 2014, p. 166-176Conference paper, Published paper (Refereed)
Abstract [en]
Optimizing memory access is critical for performance and power efficiency. CPU manufacturers have developed sampling-based performance measurement units (PMUs) that report precise costs of memory accesses at specific addresses. However, this data is too low-level to be meaningfully interpreted and contains an excessive amount of irrelevant or uninteresting information.
We have developed a method to gather fine-grained memory access performance data for specific data objects and regions of code with low overhead and attribute semantic information to the sampled memory accesses. This information provides the context necessary to more effectively interpret the data. We have developed a tool that performs this sampling and attribution and used the tool to discover and diagnose performance problems in real-world applications. Our techniques provide useful insight into the memory behavior of applications and allow programmers to understand the performance ramifications of key design decisions: domain decomposition, multi-threading, and data motion within distributed memory systems.
Place, publisher, year, edition, pages
Piscataway, NJ, USA: IEEE Press, 2014. p. 166-176
National Category
Computer Sciences
Research subject
Computer and Information Sciences Computer Science, Computer Science
Identifiers
URN: urn:nbn:se:lnu:diva-42270DOI: 10.1109/SC.2014.19OAI: oai:DiVA.org:lnu-42270DiVA, id: diva2:804998
Conference
SC14: International Conference for High Performance Computing, Networking, Storage and Analysis, 16-21 Nov. 2014, New Orleans, LA
2015-04-142015-04-142018-01-11Bibliographically approved