Informed prefetching for indirect memory accesses
Date of Original Version
Indirect memory accesses have irregular access patterns that limit the performance of conventional software and hardware-based prefetchers. To address this problem, we propose the Array Tracking Prefetcher (ATP), which tracks array-based indirect memory accesses using a novel combination of software and hardware. ATP is first configured by special metadata instructions, which are inserted by programmer or compiler to pass data structure traversal knowledge. It then calculates and issues prefetches based on this information. ATP also employs a novel mechanism for dynamically adjusting prefetching distance to reduce early or late prefetches. ATP yields average speedup of 2.17 as compared to a single-core without prefetching. By contrast, the speedup for conventional software and hardware-based prefetching is 1.84 and 1.32, respectively. For four cores, the average speedup for ATP is 1.85, while the corresponding speedups for software and hardwarebased prefetching are 1.60 and 1.25, respectively.
ACM Transactions on Architecture and Code Optimization
Cavus, Mustafa, Resit Sendag, and Joshua J. Yi. "Informed prefetching for indirect memory accesses." ACM Transactions on Architecture and Code Optimization 17, 1 (2020). doi:10.1145/3374216.