MERE : Hardware-Software Co-Design for Masking Cache Miss Latency in Embedded Processors

You, Dean and Jiang, Jieyu and Wang, Xiaoxuan and Du, Yushu and Tan, Zhihang and Xu, Wenbo and Wang, Hui and Guan, Jiapeng and Wei, Ran and Zhao, Shuai and Jiang, Zhe (2025) MERE : Hardware-Software Co-Design for Masking Cache Miss Latency in Embedded Processors. ACM Transactions on Embedded Computing, 24 (5s). pp. 1-26. ISSN 1539-9087

Full text not available from this repository.

Abstract

Runahead execution is a technique to mask memory latency caused by irregular memory accesses. By pre-executing the application code during occurrences of long-latency operations and prefetching anticipated cache-missed data into the cache hierarchy, runahead effectively masks memory latency for subsequent cache misses and achieves high prefetching accuracy; however, this technique has been limited to superscalar out-of-order and superscalar in-order cores. For implementation in scalar in-order cores, the challenges of area-/energy-constraint and severe cache contention remain. Here, we build the first full-stack system featuring runahead, MERE , from SoC and a dedicated ISA to the OS and programming model. Through this deployment, we show that enabling runahead in scalar in-order cores is possible, with minimal area and power overheads, while still achieving high performance. By re-constructing the sequential runahead employing a hardware/software co-design approach, the system can be implemented on a mature processor and SoC. Building on this, an adaptive runahead mechanism is proposed to mitigate the severe cache contention in scalar in-order cores. Combining this, we provide a comprehensive solution for embedded processors managing irregular workloads. Our evaluation demonstrates that the proposed MERE attains 93.5% of a 2-wide out-of-order core’s performance while constraining area and power overheads below 5%, with the adaptive runahead mechanism delivering an additional 20.1% performance gain through mitigating the severe cache contention issues.

Item Type:

Journal Article

Journal or Publication Title:

ACM Transactions on Embedded Computing

Uncontrolled Keywords:

/dk/atira/pure/subjectarea/asjc/1700/1708

Subjects:

?? hardware and architecturesoftware ??

Departments:

Faculty of Science and Technology > School of Computing & Communications

ID Code:

237690

Deposited By:

ep_importer_pure

Deposited On:

29 May 2026 13:35

Refereed?:

Yes

Published?:

Published

Last Modified:

29 May 2026 21:50

URI:

https://eprints.lancs.ac.uk/id/eprint/237690