RT Journal Article T1 COMPAD: A heterogeneous cache-scratchpad CPU architecture with data layout compaction for embedded loop-dominated applications A1 Marinelli, Tommaso A1 Gómez Pérez, José Ignacio A1 Tenllado Van Der Reijden, Christian Tomás A1 Catthoor, Francky AB The growing trend of pervasive computing has consolidated the everlasting need for power efficient devices. The conventional cache subsystem of general-purpose CPUs, while being able to adapt to many use cases, suffers from energy inefficiencies in some scenarios. It is well-known by now in the academic literature that the utilization of a scratchpad memory (SPM) can help reducing the overall energy consumption of embedded systems. This work proposes a hybrid cache-SPM architecture with support logic for semi-transparent data management and spatial locality improvement. Selected data are transferred and stored in the SPM in a compact form using dynamic layout transformation. As a second major contribution, we introduce a methodology to identify memory access sequences that make an inefficient use of the cache, marking them as candidates to be moved to an SPM of constrained space. The methodology does not require access to the source code of the target applications, relying on binary instrumentation and offline profiling. The resulting mapping policies have been tested on a simulated system, showing a mean memory dynamic energy reduction of 43% and a mean speed gain of 13% with a representative benchmark set. PB Elsevier SN 1383-7621 YR 2023 FD 2023-10-29 LK https://hdl.handle.net/20.500.14352/88544 UL https://hdl.handle.net/20.500.14352/88544 LA eng NO Ministerio de Ciencia e Innovación NO Agencia Estatal de Investigación NO Comunidad de Madrid NO Fondo Europeo de Desarrollo Regional DS Docta Complutense RD 1 sept 2024