Gem5-X: A Many-core Heterogeneous Simulation Platform for Architectural Exploration and Optimization

Loading...
Thumbnail Image
Full text at PDC
Publication date

2021

Authors
Qureshi, Yasir M.
Simon, William A.
Zapater Sancho, Marina
Advisors (or tutors)
Editors
Journal Title
Journal ISSN
Volume Title
Publisher
Association for Computing Machinery
Citations
Google Scholar
Citation
Yasir Mahmood Qureshi, William Andrew Simon, Marina Zapater, Katzalin Olcoz, and David Atienza. 2021. Gem5-X: A Many-core Heterogeneous Simulation Platform for Architectural Exploration and Optimization. ACM Trans. Archit. Code Optim. 18, 4, Article 44 (July 2021), 27 pages. https://doi.org/10.1145/3461662
Abstract
The increasing adoption of smart systems in our daily life has led to the development of new applications with varying performance and energy constraints, and suitable computing architectures need to be developed for these new applications. In this article, we present gem5-X, a system-level simulation framework, based on gem-5, for architectural exploration of heterogeneous many-core systems. To demonstrate the capabilities of gem5-X, real-time video analytics is used as a case-study. It is composed of two kernels, namely, video encoding and image classification using convolutional neural networks (CNNs). First, we explore through gem5-X the benefits of latest 3D high bandwidth memory (HBM2) in different architectural configurations. Then, using a two-step exploration methodology, we develop a new optimized clustered-heterogeneous architecture with HBM2 in gem5-X for video analytics application. In this proposed clustered-heterogeneous architecture, ARMv8 in-order cluster with in-cache computing engine executes the video encoding kernel, giving 20% performance and 54% energy benefits compared to baseline ARM in-order and Out-of-Order systems, respectively. Furthermore, thanks to gem5-X, we conclude that ARM Out-of-Order clusters with HBM2 are the best choice to run visual recognition using CNNs, as they outperform DDR4-based system by up to 30% both in terms of performance and energy savings.
Research Projects
Organizational Units
Journal Issue
Description
Keywords
Collections