Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors

dc.conference.date17-20 Aug 2020
dc.conference.placeEdmonton (Canada)
dc.conference.titleInternational Conference on Parallel Processing (ICPP)
dc.contributor.authorSáez Alcaide, Juan Carlos
dc.contributor.authorCastro Rodríguez, Fernando
dc.contributor.authorPrieto Matías, Manuel
dc.date.accessioned2024-01-25T15:46:57Z
dc.date.available2024-01-25T15:46:57Z
dc.date.issued2020
dc.description.abstractAsymmetric multicore processors (AMPs) couple high-performance big cores and low-power small cores with the same instruction-set architecture but different features, such as clock frequency or microarchitecture. Previous work has shown that asymmetric designs may deliver higher energy efficiency than symmetric multicores for diverse workloads. Despite their benefits, AMPs pose significant challenges to runtime systems of parallel programming models. While previous work has mainly explored how to efficiently execute task-based parallel applications on AMPs, via enhancements in the runtime system, improving the performance of unmodified data-parallel applications on these architectures is still a big challenge. In this work we analyze the particular case of loop-based OpenMP applications, which are widely used today in scientific and engineering domains, and constitute the dominant application type in many parallel benchmark suites used for performance evaluation on multicore systems. We observed that conventional loop-scheduling OpenMP approaches are unable to efficiently cope with the load imbalance that naturally stems from the different performance delivered by big and small cores. To address this shortcoming, we propose Asymmetric Iteration Distribution (AID), a set of novel loop-scheduling methods for AMPs that distribute iterations unevenly across worker threads to efficiently deal with performance asymmetry. We implemented AID in libgomp –the GNU OpenMP runtime system–, and evaluated it on two different asymmetric multicore platforms. Our analysis reveals that the AID methods constitute effective replacements of the static and dynamic methods on AMPs, and are capable of improving performance over these conventional strategies by up to 56% and 16.8%, respectively.
dc.description.departmentDepto. de Arquitectura de Computadores y Automática
dc.description.facultyFac. de Informática
dc.description.facultyFac. de Ciencias Físicas
dc.description.refereedTRUE
dc.description.sponsorshipMinisterio de Economía y Competitividad (España)
dc.description.sponsorshipComunidad de Madrid
dc.description.sponsorshipEuropean Commission
dc.description.statuspub
dc.identifier.citationJ.C. Saez, F. Castro, and M. Prieto-Matias. 2020. Enabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors. In Proceedings of the 49th International Conference on Parallel Processing (ICPP '20). Association for Computing Machinery, New York, NY, USA, Article 51, 1–11. https://doi.org/10.1145/3404397.3404441
dc.identifier.doi10.1145/3404397.3404441
dc.identifier.isbn978-1-4503-8816-0
dc.identifier.officialurlhttps://dl.acm.org/doi/10.1145/3404397.3404441
dc.identifier.urihttps://hdl.handle.net/20.500.14352/95528
dc.language.isoeng
dc.page.final51:11
dc.page.initial51:1
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/RTI2018-093684-B-I00/ES/HETEROGENEIDAD Y ESPECIALIZACION EN LA ERA POST-MOORE/
dc.relation.projectIDS2018/TCS-4423
dc.rights.accessRightsrestricted access
dc.subject.keywordOpenMP
dc.subject.keywordAsymmetric multicore processors
dc.subject.keywordLoop scheduling
dc.subject.keywordRuntime system
dc.subject.keywordbig.LITTLE
dc.subject.ucmSoftware
dc.subject.unesco1203.17 Informática
dc.titleEnabling performance portability of data-parallel OpenMP applications on asymmetric multicore processors
dc.typeconference paper
dc.type.hasVersionP
dspace.entity.typePublication
relation.isAuthorOfPublication5a9b186f-f010-47e3-b598-040d7f5dc5ba
relation.isAuthorOfPublication9aac3e41-2993-45aa-b0e1-7bae1dacd982
relation.isAuthorOfPublication5d3f6717-1495-4217-853c-8c9c75d56620
relation.isAuthorOfPublication.latestForDiscovery5a9b186f-f010-47e3-b598-040d7f5dc5ba

Download

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Enabling_performance_portability.pdf
Size:
16.95 MB
Format:
Adobe Portable Document Format

Collections