Aviso: para depositar documentos, por favor, inicia sesión e identifícate con tu cuenta de correo institucional de la UCM con el botón MI CUENTA UCM. No emplees la opción AUTENTICACIÓN CON CONTRASEÑA
 

Experiences with nested parallelism in task-parallel applications using malleable BLAS on multicore processors

dc.contributor.authorCastelló, Adrián
dc.contributor.authorQuintana-Ortí, Enrique S.
dc.contributor.authorRodríguez Sánchez, Rafael
dc.contributor.authorCatalán Pallarés, Sandra
dc.contributor.authorIgual Peña, Francisco Daniel
dc.date.accessioned2025-01-21T12:13:49Z
dc.date.available2025-01-21T12:13:49Z
dc.date.issued2023-03-10
dc.description.abstractMalleability is defined as the ability to vary the degree of parallelism at runtime, and is regarded as a means to improve core occupation on state-of-the-art multicore processors tshat contain tens of computational cores per socket. This property is especially interesting for applications consisting of irregular workloads and/or divergent executions paths. The integration of malleability in high-performance instances of the Basic Linear Algebra Subprograms (BLAS) is currently nonexistent, and, in consequence, applications relying on these computational kernels cannot benefit from this capability. In response to this scenario, in this paper we demonstrate that significant performance benefits can be gathered via the exploitation of malleability in a framework designed to implement portable and high-performance BLAS-like operations. For this purpose, we integrate malleability within the BLIS library, and provide an experimental evaluation of the result on three different practical use cases.
dc.description.departmentDepto. de Arquitectura de Computadores y Automática
dc.description.facultyFac. de Informática
dc.description.refereedTRUE
dc.description.statuspub
dc.identifier.citationRodríguez-Sánchez R, Castelló A, Catalán S, Igual FD, Quintana-Ortí ES. Experiences with nested parallelism in task-parallel applications using malleable BLAS on multicore processors. The International Journal of High Performance Computing Applications. 2024;38(2):55-68. doi:10.1177/10943420231157653
dc.identifier.doi10.1177/10943420231157653
dc.identifier.officialurlhttps://journals.sagepub.com/doi/10.1177/10943420231157653
dc.identifier.urihttps://hdl.handle.net/20.500.14352/115344
dc.issue.number2
dc.journal.titleThe International Journal of High Performance Computing Applications
dc.language.isoeng
dc.page.final68
dc.page.initial55
dc.publisherSAGE
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject.keywordMalleability
dc.subject.keywordBasic Linear Algebra Subprograms
dc.subject.keywordHigh Performance
dc.subject.keywordMulticore Processors
dc.subject.ucmInformática (Informática)
dc.subject.unesco1203.23 Lenguajes de Programación
dc.titleExperiences with nested parallelism in task-parallel applications using malleable BLAS on multicore processors
dc.typejournal article
dc.type.hasVersionAM
dc.volume.number28
dspace.entity.typePublication
relation.isAuthorOfPublication02e9ebb2-af1f-451a-a819-47cb4e4ce515
relation.isAuthorOfPublication9c042df5-5a71-4088-a155-194f339a226e
relation.isAuthorOfPublicatione1ed9960-37d5-4817-8e5c-4e0e392b4d66
relation.isAuthorOfPublication.latestForDiscovery02e9ebb2-af1f-451a-a819-47cb4e4ce515

Download

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2022_IJHCA_special_issue_HPCMALL (1).pdf
Size:
690.97 KB
Format:
Adobe Portable Document Format

Collections