Aviso: para depositar documentos, por favor, inicia sesión e identifícate con tu cuenta de correo institucional de la UCM con el botón MI CUENTA UCM. No emplees la opción AUTENTICACIÓN CON CONTRASEÑA
 

Automatic Generators for a Family of Matrix Multiplication Routines with Apache TVM

dc.contributor.authorAlaejos, Guillermo
dc.contributor.authorCastelló, Adrián
dc.contributor.authorAlonso-Jordá, Pedro
dc.contributor.authorMartínez, Héctor
dc.contributor.authorQuintana-Ortí, Enrique S.
dc.contributor.authorIgual Peña, Francisco Daniel
dc.date.accessioned2025-01-21T12:26:18Z
dc.date.available2025-01-21T12:26:18Z
dc.date.issued2024-03-16
dc.description.abstractWe explore the utilization of the Apache TVM open source framework to automatically generate a family of algorithms that follow the approach taken by popular linear algebra libraries, such as GotoBLAS2, BLIS, and OpenBLAS, to obtain high-performance blocked formulations of the general matrix multiplication (gemm). In addition, we fully automatize the generation process by also leveraging the Apache TVM framework to derive a complete variety of the processor-specific micro-kernels for gemm. This is in contrast with the convention in high-performance libraries, which hand-encode a single micro-kernel per architecture using Assembly code. In global, the combination of our TVM-generated blocked algorithms and micro-kernels for gemm (1) improves portability, maintainability, and, globally, streamlines the software life cycle; (2) provides high flexibility to easily tailor and optimize the solution to different data types, processor architectures, and matrix operand shapes, yielding performance on a par (or even superior for specific matrix shapes) with that of hand-tuned libraries; and (3) features a small memory footprint.
dc.description.departmentDepto. de Arquitectura de Computadores y Automática
dc.description.facultyFac. de Informática
dc.description.refereedTRUE
dc.description.statuspub
dc.identifier.citationGuillermo Alaejos, Adrián Castelló, Pedro Alonso-Jordá, Francisco D. Igual, Héctor Martínez, and Enrique S. Quintana-Ortí. 2024. Algorithm 1039: Automatic Generators for a Family of Matrix Multiplication Routines with Apache TVM. ACM Trans. Math. Softw. 50, 1, Article 6 (March 2024), 34 pages. https://doi.org/10.1145/3638532
dc.identifier.doi10.1145/3638532
dc.identifier.officialurlhttps://dl.acm.org/doi/10.1145/3638532
dc.identifier.relatedurlhttps://arxiv.org/abs/2310.20347
dc.identifier.urihttps://hdl.handle.net/20.500.14352/115351
dc.issue.number1
dc.journal.titleACM Transactions on Mathematical Software
dc.language.isoeng
dc.page.final34
dc.page.initial1
dc.publisherAssociation for Computing Machinery
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject.ucmSoftware
dc.subject.unesco1203 Ciencia de Los Ordenadores
dc.titleAutomatic Generators for a Family of Matrix Multiplication Routines with Apache TVM
dc.typejournal article
dc.type.hasVersionAM
dc.volume.number50
dspace.entity.typePublication
relation.isAuthorOfPublicatione1ed9960-37d5-4817-8e5c-4e0e392b4d66
relation.isAuthorOfPublication.latestForDiscoverye1ed9960-37d5-4817-8e5c-4e0e392b4d66

Download

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2310.20347v1.pdf
Size:
1.03 MB
Format:
Adobe Portable Document Format

Collections