Hybrid reward-driven reinforcement learning for efficient quantum circuit synthesis

dc.contributor.authorGiordano, Sara
dc.contributor.authorSen, Kornikar
dc.contributor.authorMartín-Delgado Alcántara, Miguel Ángel
dc.date.accessioned2026-03-02T18:05:38Z
dc.date.available2026-03-02T18:05:38Z
dc.date.issued2026-02-03
dc.description© The Author(s) 2026. Next Generation EU PRTR-C17. W911NF-14-1-0103.
dc.description.abstractA reinforcement learning (RL) framework is introduced for the efficient synthesis of quantum circuits that generate specified target quantum states from a fixed initial state, addressing a central challenge in both the Noisy Intermediate-Scale Quantum (NISQ) era and future fault-tolerant quantum computing. The approach utilizes tabular Q-learning, based on action sequences, within a discretized quantum state space, to effectively manage the exponential growth of the space dimension. The framework introduces a hybrid reward mechanism, combining a static, domain-informed reward that guides the agent toward the target state with customizable dynamic penalties that discourage inefficient circuit structures such as gate congestion and redundant state revisits. This is a circuit-aware reward, in contrast to the current trend of works on this topic, which are primarily fidelity-based. By leveraging sparse matrix representations and state-space discretization, the method enables practical navigation of high-dimensional environments while minimizing computational overhead. Benchmarking on graph-state preparation tasks for up to seven qubits, we demonstrate that the algorithm consistently discovers minimal-depth circuits with optimized gate counts. Moreover, extending the framework to a universal gate set still yields low depth circuits, highlighting the algorithm’s robustness and adaptability. The results confirm that this RL-driven approach, with our completely circuit-aware method, efficiently explores the complex quantum state space and synthesizes near-optimal quantum circuits, providing a resource-efficient foundation for quantum circuit optimization.
dc.description.departmentDepto. de Física Teórica
dc.description.facultyFac. de Ciencias Físicas
dc.description.refereedTRUE
dc.description.sponsorshipMinisterio de Ciencia e Innovación (España)
dc.description.sponsorshipAgencia Estatal de Investigación
dc.description.sponsorshipEuropean Comission
dc.description.sponsorshipComunidad de Madrid
dc.description.sponsorshipMinisterio de Transformación Digital y de la Función Pública (España)
dc.description.sponsorshipU.S. Army Research Office
dc.description.statuspub
dc.identifier.citationGiordano, Sara, et al. «Hybrid Reward-Driven Reinforcement Learning for Efficient Quantum Circuit Synthesis». Quantum Machine Intelligence, vol. 8, n.o 1, junio de 2026, p. 9. DOI.org (Crossref), https://doi.org/10.1007/s42484-026-00359-8.
dc.identifier.doi10.1007/s42484-026-00359-8
dc.identifier.essn2524-4914
dc.identifier.issn2524-4906
dc.identifier.officialurlhttps://dx.doi.org/10.1007/s42484-026-00359-8
dc.identifier.relatedurlhttps://link-springer-com.bucm.idm.oclc.org/article/10.1007/s42484-026-00359-8
dc.identifier.urihttps://hdl.handle.net/20.500.14352/133698
dc.issue.number1
dc.journal.titleQuantum Machine Intelligence
dc.language.isoeng
dc.page.final9-19
dc.page.initial9-1
dc.publisherSpringer
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2021-122547NB-I00/ES/TECNOLOGIAS CLAVE PARA COMPUTACION CUANTICA/
dc.relation.projectIDTEC-2024/COM-84 QUITEMAD-CM
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject.cdu004.27
dc.subject.cdu004.85
dc.subject.cdu530.145
dc.subject.keywordCircuit depth
dc.subject.keywordCircuit optimization
dc.subject.keywordQuantum circuits
dc.subject.keywordReinforcement learning
dc.subject.ucmInformática (Informática)
dc.subject.ucmTeoría de los quanta
dc.subject.ucmInteligencia artificial (Informática)
dc.subject.unesco1203.04 Inteligencia Artificial
dc.subject.unesco2212.12 Teoría Cuántica de Campos
dc.subject.unesco1203.02 Lenguajes Algorítmicos
dc.titleHybrid reward-driven reinforcement learning for efficient quantum circuit synthesis
dc.typejournal article
dc.type.hasVersionVoR
dc.volume.number8
dspace.entity.typePublication
relation.isAuthorOfPublication1cfed495-7729-410a-b898-8196add14ef6
relation.isAuthorOfPublication.latestForDiscovery1cfed495-7729-410a-b898-8196add14ef6

Download

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Quantum Mach. Intell. 8, 9 (2026).pdf
Size:
4.75 MB
Format:
Adobe Portable Document Format

Collections