Phase transition in the computational complexity of the shortest common superstring and genome assembly
| dc.contributor.author | Fernández Pérez, Luis Antonio | |
| dc.contributor.author | Martín Mayor, Víctor | |
| dc.contributor.author | Yllanes, D. | |
| dc.date.accessioned | 2025-11-05T15:29:44Z | |
| dc.date.available | 2025-11-05T15:29:44Z | |
| dc.date.issued | 2024-01-24 | |
| dc.description | ©2024 American Physical Society | |
| dc.description.abstract | Genome assembly, the process of reconstructing a long genetic sequence by aligning and merging short fragments, or reads, is known to be NP-hard, either as a version of the shortest common superstring problem or in a Hamiltonian-cycle formulation. That is, the computing time is believed to grow exponentially with the problem size in the worst case. Despite this fact, high-throughput technologies and modern algorithms currently allow bioinformaticians to handle datasets of billions of reads. Using methods from statistical mechanics, we address this conundrum by demonstrating the existence of a phase transition in the computational complexity of the problem and showing that practical instances always fall in the “easy” phase (solvable by polynomialtime algorithms). In addition, we propose a Markov-chain Monte Carlo method that outperforms common deterministic algorithms in the hard regime. | |
| dc.description.department | Depto. de Física Teórica | |
| dc.description.faculty | Fac. de Ciencias Físicas | |
| dc.description.refereed | TRUE | |
| dc.description.sponsorship | Ministerio de Ciencia, Innovación y Universidades (España) | |
| dc.description.sponsorship | Agencia Estatal de Investigación (España) | |
| dc.description.sponsorship | European Commission | |
| dc.description.sponsorship | Chan Zuckerberg Biohub | |
| dc.description.status | pub | |
| dc.identifier.citation | Fernandez, L. A., et al. «Phase Transition in the Computational Complexity of the Shortest Common Superstring and Genome Assembly». Physical Review E, vol. 109, n.o 1, enero de 2024, p. 014133. DOI.org (Crossref), https://doi.org/10.1103/PhysRevE.109.014133 | |
| dc.identifier.doi | 10.1103/physreve.109.014133 | |
| dc.identifier.essn | 2470-0053 | |
| dc.identifier.issn | 2470-0045 | |
| dc.identifier.officialurl | https//doi.org/10.1103/physreve.109.014133 | |
| dc.identifier.relatedurl | https://journals.aps.org/pre/abstract/10.1103/PhysRevE.109.014133 | |
| dc.identifier.relatedurl | https://arxiv.org/abs/2210.09986 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.14352/125779 | |
| dc.issue.number | 1 | |
| dc.journal.title | Physical Review E | |
| dc.language.iso | eng | |
| dc.page.final | 014133-8 | |
| dc.page.initial | 014133-1 | |
| dc.publisher | American Physical Society | |
| dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-136374NB-C21/ES/COMPLEJIDAD EN FISICA Y MAS ALLA: DE LOS VIDRIOS DE ESPIN A LAS INTERACCIONES SOCIALES/ | |
| dc.rights.accessRights | open access | |
| dc.subject.cdu | 531.1:004 | |
| dc.subject.cdu | 57:004 | |
| dc.subject.keyword | Bioinformatics | |
| dc.subject.keyword | Computational complexity | |
| dc.subject.keyword | NP-hard problems | |
| dc.subject.keyword | Phase transitions | |
| dc.subject.keyword | Sequencing analysis | |
| dc.subject.keyword | Monte Carlo methods | |
| dc.subject.ucm | Física (Física) | |
| dc.subject.ucm | Análisis numérico | |
| dc.subject.ucm | Informática (Informática) | |
| dc.subject.ucm | Biología | |
| dc.subject.unesco | 2212 Física Teórica | |
| dc.subject.unesco | 1203.17 Informática | |
| dc.subject.unesco | 2205.10 Mecánica Estadística | |
| dc.subject.unesco | 1206 Análisis Numérico | |
| dc.title | Phase transition in the computational complexity of the shortest common superstring and genome assembly | |
| dc.type | journal article | |
| dc.type.hasVersion | AM | |
| dc.volume.number | 109 | |
| dspace.entity.type | Publication | |
| relation.isAuthorOfPublication | 146096b1-5825-4230-8ad9-b2dad468673b | |
| relation.isAuthorOfPublication | 061118c0-eadf-4ee3-8897-2c9b65a6df66 | |
| relation.isAuthorOfPublication.latestForDiscovery | 146096b1-5825-4230-8ad9-b2dad468673b |
Download
Original bundle
1 - 1 of 1
Loading...
- Name:
- (2024). Phase transition in the computational complexity ....pdf
- Size:
- 824.54 KB
- Format:
- Adobe Portable Document Format
- Description:
- last revised 11 Mar 2024 (this version, v2) Postprint


