Identification of hyperauthorship in scientific publications: an approach based on outlier detection methods

dc.contributor.authorSánchez Jiménez, Rodrigo
dc.contributor.authorRodríguez González, Juan Tinguaro
dc.contributor.authorGuerrero-Bote, Vicente P.
dc.contributor.authorDe Moya-Anegón, Félix
dc.date.accessioned2026-04-10T11:29:39Z
dc.date.available2026-04-10T11:29:39Z
dc.date.issued2026-04-07
dc.description.abstractThe phenomenon of hyperauthorship—characterized by excessive numbers of co-authors in scientific publications—has emerged as a significant concern in contemporary bibliometric analysis. While the growth of collaborative research is well-documented, systematic methods to identify what constitutes "excessive" authorship remain underdeveloped. This study presents a comprehensive approach to hyperauthorship identification based on outlier detection methods, analyzing over 52 million publications from Scopus (2003–2024) across 310 categories. A preliminary analysis of this data reveals substantial temporal and disciplinary variations in authorship patterns, with single authorship declining from one-third of publications in 2003 to significantly lower levels by 2024, while multi-authored works (>10 authors) increased dramatically. Then, 14 different outlier detection methods were implemented, including classical robust measures, skewness-corrected approaches, sequential methods, clustering techniques, and a parametric model based on the discrete power-law distribution (POW). This last method demonstrated superior performance, producing suitable thresholds in 100% of cases without indications of potential over- or under-identification of outliers. The systematic application of POW reveals hyperauthorship thresholds ranging from 3 authors in early 2000s Social Sciences to over 40 in recent Medicine and Physics publications, with an overall increase in hyperauthorship rates from 0.81% to 1.01% globally between 2003–2024. Medicine showed the most dramatic evolution, nearly doubling its hyperauthorship rate to 1.60% by 2024, while areas like Energy maintained consistently low rates (0.53%). These findings provide evidence-based reference points for editorial policies and research evaluation, demonstrating that hyperauthorship assessment requires discipline-specific and temporally-adjusted approaches rather than universal thresholds.
dc.description.departmentDepto. de Biblioteconomía y Documentación
dc.description.facultyFac. de Ciencias de la Documentación
dc.description.refereedTRUE
dc.description.statuspub
dc.identifier.citationRodrigo Sánchez-Jiménez, J․Tinguaro Rodríguez, Vicente P. Guerrero-Bote, Félix de Moya-Anegón, Identification of hyperauthorship in scientific publications: An approach based on outlier detection methods, Journal of Informetrics, Volume 20, Issue 2, 2026, 101803. https://doi.org/10.1016/j.joi.2026.101803
dc.identifier.doi10.1016/j.joi.2026.101803
dc.identifier.officialurlhttps://doi.org/10.1016/j.joi.2026.101803
dc.identifier.relatedurlhttps://www.sciencedirect.com/science/article/pii/S1751157726000362
dc.identifier.urihttps://hdl.handle.net/20.500.14352/134615
dc.issue.number2
dc.journal.titleJournal of Informetrics
dc.language.isoeng
dc.page.final15
dc.page.initial1
dc.publisherElsevier
dc.rightsAttribution 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject.keywordHyperauthorshipBibliometricsOutlier detectionCo-authorshipScientific collaborationResearch evaluationScopusPower-law distributionMulti-authorshipBibliometric indicators
dc.subject.ucmBibliometría
dc.subject.unesco5701.06 Documentación
dc.titleIdentification of hyperauthorship in scientific publications: an approach based on outlier detection methods
dc.typejournal article
dc.type.hasVersionVoR
dc.volume.number20
dspace.entity.typePublication
relation.isAuthorOfPublication92066822-7c77-4f71-8c47-ae3682d275c8
relation.isAuthorOfPublicationddad170a-793c-4bdc-b983-98d313c81b03
relation.isAuthorOfPublication.latestForDiscovery92066822-7c77-4f71-8c47-ae3682d275c8

Download

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
1-s2.0-S1751157726000362-main.pdf
Size:
3.12 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
1-s2.0-S1751157726000362-mmc1_Appendix_A.docx
Size:
25.79 KB
Format:
Microsoft Word XML
Loading...
Thumbnail Image
Name:
1-s2.0-S1751157726000362-mmc2_Appendix_B.docx
Size:
62.66 KB
Format:
Microsoft Word XML

Collections