¿Tienen GPT-3.5 y GPT-4 un estilo de escritura diferente del estilo humano? : un estudio exploratorio para el español

Alonso Simón, Lara; Fernández-Pampillón Cesteros, Ana María; Fernández Trinidad, Marianela; Márquez Cruz, Manuel

doi:10.58859/rael.v23i1.666

¿Tienen GPT-3.5 y GPT-4 un estilo de escritura diferente del estilo humano? : un estudio exploratorio para el español

dc.contributor.author	Alonso Simón, Lara
dc.contributor.author	Fernández-Pampillón Cesteros, Ana María
dc.contributor.author	Fernández Trinidad, Marianela
dc.contributor.author	Márquez Cruz, Manuel
dc.date.accessioned	2026-01-15T19:38:26Z
dc.date.available	2026-01-15T19:38:26Z
dc.date.issued	2024
dc.description	Esta publicación es parte del proyecto de I+D+i Proyecto ROBOT-TALK PID2022-140897OB-I00 financiado por MCIN/AEI/10.13039/501100011033/ y FEDER/UE.
dc.description.abstract	RESUMEN: La cuestión que se aborda en este trabajo de investigación es la comprobación, mediante técnicas estadísticas, de que los modelos generativos de lenguaje GPT-3.5 (versión gratuita) y GPT-4 (versión de pago) de ChatGPT tienen un estilo de escritura distinto al de los humanos, y que pueden diferenciarse, al menos, por tres tipos de rasgos: léxicos, signos de puntuación y estructura sintáctica de las oraciones. Determinar si los grandes modelos de lenguaje tienen un estilo propio es relevante de cara a poder detectar la autoría automática de los textos. En trabajos anteriores se construyó un corpus comparable de textos humanos y automáticos en español y, mediante un estudio cualitativo, se localizó un conjunto de rasgos lingüísticos y estilísticos propios de cada autor. En este trabajo se ha podido comprobar cuantitativamente que 17 variables lingüísticas presentan diferencias estadísticamente significativas entre autores humanos y los modelos GPT-3.5 y GPT-4.
dc.description.abstract	ABSTRACT: The aim of this research is to verify, using statistical methods, that the generative language models GPT-3.5 (free version) and GPT-4 (paid version) of ChatGPT have their own writing style distinct from that of humans and that they can be distinguished by at least three types of features: lexical features, punctuation marks and syntactic sentence structure. Determining whether large language models have their own style is relevant in order to detect automatic authorship of texts. In previous work, a comparable corpus of human and automatic texts in Spanish was constructed and, through a qualitative study, a set of linguistic and stylistic features specific to each author was identified. In this work, it has been quantitatively demonstrated that 17 identified linguistic variables show statistically significant differences between human authors and the GPT-3.5 and GPT-4 models.
dc.description.department	Depto. de Lingüística, Estudios Hebreos, Vascos y de Asia Oriental
dc.description.faculty	Fac. de Filología
dc.description.refereed	TRUE
dc.description.sponsorship	Ministerio de Ciencia e Innovación (España)
dc.description.status	pub
dc.identifier.citation	Alonso Simón, L., Fernández-Pampillón Cesteros, A.M., Fernández Trinidad, M. y Márquez Cruz, M. (2024). «¿Tienen GPT-3.5 y GPT-4 un estilo de escritura diferente del estilo humano? : un estudio exploratorio para el español». RAEL: Revista Electrónica de Lingüística Aplicada, 23, 34-54. https://doi.org/10.58859/rael.v23i1.666
dc.identifier.doi	10.58859/rael.v23i1.666
dc.identifier.essn	1885-9089
dc.identifier.officialurl	https://doi.org/10.58859/rael.v23i1.666
dc.identifier.relatedurl	https://rael.aesla.org.es/index.php/RAEL/article/view/666
dc.identifier.relatedurl	https://matrix.aesla.org.es/RAEL/
dc.identifier.relatedurl	https://www.aesla.org.es/es
dc.identifier.uri	https://hdl.handle.net/20.500.14352/130386
dc.issue.number	1
dc.journal.title	RAEL : Revista Electrónica de Lingüística Aplicada
dc.language.iso	spa
dc.page.final	54
dc.page.initial	34
dc.publisher	Asociación Española de Lingüística Aplicada (AESLA)
dc.relation.projectID	info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-140897OB-I00/ES/RECONOCIMIENTO DEL ORIGEN ROBOTICO DE TEXTOS. AUTOMATIZACION DE TAREAS Y CONOCIMIENTO LINGUISTICO/
dc.rights	Attribution-NonCommercial 4.0 International	en
dc.rights.accessRights	open access
dc.rights.uri	http://creativecommons.org/licenses/by-nc/4.0/
dc.subject.cdu	81'322
dc.subject.cdu	004.8
dc.subject.keyword	Estilo de escritura
dc.subject.keyword	Grandes modelos de lenguaje
dc.subject.keyword	GPT-3.5
dc.subject.keyword	GPT-4
dc.subject.keyword	Lingüística de corpus
dc.subject.keyword	Writing style
dc.subject.keyword	Large language models
dc.subject.keyword	GPT-3.5
dc.subject.keyword	GPT-4
dc.subject.keyword	Corpus linguistics
dc.subject.ucm	Lingüística
dc.subject.ucm	Inteligencia artificial (Informática)
dc.subject.unesco	5701.04 Lingüística Informatizada
dc.subject.unesco	1203.04 Inteligencia Artificial
dc.title	¿Tienen GPT-3.5 y GPT-4 un estilo de escritura diferente del estilo humano? : un estudio exploratorio para el español
dc.title	Do GPT-3.5 and GPT-4 Have a Writing Style Different from Human Style? : An Exploratory Study for Spanish
dc.type	journal article
dc.type.hasVersion	VoR
dc.volume.number	23
dspace.entity.type	Publication
relation.isAuthorOfPublication	8896ce00-4613-4c0c-8800-da6676dee16a
relation.isAuthorOfPublication	bf0de562-de49-4049-9676-5e4b43614797
relation.isAuthorOfPublication	e1d1eda1-387b-40d9-a301-ad5b4d64f9cc
relation.isAuthorOfPublication	79c41c92-d178-4444-b1f7-051f72141f64
relation.isAuthorOfPublication.latestForDiscovery	8896ce00-4613-4c0c-8800-da6676dee16a

Download

Original bundle

Now showing 1 - 1 of 1

Name:: Docta_anaibanezmoreno,+rael-23_666-DEF.pdf
Size:: 791.25 KB
Format:: Adobe Portable Document Format

Download

Collections

Artículos