Linguistic injustice in multilingual technologies: the TenTen Corpus Family as a case study

dc.book.titleMultilingual digital humanities
dc.contributor.authorBordonaba Plou, David
dc.contributor.authorJreis-Navarro, Laila M.
dc.contributor.editorViola, Lorella
dc.contributor.editorSpence, Paul
dc.date.accessioned2026-01-13T09:32:25Z
dc.date.available2026-01-13T09:32:25Z
dc.date.issued2023
dc.description.abstractThe aim of this work is twofold. First, to distinguish a phenomenon that produces a new type of linguistic injustice, “the paradox of Anglocentric multilingualism.” This paradox arises when a multilingual philosophy is pursued in constructing complex systems of analysis in the digital environment; however, these systems imply advantages in the study of English over other languages. The injustice derives from a poor level of precision in the output of a technology when analyzing non-English languages. Second, to contend that multilingual DH should deal with the deficiencies of tools’ performance, in addition to those of the language resources, because this disadvantage makes it difficult for any cross-linguistic study to provide reliable empirical data in dis(proving) linguistic intuitions. To illustrate some of the potential problems derived from the paradox, this work will detail the difficulties we have faced in a cross-linguistic study on color terms, when using the Arabic corpus arTenTen and the Spanish corpus esTenTen in Sketch Engine. We will study the different performances of the tool in Arabic and Spanish, compared to English, to signal the weaknesses of this tool in a multilingual arena, enabling its improvement and enriching the critical and inclusive framework of multilingual DH.
dc.description.departmentDepto. de Lógica y Filosofía Teórica
dc.description.facultyFac. de Filosofía
dc.description.refereedTRUE
dc.description.statuspub
dc.identifier.citationBordonaba-Plou, D. y Jreis-Navarro, L.M. (2023) «Linguistic Injustice in Multilingual Technologies: The TenTen Corpus Family as a Case Study», en Multilingual Digital Humanities. Taylor and Francis, pp. 129-144. Disponible en: https://doi.org/10.4324/9781003393696-12.
dc.identifier.doi10.4324/9781003393696-12
dc.identifier.isbn9781003393696
dc.identifier.isbn9781032494173
dc.identifier.isbn9781032491943
dc.identifier.officialurlhttps://doi.org/10.4324/9781003393696
dc.identifier.relatedurlhttps://www.taylorfrancis.com/chapters/edit/10.4324/9781003393696-12
dc.identifier.relatedurlhttps://produccioncientifica.ucm.es/documentos/65a45ffe528f145c72fff46a
dc.identifier.urihttps://hdl.handle.net/20.500.14352/129996
dc.language.isoeng
dc.page.final144
dc.page.initial129
dc.page.total16
dc.publication.placeLondon
dc.publisherRoutledge
dc.relation.ispartofseriesDigital research in the arts and humanities
dc.rights.accessRightsmetadata only access
dc.subject.cdu81:1
dc.subject.cdu81'322
dc.subject.cdu81'246.3
dc.subject.cdu82:001.8
dc.subject.keywordHumanidades digitales
dc.subject.keywordMultilingüismo
dc.subject.ucmFilosofía
dc.subject.ucmFilología
dc.subject.ucmLingüística
dc.subject.unesco72 Filosofía
dc.subject.unesco5505.10 Filología
dc.subject.unesco5505.10-1 Filología Árabe
dc.subject.unesco7202.07 Filosofía del Lenguaje
dc.titleLinguistic injustice in multilingual technologies: the TenTen Corpus Family as a case study
dc.typebook part
dc.type.hasVersionVoR
dspace.entity.typePublication
relation.isAuthorOfPublication5f5cd501-3e2c-47fc-8383-5a867a43724c
relation.isAuthorOfPublication.latestForDiscovery5f5cd501-3e2c-47fc-8383-5a867a43724c

Download