Aviso: para depositar documentos, por favor, inicia sesión e identifícate con tu cuenta de correo institucional de la UCM con el botón MI CUENTA UCM. No emplees la opción AUTENTICACIÓN CON CONTRASEÑA
 

Creation of a high-quality, register-diversified parallel (English- Spanish) corpus for linguistic and computational investigations

dc.contributor.authorLavid López, María Julia
dc.contributor.authorArús Hita, Jorge
dc.contributor.authorHoste, Veronique
dc.contributor.authorDeClerck, Bernard
dc.date.accessioned2024-11-07T17:52:11Z
dc.date.available2024-11-07T17:52:11Z
dc.date.issued2015
dc.descriptionThe MULTINOT project is financed by the Spanish Ministry of Economy and Competitiveness under project grant FFI2012-32201.
dc.description.abstractThis paper outlines current work on the construction of a high-quality, richly-annotated and register-diversified parallel corpus for the English-Spanish language pair, as currently carried out within the framework of the MULTINOT project. The corpus consists of original and translated texts in both directions and is designed as a multifunctional resource to be used in a number of disciplines such as corpus-based contrastive linguistic and translation studies, machine translation, computer-assisted translation, computer-assisted language learning and terminology extraction. The paper describes the structure of the corpus –which includes four subcorpora: English originals (EO) and Spanish originals (SO), English translations (Etrans) and Spanish translations (Strans)-, the registers selected for inclusion in the corpus, and the methodology used to guarantee the quality of the processing steps to enrich the corpus with linguistic information at different levels.
dc.description.departmentDepto. de Estudios Ingleses: Lingüística y Literatura
dc.description.facultyFac. de Filología
dc.description.refereedTRUE
dc.description.sponsorshipMinisterio de Economía y Competitividad (España)
dc.description.statuspub
dc.identifier.citationLavid, Julia, et al. «Creation of a High-quality, Register-diversified Parallel (English-Spanish) Corpus for Linguistic and Computational Investigations». Procedia : Social and Behavioral Sciences, vol. 198, 2015, pp. 249-256. ScienceDirect, https://doi.org/10.1016/j.sbspro.2015.07.443.
dc.identifier.doi10.1016/j.sbspro.2015.07.443
dc.identifier.issn1877-0428
dc.identifier.officialurlhttps://www.sciencedirect.com/science/article/pii/S1877042815044444
dc.identifier.relatedurlhttps://www.sciencedirect.com/journal/procedia-social-and-behavioral-sciences
dc.identifier.relatedurlhttps://www.sciencedirect.com/
dc.identifier.relatedurlhttps://www.elsevier.com/
dc.identifier.relatedurlhttps://doi.org/10.1016/j.sbspro.2015.07.552
dc.identifier.urihttps://hdl.handle.net/20.500.14352/110262
dc.journal.titleProcedia : Social and Behavioral Sciences
dc.language.isoeng
dc.page.final256
dc.page.initial249
dc.publisherElsevier
dc.relation.projectIDinfo:eu-repo/grantAgreement/MINECO//FFI2012-32201/ES/ANOTACION MULTIDIMENSIONAL DE TEXTOS COMPARABLES Y PARALELOS (INGLES-ESPAÑOL) PARA INVESTIGACIONES LINGUISTICAS Y COMPUTACIONALES/
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject.cdu811.111
dc.subject.cdu811.134.2
dc.subject.cdu81'25
dc.subject.keywordCorpus creation
dc.subject.keywordCorpus annotation
dc.subject.keywordEnglish
dc.subject.keywordSpanish
dc.subject.ucmLingüística
dc.subject.ucmTraducción e interpretación
dc.subject.unesco57 Lingüística
dc.subject.unesco5701.13 Lingüística Aplicada a la Traducción E Interpretación
dc.titleCreation of a high-quality, register-diversified parallel (English- Spanish) corpus for linguistic and computational investigations
dc.typejournal article
dc.type.hasVersionVoR
dc.volume.number198, Current Work in Corpus Linguistics : Working with Traditionally- conceived Corpora and Beyond. Selected Papers from the 7th International Conference on Corpus Linguistics (CILC2015)
dspace.entity.typePublication
relation.isAuthorOfPublicationcd1f6aa5-c457-4bd5-976e-953cd24d472e
relation.isAuthorOfPublication9b7ac5ea-9b1b-49e4-8207-98fd54bc8b48
relation.isAuthorOfPublication.latestForDiscoverycd1f6aa5-c457-4bd5-976e-953cd24d472e

Download

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Docta_1-s2.0-S1877042815044444-main.pdf
Size:
199.78 KB
Format:
Adobe Portable Document Format

Collections