Cross-Dataset Analysis of Language Models for Generalised Multi-Label Review Note Distribution in Animated Productions

dc.contributor.authorGarcés Casao, Diego
dc.contributor.authorSantos Peñas, Matilde
dc.contributor.authorFernández Llorca, David
dc.date.accessioned2025-10-27T16:45:56Z
dc.date.available2025-10-27T16:45:56Z
dc.date.issued2025-04-22
dc.description.abstractDuring the production of an animated film, supervisors and directors hold daily meetings to evaluate in-progress material. Over the course of the several years it takes to complete a film, thousands of text notes outlining required fixes are generated. These notes are manually allocated to various departments for resolution. However, as with any manual process, a significant number of notes are either delayed, miss-assigned or overlooked entirely, which can negatively impact the final quality of the film. This paper investigates the performance of various methods for automating the distribution of review notes across relevant departments using datasets from multiple films produced by an animation studio in Madrid, Spain. Since each note can belong to multiple departments, the task is posed as a multi-label classification problem. The analysis and comparison of the results obtained with datasets from three different films, focusing on generalisation, provides critical insights for any Animation Studio evaluating the use of these methods in their process. The methods leverage Large Language Models (LLMs), including encoder-only models such as BERT and decoder-only models like Llama 2. Fine-tuning with QLoRA and in-context learning techniques were applied and evaluated across all datasets, and a cross-dataset analysis is presented. The fine-tuned encoder-only model achieved an F1-score of 0.98 for notes directed to the Animation department. Training was carried out locally on an RTX-3090 GPU, completing it in less than 30 min.
dc.description.departmentDepto. de Arquitectura de Computadores y Automática
dc.description.facultyInstituto de Tecnología del Conocimiento (ITC)
dc.description.refereedTRUE
dc.description.sponsorshipJoint Research Centre, European Commission
dc.description.statuspub
dc.identifier.citationGarcés, D., Santos, M., & Fernández-Llorca, D. (2025). Cross-Dataset Analysis of Language Models for Generalised Multi-label Review Note Distribution in Animated Productions. International Journal of Computational Intelligence Systems, 18(1), 88.
dc.identifier.doi10.1007/s44196-025-00785-9
dc.identifier.officialurlhttps://link.springer.com/article/10.1007/s44196-025-00785-9
dc.identifier.urihttps://hdl.handle.net/20.500.14352/125433
dc.issue.number88
dc.journal.titleInternational Journal of Computational Intelligence Systems
dc.language.isoeng
dc.page.final19
dc.page.initial1
dc.publisherSpringer Nature
dc.relation.projectIDHUMAINT project
dc.rightsAttribution-NonCommercial-NoDerivatives 4.0 Internationalen
dc.rights.accessRightsopen access
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/
dc.subject.keywordMovie production
dc.subject.keywordText classification
dc.subject.keywordLarge language models (LLMs)
dc.subject.keywordLlama 2
dc.subject.keywordFine-tuning
dc.subject.keywordIn-context learning
dc.subject.ucmInteligencia artificial (Informática)
dc.subject.unesco1203.04 Inteligencia Artificial
dc.titleCross-Dataset Analysis of Language Models for Generalised Multi-Label Review Note Distribution in Animated Productions
dc.typejournal article
dc.volume.number18
dspace.entity.typePublication
relation.isAuthorOfPublication99cac82a-8d31-45a5-bb8d-8248a4d6fe7f
relation.isAuthorOfPublication.latestForDiscovery99cac82a-8d31-45a5-bb8d-8248a4d6fe7f

Download

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Cross-Dataset_Analysisof_Language.pdf
Size:
1.34 MB
Format:
Adobe Portable Document Format

Collections