RT Journal Article
T1 Resource management for power-constrained HEVC transcoding using reinforcement learning
A1 Costero Valero, Luis María
A1 Iranfar, Arman
A1 Zapater, Marina
A1 Atienza, David
A1 Olcoz Herrero, Katzalin
AB The advent of online video streaming applications and services along with the users' demand for high-quality contents require High Efficiency Video Coding (HEVC), which provides higher video quality and more compression at the cost of increased complexity. On one hand, HEVC exposes a set of dynamically tunable parameters to provide trade-offs among Quality-of-Service (QoS), performance, and power consumption of multi-core servers on the video providers' data center. On the other hand, resource management of modern multi-core servers is in charge of adapting system-level parameters, such as operating frequency and multithreading, to deal with concurrent applications and their requirements. Therefore, efficient multi-user HEVC streaming necessitates joint adaptation of application- and system-level parameters. Nonetheless, dealing with such a large and dynamic design space is challenging and difficult to address through conventional resource management strategies. Thus, in this work, we develop a multi-agent Reinforcement Learning framework to jointly adjust application- and system-level parameters at runtime to satisfy the QoS of multi-user HEVC streaming in power-constrained servers. In particular, the design space, composed of all design parameters, is split into smaller independent sub-spaces. Each design sub-space is assigned to a particular agent so that it can explore it faster, yet accurately. The benefits of our approach are revealed in terms of adaptability and quality (with up to to 4x improvements in terms of QoS when compared to a static resource management scheme), and learning time (6 x faster than an equivalent mono-agent implementation). Finally, we show that the power-capping techniques formulated outperform the hardware-based power capping with respect to quality.
PB IEEE Computer  Society
SN 1045-9219
YR 2020
FD 2020-12-01
LK https://hdl.handle.net/20.500.14352/6543
UL https://hdl.handle.net/20.500.14352/6543
LA eng
NO ©2020 IEEE Computer Society This work was supported by the EU (FEDER) and Spanish MINECO (RTI2018-093684-B-I00), MECD (FPU15/02050), CM(S2018/TCS-4423), and UCM (PR65/19-22445), the ERC Consolidator Grant COMPUSAPIEN (GA No. 725657), the H2020 RECIPE project (GA No. 801137), and the H2020 DeepHealth project (GA No. 825111)
NO Unión Europea. Horizonte 2020
NO Ministerio de Economía y Competitividad (MINECO)/FEDER
NO Ministerio de Educación, Cultura y Deporte (MECD)
NO Comunidad de Madrid
NO Universidad Complutense de Madrid
DS Docta Complutense
RD 15 dic 2025