Wind turbine pitch reinforcement learning control improved by PID regulator and learning observer
dc.contributor.author | Sierra-García, Jesús Enrique | |
dc.contributor.author | Santos Peñas, Matilde | |
dc.contributor.author | Pandit, Ravi | |
dc.date.accessioned | 2024-09-13T13:53:48Z | |
dc.date.available | 2024-09-13T13:53:48Z | |
dc.date.issued | 2022 | |
dc.description.abstract | Wind turbine (WT) pitch control is a challenging issue due to the non-linearities of the wind device and its complex dynamics, the coupling of the variables and the uncertainty of the environment. Reinforcement learning (RL) based control arises as a promising technique to address these problems. However, its applicability is still limited due to the slowness of the learning process. To help alleviate this drawback, in this work we present a hybrid RL-based control that combines a RL-based controller with a proportional–integral–derivative (PID) regulator, and a learning observer. The PID is beneficial during the first training episodes as the RL based control does not have any experience to learn from. The learning observer oversees the learning process by adjusting the exploration rate and the exploration window in order to reduce the oscillations during the training and improve convergence. Simulation experiments on a small real WT show how the learning significantly improves with this control architecture, speeding up the learning convergence up to 37%, and increasing the efficiency of the intelligent control strategy. The best hybrid controller reduces the error of the output power by around 41% regarding a PID regulator. Moreover, the proposed intelligent hybrid control configuration has proved more efficient than a fuzzy controller and a neuro-control strategy. | |
dc.description.department | Depto. de Arquitectura de Computadores y Automática | |
dc.description.faculty | Instituto de Tecnología del Conocimiento (ITC) | |
dc.description.fundingtype | APC financiada por la UCM | |
dc.description.refereed | TRUE | |
dc.description.status | pub | |
dc.identifier.citation | Sierra-Garcia JE, Santos M, Pandit R. Wind turbine pitch reinforcement learning control improved by PID regulator and learning observer. Engineering Applications of Artificial Intelligence. 2022 May 1;111:104769. | |
dc.identifier.doi | doi.org/10.1016/j.engappai.2022.104769 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14352/108134 | |
dc.issue.number | 104769 | |
dc.journal.title | Engineering Applications of Artificial Intelligence | |
dc.language.iso | eng | |
dc.publisher | Elsevier | |
dc.relation.projectID | MCI/AEI/FEDER Project number RTI2018-094902-B-C21. | |
dc.rights | Attribution-NonCommercial-NoDerivatives 4.0 International | en |
dc.rights.accessRights | open access | |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/4.0/ | |
dc.subject.keyword | Intelligent control | |
dc.subject.keyword | Reinforcement learning | |
dc.subject.keyword | Learning observer | |
dc.subject.keyword | Pitch control | |
dc.subject.keyword | Wind turbines | |
dc.subject.ucm | Inteligencia artificial (Informática) | |
dc.subject.unesco | 3311.02 Ingeniería de Control | |
dc.title | Wind turbine pitch reinforcement learning control improved by PID regulator and learning observer | |
dc.type | journal article | |
dc.volume.number | 111 | |
dspace.entity.type | Publication | |
relation.isAuthorOfPublication | 99cac82a-8d31-45a5-bb8d-8248a4d6fe7f | |
relation.isAuthorOfPublication.latestForDiscovery | 99cac82a-8d31-45a5-bb8d-8248a4d6fe7f |
Download
Original bundle
1 - 1 of 1