%0 Thesis
%A Redondo&#x20;Antón,&#x20;Javier
%T Comparativa&#x20;de&#x20;modelos&#x20;de&#x20;random&#x20;forest&#x20;y&#x20;redes&#x20;neuronalesaplicados&#x20;al&#x20;mantenimiento&#x20;predictivo&#x20;con&#x20;valores&#x20;ausentes&#x20;ydatos&#x20;desbalanceados
%D 2021
%U https:&#x2F;&#x2F;hdl.handle.net&#x2F;20.500.14352&#x2F;5181
%X En&#x20;este&#x20;trabajo&#x20;se&#x20;describen&#x20;las&#x20;tareas&#x20;seguidas&#x20;para&#x20;solucionar&#x20;un&#x20;problema&#x20;de&#x20;mantenimiento&#x20;predictivo&#x20;que&#x20;consiste&#x20;en&#x20;utilizar&#x20;técnicas&#x20;de&#x20;aprendizaje&#x20;automático&#x20;para&#x20;predecir&#x20;si&#x20;un&#x20;componente&#x20;específico&#x20;del&#x20;sistema&#x20;de&#x20;aire&#x20;comprimido&#x20;de&#x20;un&#x20;camión&#x20;pesado&#x20;se&#x20;enfrentará&#x20;a&#x20;un&#x20;fallo&#x20;inminente.&#x20;Este&#x20;problema&#x20;se&#x20;modela&#x20;como&#x20;un&#x20;problema&#x20;de&#x20;clasificación,&#x20;ya&#x20;que&#x20;el&#x20;objetivo&#x20;es&#x20;determinar&#x20;si&#x20;una&#x20;instancia&#x20;no&#x20;observada&#x20;representa&#x20;un&#x20;fallo&#x20;o&#x20;no.&#x20;Se&#x20;evalúan&#x20;varios&#x20;algoritmos&#x20;de&#x20;clasificación&#x20;y&#x20;se&#x20;investiga&#x20;cómo&#x20;tratar&#x20;con&#x20;un&#x20;conjunto&#x20;de&#x20;datos&#x20;desbalanceado&#x20;y&#x20;con&#x20;gran&#x20;cantidad&#x20;de&#x20;valores&#x20;ausentes.&#x20;El&#x20;enfoque&#x20;se&#x20;compone&#x20;de&#x20;cuatro&#x20;pasos:&#x20;(i)&#x20;la&#x20;creación&#x20;de&#x20;tres&#x20;conjuntos&#x20;de&#x20;datos&#x20;distintos&#x20;aplicando&#x20;diversas&#x20;técnicas&#x20;de&#x20;tratamiento&#x20;de&#x20;datos;&#x20;(ii)&#x20;la&#x20;creación&#x20;de&#x20;varios&#x20;modelos&#x20;de&#x20;aprendizaje&#x20;automático;&#x20;(iii)&#x20;el&#x20;ajuste&#x20;de&#x20;sus&#x20;hiperparámetros&#x20;y&#x20;del&#x20;umbral&#x20;de&#x20;probabilidad&#x20;para&#x20;las&#x20;predicciones,&#x20;y&#x20;(iv)&#x20;la&#x20;comparación&#x20;de&#x20;resultados&#x20;entre&#x20;los&#x20;distintos&#x20;modelos&#x20;sobre&#x20;los&#x20;conjuntos&#x20;creados&#x20;para&#x20;determinar&#x20;la&#x20;mejor&#x20;solución.&#x20;Los&#x20;resultados&#x20;muestran&#x20;que&#x20;una&#x20;buena&#x20;imputación&#x20;de&#x20;los&#x20;valores&#x20;ausentes&#x20;y&#x20;el&#x20;ajuste&#x20;del&#x20;umbral&#x20;de&#x20;probabilidad&#x20;son&#x20;factores&#x20;clave&#x20;a&#x20;la&#x20;hora&#x20;de&#x20;mejorar&#x20;el&#x20;rendimiento&#x20;de&#x20;los&#x20;clasificadores.
%X This&#x20;paper&#x20;describes&#x20;the&#x20;workflow&#x20;used&#x20;to&#x20;solve&#x20;a&#x20;predictive&#x20;maintenance&#x20;problem&#x20;that&#x20;consists&#x20;in&#x20;using&#x20;machine&#x20;learning&#x20;techniques&#x20;to&#x20;predict&#x20;whether&#x20;a&#x20;specific&#x20;component&#x20;of&#x20;the&#x20;Air&#x20;Pressure&#x20;System&#x20;of&#x20;a&#x20;heavy&#x20;truck&#x20;is&#x20;facing&#x20;an&#x20;imminent&#x20;failure.&#x20;This&#x20;problem&#x20;is&#x20;modeled&#x20;as&#x20;a&#x20;classification&#x20;problem,&#x20;since&#x20;the&#x20;objective&#x20;is&#x20;to&#x20;determine&#x20;whether&#x20;or&#x20;not&#x20;an&#x20;unobserved&#x20;instance&#x20;represents&#x20;a&#x20;failure.&#x20;Several&#x20;classification&#x20;algorithms&#x20;are&#x20;evaluated&#x20;and&#x20;it&#x20;is&#x20;investigated&#x20;how&#x20;to&#x20;deal&#x20;with&#x20;an&#x20;unbalanced&#x20;dataset&#x20;with&#x20;a&#x20;large&#x20;number&#x20;of&#x20;missing&#x20;values.&#x20;The&#x20;approach&#x20;consists&#x20;of&#x20;four&#x20;steps:&#x20;(i)&#x20;the&#x20;creation&#x20;of&#x20;three&#x20;different&#x20;datasets&#x20;by&#x20;applying&#x20;various&#x20;data&#x20;processing&#x20;techniques;&#x20;(ii)&#x20;the&#x20;creation&#x20;of&#x20;several&#x20;machine&#x20;learning&#x20;models;&#x20;(iii)&#x20;the&#x20;adjustment&#x20;of&#x20;their&#x20;hyperparameters&#x20;and&#x20;probability&#x20;threshold&#x20;for&#x20;predictions;&#x20;and&#x20;(iv)&#x20;the&#x20;comparison&#x20;of&#x20;results&#x20;between&#x20;the&#x20;different&#x20;models&#x20;on&#x20;the&#x20;created&#x20;datasets&#x20;to&#x20;determine&#x20;the&#x20;best&#x20;solution.&#x20;The&#x20;results&#x20;show&#x20;that&#x20;appropriate&#x20;imputation&#x20;of&#x20;missing&#x20;values&#x20;and&#x20;adjustment&#x20;of&#x20;the&#x20;probability&#x20;threshold&#x20;are&#x20;key&#x20;factors&#x20;in&#x20;improving&#x20;the&#x20;performance&#x20;of&#x20;the&#x20;classifiers.
%~