The Node Classification Problem: aplicación al problema de clasificación multiclase en artículos relacionados con el aprendizaje automático

Martínez Tena, Eduardo

The Node Classification Problem: aplicación al problema de clasificación multiclase en artículos relacionados con el aprendizaje automático

dc.contributor.advisor	Gómez González, Daniel
dc.contributor.author	Martínez Tena, Eduardo
dc.date.accessioned	2023-06-22T21:22:51Z
dc.date.available	2023-06-22T21:22:51Z
dc.date.issued	2022-09
dc.description.abstract	La teoría de grafos es un área en constante crecimiento, pues su aplicación se extiende en diferentes áreas de las matemáticas, la ciencia y la tecnología. Se utiliza de manera activa en los campos como la bioquímica, las redes de comunicación y la teoría de la codificación, la informática (algoritmos y computación) y la investigación operativa (programación), así como en otra gran variedad de aplicaciones como la teoría de la codificación, el radar, la astronomía, el diseño de circuitos, el direccionamiento de redes de comunicación y la gestión de bases de datos. Este trabajo refleja el gran potencial de esta herramienta en el campo de la Inteligencia Artificial y, en concreto, del Machine Learning. La capacidad única de los grafos permite capturar las relaciones estructurales entre los datos y, por lo tanto, permite obtener más información en comparación con el análisis de datos aislados. La mayoría de las técnicas de ML no representan o no tienen en cuenta las posibles estructuras o relaciones que pudieran existir entre los objetos, asumiendo por simplicidad que los items son independientes entre sí. Por otro lado, y para todos aquellos escenarios y problemas de ML en que los datos presentan relaciones intrínsecas entre ellos, parece una necesidad tener en cuenta esta información adicional, eliminando la hipótesis de independencia (o no dependencia entre los datos). Sin embargo, a menudo es muy difícil resolver los problemas de aprendizaje en grafos debido a que, principalmente, muchos tipos de datos no están estructurados originalmente como grafos (como las imágenes y los datos de texto) y que, para datos estructurados como grafos, los patrones de conectividad subyacentes suelen ser complejos. Por lo tanto, dado un grafo con etiquetas en algunos nodos, ¿es posible predecir el valor que tendrán los valores no etiquetados? Este problema se conoce como el Node Clas- sification Problem. Aunque se han hecho algunos esfuerzos por incorporar la Teoría de Grafos a los modelos de aprendizaje, muchos de ellos siguen adoleciendo de mecanismos de aprendizaje poco profundos. Los modelos de aprendizaje profundo sobre grafos (por ejem- plo, las Redes Neuronales de Grafos - GNN) han surgido recientemente en el aprendizaje de otras áreas relacionadas, y han demostrado un rendimiento superior en varios problemas. En este trabajo se estudiarán e implementarán las últimas técnicas de aprendizaje sobre redes para problemas de ML donde los objetos están relacionados entre sí por medio de un grafo. En particular, se implementarán y analizarán las PRM (Probabilistic Relational Methods) y GCN (Graph Convolutional Networks) adaptadas al problema del Node Classi- fication Problem, esta última basándonos en el trabajo de Thomas N. Kirf y Max Welling, que forman parte del elenco de pocos autores que aborda este problema para el Node Clas- sification Problem. Es importante notar que las Redes Convolucionales (CNN) han sido usadas con frecuencia para problemas de imágenes donde las relaciones entre los píxeles que se pretende clasifi se modelizan mediante una estructura de grafo específi en rejilla, pero el Node Classifi n Problem permite relaciones entre objetos de cualquier tipo con una estructura de grafo general donde las técnicas clásicas (Redes Convolucionales) para imágenes no pueden aplicarse directamente.
dc.description.abstract	Graph theory is a constantly growing area, as its application extends into different areas of mathematics, science and technology. It is actively used in fields such as bio- chemistry, communication networks and coding theory, computer science (algorithms and computation) and operations research (programming), as well as in a wide variety of other applications such as coding theory, radar, astronomy, circuit design, addressing communi- cation networks and database management. This work reflects the great potential of this tool in the field of Artificial Intelligence and, in particular, Machine Learning. The unique capability of graphs allows capturing structural relationships between data and, therefore, allows obtaining more information compared to the analysis of isolated da- ta. Most ML techniques do not represent or do not take into account the possible structures or relationships that may exist between objects, assuming for simplicity that items are in- dependent of each other. On the other hand, and for all those ML scenarios and problems in which the data present intrinsic relationships between them, it seems a necessity to take into account this additional information, eliminating the hypothesis of independence (or non-dependence between the data). However, it is often very difficult to solve graph learning problems because, primarily, many types of data are not originally structured as graphs (such as images and text data) and that, for data structured as graphs, the underlying connectivity patterns are often complex. Therefore, given a network with labels at some nodes: is it possible to predict the value that unlabeled values will have? This problem is known as the Node Classification Problem. Although some eff rts have been made to incorporate graph theory into learning models, many of them still suffer from shallow learning mechanisms. Deep learning models on graphs (e.g., graph neural networks) have recently emerged in learning in other related areas, and have demonstrated superior performance on several problems. In this paper we will study and implement the latest network learning techniques for ML problems where objects are related to each other by means of a graph. Specifically, we will implement PRM (Probabilistic Relational Methods) and GCN (Graph Convolutional Networks), adapted to the Node Classifi Problem, this last one based on the work of Thomas N. Kirf and Max Welling, who are among the few authors who address this problem for the Node Classifi tion Problem. It is important to note that Convolutional Networks (CNN) have been frequently used for image classifi where the elationships between the the pixels to be classifi are modelled by a specifi graph structure as a grid, but the Node Classifi Problem allows relationships between objects of any type with a general graph structure where classical techniques (Convolutional Networks) for images cannot be directly applied.
dc.description.department	Depto. de Estadística y Ciencia de los Datos
dc.description.faculty	Fac. de Estudios Estadísticos
dc.description.refereed	TRUE
dc.description.status	unpub
dc.eprint.id	https://eprints.ucm.es/id/eprint/75034
dc.identifier.uri	https://hdl.handle.net/20.500.14352/73978
dc.language.iso	spa
dc.master.title	Máster en Minería de Datos e Inteligencia de Negocios. Facultad de Estudios Estadísticos. Curso 2021-2022
dc.rights.accessRights	open access
dc.subject.cdu	519.17
dc.subject.cdu	004.85
dc.subject.ucm	Estadística
dc.subject.unesco	1209 Estadística
dc.title	The Node Classification Problem: aplicación al problema de clasificación multiclase en artículos relacionados con el aprendizaje automático
dc.type	master thesis
dcterms.references	Biggs, N.,Lloyd,E.K.,Wilson,R.J.(1986).GraphTheory,1736-1936.OxfordUniversityPress. Breiman, L.(1996).Baggingpredictors.Machinelearning,24(2),123-140. Breiman, L.(2001).Randomforests.Machinelearning,45(1),5-32. Chen, T., Guestrin, C. (2016, August). Xgboost: A scalable tree boosting system. In Proceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining (pp. 785-794). El-Habil, A. M. (2012). An application on multinomial logistic regression model. Pa- kistan journal of statistics and operation research, 271-291. G.E. Hinton, N. Srivastava, A. Krizhevsky, I. Sutskever, and R.R. Salakhutdinov. Im- proving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580, 2012. Gómez González, D. (n.d.). Maquinas de Vector Soporte (No free lunch theorem). Kipf, T. N., Welling, M. (2016). Semi-supervised classifi with graph convolutio- nal networks. arXiv preprint arXiv:1609.02907. LeCun, Y., Kavukcuoglu, K., Farabet, C. (2010, May). Convolutional networks and applications in vision. In Proceedings of 2010 IEEE international symposium on circuits and systems (pp. 253-256). IEEE. Lizares, M. (2017). Universidad Nacional Mayor de San Marcos. Recuperado el 10 de marzo de 2020 de http://cybertesis.unmsm.edu.pe/bitstream/handle/cybertesis/7122/Lizares. Maier, H. R., Dandy, G. C. (2000). Neural networks for the prediction and forecasting of water resources variables: A review of modelling issues and applications. Environmental Modelling and Software, 15(1), 101–124. https://doi.org/10.1016/S1364-8152(99)00007-9 Marín, A., Martínez-Merino, L. I., Puerto, J., Rodríguez-Chía, A. M. (2022). The soft-margin Support Vector Machine with ordered weighted average. Knowledge-Based Systems, 237, 107705. Neville, J., Jensen, D., Friedland, L., Hay, M. (2003, August). Learning relational probability trees. In Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 625-630). Neville, J., Jensen, D., Gallagher, B. (2003, November). Simple estimators for rela- tional bayesian classifi In Third IEEE International Conference on Data Mining (pp. 609-612). IEEE. Nwankpa, C., Ijomah, W., Gachagan, A., Marshall, S. (2018). Activation functions: Comparison of trends in practice and research for deep learning. arXiv preprint arXiv:1811.03378. Song, Y. Y., Ying, L. U. (2015). Decision tree methods: applications for classifi and prediction. Shanghai archives of psychiatry, 27(2), 130. Tariq, H. I., Sohail, A., Aslam, U., Batcha, N. K. (2019). Loan default prediction mo- del using sample, explore, modify, model, and assess (SEMMA). Journal of Computational and Theoretical Nanoscience, 16(8), 3489-3503. Taskar, B., Segal, E., Koller, D. (2001, August). Probabilistic classifi n and cluste- ring in relational data. In International joint conference on artificial intelligence (Vol. 17, No. 1, pp. 870-878). Lawrence Erlbaum Associates LTD. Xiao, S., Wang, S., Dai, Y., Guo, W. (2022). Graph neural networks in node classifi cation: survey and evaluation. Machine Vision and Applications, 33(1), 1-19. Yekkehkhany, B., Safari, A., Homayouni, S., Hasanlou, M. (2014). A comparison study of different kernel functions for SVM-based classifi of multi-temporal polarimetry SAR data. The International Archives of Photogrammetry, Remote Sensing and Spatial Information Sciences, 40(2), 281. Zhou, J., Cui, G., Hu, S., Zhang, Z., Yang, C., Liu, Z., ... Sun, M. (2020). Graph neural networks: A review of methods and applications. AI Open, 1, 57-81. Ziwei, J., Li, D. y Telgarsky, M. (2021). Early-stopped neural networks are consistent. Advances in Neural Information Processing Systems, 34, 1805-1817.
dspace.entity.type	Publication
relation.isAdvisorOfPublication	4dcf8c54-8545-4232-8acf-c163330fd0fe
relation.isAdvisorOfPublication.latestForDiscovery	4dcf8c54-8545-4232-8acf-c163330fd0fe

Download

Original bundle

Now showing 1 - 1 of 1

Name:: TFM EduardoMartinezTena.pdf
Size:: 5.74 MB
Format:: Adobe Portable Document Format

Download

Collections

Trabajos Fin de Master (TFM)