Start Over

Estudi de la resolució de trencaclosques matemàtics amb Reinforcement learning

Authors :: Universitat Politècnica de Catalunya. Departament de Física
Sellarès González, Jordi
Palomo Olivares, Amanda
Universitat Politècnica de Catalunya. Departament de Física
Sellarès González, Jordi
Palomo Olivares, Amanda
Publication Year :: 2022
Abstract: Aquest Treball de Fi de Grau deu el seu nom al seu objectiu principal: abordar l’estudi de la incorporació del Reinforcement Learning als trencaclosques que integren el contingut del treball. En aquest paradigma, els programes aprenen a resoldre problemes matemàticament complexos a partir d'estímuls senzills relacionats amb els seus èxits. Per fer possible el desenvolupament de l’estudi, s'ha implementat la tècnica del Reinforcement Learning com a base i s’ha adaptat a la naturalesa de cada un dels jocs que es tracten. Es fa servir el Q-Learning per a aquells trencaclosques que són possibles d'abordar per la seva capacitat mínimament limitada. En canvi, es contempla el cas d’un joc que no segueix aquesta tendència per marcar una dinàmica massa àmplia. En aquest, s'adaptarà la tècnica de Deep Q-Learning per abordar-ho amb una xarxa neuronal que faciliti la manejabilitat dels estats que formen el trencaclosques. Els resultats que s’han obtingut en aquest estudi reflecteixen que l’aprenentatge d’un model mitjançant estímuls i recompenses en els jocs basats en procediments i regles és totalment assumible. Això s’evidencia ja que l'agent no només aprèn els requisits del joc en qüestió sinó que proporciona la solució òptima respecte a l'elecció de la solució més curta i, per tant, de la minimització d'eleccions errònies. Finalment cal dir que els objectius d’aquest estudi s’han aconseguit satisfactòriament i per comprovar l’aprenentatge, es realitza una comparació del comportament humà amb el del model. Així doncs, s'evidencia la bona integració del camp dels jocs amb la intel·ligència artificial a causa de les regles que regeixen el model de comportament de l’agent.<br />This Final Degree Project owes its name to its main o bjective: einforcement L earning to address the study of the incorporation of Reinforcement Learning in the puzzles that make up the content of the project. In this paradigm, programs learn to solve mathematically complex problems from simple stimuli related to their achievements.To make the development of the project possible, the Reinforcement Learning technique has been implemented as a basis and has been adapted to the nature of each of the games in question. Q Learning is used for those puzzles that are possibl e to address due to their minimally limited capacity. In contrast, the case of a game that does not follow this trend due to have a dynamic that is too broad. In this one, the Deep Qadapted to address it with a neural network th Learning technique will be at facilitates the manageability of the states that make up the game. The results obtained in this project show that model learning through stimuli and rewards in games based on procedures and rules is fully achievable. This is evident since the agent not only learns the requirements of the game in question but also provides the optimal solution regarding the choice of the shortest solution and, therefore, the minimization of erroneous choices. Finally, the objectives of this study have been satisfactoril y achieved and to verify learning, a comparison of human behavior with that of the model is made. Thus, the good integration of the field of games with artificial intelligence is evident due to the rules that govern the behavior model of the agent. II

Details

Database :: OAIster
Notes :: application/pdf, application/pdf, application/pdf, Catalan
Publication Type :: Electronic Resource
Accession number :: edsoai.on1348514263
Document Type :: Electronic Resource

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Estudi de la resolució de trencaclosques matemàtics amb Reinforcement learning

Abstract

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Estudi de la resolució de trencaclosques matemàtics amb Reinforcement learning

Abstract

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources