검색 상세

A study on some games with reinforcement learning

초록/요약

Artificial intelligence has increased in popularity as a result of the victory in the match between Lee Sedol and AlphaGo, an artificial intelligence Go program developed by Google DeepMind. In this paper, we apply Q-learning, one of the reinforcement learning algorithms, to tic-tac-toe, which is a simpler game than Go, and renju, which is a type of Gomoku. In addition, the newly proposed mathematical game, the factorization game, is implemented in C-language, and Q-learning is applied. As a result of the analysis, Renju confirmed that the number of cases is too diverse, which makes learning difficult with Q-learning. The factorization game is a fair simulation when playing black and white 10 * 10 Go-board. For instance, when p = 4, place two or three Go-stones on the board. And we analyze the results of Q-learning applied to the factorization game. In the future, we plan to analyze the effect of the deep neural network technology, which shows good performance for a larger number of cases when applied to Renju, and also analyze the factorization game when the board size is increased.

more