Back to Search Start Over

Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach

Authors :
Lee, Donghwan
Lee, Donghwan
Publication Year :
2023

Abstract

The objective of this paper is to investigate the finite-time analysis of a Q-learning algorithm applied to two-player zero-sum Markov games. Specifically, we establish a finite-time analysis of both the minimax Q-learning algorithm and the corresponding value iteration method. To enhance the analysis of both value iteration and Q-learning, we employ the switching system model of minimax Q-learning and the associated value iteration. This approach provides further insights into minimax Q-learning and facilitates a more straightforward and insightful convergence analysis. We anticipate that the introduction of these additional insights has the potential to uncover novel connections and foster collaboration between concepts in the fields of control theory and reinforcement learning communities.<br />Comment: arXiv admin note: text overlap with arXiv:2205.05455

Details

Database :
OAIster
Publication Type :
Electronic Resource
Accession number :
edsoai.on1438454811
Document Type :
Electronic Resource