Back to Search Start Over

Stationary Markov Nash equilibria for nonzero-sum constrained ARAT Markov games

Authors :
Dufour, François
Prieto-Rumeau, Tomás
Méthodes avancées d’apprentissage statistique et de contrôle (ASTRAL)
Institut de Mathématiques de Bordeaux (IMB)
Université Bordeaux Segalen - Bordeaux 2-Université Sciences et Technologies - Bordeaux 1 (UB)-Université de Bordeaux (UB)-Institut Polytechnique de Bordeaux (Bordeaux INP)-Centre National de la Recherche Scientifique (CNRS)-Université Bordeaux Segalen - Bordeaux 2-Université Sciences et Technologies - Bordeaux 1 (UB)-Université de Bordeaux (UB)-Institut Polytechnique de Bordeaux (Bordeaux INP)-Centre National de la Recherche Scientifique (CNRS)-Inria Bordeaux - Sud-Ouest
Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)-Naval Group
Université Bordeaux Segalen - Bordeaux 2-Université Sciences et Technologies - Bordeaux 1 (UB)-Université de Bordeaux (UB)-Institut Polytechnique de Bordeaux (Bordeaux INP)-Centre National de la Recherche Scientifique (CNRS)
Quality control and dynamic reliability (CQFD)
Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)
Institut Polytechnique de Bordeaux (Bordeaux INP)
Universidad Estatal a Distancia (UNED)
Université Bordeaux Segalen - Bordeaux 2-Université Sciences et Technologies - Bordeaux 1-Université de Bordeaux (UB)-Institut Polytechnique de Bordeaux (Bordeaux INP)-Centre National de la Recherche Scientifique (CNRS)-Université Bordeaux Segalen - Bordeaux 2-Université Sciences et Technologies - Bordeaux 1-Université de Bordeaux (UB)-Institut Polytechnique de Bordeaux (Bordeaux INP)-Centre National de la Recherche Scientifique (CNRS)-Inria Bordeaux - Sud-Ouest
Université Bordeaux Segalen - Bordeaux 2-Université Sciences et Technologies - Bordeaux 1-Université de Bordeaux (UB)-Institut Polytechnique de Bordeaux (Bordeaux INP)-Centre National de la Recherche Scientifique (CNRS)
Inria Bordeaux - Sud-Ouest
Institut National de Recherche en Informatique et en Automatique (Inria)
Source :
SIAM Journal on Control and Optimization, SIAM Journal on Control and Optimization, 2022, ⟨10.1137/21M144565X⟩
Publication Year :
2022
Publisher :
HAL CCSD, 2022.

Abstract

International audience; We consider a nonzero-sum Markov game on an abstract measurable state space with compact metric action spaces. The goal of each player is to maximize his respective discounted payoff function under the condition that some constraints on a discounted payoff are satisfied. We are interested in the existence of a Nash or noncooperative equilibrium. Under suitable conditions, which include absolute continuity of the transitions with respect to some reference probability measure, additivity of the payoffs and the transition probabilities (ARAT condition), and continuity in action of the payoff functions and the density function of the transitions of the system, we establish the existence of a constrained stationary Markov Nash equilibrium, that is, the existence of stationary Markov strategies for each of the players yielding an optimal profile within the class of all history-dependent profiles.

Details

Language :
English
ISSN :
03630129 and 10957138
Database :
OpenAIRE
Journal :
SIAM Journal on Control and Optimization, SIAM Journal on Control and Optimization, 2022, ⟨10.1137/21M144565X⟩
Accession number :
edsair.doi.dedup.....6f83842c2ffec80c9f2cc7bb798a5608
Full Text :
https://doi.org/10.1137/21M144565X⟩