Noisy mean ﬁeld stochastic games with network applications Hamidou Tembine LSS, CNRS-Sup´ elec-Univ. Paris Sud, France Email: tembine@ieee.org Pedro Vilanova AMCS, KAUST, Saudi Arabia E-mail:pedro.guerra@kaust.edu.sa M´ erouane Debbah Chaire Alcatel-Lucent en radio-ﬂexible, Sup´ elec, France E-mail:merouane.debbah@supelec.fr Abstract We consider a class of stochastic games with ﬁnite number of resource states, individual states and actions per states. At each stage, a random set of players interact. The states and the actions of all the interacting players determine together the instantaneous payoffs and the transitions to the next states. We study the convergence of the stochastic game with variable set of interacting players when the total number of possible players grow without bound. We show that the optimal payoffs, the mean ﬁeld equilibrium payoffs are solution of coupled system of backward-forward equations. The limiting games are equivalent to discrete time anonymous sequential games or to differential population games. Using multidimensional diffusion process, a general mean ﬁeld convergence to stochastic differential equation is given. We illustrate the controlled mean ﬁeld limit in wireless networks. CONTENTS I Introduction 2 II Stochastic game with individual states 4 II-A The setting ............................................. 4 II-B Policies and Strategies ...................................... 5 II-C Different types of payoffs ..................................... 5 II-D Computation of equilibria ..................................... 7 II-E Q-values .............................................. 7 II-F Dynamic team problems and stochastic potential games .................... 8 III The classic mean ﬁeld approach 8 III-A Mean Field Interaction model [1] ................................ 8 III-B Mean Field Asymptotic of Markov Decision Evolutionary Games and Teams model [2] . . . 9 IV Controlled mean ﬁeld interaction 10 V Noisy mean ﬁeld approach 10 VI Convergence to discrete time mean ﬁeld 13 VI-A Big step-size ............................................ 13 VI-A1 Mean ﬁeld coordination games ............................ 14 VI-B Vanishing step-size ........................................ 15 VI-B1 Centralized mean ﬁeld control ............................ 15 VII Application to malware propagation 16 VII-A Homogeneous system ....................................... 18 VII-A1 Uncontrolled behaviour ................................ 18 VII-A2 Controlled behaviour ................................. 18 VII-B Heterogeneous system ....................................... 19