Vol.:(0123456789) The Journal of Supercomputing https://doi.org/10.1007/s11227-020-03293-z 1 3 Improving learning ability of learning automata using chaos theory Bagher Zarei 1 · Mohammad Reza Meybodi 2 © Springer Science+Business Media, LLC, part of Springer Nature 2020 Abstract A learning automaton (LA) can be considered as an abstract system with a fnite set of actions. LA operates by choosing an action from the set of its actions and applying it to the stochastic environment. The environment evaluates the chosen action, and automaton uses the response of the environment to update its decision- making method for selecting the next action. This process is repeated until the opti- mal action is found. The learning algorithm (learning scheme) determines how to use the environment response for updating the decision-making method to select the next action. In this paper, the chaos theory is incorporated with the LA and a new type of LA, namely chaotic LA (cLA), is introduced. In cLA, the chaotic numbers are used instead of the random numbers when choosing the action. The experiment results show that in most cases, the use of chaotic numbers leads to a signifcant improvement in the learning ability of the LA. Among the chaotic maps investigated in this paper, the Tent map has better performance than the other maps. The conver- gence rate/convergence time of the LA will increase/decrease by 91.4%/29.6% to 264.4%/69.1%, on average, by using the Tent map. Furthermore, the chaotic LA has more scalability than the standard LA, and its performance will not decrease signif- cantly by increasing the problem size (number of actions). Keywords Reinforcement learning · Learning automata · Chaos theory · Chaotic map · Chaotic learning automata * Mohammad Reza Meybodi mmeybodi@aut.ac.ir 1 Faculty of Computer and Information Technology Engineering, Qazvin Branch, Islamic Azad University, Qazvin, Iran 2 Department of Computer Engineering and Information Technology, Amirkabir University of Technology, Tehran, Iran