Multi-target evolutionary latent space search of a generative adversarial network for human face generation Benjamín Machín Universidad de la República Uruguay benjamin.machin@fng.edu.uy Sergio Nesmachnow Universidad de la República Uruguay sergion@fng.edu.uy Jamal Toutouh ITIS Software, Universidad de Málaga Spain jamal@lcc.uma.es ABSTRACT This article presents an evolutionary approach for multi-target synthesized human face image generation based on exploring the latent space of generative adversarial networks. The proposed ap- proach seeks to generate diferent human face images those share similarities to two given target images. The optimization applies generative adversarial networks for face generation, facial recogni- tion for similarity evaluation, and an ad-hoc evolutionary algorithm for exploring the search space. The main results show that realis- tic images are generated, properly blending the main features of the two given target images, and deceiving a well-known facial recognition system. CCS CONCEPTS · Computing methodologies Bio-inspired approaches; Neu- ral networks; Continuous space search. KEYWORDS generative adversarial networks, multiobjective optimization; evo- lutionary latent space exploration, human face image generation ACM Reference Format: Benjamín Machín, Sergio Nesmachnow, and Jamal Toutouh. 2022. Multi- target evolutionary latent space search of a generative adversarial network for human face generation. In Genetic and Evolutionary Computation Confer- ence Companion (GECCO ’22 Companion), July 9ś13, 2022, Boston, MA, USA. ACM, New York, NY, USA, 9 pages. https://doi.org/10.1145/3520304.3533992 1 INTRODUCTION Generative machine learning models are a class of statistical mod- els that are able to generate new data instances. Given a set of data instances and a set of labels , generative models try to learn/capture the joint probability (, ) to describe the process of generating new datasets [6]. Generative Adversarial Networks (GANs) are deep learning-, neural network-based generative models [7] GANs consist of two artifcial neural networks (ANNs): a generator and a discriminator that apply adversarial learning to optimize their parameters. The generator learns how to create synthesized samples from input Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for proft or commercial advantage and that copies bear this notice and the full citation on the frst page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specifc permission and/or a fee. Request permissions from permissions@acm.org. GECCO ’22 Companion, July 9ś13, 2022, Boston, MA, USA © 2022 Association for Computing Machinery. ACM ISBN 978-1-4503-9268-6/22/07. . . $15.00 https://doi.org/10.1145/3520304.3533992 vectors drawn from a random latent space , i.e., () = . The goal of the generator is to approximate the true data distribution. Simultaneously, the discriminator learns to distinguish real samples (from a training dataset) from synthesized samples , produced by the generator. The GAN training process works iteratively and converges to a generator that approximates the real data distribu- tion and generate images that deceive the discriminator, which is no longer able to diferentiate real from synthesized samples (it labels real and synthesized samples at random). GANs have positioned as a successful tool for many applications that require the creation of creating synthesized data, especially those concerning multimedia (e.g., images, sound, video), healthcare, and other areas [13, 17ś19]. The synthesized data created by the generator is determined by the input vector drawn from the latent space. The latent space in a GAN is defned by a high dimensional random distribution (e.g., Gaussian distribution) linked to the trained generative model. Properly searching the latent space can lead to determining useful directions for generating images with specifc features, or fnd- ing disentangling sub-spaces or hyperplanes of the search space that control pose and facial characteristics. Given that the search space defned by the latent space of GANs has a large dimension, traditional optimization/search methods are not applicable. Thus, population-based metaheuristic methods, and especially evolution- ary computation [11], are useful to perform the search in reduced computational times. This article presents a proposal for fnding a sub-latent space (i.e., vectors from the latent space) that produces synthesized data samples similar to a set of target images. GANs are used to produce realistic human face images, and the search focuses on fnding latent space vectors to generate human face images with specifc attributes of the input images. This article extends our previous research [9], where an evolu- tionary algorithm (EA) was applied to search the latent space of GANs. The methodology applied in this article considers a multi- target approach, where the evolutionary search seeks to generate images with the features of two target faces. The generated im- ages are realistic, demonstrating that the proposed EA is capable of mixing features of the two input target images. Furthermore, the generated images are not distinguishable from the two target images for FaceNet, a widely used facial recognition systems [14]. Thus, this article studies the capability of generative neural mod- els to produce samples from the whole data distribution for the human face image generation problem and the efectiveness of evolutionary approaches for properly searching the latent space of GANs, when considering multiple target images. The main contri- butions of the research reported in this article are: i) a method based on EAs to generate synthesized human face images that properly 1878