energies Article Adaptive Surrogate Estimation with Spatial Features Using a Deep Convolutional Autoencoder for CO 2 Geological Sequestration Suryeom Jo 1 , Changhyup Park 2, * , Dong-Woo Ryu 1 and Seongin Ahn 1   Citation: Jo, S.; Park, C.; Ryu, D.-W.; Ahn, S. Adaptive Surrogate Estimation with Spatial Features Using a Deep Convolutional Autoencoder for CO 2 Geological Sequestration. Energies 2021, 14, 413. https://doi.org/10.3390/ en14020413 Received: 26 November 2020 Accepted: 10 January 2021 Published: 13 January 2021 Publisher’s Note: MDPI stays neu- tral with regard to jurisdictional clai- ms in published maps and institutio- nal affiliations. Copyright: © 2021 by the authors. Li- censee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and con- ditions of the Creative Commons At- tribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/). 1 Geo-ICT Convergence Research Team, Korea Institute of Geoscience and Mineral Resources, Daejeon 34132, Korea; suryeom@kigam.re.kr (S.J.); dwryu@kigam.re.kr (D.-W.R.); seongin@kigam.re.kr (S.A.) 2 Department of Energy and Resources Engineering, Kangwon National University, Chuncheon 24341, Korea * Correspondence: changhyup@kangwon.ac.kr; Tel.: +82-33-2506259 Abstract: This paper develops a reliable deep-learning framework to extract latent features from spatial properties and investigates adaptive surrogate estimation to sequester CO 2 into heterogeneous deep saline aquifers. Our deep-learning architecture includes a deep convolutional autoencoder (DCAE) and a fully-convolutional network to not only reduce computational costs but also to extract dimensionality-reduced features to conserve spatial characteristics. The workflow integrates two different spatial properties within a single convolutional system, and it also achieves accurate reconstruction performance. This approach significantly reduces the number of parameters to 4.3% of the original number required, e.g., the number of three-dimensional spatial properties needed decreases from 44,460 to 1920. The successful dimensionality reduction is accomplished by the DCAE system regarding all inputs as image channels from the initial stage of learning using the fully- convolutional network instead of fully-connected layers. The DCAE reconstructs spatial parameters such as permeability and porosity while conserving their statistical values, i.e., their mean and standard deviation, achieving R-squared values of over 0.972 with a mean absolute percentage error of their mean values of less than 1.79%. The adaptive surrogate model using the latent features extracted by DCAE, well operations, and modeling parameters is able to accurately estimate CO 2 sequestration performances. The model shows R-squared values of over 0.892 for testing data not used in training and validation. The DCAE-based surrogate estimation exploits the reliable integration of various spatial data within the fully-convolutional network and allows us to evaluate flow behavior occurring in a subsurface domain. Keywords: deep convolutional autoencoder; deep learning; spatial parameter; latent feature; surro- gate model; data integration 1. Introduction Data science has revolutionized engineering analytics in the oil and gas industry. Data- driven analyses are now assisting the decision-making process making it more reliable as well as more efficient [15]. Despite these high-end computer-assisted methods delivering efficient solutions, establishing reliable standard forms has been challenging. Uncertainty quantification requires reliable integration of all available scale-dominant spatiotemporal properties. Rock properties are heterogeneous spatially, which makes fluid-flow uncertain in subsurface domains. A grid in a heterogeneous geo-model contains a lot of scale-variant, non-linearly-connected, and spatiotemporal information that influences flow behavior, e.g., porosity, permeability, capillary pressure, and fluid saturations. These properties are assigned to an individual grid but are non-linearly linked so that the governing equations integrate them closely. Quantifying the non-linearity of grid properties is essential to simu- late flow behavior in a realistic manner. Integrating all available scale-dependent, uncertain, and spatiotemporal properties requires large amounts of computing resources; thereby, the Energies 2021, 14, 413. https://doi.org/10.3390/en14020413 https://www.mdpi.com/journal/energies