  Citation: Li, L.; Shi, Z.; Liang, H.; Liu, J.; Qiao, Z. Machine Learning- Assisted Computational Screening of Metal-Organic Frameworks for Atmospheric Water Harvesting. Nanomaterials 2022, 12, 159. https://doi.org/10.3390/ nano12010159 Academic Editors: Filipe Figueiredo and Jorge Pasán Received: 7 November 2021 Accepted: 31 December 2021 Published: 3 January 2022 Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affil- iations. Copyright: © 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https:// creativecommons.org/licenses/by/ 4.0/). nanomaterials Article Machine Learning-Assisted Computational Screening of Metal-Organic Frameworks for Atmospheric Water Harvesting Lifeng Li 1,† , Zenan Shi 1,† , Hong Liang 1, *, Jie Liu 2, * and Zhiwei Qiao 1, * 1 Guangzhou Key Laboratory for New Energy and Green Catalysis, School of Chemistry and Chemical Engineering, Guangzhou University, Guangzhou 510006, China; lilifeng@e.gzhu.edu.cn (L.L.); zenanshi@126.com (Z.S.) 2 Key Laboratory for Green Chemical Process of Ministry of Education, School of Chemical Engineering and Pharmacy, Wuhan Institute of Technology, Wuhan 430073, China * Correspondence: lhong@gzhu.edu.cn (H.L.); ljie@wit.edu.cn (J.L.); zqiao@gzhu.edu.cn (Z.Q.); Tel.: +86-135-6015-8624 (Z.Q.) These authors contributed equally to this work. Abstract: Atmospheric water harvesting by strong adsorbents is a feasible method of solving the shortage of water resources, especially for arid regions. In this study, a machine learning (ML)- assisted high-throughput computational screening is employed to calculate the capture of H 2 O from N 2 and O 2 for 6013 computation-ready, experimental metal-organic frameworks (CoRE-MOFs) and 137,953 hypothetical MOFs (hMOFs). Through the univariate analysis of MOF structure-performance relationships, Q st is shown to be a key descriptor. Moreover, three ML algorithms (random forest, gradient boosted regression trees, and neighbor component analysis (NCA)) are applied to hunt for the complicated interrelation between six descriptors and performance. After the optimizing strategy of grid search and five-fold cross-validation is performed, three ML can effectively build the predictive model for CoRE-MOFs, and the accuracy R 2 of NCA can reach 0.97. In addition, based on the relative importance of the descriptors by ML, it can be quantitatively concluded that the Q st is dominant in governing the capture of H 2 O. Besides, the NCA model trained by 6013 CoRE-MOFs can predict the selectivity of hMOFs with a R 2 of 0.86, which is more universal than other models. Finally, 10 CoRE-MOFs and 10 hMOFs with high performance are identified. The computational screening and prediction of ML could provide guidance and inspiration for the development of materials for water harvesting in the atmosphere. Keywords: metal-organic frameworks; water harvesting; molecular simulation; algorithm; absorption 1. Induction As we all know, 71% of the earth’s surface is covered by water, and the remaining 29% is land. At first glance, we have a lot of water resources; in fact, the water we use is mainly fresh water, and fresh water only accounts for 2.5% of all water resources on the earth [1]. Among them, approximately 69% of the fresh water is enclosed in the ice layer of Antarctica and Greenland, and the remaining 30% is stored in the ground, so the fresh water (such as river water and fresh water lakes) that humans can directly use accounts for only 0.4% of all water resources [1]. As population growth and living standards improve, water resources are becoming increasingly scarce, especially daily water for residents of arid regions. Currently, one third of the world’s population live in regions with medium and high water shortages. It is estimated that two thirds of the world will face water shortages by 2050 [2]. Therefore, the lack of fresh water has become one of the major crises to be resolved. At present, several technologies are being used to address this issue. Desalination is one of the main ways to develop new fresh water resources, but the construction of this infrastructure requires a lot of money and the production process is highly energy intensive [3]. In addition, since the main arid and water-scarce areas are far inland, there Nanomaterials 2022, 12, 159. https://doi.org/10.3390/nano12010159 https://www.mdpi.com/journal/nanomaterials