ARTICLE Machine learning uncovers cell identity regulator by histone code Bo Xia 1,2,3,4,7 , Dongyu Zhao 1,2,3,4,7 , Guangyu Wang 1,2,3,4,7 , Min Zhang 2,3,4 , Jie Lv 1,2,3,4 , Alin S. Tomoiaga 5 , Yanqiang Li 1,2,3,4 , Xin Wang 1,2,3,4 , Shu Meng 2,3,4 , John P. Cooke 2,3,4 , Qi Cao 6 , Lili Zhang 2,3,4 & Kaifu Chen 1,2,3,4 Conversion between cell types, e.g., by induced expression of master transcription factors, holds great promise for cellular therapy. Our ability to manipulate cell identity is constrained by incomplete information on cell identity genes (CIGs) and their expression regulation. Here, we develop CEFCIG, an articial intelligent framework to uncover CIGs and further dene their master regulators. On the basis of machine learning, CEFCIG reveals unique histone codes for transcriptional regulation of reported CIGs, and utilizes these codes to predict CIGs and their master regulators with high accuracy. Applying CEFCIG to 1,005 epigenetic proles, our analysis uncovers the landscape of regulation network for identity genes in individual cell or tissue types. Together, this work provides insights into cell identity regulation, and delivers a powerful technique to facilitate regenerative medicine. https://doi.org/10.1038/s41467-020-16539-4 OPEN 1 Center for Bioinformatics and Computational Biology, Houston Methodist Research Institute, Houston, TX, USA. 2 Center for Cardiovascular Regeneration, Department of Cardiovascular Sciences, Houston Methodist Research Institute, Houston, TX, USA. 3 Department of Cardiothoracic Surgeries, Weill Cornell Medical College, Cornell University, New York, NY, USA. 4 Institute for Academic Medicine, Houston Methodist Research Institute, Houston, TX, USA. 5 Business Analytics, CIS & Law Department, The OMalley School of Business Accounting, Manhattan College, Riverdale, NY, USA. 6 Department of Urology, Robert H. Lurie Comprehensive Cancer Center, Chicago, IL, USA. 7 These authors contributed equally: Bo Xia, Dongyu Zhao, Guangyu Wang. email: qi.cao@northwestern.edu; lzhang3@houstonmethodist.org; kchen2@houstonmethodist.org NATURE COMMUNICATIONS | (2020)11:2696 | https://doi.org/10.1038/s41467-020-16539-4 | www.nature.com/naturecommunications 1 1234567890():,;