The Research Core Dataset for the German science system: developing standards for an integrated management of research information Sophie Biesenbender 1 Stefan Hornbostel 1 Received: 25 February 2016 Ó Akade´miai Kiado´, Budapest, Hungary 2016 Abstract The paper summarizes the results of the recently completed project to derive a Research Core Dataset (RCD) for the German science system. It describes the basic principles and the architecture of the specification by introducing its main components and elements and by depicting the provisions with regard to aggregate and base data. In this context, the paper also explains the peculiarities of the German science system and the need for standardization given institutional heterogeneity and highly fragmented institu- tional reporting activities. The paper concludes with a short outlook on the potential chances and risks of the RCD to promote data integration and efficiency in reporting by research institutions in Germany. Keywords Comparability Á Data integration Á Data quality Á Standardization Introduction: standardization of research information in the German science system The science system in Germany is historically grown and characterized by institutional heterogeneity. Next to the state-regulated 239 public higher education institutions (of which 125 have the right to award doctoral degrees), 1 there are numerous non-university research institutions that fall under yet different regulation and are often jointly financed by federal and state governments (through the Joint Science Conference). Most of these institutions are part of either one of the four umbrella associations: Max Planck Society & Sophie Biesenbender biesenbender@dzhw.eu 1 German Centre for Higher Education Research and Science Studies (DZHW), Berlin, Germany 1 According to the German Rectors Conference; see http://www.hochschulkompass.de/hochschulen/ download.html; accessed 19 October 2015. 123 Scientometrics DOI 10.1007/s11192-016-1909-2