Journal of Informetrics 8 (2014) 181–196 Contents lists available at ScienceDirect Journal of Informetrics j ourna l h o mepa ge: www.elsevier.com/locate/joi Estimating the accuracies of journal impact factor through bootstrap Kuan-Ming Chen a , Tsung-Hau Jen a, , Margaret Wu b a Science Education Center, National Taiwan Normal University, No. 88, 4th Section, Ting-Chou Road, Wen-Shan District, Taipei City 11677, Taiwan, ROC b Work-based Education Research Centre (WERC), Victoria Institute for Education, Diversity and Lifelong Learning, Victoria University, PO Box 14428, Melbourne, VIC 8001, Australia a r t i c l e i n f o Article history: Received 29 May 2013 Received in revised form 18 November 2013 Accepted 20 November 2013 Keywords: Bootstrap Journal impact factor Error estimation Education Citation index a b s t r a c t The journal impact factor (JIF) reported in journal citation reports has been used to represent the influence and prestige of a journal. Whereas the consideration of the stochastic nature of a statistic is a prerequisite for statistical inference, the estimation of JIF uncertainty is necessary yet unavailable for comparing the impact among journals. Using journals in the Database of Research in Science Education (DoRISE), the current study proposes bootstrap methods to estimate the JIF variability. The paper also provides a comprehensive exposition of the sources of JIF variability. The collections of articles in the year of interest and in the preceding years both contribute to JIF variability. In addition, the variability estimate differs depending on the way a database selects its journals for inclusion. In the bootstrap process, the nested structure of articles in a journal was accounted for to ensure that each bootstrap replication reflects the actual citation characteristics of articles in the journal. In conclusion, the proposed point and interval estimates of the JIF statistic are obtained and more informative inferences on the impact of journals can be drawn. © 2013 Elsevier Ltd. All rights reserved. 1. Introduction The idea that citation frequency measures the impact of a journal dates back nearly 60 years ago (Garfield, 1955). Since 1975, the journal impact factor (JIF) reported in the Science Citation Index (SCI) has been used to represent the influence and prestige of a journal (Garfield, 1999). The 2-year and 5-year JIFs indicate the mean citation rate across articles of a journal in the two and five preceding years, respectively. The JIF obtained in this way is an average citation rate per article and allows comparison among journals regardless of the total number of articles published by a journal. However, from a statistical perspective, any measurement comes with variability or uncertainty, and this variability must be considered to make comparison or draw inferences (Greenwood, 2007; Leydesdorff, 2013; Schneider, 2013). To date, advanced statistical techniques on high-speed computers are readily accessible; therefore, the current study utilized the bootstrap method (Efron & Tibshirani, 1993) to estimate the average JIF with confidence interval (CI) and standard error (SE) for a journal. Furthermore, taking both the JIF definition and the characteristics of database into consideration, the sources underlying JIF variability are separately examined. A number of concerns about JIF has been brought about by researchers, which mainly focused on the merits and disad- vantages of using JIF (e.g., Abramo, D’Angelo, & Di Costa, 2010; Buela-Casal, Perakakis, Taylor, & Checa, 2006; Fassoulaki, Corresponding author. Tel.: +886 2 77346773; fax: +886 2 29327187. E-mail addresses: skmchen@ntnu.edu.tw (K.-M. Chen), tsunghau@ntnu.edu.tw, tsunghau.jen@gmail.com (T.-H. Jen), wu@edmeasurement.com.au (M. Wu). 1751-1577/$ see front matter © 2013 Elsevier Ltd. All rights reserved. http://dx.doi.org/10.1016/j.joi.2013.11.007