214 International Journal of Statistics in Medical Research, 2016, 5, 214-218 E-ISSN: 1929-6029/16 © 2016 Lifescience Global Addressing the Challenge of P-Value and Sample Size when the Significance is Borderline: The Test of Random Duplication of Participants as a New Approach Jose-Gaby Tshikuka 1,2,* , Mgaywa G.M.D. Magafu 1,3 , Mooketsi Molefi 1 , Tiny Masupe 1 , Reginald B. Matchaba-Hove 4 , Bontle Mbongwe 4 and Roy Tapera 4 1 Department of Public Health Medicine, Faculty of Medicine, University of Botswana, Private bag 00713, Gaborone, Botswana 2 Department of Health Sciences, National Pedagogic University, Kinshasa I, DRC, Republic of the Congo 3 Department of Global Health, University of Washington, Seattle, USA 4 School of Public Health, Faculty of Health Sciences, University of Botswana, Private Bag 0022, Gaborone, Botswana Abstract: The issue of borderline p-value seems to divide health scientists into two schools of thought. One school of thought argues that when the p-value is greater than or equal to the statistical significance cut-off level of 0.05, it should not be considered statistically significant and the null hypothesis should be accepted no matter how close the p-value is to the 0.05. The other school of thought believes that by doing so one might be committing a Type 2 error and possibly missing valuable information. In this paper, we discuss an approach to address this issue and suggest the test of random duplication of participants as a way to interpret study outcomes when the statistical significance is borderline. This discussion shows the irrefutability of the concept of borderline statistical significance, however, it is important that one demonstrates whether a borderline statistical significance is truly borderline or not. Since the absence of statistical significance is not necessarily evidence of absence of effect, one needs to double check if a borderline statistical significance is indeed borderline or not. The p-value should not be looked at as a rule of thumb for accepting or rejecting the null hypothesis but rather as a guide for further action or analysis that leads to correct conclusions. Keywords: P-value, Sample Size, Statistical Significance, Borderline Significance, Participant Random Duplication. INTRODUCTION The p-value that indicates that an effect under study is statistically significant is a value that was established arbitrarily and by convention it should be < 0.05 [1,2]. This means that under the null hypothesis of no association, the probability of observing an effect as large as that found in the study population by chance and by chance alone is less than 5% [1-3]. This is the same as saying that chance is an unlikely explanation of the outcome [1-3]. But if the p-value is found to be 0.05, it is said that chance cannot be excluded as the likely explanation for the outcome, in which case the null hypothesis is not rejected and often the conclusion is that there is no effect or real association [4,5]. That is to say that when interpreting results from a study, we only have one of the two alternative conclusions – either the findings show that the explanatory variable under investigation has a statistically significant effect on the outcome (p < 0.05) or the explanatory variable does not have a statistically significant effect on the outcome (p > 0.05) [1-6]. *Address correspondence to this author at the Department of Public Health Medicine, Faculty of Medicine, University of Botswana, Private bag 00713, Gaborone, Botswana; Tel: +2673554603; E-mail: josegaby.tshikuka@mopipi.ub.bw A number of authors staunchly defend this viewpoint [7,8]. They argue that when the p-value is greater than 0.05, the null hypothesis should be simply accepted and that the explanatory variable shows no effect on the outcome and no other logical assertion should be considered. However, recent developments in biostatistics have shown that more experts are increasingly using such terms as “borderline significance”, “approaching significance”, “nearing significance”, and the like for outcomes whose p-values are slightly greater than or equal to 0.05 [9-12]. The criteria used to support p-values which are slightly greater than 0.05 as of borderline statistical significance or as of no statistical significance at all remain unclear to date and therefore the need for debating and clarifying this issue. This paper discusses the challenge of p-value and sample size when the statistical significance is borderline and lays ground work for scientific reasoning to avoid misinterpretation of p-values when they are slightly greater than or equal to the statistical significance cut-off level of 0.05. Correct interpretation will prevent erroneous conclusions and misguided actions that may follow.