Computational Statistics & Data Analysis 51 (2007) 2559 – 2572 www.elsevier.com/locate/csda Quantile estimation in two-phase sampling María del Mar Rueda a , , Antonio Arcos a , Juan Francisco Muñoz a , Sarjinder Singh b a Department of Statistics and O.R., University of Granada, 18071 Granada, Spain b Department of Statistics, St. Could State University, 720 FourthAvenue South, St. Cloud, MN 56301-4498, USA Received 12 January 2005; received in revised form 3 January 2006; accepted 3 January 2006 Available online 24 January 2006 Abstract The estimation of quantiles in two-phase sampling with arbitrary sampling design in each of the two phases is investigated. Several ratio and exponentiation type estimators that provide the optimum estimate of a quantile based on an optimum exponent are proposed. Properties of these estimators are studied under large sample size approximation and the use of double sampling for stratification to estimate quantiles can also be seen. The real performance of these estimators will be evaluated for the three quartiles on the basis of data from two real populations using different sampling designs. The simulation study shows that proposed estimators can be very satisfactory in terms of relative bias and efficiency. © 2006 Elsevier B.V.All rights reserved. Keywords: Auxiliary information; Finite population quantiles; Two-phase sampling; Stratified random sampling 1. Introduction The problem of estimating a population mean in the presence of an auxiliary variable has been widely discussed in the finite population sampling literature. However, for the problem of estimating a population median, the situation is quite different and only recently has this problem been discussed. Rao et al. (1990) proposed ratio and difference estimators for the median using a design-based approach. Kuk and Mak (1989) proposed two estimators for which it was only necessary to know the values of the median of the auxiliary variable for the whole population. More recently, Rueda et al. (1998) and Rueda and Arcos (2001) proposed confidence intervals for quantiles based on ratio and difference estimators of the distribution function. In Rueda et al. (2003, 2004) the population information is used through a quantile of the auxiliary variable with the same or different order as that of the quantile of the main variable considered for estimation using difference type estimators. The above estimators are based on prior knowledge of the median Q x (0.5) of the auxiliary characteristic. In many cases Q x (0.5) may not be known, and it may be seen that taking the sample selection in two phases is an attractive solution. Two-phase sampling is a good compromise for surveys in which no prior knowledge is available about the population. A key to successful two-phase sampling is the creation of a highly informative frame for the part of the population Corresponding author. Departamento de Estadística e I.O., Facultad de Ciencias, Avda. Fuentenueva, Universidad de Granada, 18071, Granada, Spain. Tel.: +34 958240494; fax: +34 958243267. E-mail addresses: mrueda@ugr.es (M. del Mar Rueda), arcos@ugr.es (A. Arcos), jfmunoz@ugr.es (J.F. Muñoz), sarjinder@yahoo.com (S. Singh). 0167-9473/$ - see front matter © 2006 Elsevier B.V. All rights reserved. doi:10.1016/j.csda.2006.01.002