Contents lists available at ScienceDirect Electrical Power and Energy Systems journal homepage: www.elsevier.com/locate/ijepes A new stratied random sample customer selection for load research study in distribution networks Yaser Raeisi-Gahrooei, Amin Khodabakhshian , Rahmat-Allah Hooshmand Department of Electrical Engineering, University of Isfahan, Isfahan, Iran ARTICLE INFO Keywords: Distribution network Energy consumption Load research Sampling ABSTRACT Distribution network decision makers need accurate and reliable information about load characteristics to plan, estimate and control the system properly. Load information is generally extracted from the collected load data of selected sample customers and, therefore, a proper sample customer selection is the pillar of any load research study. In this regard, this paper presents a new stratied sampling technique which includes three stages of sample size determination and customer stratication, sample size assignment to determined strata (subgroups), and sample customer selection. The sample size assignment to determined strata is done by considering the compatibility between load research objectives and sampling design and preparing the way to use the data and information provided by previous load research studies. Furthermore, sample customers are selected by con- sidering the energy consumption (kWh) range of customers, their activity classication, and their locations in distribution feeders. The numerical results from a real data of an electric power distribution system in Esfahan- Iran verify the eciency of the proposed technique when compared to the conventional method. 1. Introduction Load research is the process of measuring, collecting and studying the customerselectric load characteristics in order to provide the re- liable and thorough information for any company related to the pro- duction, distribution and management of electricity [1,2]. One of the major applications of load research analysis is the design and setting of retail taris for electricity supply [3,4]. This information can improve the accuracy of forecasting the future demand [5]. The other applica- tions are the usage of load proles of customers in capacitor placement and the reconguration of distribution networks [6], state estimation [7], and load modelling [8]. In this regard, the proper operation and control of distribution system requires all the information available on customerspatterns. This in turn helps the operator take proper action for both normal and atypical cases [9]. Since a large number of customers in dierent electricity taris are usually connected to a distribution network, the only possible way to study load characteristics of these customers is to use the statistical analysis. The development of the load research study consists of three steps; 1-sampling selection, 2-metering, and 3-analyzing the collected load data to extract information. The reliability and accuracy of load research results depend on how well these steps are taken. Among these steps the metering part has been facilitated by using the automated meter infrastructure (AMI) systems with a high degree of condence. Regarding the third step, after collecting the load data in the rst two steps, well-developed data mining approaches including pattern re- cognition methods and clustering algorithms [10,11] are used to extract the load information from the collected load data. However, few re- searches are done on sampling technique design, and in this regard this paper presents a new method. Customers in distribution networks have a number of distinct traits that can be divided into some homogeneous, independent and no- overlapping subgroups. These are known as stratication variables which are electricity taris, contract power, geographical position and region type. Stratifying customers into subgroups increases the preci- sion of the estimates and reduces the overall required sample size. In this regard, the stratied sampling technique can be used and includes three stages of sample size determination and customer stratication, sample size assignment to determined strata, and sample customer se- lection [12]. Sample size used in load research is limited based on companys budget to spend on metering and collecting load data. In this way, an optimal method to allocate samples to subgroups should be adopted to increase the accuracy and reliability of the collected data. Refs. [1316] give the implementation of the load survey system to identify the load characteristics of customers, by using the stratied sampling concept, to support system planning and operation. In [13], the load proles of low voltage customers are analysed and a clustering algorithm based on billing data is proposed. Ref. [14] investigates the https://doi.org/10.1016/j.ijepes.2017.11.029 Received 27 June 2016; Received in revised form 24 July 2017; Accepted 20 November 2017 Corresponding author. E-mail addresses: yaser_raisee@eng.ui.ac.ir (Y. Raeisi-Gahrooei), aminkh@eng.ui.ac.ir (A. Khodabakhshian), hooshmand_r@eng.ui.ac.ir (R.-A. Hooshmand). Electrical Power and Energy Systems 97 (2018) 363–371 Available online 04 January 2018 0142-0615/ © 2017 Elsevier Ltd. All rights reserved. T