Quantitative Quality Control from Qualitative Data: Control Charts with Latent Semantic Analysis Triss Ashton* Assistant Professor The University of Texas Pan American 1201 West University, Edinburg, TX 78539 e-mail: AshtonTA@utpa.edu (956) 665-3353 (956) 665-3367 (fax) * Corresponding Author Nicholas Evangelopoulos, Associate Professor University of North Texas 1155 Union Circle # 311160 Denton, TX 76203-5017 Victor R. Prybutok Regents Professor University of North Texas 1155 Union Circle # 311160 Denton, TX 76203-5017 ABSTRACT Large quantities of data, often referred to as Big Data, are now held by companies. This Big Data includes statements of customer opinion regarding product or service quality in an unstructured textual form. While many tools exist to extract meaningful information from Big Data, automation tools do not exist to monitor the ongoing conceptual content of that data. We use latent semantic analysis to extract concept factors related to service quality categories. Customer comments found in the data that express dissatisfaction are then considered as representing a non-conforming observation in a process. Once factors are extracted, proportions of nonconformities for service quality failure categories are plotted on a control chart. The results are easily interpreted and the approach allows for the quantitative evaluation of customer acceptance of system process improvement initiatives. Keywords Analysis of text data, big data, control charts, customer satisfaction, latent semantic analysis (LSA), statistical process control (SPC), text mining. For the final published version of this paper, See Quality & Quantity DOI DOI 10.1007/s11135-014-0036-5 available at http://link.springer.com/article/10.1007%2Fs11135-014-0036-5