Multivariate Curve Resolution of Wavelet and Fourier Compressed Spectra Peter de B. Harrington,* Paul J. Rauch, and Chunsheng Cai § Center for Intelligent Chemical Instrumentation, Chemistry Department, Ohio University, Athens, Ohio 45701-2979, Schaffner Manufacturing Company, Inc., 21 Herron Avenue, Pittsburgh, Pennsylvania 15202, and Aventis Pharmaceuticals, Mail Stop: C1-M0336 10236 Marion Park Drive, Kansas City, Missouri 64137. The multivariate curve resolution method SIMPLe to use Interactive Self-Modeling Mixture Analysis (SIMPLISMA) was applied to Fourier and wavelet compressed ion- mobility spectra. The spectra obtained from the SIM- PLISMA model were transformed back to their original representation, that is, uncompressed format. SIMPLIS- MA was able to model the same pure variables for the partial wavelet transform, although for the Fourier and complete wavelet transforms, satisfactory pure variables and models were not obtained. Data were acquired from two samples and two different ion mobility spectrometry (IMS) sensors. The first sample was thermally desorbed sodium γ-hydroxybutyrate (GHB), and the second sample was a liquid mixture of dicyclohexylamine (DCHA) and diethylmethylphosphonate (DEMP). The spectra were compressed to 6.3% of their original size. SIMPLISMA was applied to the compressed data in the Fourier and wavelet domains. An alternative method of normalizing SIMPLISMA spectra was devised that removes variation in scale between SIMPLISMA results obtained from uncompressed and compressed data. SIMPLISMA was able to accurately extract the spectral features and con- centration profiles directly from daublet compressed IMS data at a compression ratio of 93.7% with root-mean- square errors of reconstruction <3%. The daublet wavelet filters were selected, because they worked well when compared to coiflet and symmlet. The effects of the daublet filter width and compression ratio were evaluated with respect to reconstruction errors of the data sets and SIMPLISMA spectra. For these experiments, the daublet 14 filter performed well for the two data sets. Advances in analytical instrumentation have led to measure- ment systems that generate large quantities of data for single samples. Examples include hyphenated systems that include separation stages coupled to multichannel detectors, such as liquid chromatography-mass spectrometry (LC-MS) and LC-nuclear magnetic resonance spectrometry (LC-NMR). Some advances, such as the use of time-of-flight MS, furnish more resolution elements and can collect spectra at faster rates. Large volumes of data may be unwieldy for further online processing, especially if the processing is to be accomplished in real time. Another area of advancement is the miniaturization of chemical sensors. As sensors decrease in size and cost, they will find widespread use outside of the controlled environment of the analytical laboratory. The analysis of complex samples in intricate environments will also force the replacement of the classical approach of signal averaging a stable sensor response with dynamic modeling methods. The advantage of modeling the dynamic sensor response is that temporal information may be exploited to resolve analytes in a mixture or to correct instrument drift resulting from changing ambient conditions. The use of dynamic modeling requires storing individual spectra or data objects. As sensors decrease in size, their storage and processing capabilities may become limited. For some applications, the data may be transmitted from sensors using wireless communication. Bandwidth limits may be encountered that may restrict the rate at which data can be conveyed, thereby necessitating the use of compression. IMS sensors are amenable to miniaturization. These instru- ments are routinely used at airports for screening hand luggage for explosives, and have broad application for forensic, environ- mental, process, and industrial hygiene monitoring. Hand-held IMS sensors are commercially available that furnish detection limits below one part per million. By modeling a data set using methods such as principal component analysis (PCA) and SIMPLISMA, the model can provide a clear and complete perspective of large and intricate sets of measurements. Multivariate curve resolution methods build mathematical models from variations in a data set that resolve spectra (i.e., curves). The models estimate the number of independent components (i.e., analytes). Overlapping peaks in the spectra may be modeled as separate components. Models furnish clear perspectives into complex trends in data. 1 Modeling methods should be used routinely for spectrometric and multichannel measurements. Compression methods may be used to reduce the size of the spectra so that low capacity storage can be maintained and computational burdens alleviated. In many cases, compressing the data can also improve its quality by removing high-frequency noise components. In many compression methods, the data when * Corresponding author. ² Ohio University. Shaffner Manufacturing Company, Inc. § Aventis Pharmaceuticals. (1) Harrington, P. B.; Reese, E. S.; Rauch, P. J.; Hu, L.; Davis, D. M. Appl. Spectrosc. 1997, 51, 808-816. Anal. Chem. 2001, 73, 3247-3256 10.1021/ac000956s CCC: $20.00 © 2001 American Chemical Society Analytical Chemistry, Vol. 73, No. 14, July 15, 2001 3247 Published on Web 05/26/2001