Keith J. Dreyer, DO, PhD Mannudeep K. Kalra, MD, DNB Michael M. Maher, FFR (RCSI), MD, FRCR Autumn M. Hurier, BS Benjamin A. Asfaw, MHSA Thomas Schultz, BS Elkan F. Halpern, PhD James H. Thrall, MD Published online before print 10.1148/radiol.2341040049 Radiology 2005; 234:323–329 Abbreviations: BI-RADS = Breast Imaging Reporting and Data System CI = confidence interval LEXIMER = Lexicon Mediated Entropy Reduction 1 From the Division of Computing and Information Services, Department of Radiology, Massachusetts General Hospital and Harvard Medical School, 100 Charles River Plaza, Suite 471, Cambridge St, Boston, MA 02114. Re- ceived January 9, 2004; revision re- quested March 12; revision received April 6; accepted May 19. Address correspondence to K.J.D. (e-mail: kdreyer@partners.org). Authors stated no financial relation- ship to disclose. Author contributions: Guarantors of integrity of entire study, K.J.D., J.H.T.; study concepts, K.J.D., M.K.K., J.H.T.; study design, K.J.D., M.K.K.; literature research, M.K.K., K.J.D.; clinical studies, K.J.D., M.K.K., M.M.M., B.A.A., T.S.; experimental studies, K.J.D., A.M.H., T.S.; data ac- quisition, M.K.K., M.M.M., A.M.H., B.A.A., T.S., K.J.D.; data analysis/inter- pretation, M.K.K., E.F.H.; statistical analysis, M.K.K., E.F.H.; manuscript preparation, K.J.D., M.K.K., M.M.M., J.H.T., E.F.H.; manuscript definition of intellectual content, K.J.D., M.K.K., J.H.T.; manuscript editing, K.J.D., M.K.K., J.H.T., M.M.M.; manuscript revision/review, K.J.D., M.K.K., M.M.M.; manuscript final version approval, all au- thors © RSNA, 2004 Application of Recently Developed Computer Algorithm for Automatic Classification of Unstructured Radiology Reports: Validation Study 1 PURPOSE: To validate the accuracy of Lexicon Mediated Entropy Reduction (LEXIMER), a new information theory– based computer algorithm developed by the authors for independent analysis and classification of unstructured radiology reports based on the presence of clinically important findings (F T , where T represents “true”) and recommendations for subsequent action (R T ). MATERIALS AND METHODS: The study was approved by the Human Research Committee of the institutional review board. Consecutive de-identified radiology reports (n = 1059) comprising results of barium studies (n = 99), computed tomography (n = 107), mammography (n = 90), magnetic resonance imaging (n = 108), nuclear medicine (n = 99), positron emission tomography (n = 106), radi- ography (n = 212), ultrasonography (n = 131), and vascular procedures (n = 107) were independently analyzed by two radiologists and then with LEXIMER to cate- gorize the reports into F T and F T 0 (containing or not containing clinically important findings) categories and R T and R T 0 (containing or not containing recommendations for subsequent action) categories. Accuracy, sensitivity, specificity, and positive and negative predictive values of LEXIMER for placing reports into F T and F T 0 and R T and R T 0 categories were assessed by using appropriate statistical tests. RESULTS: There was strong interobserver concordance between the two radiolo- gists for placing radiology reports into F T and R T categories (= 0.9, P .01). For the LEXIMER program, accuracy, sensitivity, specificity, and positive and negative predictive values, respectively, were 97.5% (95% confidence interval [CI]: 96.6%, 98.5%), 98.9% (95% CI: 97.9%, 99.6%), 94.9% (95% CI: 93.1%, 96.0%), 97.5% (95% CI: 96.6%, 98.0%), and 97.7% (95% CI: 95.8%, 98.8%) for placing radiology reports into F T and F T 0 categories and 99.6% (95% CI: 99.2%, 99.9%), 98.2% (95% CI: 95.0%, 99.6%), 99.9% (95% CI: 99.4%, 99.99%), 99.4% (95% CI: 96.3%, 99.9%), and 99.7% (95% CI: 98.9%, 99.9%) for placing reports into R T and R T 0 categories. CONCLUSION: LEXIMER is an accurate automated engine for evaluating the percentage positivity of clinically important findings and rates of recommendation for subsequent action in unstructured radiology reports. © RSNA, 2004 An assessment of diagnostic yield (the frequency of clinically important findings at an imaging examination) and recommendations in radiology practice may help in determin- ing the appropriateness of high-cost and high-volume radiologic procedures. Without the widespread use of structured reporting, most institutions have vast electronic databases of 323 R adiology