A property-based model for lung cancer diagnosis Alma Barranco-Mendoza 1 , Deryck R. Persaud 2 , VerĂ³nica Dahl 3 Keywords: cancer diagnosis, biomarkers, molecular targets, logic programming, constraint handling rules, concept formation. 1 Introduction. To this day, lung cancer remains the leading cause of cancer death for both sexes: almost one- third of cancer deaths in men and almost one-quarter in women. [1] Survival rates for lung cancer are low. It is suggested that only about 15% of patients are diagnosed at the early stages. The average 5-year survival rate for patients that are diagnosed early is 48% compared to 15% for those who were diagnosed at the later stages. [2] It is well known that the survival rates can be improved by the early detection of pre-invasive lesions, which are believed to be the possible precursors of malignant tumours. Although new technology is allowing numerous early lesions to be detected, it is becoming clear that only a small percentage of these will actually progress to cancer. Currently a lot of work is being done on the image analysis field, however, at the early development stages of the lesion, the information that can be obtained from lung imaging analysis (X-ray, CAT scans, MRI, PET scans, etc.) is quite limited. To completely understand the evolution of normal epithelium into invasive neoplasia would require the understanding of the genetic relationship of the cells in a pre-invasive neoplastic lesion during the development into invasive cancer. In recent years, biological research has been done in the area of cancer genetics that has shown that cancer results from an accumulation of key mutations in expanding clones originating from tissue-specific stem cells. [3] The recent availability of the human genome sequence, and the development of high throughput genomic technologies and methods for isolating selected cell populations have started to give us the opportunities for understanding how human cancers develop. This information will drastically improve cancer diagnosis and treatment through the discovery of disease-specific molecular targets. As well, recent research has also been focusing on the detection of biomarkers obtained from serum and sputum proteomic analysis [4]. Unfortunately, there is not much work done in terms of computational tools to assist in the analysis of all the genetic and molecular information in addition to the radiological, serum and sputum data that could determine with better accuracy whether an early lesion would progress into cancer or not. As well, for a system to be more valuable it should be able to provide some kind of diagnosis even if given incomplete patient information, as not all tests can or will be done on said patient. Our research intends to address this with the development of a multidisciplinary property-based model for early lung cancer diagnosis. 2 Concept Formation Rules Concept Formation Rules (CFR) [5] is a directly executable new cognitive model of knowledge construction inspired in constructivist theory as well as in recent natural language processing 1 School of Computing Science, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada. E-mail: abm@cs.sfu.ca 2 Infogenetica Bioinformatics, 3197 Tahsis Avenue, Coquitlam, BC, V3B 6E2, Canada E-mail: deryck@infogenetica.com 3 School of Computing Science, Simon Fraser University, 8888 University Drive, Burnaby, BC, V5A 1S6, Canada. E-mail: veronica@cs.sfu.ca