Using WordNet and SUMO to Determine Source Domains of Conceptual Metaphors Siaw-Fong Chung Graduate Institute of Linguistics National Taiwan University No. 1, Sec. 4, Roosevelt Road Taipei, 106 Taiwan claricefong6376@hotmail.com Kathleen Ahrens Graduate Institute of Linguistics National Taiwan University No. 1, Sec. 4, Roosevelt Road Taipei, 106 Taiwan kathleenahrens@yahoo.com Chu-Ren Huang Graduate Institute of Linguistics Academia Sinica No.128, Sec. 2, Academia Road, Nankang Taipei, 115 Taiwan churen@gate.sinica.edu.tw Abstract In previous work by Ahrens et al. (2003) and Chung et al. (2003a, 2003b), the extraction of conceptual metaphors was carried out based on prototypicality. Based on this view, the most prototypical mappings of source to target domains in the corpora help determining the Mapping Principle of the conceptual metaphors (Ahrens 2002). However, this approach is limited in the aspect that the identification of source domains must be manually determined. Therefore, the purpose of this paper is to suggest a way to reduce the manual work required for the determination of the source domain as well as to operationalize the steps to define different source domains. This paper incorporates the WordNet lexical representation and SUMO ontology in the identification of the related nodes to a group of linguistics expressions. The lexicons in the WordNet and SUMO are linked through works such as Niles and Pease (2003). 1 Introduction The Conceptual Mapping (CM) Model (Ahrens 2002) proposed that a specific Mapping Principle can be generated through observing the source and target domain mappings in conceptual metaphors. For instance, the list of words in Table 1 below were suggested by Ahrens, Chung & Huang (2003) to be related to the source domain of PERSON. They extracted a total of 2000 instances from the Academic Sinica Balanced Corpus of Mandarin Chinese. From these instances, they extracted metaphorical expressions through observing the source-target domain mappings. For instance, chengzhang ‘growth’ below shows a mapping between the concrete source domain of PERSON and the abstract domain of ECONOMY. The analyses were carried out manually. Table 1: ECONOMY IS A PERSON 成長 (growth) 衰退 (dysfunction) 成長期 (growth period) 病狀 (symptoms) 命脈 (lifeblood) 衰頹(weakness and degeneration) 成長 (grow) 衰退 (to become dysfunctional) 復甦 (regain consciousness) 惡化 (deteriorate) 恢復 (recover) Ahrens, Chung and Huang (2003) then counted the number of instances of which these expressions occurred. Based on the most frequent (i.e., the most prototypical) mappings, the Mapping Principles (MP) were suggested. For instance, the MP for ECONOMY IS A PERSON in Table 1 is ‘economy is person because people have a life cycle and economy has growth cycle.’ In this paper, we further extend the work of Ahrens et al. (2003) but minimize the manual work required when determining the source domains. We suggest that the identification of source domains can be solidified through examining a) the WordNet senses and explanations and b) the SUMO nodes