World of Computer Science and Information Technology Journal (WCSIT) ISSN: 2221-0741 Vol. 1, No. 5, 193-197, 2011 193 Improving Semantic Schema Integration Zahra Sheikhnajdy Mehran Mohsenzadeh Mashalah Abbasi Dezfuli Department of Computer Engineering Department of Computer Engineering Department of Computer Engineering Islamic Azad University – Science & Islamic Azad University - Science & Islamic Azad University – Science & Research, Khouzestan, Iran Research Center, Tehran, Iran Research, Khouzestan, Iran z.sheikhnajdy@khouzestan.srbiau.ac.ir mohsenzadeh@srbiau.ac.ir Abbasi_masha@yahoo.com Abstract—Schema matching is a critical step in many applications, such as data warehouse loading, Online Analytical Process (OLAP), Data mining, semantic web and schema integration. This task is defined for finding the semantic correspondences between elements of two schemas. Recently, schema matching has found considerable interest in both research and practice. In this paper, some approaches for supporting semantic schema matching compared and then we suggest three solutions for improving semantic schema matching problem. Keywords—schema matching; element level matcher; structural level matcher; semantic ambiguities; step word; word sense disambiguation. I. INTRODUCTION Schema matching is the identification of database elements with similar meaning as preparation for subsequent database integration. A schema consists of a set of related elements, such as classes, or XML elements or attributes. The result of a Match operation is a mapping. A mapping consists of a set of mapping elements, each of which indicates that certain elements of schema S1 are related to certain elements of schema S2 [11]. Match is a schema manipulation operation that takes two schemas as input and returns a mapping that identifies corresponding elements in the two schemas [2], [3], [4], [5], [6], [7]. Generic schema matching system architecture shows in Fig. 1. Figure 1. Generic schema matching system architecture Schema matching is primarily studied as a piece of these other applications. For example, schema integration uses matching to find similar structures in heterogeneous schemas, which are then used as integration points. Data translation uses matching to find simple data transformations. Schema matching use in several application domains such as database application domain, for instance Data integration, Data warehousing, Data mining, E-commerce, Query processing, Peer data management, Model management and so on. Another application domain of schema matching is semantic web like Semantic web services and Xml/html to ontology. Manually schema matching is a time-consuming, error- prone, and therefore expensive process. Thus, a faster and less labor-intensive integration approach that does this job automated is needed. Over the past 20 years, different schema matching methods have been proposed and have been shown to be successful to various degrees. However, schema matching is an ongoing research area and the problem is not yet considered to be solved [12]. In this paper, several exiting schema matching introduced and compared with together, then three solutions for improving this approach introduced. The paper is organized as follows. Section II presents previous work and the basic characteristics of known matchers. Section III introduces our solutions and finally, conclusions and future work are discussed in section IV. II. EXITING SCHEMA MATCHING APROACHES In this section we present a classification of the major approaches to schema matching and describe the most popular ones. A. A Classification Of Schema Matching Approaches Schema matching is an important subtask of data integration. The core of schema matching is the operator Match which takes two schemas as input and produces a mapping between the elements of these schemas based on semantic correspondences. Implementing this operator requires an Application/Tool1 (semantic web) Application/Tool 2 (E-commerce) Application/Tool 3 (warehousing) Schema import/export Internal schema representation Generic match implementation General libraries Application/Tool 4 (schema integration)