Interoperability for Geospatial Analysis: a semantics and ontology-
based approach
Zarine Kemp
1
, Lei Tan
1
and Jacqueline Whalley
2
1
Computing Laboratory, University of Kent
Canterbury, Kent CT2 7RY, U.K.
2
School of Computer and Information Sciences, Auckland University of Technology,
Private Bag 92006, Auckland 1020, New Zealand
Z.Kemp@kent.ac.uk
Abstract
Information extraction and integration from
heterogeneous, autonomous data resources are major
requirements for many spatial applications. Geospatial
analysis for scientific discovery involves identification of
relevant information resources, extraction and fusion of
requisite subsets of the information, application of spatial
analytical techniques and visualization of the results in an
appropriate form. The motivating application domain
underlying this research is marine environmental
management, although the principles discussed apply to a
wide range of scientific disciplines. The research
discussed in the paper focuses on integration of data
sources, data exploration and interactive data analysis. A
knowledge base is used to capture the semantics of the
spatial, temporal and thematic dimensions at a domain
level, and the context-aware framework exploited to meet
the requirements of a varied and distributed user
community with differing objectives.
Keywords: information fusion, geospatial analysis,
knowledge base, ontologies, visualization.
1 Introduction
Information technologies such as the Internet and Grid
computing have revolutionized the way that data
resources are discovered and shared. In application
domains dependent on geospatial and scientific
information, reuse, sharing and dissemination of data is a
major requirement. These information repositories are
maintained by autonomous organizations, are
heterogeneous in structure and semantics and are used by
researchers and decision-makers in various contexts and
from different perspectives. Interoperability of data and
services underpins the next phase of the World Wide
Web. Research in distributed databases, integration of
structured and semi-structured data and technologies for
mediator and information brokers have enabled
syntactical and structural heterogeneities to be overcome.
Issues relating to semantic heterogeneity are also being
tackled using metadata, ontologies and thesauri to express
Copyright © 2007, Australian Computer Society, Inc. This
paper appeared at the Eighteenth Australian Database
Conference (ADC2007), Ballarat, Victoria, Australia.
Conferences in Research and Practice in Information
Technology (CRPIT), Vol. 63. James Bailey and Alan Fekete,
Eds. Reproduction for academic, not-for profit purposes
permitted provided this text is included.
salient concepts and knowledge within a domain of
discourse.
In this paper we describe the architecture and framework
of a system for environmental information systems. We
suggest that in the context of geospatial information
systems a data integration approach based on a global
monolithic view of data, and a foundational ontology, is
not an appropriate solution. We propose an architecture
that provides interoperability, querying and analysis
capabilities for a community of researchers while
maintaining the autonomy of participating data sources.
The middleware framework uses an adaptable, scalable
knowledge base to accommodate semantic heterogeneity
and provide analysis services.
The next subsection describes a motivating application
and the data sources in the test bed. Section 2 discusses
system requirements and related work. Section 3 presents
the system architecture and details of the knowledge base.
Section 4, illustrates the interaction model using example
queries and section 5 concludes the paper.
1.1 Motivating Application
The system discussed in this paper is based on a platform
for marine research and decision support but the
requirements and principles are equally applicable to a
wide range of application areas. It is intended as a
research hub for a community of scientists who pool their
information resources and use analytical and visualization
tools for monitoring and understanding the marine
ecosystem. For example, users may wish to retrieve
detailed information about the fishing industry, study
phenomena such as algal blooms, explore the changes in
biodiversity in a particular part of the ecosystem, retrieve
applicable legislation or investigate the effects of
anthropogenic activities on particular marine species.
We discuss, briefly, the content and structural
characteristics of the data sets in the research test bed
emphasizing the geo-referenced attributes of the
information stores.
Industrial activity data: the two main activities are
fisheries and aggregate dredging for the building industry.
Management of fishing activities is regulated by the
Common Fisheries Policy (CFP) legislation of the
European Union using sea areas defined by the
International Council for the Exploration of the Sea