20 International Journal of Web Applications Volume 1 Number 1 March 2009
TBL2RDF: HTML Table to RDF Translator
V.Kiran Kumar
1
, K.Rajasekhara Rao
2
1
Computer Science Department
Dravidian University
Kuppam – 517 425. India
kirankumar.v@rediffmail.com
2
Koneru Lakshmaiah College of Engineering
Green Fields, Vaddeswaram 522502
Guntur Dt., Andhra Pradesh
India
krr_it@yahoo.co.in
ABSTRACT: There is a large quantity of data available on the web, which is designed in HTML table format. Due to lack of
machine-understandable information, these data cannot be processed by machines. This paper describes a TRANSLATOR,
a tool designed as Java application to allow anyone, even non-experts, to design their data in HTML table format for use on
the Semantic Web. This is accomplished by automatically translating data written in HTML table format into the Resource
Description Framework Schema (RDFS). XML representation in RDF has several advantages, among them is compatible
with existing Semantic Web standards.
Keywords: World Wide Web (WWW), Resource Description Framework (RDF), Resource Description Framework Schema
(RDFS), Ontology.
Received: 14 October 2008. Revised 19 November 2008. Accepted 2 December 2008
© 2009 D-line. All rights reserved.
1. Introduction
The goal of the semantic web is that web-based agents can process and “understand” data rather than merely display
them as present. There is a large quantity of data available in the web, which is designed with HTML tables. Due to
lacking of machine understandable information, these data cannot be used by web-based agents or machines. RDF is
a language representing resources on the web, which can be processed by machines rather than just displaying them.
Hence, a tool was designed which extends syntactic HTML tables into RDF files. The main advantage of using this
tool is that, a layman with little knowledge on RDF can easily create RDF documents by simply designing his data in
HTML table format.
There is a number of automated conversion tools available to convert application specific format into RDF to use RDF
tools and for integration with other data, such as D2RQ, java2rdf, javadoc2rdf, Esxcel2rdf etc., but the tool which is
presented here is designed as a java application and it takes HTML file containing table as input and produces RDFS
ontology. As there is much data available on web pages in a table format, this tool will be helpful for converting said data
into RDF ontology.
The rest of the paper is organized as follows: Section 2 describes a brief description about Resource Description Framework
(RDF) and RDFS. Section 3 discusses HTML tables. Section 4 discusses the tool architecture and its prototype. Section 5
explains the tool with a scenario and finally Section 6 concludes and explains the future work of the tool.