ichirn 01 CULTURAI. HERITAGE a n d TECHNOLOGIES i n t h e THIRD M I L L E N N I U M 1 The "XMURepetti" Project: Encoding and Manipulation of Temporal Information in Historical Text Sources Fabio Grandi (*) and Federica Mandreoli (#) "'C.S.1.TE.-C.N.R. and D.E.I.S., University of Bologna, Italy E-mail: fgrandi@deis.unibo.it '" D.S.I., University of Modena and Reggio Emilia, Italy E-mail: fmandreoli@dsi.unimo.it 4BSTRACT The paper deals with the deployment of XML-related technologies in Cultural Heritage applications concerning the :ncoding of temporal semantics in the digital version of historical documents. Since written sources have often the same importance as material evidence in mcdicval archaeology, our approach can be applied to the development of tools for the support of archaeological research. In previous work, we developed an XMLIXSL infrastructure called "The Valid Web" for the definition and management of historical information within Web documents. In this paper we describe the application and extension of such an approach to the realization of the electronic version of Repetti's historical-geographical dictionary of Tuscany. The extension concerns the uniform management of temporal indeterminacy, the use of multiple calendars and granularities and the proposed solutions have been inspired by similar research done for temporal query languages. From the user viewpoint, the proposed XML extensions allow the addition of historical metainformation to the encoded text sources and their "intelligent" temporal navigation via standard Web browsers. The project also involves the definition of optimized search algorithms, storage and temporal indexing of XML-encoded Repetti's Dictionary items, implementation of a prototype. As a byproduct, also a tool for computer-aided temporal XML- encoding of text sources will be developed to be used by Cultural Heritage operators (e.g. archaeology researchers). KEYWORDS: XML, semantic Web, temporal data management, digital libraries. INTRODUCTION In the information processing field, XML [21] is becoming a new standard for data management and exchange over the Internet [I]. In particular, a great deal of interest concerns its adoption for the representation and integration of structured and unstructured data. Moreover, an outstanding (and very appealing for Cultural Heritage applications) feature is the capability of easily encoding semantic metainformation in XML documents, to be automatically used by advanced computer tools, like "intelligent" search engines (towards a Semantic Web [23]). In this context, our research interests focused in recent years on the introduction of temporal aspects into the Web, by adapting and extending