by Manuel Llavador and José H. Canós
A typical case of low-level interoperability, particularly frequent in the Digital Libraries world, is the federation of collections via metadata conversion. Roughly speaking, a federation consists of a number of repositories, each with its own format, which agree on a common format for metadata exchange. Any metadata record, must then be transformed into the common format before it is sent as the result of a request. In this note, we report on a solution to the metadata conversion problem based on semantic mappings. Although it was developed to federate collections in a specific project, XSMapper is domain-independent and can be used in any context where an XML schema transformation is required.
Making distributed and heterogeneous systems interoperate has been a challenge for researchers and practitioners over the last decade. The complexity of the problem has led to solutions with increasing levels of sophistication, depending on the requirements imposed by the domains of the application. Different forms of middleware represent the most general solution for achieving full interoperability, but in some cases simpler solutions can be used. This is particularly the case when the requirement for interoperability originates from the heterogeneity of (meta)data formats, as often happens in the Digital Libraries world.
Such a problem arose during the development of BibShare, an environment for bibliography management, a project funded by Microsoft Research Cambridge, that allows users to collect bibliographic references, insert citations into documents and automatically generate a document's bibliography. Unlike former tools, BibShare works with a variety of word-processing systems, and permits references to be inserted not only from personal citation collections, but also from bibliography servers available on the Internet, such as DBLP . As might be expected, each collection has its own metadata format(s). In order to unify the result sets of federated searches and return these data to the user, each record retrieved must be converted to a common format. We call this the Bibshare Bibliographic Format (BBF).
Given that XML is used to exchange data, the natural solution to the problem is to use XSL transformations between records. For a collection to be added to the BibShare Federation, the owner of the collection must create an XSL template that transforms the records to the BBF. However, writing an XSL template is not a trivial task, and any tool supporting template generation would represent a significant improvement to the federation process.
Since the problem of document transformation goes beyond the scope of Bibshare, we developed a general solution to the problem. In its most general version, this be stated as follows: given two XML Schemas S1 and S2 that represent respectively the source and target formats of a transformation, obtain as automatically as possible the XSL template that transforms S1-valid documents into S2-valid documents.
|
XML Semantic Mapper (XSMapper) solves the problem based on the definition of semantic mappings between source and target schemas, following three steps (see figure):
Links:
Bibshare: http://www.bibshare.org
XSMapper: http://bibshare.dsic.upv.es/XSMapper.exe
Please contact:
José H. Canós, Technical University of Valencia / SpaRCIM
E-mail: jhcanosdsic.upv.es
http://www.dsic.upv.es/~jhcanos