학술논문

XML Schema Directory: a data structure for XML data processing
Document Type
Conference
Source
Proceedings of the First International Conference on Web Information Systems Engineering Web information systems engineering Web Information Systems Engineering, 2000. Proceedings of the First International Conference on. 1:62-69 vol.1 2000
Subject
Computing and Processing
Communication, Networking and Broadcast Technologies
XML
Data structures
Data processing
Database languages
Engines
Information technology
Acceleration
Disaster management
Data mining
Web sites
Language
Abstract
The problem addressed in this paper is the execution of XML queries over a large collection of XML documents. This paper concentrates on how to develop the necessary infrastructure to effectively manipulate XML data and it proposes a data structure, named the XML Schema Directory (XSD), as an access means to XML repositories. The aim of XSD is to accelerate query processing by quickly finding the relevant set of XML documents for a given query. This is obtained by considering only a small number of relative XML schemata and consequently a limiting number of XML documents, rather than the entire corpus of XML documents. XML schema similarity is introduced as a way to determine the relevance among XML documents which belong to the same knowledge category. The proposed algorithms for maintaining the XSD structure do not require reorganisation and they may be efficiently used in practice. An alternative advantage of the XSD structure is that it may also be used as a method for facilitating browsing.