An Analysis of Semi-structured Data Schema Approaches
Abstract
The World Wide Web provides a labyrinth of resources easily navigable by humans. The context of the links traversed is closely related to our ability to decipher meaning between subtle but important differences in the use of a word or phrase. A vision for a future web includes the ability of a computer to decipher meaning within a specific context or knowledge domain. Current research is focusing on a set of web-enabled languages that will allow future web pages to contain the context or meaning of the information being displayed. These languages are attempting to provide both structure and meaning (schema and semantics) to semi-structured data. Do these languages provide the infrastructure needed to have the future web become a reality? Can we develop languages that accurately describe the schema and semantics of web-based information? Will common database theory and techniques, especially query processing, be able to leverage this future vision for the web?
Reference List
- Abiteboul, S., Buneman, P., and Suciu, D. (2000). Data on the Web: From Relations to Semistructured Data and XML. San Francisco, California: Morgan Kaufmann Publishers.
- Adler, S., Berglund, A., Caruso, J., Deach, S., Graham, T., Grosso, P., Gutentag, E., Milowski, A., Parnell, S., Richman, J., and Zilles, S. (2001). Extensible Stylesheet Language (XSL) Version 1.0, W3C Recommendation 15 October 2001, Copyright 2001 W3C. Retrieved January 2, 2003 from http://www.w3.org/TR/.
- Beckett, D., and McBride, B. (2003). RDF/XML Syntax Specification (Revised), W3C Working Draft 23 January 2003, Copyright 2003 W3C. Retrieved January 28, 2003 from http://www.w3.org/TR/.
- Bourret, R., Cowan, J., Macherius, I., and St. Laurant, S. (1999). Document Definition Markup Language (DDML) Specification, Version 1.0, W3C Note 19 January 1999. Retrieved January 2, 2003 from http://www.w3.org/TR/.
- Bray, T., Frankstron, C., and Malhotra, A. (1998). Document Content Description for XML, Submission to the World Wide Web Consortium 31 July 1998. Retrieved January 2, 2003 from http://www.w3.org/TR/.
- Bray, T., Paoli, J., Sperberg-McQueen, C.M., and Maler, E. (2000). Extensible Markup Language (XML) 1.0 (Second Edition), W3C Recommendation 6 October 2000, Copyright 2000 W3C. Retrieved January 2, 2003 from http://www.w3.org/TR/.
- Brickley, D., and Guha, R.V. (2003). RDF Vocabulary Description Language 1.0: RDF Schema, W3C Working Draft 23 January 2003, Copyright 2003 W3C. Retrieved January 27, 2003 from http://www.w3.org/TR/.
- Buneman, P., Davidson, S., Fan, W., Hara, C., and Tan, W. (2001). Keys for XML. Proceedings of the Tenth International Conference on World Wide Web, Hong Kong, Hong Kong, May 1-5, 2001, 201-210.
- Clark, J. (1999). XSL Transformations (XSLT) Version 1.0, W3C Recommendation 15 November 1999, Copyright 1999 W3C. Retrieved January 2, 2003 from http://www.w3.org/TR/.
- Connolly, D., van Harmelen, F., Horrocks, I., McGuinness, D.L., Patel-Schneider, P.F., and Stein, L.A. (2001). DAML+OIL (March 2001) Reference Description, W3C Note 18 December 2001. Retrieved February 1, 2003 from http://www.w3.org/TR/.
- Davidson, A., Fuchs, M., Hedin, M., Jain, M., Koistinen, J., Lloyd, C., Maloney, M., and Schwarzhof, K. (1999). Schema for Object-Oriented XML 2.0, W3C Note 30 July 1999. Retrieved January 7, 2003 from http://www.w3.org/TR/.
- Dean, M., Connolly, D., van Harmelen, F., Hendler, J., Horrocks, I., McGuinness, D.L., Patel-Schneider, P.F., and Stein, L.A. (2002). Web Ontology Language (OWL) Reference Version 1.0, W3C Working Draft 12 November 2002, Copyright 2002 W3C. Retrieved January 7, 2003 from http://www.w3.org/TR/.
- Decker, S., and Sintek, M. (2003). The Semantic Web Community Portal. Retrieved February 1, 2003 from http://www.semanticweb.org/.
- Fallside, D.C. (2001). XML Schema Part 0: Primer, W3C Recommendation 2 May 2001, Copyright 2001 W3C. Retrieved January 2, 2003 from http://www.w3.org/TR/.
- Fillies, C., Wood-Albrecht, G., and Weichardt, F. (2002). A Pragmatic Application of the Semantic Web using SemTalk. Proceedings of the Eleventh International Conference on World Wide Web, Honolulu, Hawaii, USA, May 7-11, 2002, 686-692.
- Golfarelli, M., Rizzi, S., and Vrdoljak, B. (2001). Data Warehouse Design from XML Sources. Proceedings of the Fourth International Workshop on Data Warehousing and OLPA, Atlanta, Georgia, USA, November, 2001, 40-47.
- Handschuh, S., and Staab, S. (2002). Authoring and Annotation of Weg Pages in CREAM. Proceedings of the Eleventh International Conference on World Wide Web, Honolulu, Hawaii, USA, May 7-11, 2002, 462-473.
- Heflin, J., Hendler, J., Luke, S., Gasarch, C., Zhendong, Q., Spector, L., and Rager, D. (2003). SHOE: Simple HTML Ontology Extensions. Retrieved February 1, 2003 from http://www.cs.umd.edu/projects/plus/SHOE/
- Heflin, J., Volz, R., and Dale, J. (2002). Requirements for a Web Ontology Language, W3C Working Draft 8 July 2002, Copyright 2002 W3C. Retrieved January 7, 2003 from http://www.w3.org/TR/.
- Laender, A.H., Ribeiro-Neto, B.A., da Silva, A.S., and Teixeira, J.S. (2002). A Brief Survey of Web Data Extraction Tools. ACM SIGMOD Record, 31(2), 83-92.
- Lassila, O., and Swick, R.R. (1999). Resource Description Framework (RDF) Model and Syntax Specification, W3C Recommendation 22 February 1999, Copyright 1997, 1998, 1999 W3C. Retrieved January 2, 2003 from http://www.w3.org/TR/.
- Layman, A., Jung, E., Maler, E., Thompson, H.S., Paoli, J., Tigue, J., Mikula, N.H., and DeRose, S. (1998). XML-Data, W3C Note 05 January 1998. Retrieved October 25, 2002 from http://www.w3.org/TR/.
- Lo, M., Chen, S., Padmanabhan, S., and Chung, J. (2001). XAS: A System for Accessing Componentized, Virtual XML Documents. Proceedings of the 23rd International Conference on Software Engineering, Toronto, Ontario, Canada, May 12-19, 2001, 493-502.
- Manola, F., Miller, E. and McBride, B. (2003). RDF Primer, W3C Working Draft 23 January 2003, Copyright 2003 W3C. Retrieved January 28, 2003 from http://www.w3.org/TR/.
- McBride, B. (2002). Resource Description Framework (RDF): Concepts and Abstract Syntax, W3C Working Draft 08 November 2002, Copyright 2002 W3C. Retrieved January 14, 2003 from http://www.w3.org/TR/.
- M ller, A. (2002). Document Structure Description 2.0, December 19, 2002, Copyright 2002 BRICS. Retrieved January 2, 2003 from http://www.brics.dk/DSD/dsd2.html/.
- Patel-Schneider, P.F., Hayes, P., Horrocks, I., and van Harmelen, F. (2002). Web Ontology Language (OWL) Abstract Syntax and Semantics, W3C Working Draft 8 November 2002, Copyright 2002 W3C. Retrieved January 7, 2003 from http://www.w3.org/TR/.
- Smith, M.K., McGuinness, D., Volz, R., and Welty, C. (2002). Web Ontology Language (OWL): Guide Version 1.0, W3C Working Draft 4 November 2002, Copyright 2002 W3C. Retrieved January 7, 2003 from http://www.w3.org/TR/.
- Su, H., Kuno, H., and Rundensteiner, E.A. (2001). Automating the Transformation of XML Documents. Proceedings of the Third International Workshop on Web Information and Data Management, Atlanta, Georgia, USA, November, 2001, 68-75.
- Vianu, V. (2001). A Web Odyssey: from Codd to XML. Proceedings of the Twentieth International Conference on Management of Data and Symposium on Principles of Database Systems, Santa Barbara, California, USA, May 21-24, 2001, 1-15.
Last updated on February 17, 2003.