Designing Good Semi-Structured Databases and Conceptual Modeling

Author(s): Sin Yeung Lee, Mong-Li Lee, Tok Wang Ling, Kalinichenko L.A.
Published:Conceptual Modeling - ER '99: Proc. of the 18th International Conference on Conceptual Modeling. LNCS 1728. -- Berlin-Heidelberg: Springer-Verlag, 1999. -- P. 131-145.
Semi-structured data has become prevalent with the growth of the Internet and other on-line information repositories. Many organizational databases are presented on the web as semi-structured data. Designing a "good" semi-structured database is increasingly crucial to prevent data redundancy, inconsistency and updating anomalies. In this paper, we define a semi-structured schema graph and identify the various anomalies that may occur in the graph. A normal form for semi-structured schema graph, S3-NF, is proposed. We present two approaches to design S3-NF database, namely, restructuring by decomposition and the ER approach. The first approach consists of a set of rules to decompose a semi-structured schema graph into S3-NF. The second approach uses the ER model to remove anomalies at the semantic level.
