ACM SIGMOD Anthology ACM SIGMOD dblp.uni-trier.de

Data Management Support for Statistical Data Editing and Subset Selection.

Robert A. Burnett, James J. Thomas: Data Management Support for Statistical Data Editing and Subset Selection. SSDBM 1981: 88-102
@inproceedings{DBLP:conf/ssdbm/BurnettT81,
  author    = {Robert A. Burnett and
               James J. Thomas},
  editor    = {Harry K. T. Wong},
  title     = {Data Management Support for Statistical Data Editing and Subset
               Selection},
  booktitle = {Proceedings of the First LBL Workshop on Statistical Database
               Management, Melno Park, California, USA, December 2-4, 1981},
  publisher = {Lawrence Berkeley Laboratory},
  year      = {1981},
  pages     = {88-102},
  ee        = {db/conf/ssdbm/BurnettT81.html},
  crossref  = {DBLP:conf/ssdbm/81},
  bibsource = {DBLP, http://dblp.uni-trier.de}
}

Abstract

Statistical analysis of large data sets often involves an initial data editing and preparation phase to check the validity of individual data items, check for consistency among related data, correct erroneous data, and supply (impute) values for missing data where possible. During this preparatory phase of analysis , it is often necessary to partition the data set into a number of subsets by logical selection and/or random sampling techniques for purposes of hypothesis testing. This paper examines the data management support required by these editing and subsetting operations in terms of data descriptions, data manipulation functions, and logical and physical data structures. The design of a data management system which seeks to meet these requirements is described in detail. The system, called SDB, is built around a self-describing transposed file structure and supporting data access software. SDB representations of some logical data structures which are commonly encountered in statistical databases are also described. Experiences with a partial implementation of the system and its application in an interactive data editor have been encouraging.

ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 5, SSDBM, DBPL, KRDB, ADBIS, COOPIS, SIGBDP" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ...

Printed Edition

Harry K. T. Wong (Ed.): Proceedings of the First LBL Workshop on Statistical Database Management, Melno Park, California, USA, December 2-4, 1981. Lawrence Berkeley Laboratory 1982
Contents CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML

References

[1]
...
[2]
...
[3]
...
[4]
...
[5]
...
[6]
Ryosuke Hotaka, Masaaki Tsubaki: Self-Descriptive Relational Data Base. VLDB 1977: 415-426 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[7]
Don S. Batory: On Searching Transposed Files. ACM Trans. Database Syst. 4(4): 531-544(1979) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[8]
M. J. Turner, R. Hammond, P. Cotton: A DBMS for Large Statistical Databases. VLDB 1979: 319-327 CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML
[9]
...
[10]
Michael Stonebraker: Operating System Support for Database Management. Commun. ACM 24(7): 412-418(1981) CiteSeerX Google scholar pubzone.org BibTeX bibliographical record in XML

Copyright © Wed Dec 9 20:16:06 2009 by Michael Ley (ley@uni-trier.de)