20. ICDE 2004: Boston, MA, USA
Z. Meral Özsoyoglu, Stanley B. Zdonik (Eds.): Proceedings of the 20th International Conference on Data Engineering, ICDE 2004, 30 March - 2 April 2004, Boston, MA, USA. IEEE Computer Society 2004 ISBN 0-7695-2065-0
Keynote Speakers
Eric K. Neumann: Can A Semantic Web for Life Sciences Improve Drug Discovery? 2
Steven Hagan: Driving Forces in Database Technology. 3
David Lehman: Enabling Communities of Knowledge Workers. 4
Research Sessions
Indexing (I)
Nick Koudas, Beng Chin Ooi, Heng Tao Shen, Anthony K. H. Tung: LDC: Enabling Search By Partial Distance In A Hyper-Dimensional Space. 6-17
David B. Lomet: Simple, Robust and Highly Concurrent B-trees with Node Deletion. 18-27
Thanaa M. Ghanem, Rahul Shah, Mohamed F. Mokbel, Walid G. Aref, Jeffrey Scott Vitter: Bulk Operations for Space-Partitioning Trees. 29-40
Semi-Structured Data and XML (I)
Rajasekar Krishnamurthy, Venkatesan T. Chakaravarthy, Raghav Kaushik, Jeffrey F. Naughton: Recursive XML Schemas, Recursive XML Queries, and Relational Storage: XML-to-SQL Query Translation. 42-53
Ning Zhang, Varun Kacholia, M. Tamer Özsu: A Succinct Physical Storage Scheme for Efficient Evaluation of Path Queries in XML. 54-65
Xiaodong Wu, Mong-Li Lee, Wynne Hsu: A Prime Number Labeling Scheme for Dynamic Ordered XML Trees. 66-78
Data Mining (I)


James Caverlee, Ling Liu, David Buttler: Probe, Cluster, and Discover: Focused Extraction of QA-Pagelets from the Deep Web. 103-114
Query Processing (I)
Shimin Chen, Anastassia Ailamaki, Phillip B. Gibbons, Todd C. Mowry: Improving Hash Join Performance through Prefetching. 116-127
Gao Cong, Beng Chin Ooi, Kian-Lee Tan, Anthony K. H. Tung: Go Green: Recycle and Reuse Frequent Patterns. 128-139
Distributed, Parallel, Mobile (I)
Torsten Suel, Patrick Noel, Dimitre Trendafilov: Improved File Synchronization Techniques for Maintaining Large Replicated Collections over Slow Networks. 153-164
Ozgur D. Sahin, Abhishek Gupta, Divyakant Agrawal, Amr El Abbadi: A Peer-to-peer Framework for Caching Range Queries. 165-176
Per-Åke Larson, Jonathan Goldstein, Jingren Zhou: MTCache: Transparent Mid-Tier Database Caching in SQL Server. 177-188
Spatio-Temporal Querying

Jimeng Sun, Dimitris Papadias, Yufei Tao, Bin Liu: Querying about the Past, the Present, and the Future in Spatio-Temporal. 202-213
Yufei Tao, George Kollios, Jeffrey Considine, Feifei Li, Dimitris Papadias: Spatio-Temporal Aggregation Using Sketches. 214-225
Query Processing (II)
Surajit Chaudhuri, Venkatesh Ganti, Luis Gravano: Selectivity Estimation for String Predicates: Overcoming the Underestimation Problem. 227-238
Norman May, Sven Helmer, Guido Moerkotte: Nested Queries and Quantifiers in an Ordered Context. 239-250
Mohamed F. Mokbel, Ming Lu, Walid G. Aref: Hash-Merge Join: A Non-blocking Join Algorithm for Producing Fast and Early Join Results. 251-262
Semi-Structured Data and XML (II)
Neoklis Polyzotis, Minos N. Garofalakis, Yannis E. Ioannidis: Selectivity Estimation for XML Twigs. 264-275
Atsuyuki Morishima, Hiroyuki Kitagawa, Akira Matsumoto: A Machine Learning Approach to Rapid Development of XML Mapping Queries. 276-287
Indexing (II)
Dimitris Papadias, Qiongmao Shen, Yufei Tao, Kyriakos Mouratidis: Group Nearest Neighbor Queries. 301-312
Rui Zhang, Beng Chin Ooi, Kian-Lee Tan: Making the Pyramid Technique Robust to Query Types and Workloads. 313-324
Naresh Neelapala, Romil Mittal, Jayant R. Haritsa: SPINE: Putting Backbone into String Indexing. 325-336
Streams
Themistoklis Palpanas, Michail Vlachos, Eamonn J. Keogh, Dimitrios Gunopulos, Wagner Truppel: Online Amnesic Approximation of Streaming Time Series. 339-349
Brian Babcock, Mayur Datar, Rajeev Motwani: Load Shedding for Aggregation Queries over Data Streams. 350-361
Xuemin Lin, Hongjun Lu, Jian Xu, Jeffrey Xu Yu: Continuously Maintaining Quantile Summaries of the Most Recent N Elements over a Data Stream. 362-373
Data Mining (II)
Ding-Ying Chiu, Yi-Hung Wu, Arbee L. P. Chen: An Efficient Algorithm for Mining Frequent Sequences by a New Strategy without Support Counting. 375-386
Zaiqing Nie, Subbarao Kambhampati: A Frequency-based Approach for Mining Coverage Statistics in Data Integration. 387-398
Xiaoxin Yin, Jiawei Han, Jiong Yang, Philip S. Yu: CrossMine: Efficient Classification Across Multiple Database Relations. 399-410
Distributed, Parallel, Mobile (II)
Witold Litwin, Thomas J. E. Schwarz: Algebraic Signatures for Scalable Distributed Data Structures. 412-423
Songting Chen, Jun Chen, Xin Zhang, Elke A. Rundensteiner: Detection and Correction of Conflicting Source Updates for View Maintenance. 436-447
Streams and Sensors
Jeffrey Considine, Feifei Li, George Kollios, John W. Byers: Approximate Aggregation Techniques for Sensor Databases. 449-460
Surajit Chaudhuri, Arnd Christian König, Vivek R. Narasayya: SQLCM: A Continuous Monitoring Framework for Relational Database Engines. 473-484
Middleware, Workflow
Roger S. Barga, Shimin Chen, David B. Lomet: Improving Logging and Recovery Performance in Phoenix/App. 486-497
Mohamed F. Mokbel, Walid G. Aref, Khaled M. Elbassioni, Ibrahim Kamel: Scalable Multimedia Disk Scheduling. 498-509
Web Data Management
Sihem Amer-Yahia, Yannis Kotidis: Web-Services Architecture for Efficient XML Data Exchange. 523-534
David T. McWherter, Bianca Schroeder, Anastassia Ailamaki, Mor Harchol-Balter: Priority Mechanisms for OLTP and Transactional Web Applications. 535-546
Zhenyu Liu, Chang Luo, Junghoo Cho, Wesley W. Chu: A Probabilistic Approach to Metasearching with Adaptive Probing. 547-558
Middleware, Security


Radu Sion: Proving Ownership over Categorical Data. 584-595
Database Applications (I)

Dieter Gawlick, Dmitry Lenkov, Aravind Yalamanchi, Lucy Chernobrod: Applications for Expression Data in Relational Database System. 609-620
Benjamin Bin Yao, M. Tamer Özsu, Nitin Khandelwal: XBench Benchmark and Performance Testing of XML DBMSs. 621-632
Data Warehouse and OLAP
Zhenqiang Tan, Anthony K. H. Tung: Substructure Clustering on Sequential 3d Object Datasets. 634-645
H. V. Jagadish, Raymond T. Ng, Beng Chin Ooi, Anthony K. H. Tung: ItCompress: An Iterative Semantic Compression Algorithm. 646-657
Ying Feng, Divyakant Agrawal, Amr El Abbadi, Ahmed Metwally: Range CUBE: Efficient Cube Computation by Exploiting Data Correlation. 658-669
Semi-Structured Data and XML (III
Denilson Barbosa, Alberto O. Mendelzon, Leonid Libkin, Laurent Mignet, Marcelo Arenas: Efficient Incremental Validation of XML Documents. 671-682

Scientific, Biological Databases, Bio-informatics
Dennis Shasha, Jason Tsong-Li Wang, Sen Zhang: Unordered Tree Mining with Applications to Phylogeny. 708-719
Srikanta J. Bedathur, Jayant R. Haritsa: Engineering a Fast Online Persistent Suffix Tree Construction. 720-731
Xiaosong Ma, Marianne Winslett, John Norris, Xiangmin Jiao, Robert Fiedler: GODIVA: Lightweight Data Management for Scientific Visualization Applications. 732-743
Database Applications (II)
Kothuri Venkata Ravi Kanth, Siva Ravada, Ning An: Incorporating Updates in Domain Indexes: Experiences with Oracle Spatial R-trees. 745-753
Kuiyang Lou, S. Prabhakar, Karthik Ramani: Content-based Three-dimensional Engineering Shape Search. 754-765
Kai Xu, Xiaofang Zhou, Xuemin Lin: Direct Mesh: a Multiresolution Approach to Terrain Visualization. 766-776
Industrial Paper
Data Warehousing
Dennis Pedersen, Jesper Pedersen, Torben Bach Pedersen: Integrating XML Data in the TARGITOLAP System. 778-781
Bio-informatics
Peter Mork, Philip A. Bernstein: Adapting a Generic Match Algorithm to Align Ontologies of Human Anatomy. 787-790
Chung-Min Chen, Hira Agrawal, Munir Cochinwala, David Rosenbluth: Stream Query Processing for Healthcare Bio-sensor Applications. 791-794
Joel E. Richardson, James A. Kadin, Judith A. Blake, Carol J. Bult, Janan T. Eppig, Martin Ringwald: From Sipping on a Straw to Drinking from a Fire Hose: Data Integration in a Public Genome Database. 795-798
Enterprise Systems
Michael J. Carey: BEA Liquid Data for WebLogic: XML-Based Enterprise Information Integration. 800-803
David Reiner, Gil Press, Mike Lenaghan, David Barta, Rich Urmston: Information Lifecycle Management: The EMC Perspective. 804-807
Chito Jovellanos: ContextMetricsTM: Semantic and Syntactic Interoperability in Cross-Border Trading Systems. 808-811
Data and the Web
Jinyu Wang, Kongyi Zhou, K. Karun, Mark Scardina: Extending XML Database to Support Open XML. 813-816
Neeraj Agrawal, Rema Ananthanarayanan, Rahul Gupta, Sachindra Joshi, Raghu Krishnapuram, Sumit Negi: EShopMonitor: A Web Content Monitoring Tool. 817-820
Mike Hanlon, Johannes Klein, Robbert C. Van der Linden, Hansjörg Zeller: Publish/Subscribe in NonStop SQL: Transactional Streams in a Relational Context. 821-824
Poster Sessions
Indexing, Clustering, Data Mining

Chulyun Kim, Jong-Hwa Lim, Raymond T. Ng, Kyuseok Shim: SQUIRE: Sequential Pattern Mining with Quantities. 827
Elisa Bertino, Barbara Catania, Wen Qiang Wang: XJoin Index: Indexing XML Data for Efficient Handling of Branching Path Expressions. 828
Raghav Kaushik, Rajasekar Krishnamurthy, Jeffrey F. Naughton, Raghu Ramakrishnan: On the Integration of Structure Indexes and Inverted Lists. 829
Silvia Nittel, Kelvin T. Leung, Amy Braverman: Scaling Clustering Algorithms for Massive Data Sets using Data Streams. 830
Sandeep Gupta, Chinya V. Ravishankar: Using vTree Indices for Queries over Objects with Complex Motions. 831
Sanjay Chawla, Joseph G. Davis, Gaurav Pandey: On Local Pruning of Association Rules Using Directed Hypergraphs. 832
Wenyuan Li, Wee Keong Ng, Ee-Peng Lim: Spectral Analysis of Text Collection for Similarity-based Clustering. 833
Chien-Chung Huang, Shui-Lung Chuang, Lee-Feng Chien: Mining the Web for Generating Thematic Metadata from Textual Data. 834
Karin Kailing, Hans-Peter Kriegel, Stefan Schönauer, Thomas Seidl: Efficient Similarity Search in Large Databases of Tree Structured Objects. 835
Indexing, Query Processing, XML
Ilias Nitsos, Georgios Evangelidis, Dimitrios Dervos: Bitmap-Tree Indexing for Set Operations on Free Text. 837
Alin Deutsch, Yannis Papakonstantinou, Yu Xu: Minimization and Group-By Detection for Nested XQueries. 839
Bertram Ludäscher, Alan Nash: Web Service Composition Through Declarative Queries: The Case of Conjunctive Queries with Union and Negation. 840
Gary Kratkiewicz, Renu Kurien Bostwick, Geoffrey S. Knauth: Efficient Execution of Computation Modules in a Model with Massive Data. 841
Surajit Chaudhuri, Zhiyuan Chen, Kyuseok Shim, Yuqing Wu: Storing XML (with XSD) in SQL Databases: Interplay of Logical and Physical Designs. 842
Peter Rosenthal: A Type-Safe Object-Oriented Solution for the Dynamic Construction of Queries. 843

Boualem Benatallah, Mohand-Said Hacid, Hye-Young Paik, Christophe Rey, Farouk Toumani: Peering and Querying e-Catalog Communities. 846
Demo Sessions
Christian Wiesner, Alfons Kemper, Stefan Brandl: Dynamic Extensible Query Processing in Super-Peer Based P2P Systems. 848
Ilvio Bruder, Andre Zeitz, Holger Meyer, Birger Hänsel, Andreas Heuer: FLYINGDOC: An Architecture for Distributed, User-friendly, and Personalized Information Systems. 849
Moustafa A. Hammad, Mohamed F. Mokbel, Mohamed H. Ali, Walid G. Aref, Ann Christine Catlin, Ahmed K. Elmagarmid, Mohamed Y. Eltabakh, Mohamed G. Elfeky, Thanaa M. Ghanem, Robert Gwadera, Ihab F. Ilyas, Mirette S. Marzouk, Xiaopeng Xiong: Nile: A Query Processing Engine for Data Streams. 851
Avigdor Gal, Giovanni A. Modica, Hasan M. Jamil: OntoBuilder: Fully Automatic Extraction and Consolidation of Ontologies from Web Sources. 853
Amarnath Gupta, Bin Liu, Pilho Kim, Ramesh Jain: Using Stream Semantics for Continuous Queries in Media Stream Processors. 854
Omar Boucelma, Mehdi Essid, Zoé Lacroix, Julien Vinel, Jean-Yves Garinet, Abdelkader Bétari: VirGIS: Mediation for Geographical Information Systems. 855
Juliana Freire, Maya Ramanath, Lingzhi Zhang: A Flexible Infrastructure for Gathering XML Statistics and Estimating Query Cardinality. 857
Stefan Brecheisen, Hans-Peter Kriegel, Peer Kröger, Martin Pfeifle, Maximilian Viermetz, Marco Pötke: BOSS: Browsing OPTICS-Plots for Similarity Search. 858
Stéphane Lopes, Fabien De Marchi, Jean-Marc Petit: DBA Companion: A Tool for Logical Database Tuning. 859
Yong Ye, Xintao Wu, Kalpathi R. Subramanian, Liying Zhang: GenExplore: Interactive Exploration of Gene Interactions from Microarray Data. 860
Sudarshan Murthy, David Maier, Lois M. L. Delcambre, Shawn Bowers: Superimposed Applications using SPARCE. 861
Yannis Velegrakis, Renée J. Miller, Lucian Popa, John Mylopoulos: ToMAS: A System for Adapting Mappings while Schemas Evolve. 862
Radu Sion, Mikhail J. Atallah, Sunil Prabhakar: wmdb.: Rights Protection for Numeric Relational Data. 863
Panels
Daniela Florescu: Database Research for the Current Millennium. 866
Dieter Gawlick: Querying the Past, the Present, and the Future. 867
David B. Lomet: Database Kernel Research: What, if anything, is left to do? 868
Michael Stonebraker: Outrageous Ideas and/or Thoughts While Shaving. 869
Advanced Seminars
Baihua Zheng, Jianliang Xu, Wang-Chien Lee: Data Management in Location-Dependent Information Services. 871
Irini Fundulaki, Richard Hull, Bharat Kumar, Daniel F. Lieuwen, Arnaud Sahuguet: "My Personal Web": A Seminar on Personalization and Privacy for Web and Converged Services. 872


Wei Hong, Samuel Madden: Implementation and Research Issues in Query Processing for Wireless Sensor Networks . 876
Jian Pei, Shambhu J. Upadhyaya, Faisal Farooq, Venugopal Govindaraju: Data Mining for Intrusion Detection: Techniques, Applications and Systems. 877



