28. ICDE 2012:
Washington, DC, USA
Anastasios Kementsietsidis, Marcos Antonio Vaz Salles (Eds.):
IEEE 28th International Conference on Data Engineering (ICDE 2012), Washington, DC, USA (Arlington, Virginia), 1-5 April, 2012.
IEEE Computer Society 2012, ISBN 978-0-7685-4747-3
Keynotes
Panel Paper
Session 1:
Privacy
- Cuneyt Gurcan Akcora, Barbara Carminati, Elena Ferrari:
Privacy in Social Networks: How Risky is Your Social Graph?
9-19

- Graham Cormode, Cecilia M. Procopiuc, Divesh Srivastava, Entong Shen, Ting Yu:
Differentially Private Spatial Decompositions.
20-31

- Jia Xu, Zhenjie Zhang, Xiaokui Xiao, Yin Yang, Ge Yu:
Differentially Private Histogram Publication.
32-43

- Russell Paulet, Md. Golam Kaosar, Xun Yi, Elisa Bertino:
Privacy-Preserving and Content-Protecting Location Based Queries.
44-53

Session 2:
Web 2.0 Applications
Session 3:
Storage Management
- Aubrey Tatarowicz, Carlo Curino, Evan P. C. Jones, Sam Madden:
Lookup Tables: Fine-Grained Partitioning for Distributed Databases.
102-113

- Richard T. Snodgrass, Dengfeng Gao, Rui Zhang, Stephen W. Thomas:
Temporal Support for Persistent Stored Modules.
114-125

- Norifumi Nishikawa, Miyuki Nakano, Masaru Kitsuregawa:
Energy Efficient Storage Management Cooperated with Large Data Intensive Applications.
126-137

- Eric R. Schendel, Ye Jin, Neil Shah, Jackie Chen, Choong-Seock Chang, Seung-Hoe Ku, Stéphane Ethier, Scott Klasky, Robert Latham, Robert B. Ross, Nagiza F. Samatova:
ISOBAR Preconditioner for Effective and High-throughput Lossless Data Compression.
138-149

Session 4:
Data Streams Processing
- Badrish Chandramouli, David Maier, Jonathan Goldstein:
Physically Independent Stream Merging.
150-161

- Srikanta Tirthapura, David P. Woodruff:
A General Method for Estimating Correlated Aggregates over a Data Stream.
162-173

- Tingjian Ge, Fujun Liu:
Accuracy-Aware Uncertain Stream Databases.
174-185

- Lu An Tang, Yu Zheng, Jing Yuan, Jiawei Han, Alice Leung, Chih-Chieh Hung, Wen-Chih Peng:
On Discovery of Traveling Companions from Streaming Trajectories.
186-197

Session 5:
Graphs
- Dayu Yuan, Prasenjit Mitra, Huiwen Yu, C. Lee Giles:
Iterative Graph Feature Mining for Graph Indexing.
198-209

- Xiaoli Wang, Xiaofeng Ding, Anthony K. H. Tung, Shanshan Ying, Hai Jin:
An Efficient Graph Indexing Method.
210-221

- Changjiu Jin, Sourav S. Bhowmick, Byron Choi, Shuigeng Zhou:
PRAGUE: Towards Blending Practical Visual Subgraph Query Formulation and Query Processing.
222-233

- Walaa Eldin Moustafa, Amol Deshpande, Lise Getoor:
Ego-centric Graph Pattern Census.
234-245

Session 6:
Uncertain and Probabilistic Databases
Session 7:
Data Integration and Extraction
Session 8:
Spatio-Temporal Data Management
- Manish Singh, Qiang Zhu, H. V. Jagadish:
SWST: A Disk Based Index for Sliding Window Spatio-Temporal Data.
342-353

- Tobias Emrich, Hans-Peter Kriegel, Nikos Mamoulis, Matthias Renz, Andreas Züfle:
Querying Uncertain Spatio-Temporal Data.
354-365

- Jianzhong Qi, Rui Zhang, Lars Kulik, Dan Lin, Yuan Xue:
The Min-dist Location Selection Query.
366-377

- Jia Pan, Dinesh Manocha:
Bi-level Locality Sensitive Hashing for k-Nearest Neighbor Computation.
378-389

Session 9:
Query Processing
- Mert Akdere, Ugur Çetintemel, Matteo Riondato, Eli Upfal, Stanley B. Zdonik:
Learning-based Query Performance Modeling and Prediction.
390-401

- Günes Aluç, David DeHaan, Ivan T. Bowman:
Parametric Plan Caching Using Density-Based Clustering.
402-413

- Pit Fender, Guido Moerkotte, Thomas Neumann, Viktor Leis:
Effective and Robust Pruning for Top-Down Join Enumeration Algorithms.
414-425

- Anastasios Arvanitis, Georgia Koutrika:
Towards Preference-aware Relational Databases.
426-437

Session 10:
Location Aware Data Processing
- Hua Lu, Xin Cao, Christian S. Jensen:
A Foundation for Efficient Indoor Distance-Aware Query Processing.
438-449

- Justin J. Levandoski, Mohamed Sarwat, Ahmed Eldawy, Mohamed F. Mokbel:
LARS: A Location-Aware Recommender System.
450-461

- Miao Qiao, Hong Cheng, Lijun Chang, Jeffrey Xu Yu:
Approximate Shortest Distance Computing: A Query-Dependent Local Landmark Scheme.
462-473

- Guoliang Li, Jianhua Feng, Jing Xu:
DESKS: Direction-Aware Spatial Keyword Search.
474-485

Session 11:
Map-Reduce Based Data Processing
Session 12:
Social Media
- Guo-Jun Qi, Charu C. Aggarwal, Thomas S. Huang:
Community Detection with Edge Content in Social Media Networks.
534-545

- Chen Liu, Sai Wu, Shouxu Jiang, Anthony K. H. Tung:
Cross Domain Search by Exploiting Wikipedia.
546-557

- Junjie Yao, Bin Cui, Zijun Xue, Qingyun Liu:
Provenance-based Indexing Support in Micro-blog Platforms.
558-569

- Luke Dickens, Ian Molloy, Jorge Lobo, Pau-Chen Cheng, Alessandra Russo:
Learning Stochastic Models of Information Flow.
570-581

Session 13:
P2P and Distributed Processing
- Gang Chen, Tianlei Hu, Dawei Jiang, Peng Lu, Kian-Lee Tan, Hoang Tam Vo, Sai Wu:
BestPeer++: A Peer-to-Peer Based Large-Scale Data Processing Platform.
582-593

- Minqi Zhou, Heng Tao Shen, Xiaofang Zhou, Weining Qian, Aoying Zhou:
Effective Data Density Estimation in Ring-Based P2P Networks.
594-605

- Christos Doulkeridis, Akrivi Vlachou, Kjetil Nørvåg, Yannis Kotidis, Neoklis Polyzotis:
Processing of Rank Joins in Highly Distributed Systems.
606-617

- Lars Kolb, Andreas Thor, Erhard Rahm:
Load Balancing for MapReduce-based Entity Resolution.
618-629

Session 14:
XML and RDF Data Management
- Liang Jeff Chen, Philip A. Bernstein, Peter Carlin, Dimitrije Filipovic, Michael Rys, Nikita Shamgunov, James F. Terwilliger, Milos Todic, Sasa Tomasevic, Dragan Tomic:
Mapping XML to a Wide Sparse Table.
630-641

- Curtis E. Dyreson, Sourav S. Bhowmick:
Querying XML Data: As You Shape It.
642-653

- Yanghua Xiao, Ji Hong, Wanyun Cui, Zhenying He, Wei Wang, Guodong Feng:
Branch Code: A Labeling Scheme for Efficient Query Answering on Trees.
654-665

- Wangchao Le, Anastasios Kementsietsidis, Songyun Duan, Feifei Li:
Scalable Multi-query Optimization for SPARQL.
666-677

Session 15:
Performance
- Jiexing Li, Rimma V. Nehme, Jeffrey F. Naughton:
GSLPI: A Cost-Based Query Progress Indicator.
678-689

- Rui Zhang, Richard T. Snodgrass, Saumya Debray:
Micro-Specialization in DBMSes.
690-701

- Willis Lang, Srinath Shankar, Jignesh M. Patel, Ajay Kalhan:
Towards Multi-tenant Performance SLOs.
702-713

- David B. Lomet, Alan Fekete, Rui Wang, Peter Ward:
Multi-version Concurrency via Timestamp Range Conflict Management.
714-725

Session 16:
Data Extraction and Quality
Session 17:
Top-K Processing
Session 18:
Similarity
Session 19:
Text and Strings
- Fei Chen, Xixuan Feng, Christopher Re, Min Wang:
Optimizing Statistical Information Extraction Programs over Evolving Text.
870-881

- Chong Sun, Jeffrey F. Naughton, Siddharth Barman:
Approximate String Membership Checking: A Multiple Filter, Optimization-Based Approach.
882-893

- Charu C. Aggarwal, Yuchen Zhao, Philip S. Yu:
On Text Clustering with Side Information.
894-904

- Junfeng Zhou, Zhifeng Bao, Wei Wang, Tok Wang Ling, Ziyang Chen, Xudong Lin, Jingfeng Guo:
Fast SLCA and ELCA Computation for XML Keyword Queries Based on Set Intersection.
905-916

Session 20:
Query Processing II
- Nikolay Laptev, Carlo Zaniolo:
Optimization of Massive Pattern Queries by Dynamic Configuration Morphing.
917-928

- Shenoda Guirguis, Mohamed A. Sharaf, Panos K. Chrysanthis, Alexandros Labrinidis:
Three-Level Processing of Multiple Aggregate Continuous Queries.
929-940

- Farhan Tauheed, Laurynas Biveinis, Thomas Heinis, Felix Schürmann, Henry Markram, Anastasia Ailamaki:
Accelerating Range Queries for Brain Simulations.
941-952

- Junjie Yao, Bin Cui, Liansheng Hua, Yuxin Huang:
Keyword Query Reformulation on Structured Data.
953-964

Session 21:
Data Mining
Session 22:
Scientific Data, Analysis and Visualization
- Adam Seering, Philippe Cudré-Mauroux, Samuel Madden, Michael Stonebraker:
Efficient Versioning for Scientific Array Databases.
1013-1024

- Lu An Tang, Xiao Yu, Sangkyum Kim, Jiawei Han, Wen-Chih Peng, Yizhou Sun, Hector Gonzalez, Sebastian Seith:
Multidimensional Analysis of Atypical Events in Cyber-Physical Data.
1025-1036

- Fabian Keller, Emmanuel Müller, Klemens Böhm:
HiCS: High Contrast Subspaces for Density-Based Outlier Ranking.
1037-1048

- Yang Zhang, Srinivasan Parthasarathy:
Extracting Analyzing and Visualizing Triangle K-Core Motifs within Networks.
1049-1060

Session 23:
Similarity Search and Detection
- Min Soo Kim, Kyu-Young Whang, Yang-Sae Moon:
Horizontal Reduction: Instance-Level Dimensionality Reduction for Similarity Search in Large Document Databases.
1061-1072

- Uwe Draisbach, Felix Naumann, Sascha Szott, Oliver Wonneberg:
Adaptive Windows for Duplicate Detection.
1073-1083

- Jongwuk Lee, Hyunsouk Cho, Seung-won Hwang:
Efficient Dual-Resolution Layer Indexing for Top-k Queries.
1084-1095

- Reynold Cheng, Jian Gong, David W. Cheung, Jiefeng Cheng:
Evaluating Probabilistic Queries over Uncertain Matching.
1096-1107

Session 24:
Sensors Network and Trajectory
- Sabbas Burdakis, Antonios Deligiannakis:
Detecting Outliers in Sensor Networks Using the Geometric Approach.
1108-1119

- Mingwang Tang, Feifei Li, Jeff M. Phillips, Jeffrey Jestes:
Efficient Threshold Monitoring for Distributed Probabilistic Data.
1120-1131

- Dhaval Patel, Chang Sheng, Wynne Hsu, Mong-Li Lee:
Incorporating Duration Information for Trajectory Classification.
1132-1143

- Kai Zheng, Yu Zheng, Xing Xie, Xiaofang Zhou:
Reducing Uncertainty of Low-Sampling-Rate Trajectories.
1144-1155

Session 25:
Error Reduction and Data Security
- Mehmet Kuzu, Mohammad Saiful Islam, Murat Kantarcioglu:
Efficient Similarity Search over Encrypted Data.
1156-1167

- HweeHwa Pang, Xiaokui Xiao, Jialie Shen:
Obfuscating the Topical Intention in Enterprise Text Search.
1168-1179

- Katrin Eisenreich, Jochen Adamek, Philipp Rösch, Volker Markl, Gregor Hackenbroich:
Correlation Support for Risk Evaluation in Databases.
1180-1191

- Hyo-Sang Lim, Gabriel Ghinita, Elisa Bertino, Murat Kantarcioglu:
A Game-Theoretic Approach for High-Assurance of Data Trustworthiness in Sensor Networks.
1192-1203

Seminars
- Oktie Hassanzadeh, Anastasios Kementsietsidis, Yannis Velegrakis:
Data Management Issues on the Semantic Web.
1204-1206

- Emmanuel Müller, Stephan Günnemann, Ines Färber, Thomas Seidl:
Discovering Multiple Clustering Solutions: Grouping Objects in Different Views of the Data.
1207-1210

- Xin Luna Dong, Divesh Srivastava:
Detecting Clones, Copying and Reuse on the Web.
1211-1213

- Jiawei Han, Yizhou Sun, Xifeng Yan, Philip S. Yu:
Mining Knowledge from Data: An Information Network Analysis Approach.
1214-1217

- Arijit Khan, Yinghui Wu, Xifeng Yan:
Emerging Graph Queries in Linked Data.
1218-1221

- Jaideep Vaidya:
Boolean Matrix Decomposition Problem: Theory, Variations and Applications to Data Engineering.
1222-1224

Demo Group 1
- Thomas Kissinger, Hannes Voigt, Wolfgang Lehner:
SMIX Live - A Self-Managing Index Infrastructure for Dynamic Workloads.
1225-1228

- Mohammad Sadoghi, Rija Javed, Naif Tarafdar, Harsh Singh, Rohan Palaniappan, Hans-Arno Jacobsen:
Multi-query Stream Processing on FPGAs.
1229-1232

- Jia Xu, Qiushi Bai, Yu Gu, Anthony K. H. Tung, Guoren Wang, Ge Yu, Zhenjie Zhang:
EUDEMON: A System for Online Video Frame Copy Detection by Earth Mover's Distance.
1233-1236

- Meiyu Lu, Srinivas Bangalore, Graham Cormode, Marios Hadjieleftheriou, Divesh Srivastava:
A Dataset Search Engine for the Research Document Corpus.
1237-1240

- Keivan Kianmehr, Negar Koochakzadeh, Reda Alhajj:
AskFuzzy: Attractive Visual Fuzzy Query Builder.
1241-1244

- Ulrike Fischer, Frank Rosenthal, Wolfgang Lehner:
F2DB: The Flash-Forward Database System.
1245-1248

- Robert Ikeda, Junsang Cho, Charlie Fang, Semih Salihoglu, Satoshi Torikai, Jennifer Widom:
Provenance-Based Debugging and Drill-Down in Data-Oriented Workflows.
1249-1252

Demo Group 2
- Ahmed M. Aly, Asmaa Sallam, Bala M. Gnanasekaran, Long-Van Nguyen-Dinh, Walid G. Aref, Mourad Ouzzani, Arif Ghafoor:
M3: Stream Processing on Main-Memory MapReduce.
1253-1256

- Torsten Grust, Manuel Mayr:
A Deep Embedding of Queries into Ruby.
1257-1260

- Rubi Boim, Ohad Greenshpan, Tova Milo, Slava Novgorodov, Neoklis Polyzotis, Wang Chiew Tan:
Asking the Right Questions in Crowd Data Sourcing.
1261-1264

- Chunbin Lin, Jiaheng Lu, Tok Wang Ling, Bogdan Cautis:
LotusX: A Position-Aware XML Graphical Search System with Auto-Completion.
1265-1268

- Mehdi Kargar, Aijun An:
Efficient Top-k Keyword Search in Graphs with Polynomial Delay.
1269-1272

- Rui Li, Kin Hou Lei, Ravi Khadiwala, Kevin Chen-Chuan Chang:
TEDAS: A Twitter-based Event Detection and Analysis System.
1273-1276

- Fei Chiang, Periklis Andritsos, Erkang Zhu, Renée J. Miller:
AutoDict: Automated Dictionary Discovery.
1277-1280

Demo Group 3
- Barbara Carminati, Elena Ferrari, Jacopo Girardi:
Trust and Share: Trusted Information Sharing in Online Social Networks.
1281-1284

- Elke Achtert, Sascha Goldhofer, Hans-Peter Kriegel, Erich Schubert, Arthur Zimek:
Evaluation of Clusterings - Metrics and Visual Support.
1285-1288

- Mohamed Sarwat, Sameh Elnikety, Yuxiong He, Gabriel Kliot:
Horton: Online Query Execution Engine for Large Distributed Graphs.
1289-1292

- Peter M. Fischer, Jens Teubner:
MXQuery with Hardware Acceleration.
1293-1296

- Steffen Hirte, Andreas Seifert, Stephan Baumann, Daniel Klan, Kai-Uwe Sattler:
Data3 - A Kinect Interface for OLAP Using Complex Event Processing.
1297-1300

- Anisoara Nica, Ian Charlesworth, Maysum Panju:
Analyzing Query Optimization Process: Portraits of Join Enumeration Algorithms.
1301-1304

- Yonghui Xiao, James J. Gardner, Li Xiong:
DPCube: Releasing Differentially Private Data Cubes for Health Information.
1305-1308

Demo Group 4
- Roberto De Virgilio, Giorgio Orsi, Letizia Tanca, Riccardo Torlone:
NYAYA: A System Supporting the Uniform Management of Large Sets of Semantic Data.
1309-1312

- Songling Liu, Juan P. Cedeño, K. Selçuk Candan, Maria Luisa Sapino, Shengyu Huang, Xinsheng Li:
R2DB: A System for Querying and Visualizing Weighted RDF Graphs.
1313-1316

- Roger S. Barga, Jaliya Ekanayake, Wei Lu:
Project Daytona: Data Analytics as a Cloud Service.
1317-1320

- Isabel F. Cruz, Cosmin Stroe, Matteo Palmonari:
Interactive User Feedback in Ontology Matching Using Signature Vectors.
1321-1324

- Pawel Jurczyk, Li Xiong, Slawomir Goryczka:
DObjects+: Enabling Privacy-Preserving Data Federation Services.
1325-1328

- Kyriacos E. Pavlou, Richard T. Snodgrass:
DRAGOON: An Information Accountability System for High-Performance Databases.
1329-1332

- Kenneth P. Smith, Ameet Kini, William Wang, Chris Wolf, M. David Allen, Andrew Sillers:
Intuitive Interaction with Encrypted Query Execution in DataStorm.
1333-1336

Industrial Session 1:
Support for Large Scale Data Analytics
Industrial Session 2:
Evolving Platforms for New Applications
- Michael Busch, Krishna Gade, Brian Larson, Patrick Lok, Samuel Luckenbill, Jimmy Lin:
Earlybird: Real-Time Search at Twitter.
1360-1369

- Aditya Auradkar, Chavdar Botev, Shirshanka Das, Dave De Maagd, Alex Feinberg, Phanindra Ganti, Lei Gao, Bhaskar Ghosh, Kishore Gopalakrishna, Brendan Harris, Joel Koshy, Kevin Krawez, Jay Kreps, Shi Lu, Sunil Nagaraj, Neha Narkhede, Sasha Pachev, Igor Perisic, Lin Qiao, Tom Quiggle, Jun Rao, Bob Schulman, Abraham Sebastian, Oliver Seeliger, Adam Silberstein, Boris Shkolnik, Chinmay Soman, Roshan Sumbaly, Kapil Surlaker, Sajid Topiwala, Cuong Tran, Balaji Varadarajan, Jemiah Westerman, Zach White, David Zhang, Jason Zhang:
Data Infrastructure at LinkedIn.
1370-1381

- Claudio Jossen, Lukas Blunschi, Magdalini Mori, Donald Kossmann, Kurt Stockinger:
The Credit Suisse Meta-data Warehouse.
1382-1393

Industrial Session 3:
Indexing, Updates and Processing
- Zhen Hua Liu, Hui J. Chang, Balasubramanyam Sthanikam:
Efficient Support of XQuery Update Facility in XML Enabled RDBMS.
1394-1404

- Souripriya Das, Seema Sundara, Matthew Perry, Jagannathan Srinivasan, Jayanta Banerjee, Aravind Yalamanchi:
Making Unstructured Data SPARQL Using Semantic Indexing in Oracle Database.
1405-1416

- Sonia Bergamaschi, Matteo Interlandi, Mario Longo, Laura Po, Maurizio Vincini:
A Meta-language for MDX Queries in eLog Business Solution.
1417-1428

Last update Fri May 24 02:51:33 2013
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page