ICS 2004: Saint Malo, France
Paul Feautrier, James R. Goodman, André Seznec (Eds.): Proceedings of the 18th Annual International Conference on Supercomputing, ICS 2004, Saint Malo, France, June 26 - July 01, 2004. ACM 2004 ISBN 1-58113-839-3
Cache
Sorin Iacobovici, Lawrence Spracklen, Sudarshan Kadambi, Yuan Chou, Santosh G. Abraham: Effective stream-based and execution-based data prefetching. 1-11
Wei Zhang: Enhancing data cache reliability by the addition of a small fully-associative replication cache. 12-19
Masamichi Takagi, Kei Hiraki: Inter-reference gap distribution replacement: an improved replacement algorithm for set-associative caches. 20-30
Applications 1
Chao Wang, Srinivasan Parthasarathy: Parallel algorithms for mining frequent structural motifs in scientific data. 31-40
Ananth Grama, Vivek Sarin: Impact of far-field interactions on performance of multipole-based preconditioners for sparse linear systems. 41-47
Andrey N. Chernikov, Nikos Chrisochoides: Practical and efficient point insertion scheduling method for parallel guaranteed quality delaunay refinement. 48-57
Input/Output
Florin Isaila, Guido Malpohl, Vlad Olaru, Gabor Szeder, Walter F. Tichy: Integrating collective I/O and cooperative caching into the "clusterfile" parallel file system. 58-67
Eduardo Pinheiro, Ricardo Bianchini: Energy conservation techniques for disk array-based servers. 68-78
Qingbo Zhu, Asim Shankar, Yuanyuan Zhou: PB-LRU: a self-tuning power aware storage cache replacement algorithm for conserving disk energy. 79-88
Distributed computing
Thierry Gautier, Hamid-Reza Hamidi: Automatic re-scheduling of dependencies in a RPC-based grid. 89-94
Suresh Aryangat, Henrique Andrade, Alan Sussman: Time and space optimization for processing groups of multi-dimensional scientific queries. 95-105
Software tools
Robert A. van Engelen, Johnnie Birch, Yixin Shou, Burt Walsh, Kyle Gallivan: A unified framework for nonlinear dependence testing and symbolic analysis. 106-115
Xianfeng Li, Hemendra Singh Negi, Tulika Mitra, Abhik Roychoudhury: Design space exploration of caches using compressed traces. 116-125
Wei Liu, Michael C. Huang: EXPERT: expedited simulation exploiting program behavior repetition. 126-135
Applications 2
Sascha Hunold, Thomas Rauber, Gudula Rünger: Multilevel hierarchical matrix multiplication on clusters. 136-145
Paul van der Mark, Lex Wolters, Gerard Cats: A dynamic application-driven data communication strategy. 146-153
Yoshihiro Nakajima, Mitsuhisa Sato, Hitoshi Gotoh, Taisuke Boku, Daisuke Takahashi: Implementation and performance evaluation of CONFLEX-G: grid-enabled molecular conformational space search program with OmniRPC. 154-163
Supercomputers
Arun Rodrigues, Richard C. Murphy, Peter M. Kogge, Keith D. Underwood: Characterizing a new class of threads in scientific applications for high end supercomputers. 164-174
Hongzhang Shan, Erich Strohmaier: Performance characteristics of the Cray X1 and their implications for application performance tuning. 175-183
Christian Bell, Wei-Yu Chen, Dan Bonachea, Katherine A. Yelick: Evaluating support for global address space languages on the Cray X1. 184-195
Microarchitectures
Ruchira Sasanka, Sarita V. Adve, Yen-Kuang Chen, Eric Debes: The energy efficiency of CMP vs. SMT for multimedia workloads. 196-206
Wessam Hassanein, José A. B. Fortes, Rudolf Eigenmann: Data forwarding through in-memory precomputation threads. 207-216
Yongxiang Liu, Anahita Shayesteh, Gokhan Memik, Glenn Reinman: Scaling the issue window with look-ahead latency prediction. 217-226
Compilers
Chris Bentley, Scott A. Watterson, David K. Lowenthal, Barry Rountree: Implicit java array bounds checking on 64-bit architecture. 227-236
Patrick Carribault, Albert Cohen: Applications of storage mapping optimization to register promotion. 247-256
Middleware for high performance computing
Ravi R. Iyer: CQoS: a framework for enabling QoS in shared caches of CMP platforms. 257-266
Saurabh Agarwal, Rahul Garg, Meeta Sharma Gupta, José E. Moreira: Adaptive incremental checkpointing for massively parallel systems. 277-286
Communication and consistency protocols
Jaydeep Marathe, Anita Nagarajan, Frank Mueller: Detailed cache coherence characterization for OpenMP benchmarks. 287-297
Ron Brightwell, Keith D. Underwood: An analysis of the impact of MPI overlap and independent progress. 298-305
Vadim Iosevich, Assaf Schuster: A comparison of sequential consistency with home-based lazy release consistency for software distributed shared memory. 306-315
Clustered microarchitectures
Fernando Latorre, José González, Antonio González: Back-end assignment schemes for clustered multithreaded processors. 316-325
Rajeev Balasubramonian: Cluster prefetch: tolerating on-chip wire delays in clustered microarchitectures. 326-335
David Petrou, Gregory R. Ganger, Garth A. Gibson: Cluster scheduling for explicitly-speculative tasks. 336-345



