ICS 2009:
Yorktown Heights, NY, USA
Michael Gschwind, Alexandru Nicolau, Valentina Salapura, José E. Moreira (Eds.):
Proceedings of the 23rd international conference on Supercomputing, 2009, Yorktown Heights, NY, USA, June 8-12, 2009.
ACM 2009, ISBN 978-1-60558-498-0
Keynote Address I
Keynote Address II
- Don G. Grice:
The roadrunner project and the importance of energy efficiency on the road to exascale computing.
2

Keynote Address III
Applications of the cell processor
Cache enhancement techniques
Optimizing parallel applications
Transactional memory I
Compilers
- Albert Hartono, Muthu Manikandan Baskaran, Cédric Bastoul, Albert Cohen, Sriram Krishnamoorthy, Boyana Norris, J. Ramanujam, P. Sadayappan:
Parametric multi-level tiling of imperfectly nested loops.
147-157

- Cheng Wang, Youfeng Wu, Edson Borin, Shiliang Hu, Wei Liu, Dave Sager, Tin-fook Ngai, Jesse Fang:
Dynamic parallelization of single-threaded binary programs using speculative slicing.
158-168

- Alexandru Nicolau, Guangqiang Li, Alexander V. Veidenbaum, Arun Kejariwal:
Synchronization optimizations for efficient execution on multi-cores.
169-180

- Jun Shirako, Jisheng M. Zhao, V. Krishna Nandivada, Vivek Sarkar:
Chunking parallel loops in the presence of synchronization.
181-192

High performance communications I
Accelerating applications with GPUs I
Architectures for High-Performance Computing
High-performance communications II
Storage solutions for supercomputing
- Ji-Yong Shin, Zenglin Xia, Ning-Yi Xu, Rui Gao, Xiongfei Cai, Seungryoul Maeng, Feng-Hsiung Hsu:
FTL design exploration in reconfigurable high-performance SSD for server applications.
338-349

- Henry M. Monti, Ali Raza Butt, Sudharshan S. Vazhkudai:
/scratch as a cache: rethinking HPC center scratch storage.
350-359

- Chao Jin, Hong Jiang, Dan Feng, Lei Tian:
P-Code: a new RAID-6 code with optimal properties.
360-369

- Chuanyi Liu, Yu Gu, Linchun Sun, Bin Yan, Dongsheng Wang:
R-ADMAD: high reliability provision for large-scale de-duplication archival storage systems.
370-379

Accelerating applications with GPUs II
Transactional memory II
Novel supercomputing applications
Power management
- Barry Rountree, David K. Lowenthal, Bronis R. de Supinski, Martin Schulz, Vincent W. Freeh, Tyler K. Bletsch:
Adagio: making DVS practical for complex HPC applications.
460-469

- Mohammad Arjomand, Hamid Sarbazi-Azad:
A comprehensive power-performance model for NoCs with multi-flit channel buffers.
470-478

- Andrew Herdrich, Ramesh Illikkal, Ravi R. Iyer, Donald Newell, Vineet Chadha, Jaideep Moses:
Rate-based QoS techniques for cache/memory in CMP platforms.
479-488

Posters
- Ahmad Faraj, Sameer Kumar, Brian E. Smith, Amith R. Mamidala, John A. Gunnels, Philip Heidelberger:
MPI collective communications on the blue gene/p supercomputer: algorithms and optimizations.
489-490

- James Poe, Clay Hughes, Tao Li:
TransMetric: architecture independent workload characterization for transactional memory benchmarks.
491-492

- Md. Mafijul Islam, Sally A. McKee, Per Stenström:
Cancellation of loads that return zero using zero-value caches.
493-494

- Huayong Wang, Henrique Andrade, Bugra Gedik, Kun-Lung Wu:
Auto-vectorization through code generation for stream processing applications.
495-496

- Aleksandr Ovcharenko, Onkar Sahni, Christopher D. Carothers, Kenneth E. Jansen, Mark S. Shephard:
Subdomain communication to increase scalability in large-scale scientific applications.
497-498

- Yasuo Ishii, Mary Inaba, Kei Hiraki:
Access map pattern matching for data cache prefetch.
499-500

- Karan Singh, Major Bhadauria, Sally A. McKee:
Prediction-based power estimation and scheduling for CMPs.
501-502

- Jih-Ching Chiu, Kai-Ming Yang, Yu-Liang Chou:
Design of a novel SIMD architecture by fusing operations and registers.
503-504

- Jian Li, Lixin Zhang, Charles Lefurgy, Richard R. Treumann, Wolfgang E. Denzel:
Thrifty interconnection network for HPC systems.
505-506

- Liang Gu, Xiaoming Li:
Performance modeling for DFT algorithms in FFTW.
507-508

- Major Bhadauria, Vincent M. Weaver, Sally A. McKee:
PARSEC: hardware profiling of emerging workloads for CMP design.
509-510

- Mohamed E. Hussein, Wael Abd-Almageed:
Approximate kernel matrix computation on GPUs forlarge scale learning applications.
511-512

- Diana Bautista, Julio Sahuquillo, Houcine Hassan, Salvador Petit, José Duato:
Dynamic task set partitioning based on balancing memory requirements to reduce power consumption.
513-514

- Alexandros Papakonstantinou, Karthik Gururaj, John A. Stratton, Deming Chen, Jason Cong, Wen-mei W. Hwu:
High-performance CUDA kernel execution on FPGAs.
515-516

- Angeles G. Navarro, Rafael Asenjo, Siham Tabik, Calin Cascaval:
Load balancing using work-stealing for pipeline parallelism in emerging applications.
517-518

- Shih-wei Liao, Tzu-Han Hung, Donald Nguyen, Hucheng Zhou, Chinyen Chou, Chia-Heng Tu:
Prefetch optimizations on large-scale applications via parameter value prediction.
519-520

- Scott Beamer, Krste Asanovic, Christopher Batten, Ajay Joshi, Vladimir Stojanovic:
Designing multi-socket systems using silicon photonics.
521-522

- Victor Lotrich, Norbert Flocke, Mark Ponton, Beverly A. Sanders, Erik Deumens, Rodney J. Bartlett, Ajith Perera:
An infrastructure for scalable and portable parallel programs for computational chemistry.
523-524

Last update Sat May 18 18:52:25 2013
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page