PACT 2010:
Vienna, Austria
Valentina Salapura, Michael Gschwind, Jens Knoop (Eds.):
19th International Conference on Parallel Architecture and Compilation Techniques (PACT 2010), Vienna, Austria, September 11-15, 2010.
ACM 2010, ISBN 978-1-4503-0178-7
Keynote address I
- David A. Ferrucci:
Build Watson: an overview of DeepQA for the Jeopardy! challenge.
1-2

Keynote address II
Keynote address III
- Wen-mei W. Hwu:
Raising the level of many-core programming with compiler technology: meeting a grand challenge.
5-6

Power-aware design
- Victor Jiménez, Francisco J. Cazorla, Roberto Gioiosa, Mateo Valero, Carlos Boneti, Eren Kursun, Chen-Yong Cher, Canturk Isci, Alper Buyuktosunoglu, Pradip Bose:
Power and thermal characterization of POWER6 system.
7-18

- Karthik Ganesan, Jungho Jo, William Lloyd Bircher, Dimitris Kaseridis, Zhibin Yu, Lizy K. John:
System-level max power (SYMPO): a systematic approach for escalating system-level power consumption using synthetic benchmarks.
19-28

- Jonathan A. Winter, David H. Albonesi, Christine A. Shoemaker:
Scalable thread scheduling and global power management for heterogeneous many-core architectures.
29-40

- Matthew A. Watkins, David H. Albonesi:
Dynamically managed multithreaded reconfigurable architectures for chip multiprocessors.
41-52

Analysis & optimization
Caches & coherence I
- John H. Kelm, Matthew R. Johnson, Steven S. Lumetta, Sanjay J. Patel:
WAYPOINT: scaling coherence to thousand-core architectures.
99-110

- Daehoon Kim, Jeongseob Ahn, Jae-Hong Kim, Jaehyuk Huh:
Subspace snooping: filtering snoops with operating system support.
111-122

- Nick Barrow-Williams, Christian Fensch, Simon W. Moore:
Proximity coherence for chip multiprocessors.
123-134

- Hongzhou Zhao, Arrvindh Shriraman, Sandhya Dwarkadas:
SPACE: sharing pattern-based directory coherence for multicore scalability.
135-146

Parallelization and parallel programming I
- M. Aater Suleman, Moinuddin K. Qureshi, Khubaib, Yale N. Patt:
Feedback-directed pipeline parallelism.
147-156

- Zheng Li, Olivier Certner, José Duato, Olivier Temam:
Scalable hardware support for conditional parallelization.
157-168

- Jisheng Zhao, Jun Shirako, V. Krishna Nandivada, Vivek Sarkar:
Reducing task creation and termination overhead in explicitly parallel programs.
169-180

- Ganesh S. Dasika, Ankit Sethia, Vincentius Robby, Trevor N. Mudge, Scott A. Mahlke:
MEDICS: ultra-portable processing for medical image reconstruction.
181-192

Heterogeneous platforms and frameworks
- Jaejin Lee, Jungwon Kim, Sangmin Seo, Seungkyun Kim, Jung-Ho Park, Honggyu Kim, Thanh Tuan Dao, Yongjin Cho, Sung Jong Seo, Seung Hak Lee, Seung Mo Cho, Hyo Jung Song, Sang-Bum Suh, Jong-Deok Choi:
An OpenCL framework for heterogeneous multicores with local memory.
193-204

- Jayanth Gummaraju, Laurent Morichetti, Michael Houston, Ben Sander, Benedict R. Gaster, Bixia Zheng:
Twin peaks: a software platform for heterogeneous computing on general-purpose and graphics processors.
205-216

- Chuntao Hong, Dehao Chen, Wenguang Chen, Weimin Zheng, Haibo Lin:
MapCG: writing parallel program portable between CPU and GPU.
217-226

- Pradip Hari, John B. P. McCabe, Jonathan Banafato, Marcus Henry, Kevin Ko, Emmanouil Koukoumidis, Ulrich Kremer, Margaret Martonosi, Li-Shiuan Peh:
Adaptive spatiotemporal node selection in dynamic networks.
227-236

Scheduling and design optimization
- Di Xu, Chenggang Wu, Pen-Chung Yew:
On mitigating memory bandwidth contention through bandwidth-aware scheduling.
237-248

- Sergey Zhuravlev, Sergey Blagodurov, Alexandra Fedorova:
AKULA: a toolset for experimenting and developing thread placement algorithms on multicore systems.
249-260

- Sandeep Navada, Niket Kumar Choudhary, Eric Rotenberg:
Criticality-driven superscalar design space exploration.
261-272

- Srihari Cadambi, Abhinandan Majumdar, Michela Becchi, Srimat T. Chakradhar, Hans Peter Graf:
A programmable parallel accelerator for learning and classification.
273-284

Best papers
- Ferad Zyulkyarov, Srdjan Stipic, Tim Harris, Osman S. Unsal, Adrián Cristal, Ibrahim Hur, Mateo Valero:
Discovering and understanding performance bottlenecks in transactional applications.
285-294

- Changhui Lin, Vijay Nagarajan, Rajiv Gupta:
Efficient sequential consistency using conditional fences.
295-306

- Zheng Wang, Michael F. P. O'Boyle:
Partitioning streaming parallelism for multi-cores: a machine learning based approach.
307-318

- Manu Awasthi, David W. Nellans, Kshitij Sudan, Rajeev Balasubramonian, Al Davis:
Handling the problems and opportunities posed by multiple on-chip memory controllers.
319-330

Languages and compilers
- Amit Kumar, Lorenzo De Carli, Sung Jin Kim, Marc de Kruijf, Karthikeyan Sankaralingam, Cristian Estan, Somesh Jha:
Design and implementation of the PLUG architecture for programmable and efficient network lookups.
331-342

- Uday Bondhugula, Oktay Günlük, Sanjeeb Dash, Lakshminarayanan Renganarayanan:
A model for fusion and code motion in an automatic parallelizing compiler.
343-352

- Gregory Frederick Diamos, Andrew Kerr, Sudhakar Yalamanchili, Nathan Clark:
Ocelot: a dynamic optimization framework for bulk-synchronous applications in heterogeneous systems.
353-364

- William Thies, Saman P. Amarasinghe:
An empirical characterization of stream programs and its implications for language and compiler design.
365-376

Parallelization and parallel programming II
Speculation
- Rania H. Mameesh, Manoj Franklin:
Speculative-aware execution: a simple and efficient technique for utilizing multi-cores to improve single-thread performance.
421-430

- Walid J. Ghandour, Haitham Akkary, Wes Masri:
The potential of using dynamic information flow analysis in data value prediction.
431-442

- Tanausú Ramírez, Alex Pajuelo, Oliverio J. Santana, Onur Mutlu, Mateo Valero:
Efficient runahead threads.
443-452

- Yangchun Luo, Venkatesan Packirisamy, Wei-Chung Hsu, Antonia Zhai:
Energy efficient speculative threads: dynamic thread allocation in Same-ISA heterogeneous multicore systems.
453-464

Caches and coherence II
- Seth H. Pugsley, Josef B. Spjut, David W. Nellans, Rajeev Balasubramonian:
SWEL: hardware cache coherence protocols to map shared data onto shared caches.
465-476

- George Kurian, Jason E. Miller, James Psota, Jonathan Eastep, Jifeng Liu, Jürgen Michel, Lionel C. Kimerling, Anant Agarwal:
ATAC: a 1000-core cache-coherent processor with on-chip optical network.
477-488

- Samira Manabi Khan, Daniel A. Jiménez, Doug Burger, Babak Falsafi:
Using dead blocks as a virtual victim cache.
489-500

Data distribution and tiling
Poster session
- Ali Bakhoda, John Kim, Tor M. Aamodt:
On-chip network design considerations for compute accelerators.
535-536

- Rajesh Bordawekar, Uday Bondhugula, Ravi Rao:
Believe it or not!: mult-core CPUs can match GPU performance for a FLOP-intensive application!
537-538

- Muhammad Amber Hassaan, Martin Burtscher, Keshav Pingali:
Ordered and unordered algorithms for parallel breadth first search.
539-540

- Matthew Misler, Natalie D. Enright Jerger:
Moths: mobile threads for on-chip networks.
541-542

- Santhosh Sharma Ananthramu, Deepak Majeti, Sanjeev Kumar Aggarwal, Mainak Chaudhuri:
Improving speculative loop parallelization via selective squash and speculation reuse.
543-544

- Duane Merrill, Andrew S. Grimshaw:
Revisiting sorting for GPGPU stream architectures.
545-546

- Sandya S. Mannarswamy, Ramaswamy Govindarajan:
Analyzing cache performance bottlenecks of STM applications and addressing them with compiler's help.
547-548

- Mohammad Hammoud, Sangyeun Cho, Rami G. Melhem:
An intra-tile cache set balancing scheme.
549-550

- David Eklov, David Black-Schaffer, Erik Hagersten:
StatCC: a statistical cache contention model.
551-552

- Wenjing Ma, Gagan Agrawal:
An integer programming framework for optimizing shared memory use on GPUs.
553-554

- Rafael Ubal, Julio Sahuquillo, Salvador Petit, Pedro López, José Duato:
Exploiting subtrace-level parallelism in clustered processors.
555-556

- Sergey Blagodurov, Sergey Zhuravlev, Alexandra Fedorova, Ali Kamali:
A case for NUMA-aware contention management on multicore systems.
557-558

- Haibo Lin, Tao Liu, Huoding Li, Tong Chen, Lakshminarayanan Renganarayanan, Kevin O'Brien, Ling Shao:
DMATiler: revisiting loop tiling for direct memory access.
559-560

- Christian Bienia, Kai Li:
Scaling of the PARSEC benchmark inputs.
561-562

- Richard West, Puneet Zaroo, Carl A. Waldspurger, Xiao Zhang:
Online cache modeling for commodity multicore processors.
563-564

- Ahmed Abousamra, Rami G. Melhem, Alex K. Jones:
NoC-aware cache design for chip multiprocessors.
565-566

- Jun Lee, Sangmin Seo, Jaejin Lee:
A software-SVM-based transactional memory for multicore accelerator architectures with local memory.
567-568

- R. Manikantan, Kaushik Rajan, R. Govindarajan:
NUcache: a multicore cache organization based on next-use distance.
569-570

- Shantanu Gupta, Shuguang Feng, Amin Ansari, Ganesh S. Dasika, Scott A. Mahlke:
CoreGenesis: erasing core boundaries for robust and configurable performance.
571-572

- Rajkishore Barik, Jisheng Zhao, Vivek Sarkar:
Automatic vector instruction selection for dynamic compilation.
573-574

- Michael M. Lee, John Kim, Dennis Abts, Michael R. Marty, Jae W. Lee:
Approximating age-based arbitration in on-chip networks.
575-576

Last update Sat May 25 18:25:48 2013
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page