17. PPOPP 2012:
New Orleans,
LA,
USA
J. Ramanujam, P. Sadayappan (Eds.):
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2012, New Orleans, LA, USA, February 25-29, 2012.
ACM 2012, ISBN 978-1-4503-1160-1
GPU tools
Communication & SIMD optimization
Programming models
GPU algorithms
- Mario Méndez-Lojo, Martin Burtscher, Keshav Pingali:
A GPU implementation of inclusion-based points-to analysis.
107-116
- Duane Merrill, Michael Garland, Andrew S. Grimshaw:
Scalable GPU graph traversal.
117-128
- Yuan Zu, Ming Yang, Zhonghu Xu, Lin Wang, Xin Tian, Kunyang Peng, Qunfeng Dong:
GPU-based NFA implementation for memory efficient high speed regular expression matching.
129-140
Concurrent data structures
Parallel algorithms
Correctness and fault tolerance
- Guodong Li, Peng Li, Geoffrey Sawaya, Ganesh Gopalakrishnan, Indradeep Ghosh, Sreeranga P. Rajan:
GKLEE: concolic verification and test generation for GPUs.
215-224
- Peng Du, Aurelien Bouteiller, George Bosilca, Thomas Hérault, Jack Dongarra:
Algorithm-based fault tolerance for dense matrix factorizations.
225-234
- Jeremy D. Buhler, Kunal Agrawal, Peng Li, Roger D. Chamberlain:
Efficient deadlock avoidance for streaming computation with filtering.
235-246
Scheduling and synchronization
Poster session 1 (Monday)
- Muthu Manikandan Baskaran, Nicolas Vasilache, Benoît Meister, Richard Lethin:
Automatic communication optimizations through memory reuse strategies.
277-278
- Gu Liu, Hong An, Wenting Han, Xiaoqiang Li, Tao Sun, Wei Zhou, Xuechao Wei, Xulong Tang:
FlexBFS: a parallelism-aware implementation of breadth-first search on GPU.
279-280
- Michael Andersch, Chi Ching Chi, Ben H. H. Juurlink:
Programming parallel embedded and consumer applications in OpenMP superscalar.
281-282
- Jianlong Zhong, Bingsheng He:
An overview of Medusa: simplified graph processing on GPUs.
283-284
- Christophe Alias, Alain Darte, Alexandru Plesco:
Optimizing remote accesses for offloaded kernels: application to high-level synthesis for FPGA.
285-286
- Jian Tao, Marek Blazewicz, Steven R. Brandt:
Using GPU's to accelerate stencil-based computation kernels for the development of large scale scientific applications on heterogeneous systems.
287-288
- Bryan Marker, Andy Terrel, Jack Poulson, Don S. Batory, Robert A. van de Geijn:
Mechanizing the expert dense linear algebra developer.
289-290
- Cedric Nugteren, Henk Corporaal:
The boat hull model: adapting the roofline model to enable performance prediction for parallel computing.
291-292
- Min Feng, Rajiv Gupta, Laxmi N. Bhuyan:
Speculative parallelization on GPGPUs.
293-294
- Alexandra Jimborean, Philippe Clauss, Benoît Pradelle, Luis Mastrangelo, Vincent Loechner:
Adapting the polyhedral model as a framework for efficient speculative parallelization.
295-296
- Yifan Gong, Bingsheng He, Jianlong Zhong:
An overview of CMPI: network performance aware MPI in the cloud.
297-298
- Jungwon Kim, Sangmin Seo, Jun Lee, Jeongho Nah, Gangwon Jo, Jaejin Lee:
OpenCL as a unified programming model for heterogeneous CPU/GPU clusters.
299-300
- George Tzenakis, Angelos Papatriantafyllou, John Kesapides, Polyvios Pratikakis, Hans Vandierendonck, Dimitrios S. Nikolopoulos:
BDDT: : block-level dynamic dependence analysis for deterministic task-based parallelism.
301-302
- Shoaib Kamil, Derrick Coetzee, Scott Beamer, Henry Cook, Ekaterina Gonina, Jonathan Harper, Jeffrey Morlan, Armando Fox:
Portable parallel performance from sequential, productive, embedded domain-specific languages.
303-304
- Torsten Hoefler, Timo Schneider:
Communication-centric optimizations by dynamically detecting collective operations.
305-306
Poster session 2 (Tuesday)
- Donghui Zhang, Per-Åke Larson:
LHlf: lock-free linear hashing (poster paper).
307-308
- Shahar Timnat, Anastasia Braginsky, Alex Kogan, Erez Petrank:
Wait-free linked-lists.
309-310
- Minh Ngoc Dinh, David Abramson, Chao Jin, Andrew Gontarek, Bob Moench, Luiz De Rose:
Scalable parallel debugging with statistical assertions.
311-312
- Alexander Malkis, Anindya Banerjee:
Verification of software barriers.
313-314
- Anshul Mittal, Nikhil Jain, Thomas George, Yogish Sabharwal, Sameer Kumar:
Collective algorithms for sub-communicators.
315-316
- Joeri De Koster, Stefan Marr, Theo D'Hondt:
Synchronization views for event-loop actors.
317-318
- Zviad Metreveli, Nickolai Zeldovich, M. Frans Kaashoek:
CPHASH: a cache-partitioned hash table.
319-320
- John Robert Wernsing, Greg Stitt:
RACECAR: a heuristic for automatic function specialization on multi-core heterogeneous systems.
321-322
- Yujie Liu, Michael F. Spear:
A lock-free, array-based priority queue.
323-324
- Albert Noll, Thomas R. Gross:
An infrastructure for dynamic optimization of parallel programs.
325-326
- Fredrik Kjolstad, Torsten Hoefler, Marc Snir:
Automatic datatype generation and optimization.
327-328
- Jacob Burnim, Tayfun Elmas, George C. Necula, Koushik Sen:
NDetermin: inferring nondeterministic sequential specifications for parallelism correctness.
329-330
- Chang-Seo Park, Koushik Sen:
Concurrent breakpoints.
331-332
- Andrew Stone, John Dennis, Michelle Strout:
Establishing a Miniapp as a programmability proxy.
333-334
- Lei Jiang, Pragneshkumar B. Patel, George Ostrouchov, Ferdinand Jamitzky:
OpenMP-style parallelism in data-centered multicore computing with R.
335-336
- Yves Caniou, Daniel Diaz, Florian Richoux, Philippe Codognet, Salvador Abreu:
Performance analysis of parallel constraint-based local search.
337-338
Last update Thu May 24 04:40:44 2012
CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page