| 2013 | ||
|---|---|---|
| j10 | Torsten Hoefler, Kamil Iskra: Operating systems and runtime environments on supercomputers. IJHPCA 27(2): 123 (2013) | |
| c65 | Andrew Friedley, Torsten Hoefler, Greg Bronevetsky, Andrew Lumsdaine, Ching-Chen Ma: Ownership passing: efficient distributed memory programming on multi-core systems. PPOPP 2013: 177-186 | |
| 2012 | ||
| j9 | Torsten Hoefler, Kamil Iskra: Operating systems and runtime environments on supercomputers. IJHPCA 26(2): 93-94 (2012) | |
| j8 | Torsten Hoefler: Extensions for next-generation parallel programming models. Parallel Computing 38(1-2): 1 (2012) | |
| c64 | Torsten Hoefler, Timo Schneider: Runtime detection and optimization of collective communication patterns. PACT 2012: 263-272 | |
| c63 | Peter Gottschling, Torsten Hoefler: Productive Parallel Linear Algebra Programming with Unstructured Topology Adaption. CCGRID 2012: 9-16 | |
| c62 | Greg Bauer, Steven Gottlieb, Torsten Hoefler: Performance Modeling and Comparative Analysis of the MILC Lattice QCD Application su3_rmd. CCGRID 2012: 652-659 | |
| c61 | Simone Pellegrini, Torsten Hoefler, Thomas Fahringer: On the Effects of CPU Caches on MPI Point-to-Point Communications. CLUSTER 2012: 495-503 | |
| c60 | Kishor Kharbas, Donghoon Kim, Torsten Hoefler, Frank Mueller: Assessing HPC Failure Detectors for MPI Jobs. PDP 2012: 81-88 | |
| c59 | Torsten Hoefler, Timo Schneider: Communication-centric optimizations by dynamically detecting collective operations. PPOPP 2012: 305-306 | |
| c58 | Fredrik Kjolstad, Torsten Hoefler, Marc Snir: Automatic datatype generation and optimization. PPOPP 2012: 327-328 | |
| c57 | Simone Pellegrini, Torsten Hoefler, Thomas Fahringer: Exact Dependence Analysis for Increased Communication Overlap. EuroMPI 2012: 89-99 | |
| c56 | Timo Schneider, Robert Gerstenberger, Torsten Hoefler: Micro-applications for Communication Data Access Patterns and MPI Datatypes. EuroMPI 2012: 121-131 | |
| c55 | Torsten Hoefler, James Dinan, Darius Buntinas, Pavan Balaji, Brian W. Barrett, Ron Brightwell, William Gropp, Vivek Kale, Rajeev Thakur: Leveraging MPI's One-Sided Communication Interface for Shared-Memory Programming. EuroMPI 2012: 132-141 | |
| c54 | Torsten Hoefler, Timo Schneider: Optimization principles for collective neighborhood communications. SC 2012: 98 | |
| c53 | Vivek Kale, Todd Gamblin, Torsten Hoefler, Bronis R. de Supinski, William D. Gropp: Abstract: Slack-Conscious Lightweight Loop Scheduling for Improving Scalability of Bulk-synchronous MPI Applications. SC Companion 2012: 1392 | |
| 2011 | ||
| j7 | Torsten Hoefler, Rolf Rabenseifner, Hubert Ritzdorf, Bronis R. de Supinski, Rajeev Thakur, Jesper Larsson Träff: The scalable process topology interface of MPI 2.2. Concurrency and Computation: Practice and Experience 23(4): 293-310 (2011) | |
| j6 | Pavan Balaji, Darius Buntinas, David Goodell, William Gropp, Torsten Hoefler, Sameer Kumar, Ewing L. Lusk, Rajeev Thakur, Jesper Larsson Träff: Mpi on millions of Cores. Parallel Processing Letters 21(1): 45-60 (2011) | |
| c52 | Timo Schneider, Sven Eckelmann, Torsten Hoefler, Wolfgang Rehm: Kernel-Based Offload of Collective Operations - Implementation, Evaluation and Lessons Learned. Euro-Par (2) 2011: 264-275 | |
| c51 | ||
| c50 | Jeremiah Willcock, Torsten Hoefler, Nicholas Gerard Edmonds, Andrew Lumsdaine: Active pebbles: parallel programming for data-driven applications. ICS 2011: 235-244 | |
| c49 | Jens Domke, Torsten Hoefler, Wolfgang E. Nagel: Deadlock-Free Oblivious Routing for Arbitrary Topologies. IPDPS 2011: 616-627 | |
| c48 | ||
| c47 | Eric Holk, William E. Byrd, Jeremiah Willcock, Torsten Hoefler, Arun Chauhan, Andrew Lumsdaine: Kanor - A Declarative Language for Explicit Communication. PADL 2011: 190-204 | |
| c46 | Jeremiah Willcock, Torsten Hoefler, Nicholas Gerard Edmonds, Andrew Lumsdaine: Active pebbles: a programming model for highly parallel fine-grained data-driven computations. PPOPP 2011: 305-306 | |
| c45 | Vishwanath Venkatesan, Mohamad Chaarawi, Edgar Gabriel, Torsten Hoefler: Design and Evaluation of Nonblocking Collective I/O Operations. EuroMPI 2011: 90-98 | |
| c44 | William Gropp, Torsten Hoefler, Rajeev Thakur, Jesper Larsson Träff: Performance Expectations and Guidelines for MPI Derived Datatypes. EuroMPI 2011: 150-159 | |
| c43 | Torsten Hoefler, Marc Snir: Writing Parallel Libraries with MPI - Common Practice, Issues, and Extensions. EuroMPI 2011: 345-355 | |
| 2010 | ||
| j5 | Torsten Hoefler: Software and Hardware Techniques for Power-Efficient HPC Networking. Computing in Science and Engineering 12(6): 30-37 (2010) | |
| j4 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: Accurately measuring overhead, communication time and progression of blocking and nonblocking collective operations at massive scale. IJPEDS 25(4): 241-258 (2010) | |
| c42 | Jeremiah Willcock, Torsten Hoefler, Nicholas Gerard Edmonds, Andrew Lumsdaine: AM++: a generalized active message framework. PACT 2010: 401-410 | |
| c41 | Torsten Hoefler: Bridging Performance Analysis Tools and Analytic Performance Modeling for HPC. Euro-Par Workshops 2010: 483-491 | |
| c40 | Nick Edmonds, Torsten Hoefler, Andrew Lumsdaine: A space-efficient parallel algorithm for computing betweenness centrality in distributed memory. HiPC 2010: 1-10 | |
| c39 | L. Baba Arimilli, Ravi Arimilli, Vicente Chung, Scott Clark, Wolfgang E. Denzel, Ben C. Drerup, Torsten Hoefler, Jody B. Joyner, Jerry Lewis, Jian Li, Nan Ni, Ramakrishnan Rajamony: The PERCS High-Performance Interconnect. Hot Interconnects 2010: 75-82 | |
| c38 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: LogGOPSim: simulating large-scale applications in the LogGOPS model. HPDC 2010: 597-604 | |
| c37 | Torsten Hoefler, Christian Siebert, Andrew Lumsdaine: Scalable communication protocols for dynamic sparse data exchange. PPOPP 2010: 159-168 | |
| c36 | Torsten Hoefler, William Gropp, Rajeev Thakur, Jesper Larsson Träff: Toward Performance Models of MPI Implementations for Understanding Application Scaling Issues. EuroMPI 2010: 21-30 | |
| c35 | Torsten Hoefler, Greg Bronevetsky, Brian Barrett, Bronis R. de Supinski, Andrew Lumsdaine: Efficient MPI Support for Advanced Hybrid Programming Models. EuroMPI 2010: 50-61 | |
| c34 | Torsten Hoefler, Steven Gottlieb: Parallel Zero-Copy Algorithms for Fast Fourier Transform and Conjugate Gradient Using MPI Datatypes. EuroMPI 2010: 132-141 | |
| c33 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: Characterizing the Influence of System Noise on Large-Scale Applications by Simulation. SC 2010: 1-11 | |
| 2009 | ||
| j3 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: The Effect of Network Noise on Large-Scale Collective Communications. Parallel Processing Letters 19(4): 573-593 (2009) | |
| j2 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: LogGP in theory and practice - An in-depth analysis of modern interconnection networks and benchmarking methods for collective operations. Simulation Modelling Practice and Theory 17(9): 1511-1521 (2009) | |
| c32 | Prabhanjan Kambadur, Anshul Gupta, Torsten Hoefler, Andrew Lumsdaine: Demand-driven execution of static directed acyclic graphs using task parallelism. HiPC 2009: 284-293 | |
| c31 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: Optimized Routing for Large-Scale InfiniBand Networks. Hot Interconnects 2009: 103-111 | |
| c30 | Torsten Hoefler, Christian Siebert, Andrew Lumsdaine: Group Operation Assembly Language - A Flexible Way to Express Collective Communication. ICPP 2009: 574-581 | |
| c29 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: A power-aware, application-based performance study of modern commodity cluster interconnection networks. IPDPS 2009: 1-7 | |
| c28 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: The impact of network noise at large-scale communication performance. IPDPS 2009: 1-8 | |
| c27 | ||
| c26 | Christian Kaiser, Torsten Hoefler, Boris Bierbaum, Thomas Bemmerl: Implementation and analysis of nonblocking collective operations on SCI networks. IPDPS 2009: 1-7 | |
| c25 | Torsten Hoefler, Andrew Lumsdaine, Jack Dongarra: Towards Efficient MapReduce Using MPI. PVM/MPI 2009: 240-249 | |
| 2008 | ||
| c24 | Timo Schneider, Torsten Hoefler, Simon Wunderlich, Torsten Mehlan, Wolfgang Rehm: An Optimized ZGEMM Implementation for the Cell BE. PASA 2008: 113-122 | |
| c23 | Torsten Hoefler, Andrew Lumsdaine: Overlapping Communication and Computation with High Level Communication Routines. CCGRID 2008: 572-577 | |
| c22 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: Multistage switches are not crossbars: Effects of static routing in high-performance networks. CLUSTER 2008: 116-125 | |
| c21 | Torsten Hoefler, Andrew Lumsdaine: Message progression in parallel computing - to thread or not to thread? CLUSTER 2008: 213-222 | |
| c20 | Patrick Geoffray, Torsten Hoefler: Adaptive Routing Strategies for Modern High Performance Networks. Hot Interconnects 2008: 165-172 | |
| c19 | Torsten Hoefler, Andrew Lumsdaine: Optimizing non-blocking collective operations for infiniband. IPDPS 2008: 1-8 | |
| c18 | Torsten Hoefler, Timo Schneider, Andrew Lumsdaine: Accurately measuring collective operations at massive scale. IPDPS 2008: 1-8 | |
| c17 | Torsten Hoefler, Florian Lorenzen, Andrew Lumsdaine: Sparse Non-blocking Collectives in Quantum Mechanical Calculations. PVM/MPI 2008: 55-63 | |
| c16 | Torsten Hoefler, Maraike Schellmann, Sergei Gorlatch, Andrew Lumsdaine: Communication Optimization for Medical Image Reconstruction Algorithms. PVM/MPI 2008: 75-83 | |
| c15 | Torsten Hoefler, Peter Gottschling, Andrew Lumsdaine: Leveraging non-blocking collective communication in high-performance applications. SPAA 2008: 113-115 | |
| 2007 | ||
| j1 | Torsten Hoefler, Peter Gottschling, Andrew Lumsdaine, Wolfgang Rehm: Optimizing a conjugate gradient solver with non-blocking collective operations. Parallel Computing 33(9): 624-633 (2007) | |
| c14 | Torsten Hoefler, Torsten Mehlan, Andrew Lumsdaine, Wolfgang Rehm: Netgauge: A Network Performance Measurement Framework. HPCC 2007: 659-671 | |
| c13 | Torsten Hoefler, Andre Lichei, Wolfgang Rehm: Low-Overhead LogGP Parameter Assessment for Modern Interconnection Networks. IPDPS 2007: 1-8 | |
| c12 | Torsten Hoefler, Christian Siebert, Wolfgang Rehm: A practically constant-time MPI Broadcast Algorithm for large-scale InfiniBand Clusters with Multicast. IPDPS 2007: 1-8 | |
| c11 | Torsten Hoefler, Prabhanjan Kambadur, Richard L. Graham, Galen M. Shipman, Andrew Lumsdaine: A Case for Standard Non-blocking Collective Operations. PVM/MPI 2007: 125-134 | |
| c10 | Torsten Hoefler, Andrew Lumsdaine, Wolfgang Rehm: Implementation and performance analysis of non-blocking collective operations for MPI. SC 2007: 52 | |
| 2006 | ||
| c9 | Torsten Hoefler, Torsten Mehlan, Frank Mietke, Wolfgang Rehm: Adding Low-Cost Hardware Barrier Support to Small Commodity Clusters. ARCS Workshops 2006: 343-350 | |
| c8 | Frank Mietke, Robert Rex, Robert Baumgartl, Torsten Mehlan, Torsten Hoefler, Wolfgang Rehm: Analysis of the Memory Registration Process in the Mellanox InfiniBand Software Stack. Euro-Par 2006: 124-133 | |
| c7 | Torsten Hoefler, Torsten Mehlan, Frank Mietke, Wolfgang Rehm: Fast barrier synchronization for InfiniBand/spl trade/. IPDPS 2006 | |
| c6 | Torsten Hoefler, Torsten Mehlan, Frank Mietke, Wolfgang Rehm: LogfP - a model for small messages in InfiniBand. IPDPS 2006 | |
| c5 | Torsten Hoefler, Jeffrey M. Squyres, Wolfgang Rehm, Andrew Lumsdaine: A Case for Non-blocking Collective Operations. ISPA Workshops 2006: 155-164 | |
| c4 | Torsten Mehlan, Jochen Strunk, Torsten Hoefler, Frank Mietke, Wolfgang Rehm: IRS - A Portable Interface for Reconfigurable Systems. PARELEC 2006: 187-191 | |
| c3 | Torsten Hoefler, Carsten Viertel, Torsten Mehlan, Frank Mietke, Wolfgang Rehm: Assessing Single-Message and Multi-Node Communication Performance of InfiniBand. PARELEC 2006: 227-232 | |
| c2 | Torsten Hoefler, Peter Gottschling, Wolfgang Rehm, Andrew Lumsdaine: Optimizing a Conjugate Gradient Solver with Non-Blocking Collective Operations. PVM/MPI 2006: 374-382 | |
| 2005 | ||
| c1 | Torsten Hoefler, Lavinio Cerquetti, Torsten Mehlan, Frank Mietke, Wolfgang Rehm: A Practical Approach to the Rating of Barrier Algorithms Using the LogP Model and Open MPI. ICPP Workshops 2005: 562-569 | |
Colors in the list of coauthors
Last update Wed May 22 03:24:40 2013 CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page