| 2012 | ||
|---|---|---|
| c61 | Milind Chabbi, John M. Mellor-Crummey: DeadSpy: a tool to pinpoint program inefficiencies. CGO 2012: 124-134 | |
| 2011 | ||
| c60 | ||
| c59 | Nathan R. Tallent, John M. Mellor-Crummey, Michael Franco, Reed Landrum, Laksono Adhianto: Scalable fine-grained call path tracing. ICS 2011: 63-74 | |
| c58 | Guohua Jin, John M. Mellor-Crummey, Laksono Adhianto, William N. Scherer III, Chaoran Yang: Implementation and Performance Evaluation of the HPC Challenge Benchmarks in Coarray Fortran 2.0. IPDPS 2011: 1089-1100 | |
| c57 | ||
| e2 | Keith D. Cooper, John M. Mellor-Crummey, Vivek Sarkar (Eds.): Languages and Compilers for Parallel Computing - 23rd International Workshop, LCPC 2010, Houston, TX, USA, October 7-9, 2010. Revised Selected Papers. Lecture Notes in Computer Science 6548, Springer 2011, isbn 978-3-642-19594-5 | |
| 2010 | ||
| j23 | Laksono Adhianto, S. Banerjee, Michael W. Fagan, Mark Krentel, Gabriel Marin, John M. Mellor-Crummey, Nathan R. Tallent: HPCTOOLKIT: tools for performance analysis of optimized parallel programs. Concurrency and Computation: Practice and Experience 22(6): 685-701 (2010) | |
| j22 | John M. Mellor-Crummey, William Gropp, Maurice Herlihy: Teaching parallel programming: a roundtable discussion. ACM Crossroads 17(1): 28-30 (2010) | |
| c56 | Laksono Adhianto, John M. Mellor-Crummey, Nathan R. Tallent: Effectively Presenting Call Path Profiles of Application Performance. ICPP Workshops 2010: 179-188 | |
| c55 | Nathan R. Tallent, John M. Mellor-Crummey, Allan Porterfield: Analyzing lock contention in multithreaded applications. PPOPP 2010: 269-280 | |
| c54 | Nathan R. Tallent, Laksono Adhianto, John M. Mellor-Crummey: Scalable Identification of Load Imbalance in Parallel Executions Using Call Path Profiles. SC 2010: 1-11 | |
| 2009 | ||
| j21 | Nathan R. Tallent, John M. Mellor-Crummey: Identifying Performance Bottlenecks in Work-Stealing Computations. IEEE Computer 42(11): 44-50 (2009) | |
| c53 | Nathan R. Tallent, John M. Mellor-Crummey, Michael W. Fagan: Binary analysis for measurement and attribution of program performance. PLDI 2009: 441-452 | |
| c52 | Nathan R. Tallent, John M. Mellor-Crummey: Effective performance measurement and analysis of multithreaded applications. PPOPP 2009: 229-240 | |
| c51 | Nathan R. Tallent, John M. Mellor-Crummey, Laksono Adhianto, Michael W. Fagan, Mark Krentel: Diagnosing performance bottlenecks in emerging petascale applications. SC 2009 | |
| 2008 | ||
| c50 | Gabriel Marin, John M. Mellor-Crummey: Pinpointing and Exploiting Opportunities for Enhancing Data Reuse. ISPASS 2008: 115-126 | |
| c49 | ||
| 2007 | ||
| c48 | Cristian Coarfa, John M. Mellor-Crummey, Nathan Froyd, Yuri Dotsenko: Scalability analysis of SPMD codes using expectations. ICS 2007: 13-22 | |
| c47 | Gabriel Marin, John M. Mellor-Crummey: Application Insight Through Performance Modeling. IPCCC 2007: 65-74 | |
| e1 | Katherine A. Yelick, John M. Mellor-Crummey (Eds.): Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2007, San Jose, California, USA, March 14-17, 2007. ACM 2007, isbn 978-1-59593-602-8 | |
| 2006 | ||
| j20 | Yuri Dotsenko, Cristian Coarfa, Luay Nakhleh, John M. Mellor-Crummey, Usman Roshan: PRec-I-DCM3: a parallel framework for fast and accurate large-scale phylogeny reconstruction. IJBRA 2(4): 407-419 (2006) | |
| j19 | Cristian Coarfa, Yuri Dotsenko, John M. Mellor-Crummey: Experiences with Sweep3D implementations in Co-array Fortran. The Journal of Supercomputing 36(2): 101-121 (2006) | |
| j18 | Apan Qasem, Ken Kennedy, John M. Mellor-Crummey: Automatic tuning of whole applications using direct search and a performance-based transformation system. The Journal of Supercomputing 36(2): 183-196 (2006) | |
| 2005 | ||
| j17 | Guohua Jin, John M. Mellor-Crummey: Improving Performance by Reducing the Memory Footprint of Scientific Applications. IJHPCA 19(4): 433-451 (2005) | |
| j16 | Francine Berman, Henri Casanova, Andrew A. Chien, Keith D. Cooper, Holly Dail, Anshuman Dasgupta, W. Deng, Jack Dongarra, S. Lennart Johnsson, Ken Kennedy, Charles Koelbel, Bo Liu, Xin Liu, Anirban Mandal, Gabriel Marin, Mark Mazina, John M. Mellor-Crummey, Celso L. Mendes, Alex Olugbile, M. Patel, Daniel A. Reed, Zhiao Shi, Otto Sievert, Huaxia Xia, Asim YarKhan: New Grid Scheduling and Rescheduling Methods in the GrADS Project. International Journal of Parallel Programming 33(2-3): 209-229 (2005) | |
| j15 | Guohua Jin, John M. Mellor-Crummey: SFCGen: A framework for efficient generation of multi-dimensional space-filling curves by recursion. ACM Trans. Math. Softw. 31(1): 120-148 (2005) | |
| c46 | Guohua Jin, John M. Mellor-Crummey: Space-filling Curve Generation: A Table-based Approach. AMCS 2005: 40-46 | |
| c45 | Luay Nakhleh, Guohua Jin, Fengmei Zhao, John M. Mellor-Crummey: Reconstructing Phylogenetic Networks Using Maximum Parsimony. CSB 2005: 93-102 | |
| c44 | Anirban Mandal, Ken Kennedy, Charles Koelbel, Gabriel Marin, John M. Mellor-Crummey, Bo Liu, S. Lennart Johnsson: Scheduling strategies for mapping application workflows onto the grid. HPDC 2005: 125-134 | |
| c43 | Cristian Coarfa, Yuri Dotsenko, John M. Mellor-Crummey, Luay Nakhleh, Usman Roshan: PRec-I-DCM3: A Parallel Framework for Fast and Accurate Large Scale Phylogeny Reconstruction. ICPADS (2) 2005: 346-350 | |
| c42 | Nathan Froyd, John M. Mellor-Crummey, Robert J. Fowler: Low-overhead call path profiling of unmodified, optimized code. ICS 2005: 81-90 | |
| c41 | Daniel G. Chavarría-Miranda, Guohua Jin, John M. Mellor-Crummey: COTS Clusters vs. the Earth Simulator: An Application Study Using IMPACT-3D. IPDPS 2005 | |
| c40 | Michelle Mills Strout, John M. Mellor-Crummey, Paul D. Hovland: Representation-independent program analysis. PASTE 2005: 67-74 | |
| c39 | Daniel G. Chavarría-Miranda, John M. Mellor-Crummey: Effective communication coalescing for data-parallel applications. PPOPP 2005: 14-25 | |
| c38 | Cristian Coarfa, Yuri Dotsenko, John M. Mellor-Crummey, François Cantonnet, Tarek A. El-Ghazawi, Ashrujit Mohanti, Yiyi Yao, Daniel G. Chavarría-Miranda: An evaluation of global address space languages: co-array fortran and unified parallel C. PPOPP 2005: 36-47 | |
| 2004 | ||
| j14 | John M. Mellor-Crummey, John Garvin: Optimizing Sparse Matrix - Vector Product Computations Using Unroll and Jam. IJHPCA 18(2): 225-236 (2004) | |
| c37 | Yuri Dotsenko, Cristian Coarfa, John M. Mellor-Crummey: A Multi-Platform Co-Array Fortran Compiler. IEEE PACT 2004: 29-40 | |
| c36 | Anirban Mandal, Anshuman Dasgupta, Ken Kennedy, Mark Mazina, Charles Koelbel, Gabriel Marin, Keith D. Cooper, John M. Mellor-Crummey, Bo Liu, S. Lennart Johnsson: Scheduling workflow applications in GrADS. CCGRID 2004: 790-797 | |
| c35 | Keith D. Cooper, Anshuman Dasgupta, Ken Kennedy, Charles Koelbel, Anirban Mandal, Gabriel Marin, Mark Mazina, John M. Mellor-Crummey, Francine Berman, Henri Casanova, Andrew A. Chien, Holly Dail, Xin Liu, Alex Olugbile, Otto Sievert, Huaxia Xia, S. Lennart Johnsson, Bo Liu, M. Patel, Daniel A. Reed, W. Deng, Celso L. Mendes, Zhiao Shi, Asim YarKhan, Jack Dongarra: New Grid Scheduling and Rescheduling Methods in the GrADS Project. IPDPS Next Generation Software Program - NSFNGS - PI Workshop 2004 | |
| c34 | Yuri Dotsenko, Cristian Coarfa, John M. Mellor-Crummey, Daniel G. Chavarría-Miranda: Experiences with Co-array Fortran on Hardware Shared Memory Platforms. LCPC 2004: 332-347 | |
| c33 | Gabriel Marin, John M. Mellor-Crummey: Cross-architecture performance predictions for scientific applications using parameterized models. SIGMETRICS 2004: 2-13 | |
| 2003 | ||
| j13 | Daniel G. Chavarría-Miranda, John M. Mellor-Crummey: An Evaluation of Data-Parallel Compiler Support for Line-Sweep Applications. J. Instruction-Level Parallelism 5 (2003) | |
| j12 | Alain Darte, John M. Mellor-Crummey, Robert J. Fowler, Daniel G. Chavarría-Miranda: Generalized multipartitioning of multi-dimensional arrays for parallelizing line-sweep computations. J. Parallel Distrib. Comput. 63(9): 887-911 (2003) | |
| c32 | Cristian Coarfa, Yuri Dotsenko, Jason Eckhardt, John M. Mellor-Crummey: Co-array Fortran Performance and Potential: An NPB Experimental Study. LCPC 2003: 177-193 | |
| 2002 | ||
| j11 | John M. Mellor-Crummey, Vikram S. Adve, Bradley Broom, Daniel G. Chavarría-Miranda, Robert J. Fowler, Guohua Jin, Ken Kennedy, Qing Yi: Advanced optimization strategies in the Rice dHPF compiler. Concurrency and Computation: Practice and Experience 14(8-9): 741-767 (2002) | |
| j10 | John M. Mellor-Crummey, Robert J. Fowler, Gabriel Marin, Nathan R. Tallent: HPCVIEW: A Tool for Top-down Analysis of Node Performance. The Journal of Supercomputing 23(1): 81-104 (2002) | |
| c31 | Daniel G. Chavarría-Miranda, John M. Mellor-Crummey: An Evaluation of Data-Parallel Compiler Support for Line-Sweep Applications. IEEE PACT 2002: 7-17 | |
| c30 | Guohua Jin, John M. Mellor-Crummey: Experiences tuning SMG98: a semicoarsening multigrid benchmark based on the hypre library. ICS 2002: 305-314 | |
| c29 | Daniel G. Chavarría-Miranda, Alain Darte, Robert J. Fowler, John M. Mellor-Crummey: Generalized Multipartitioning for Multi-Dimensional Arrays. IPDPS 2002 | |
| c28 | Ken Kennedy, Mark Mazina, John M. Mellor-Crummey, Keith D. Cooper, Linda Torczon, Francine Berman, Andrew A. Chien, Holly Dail, Otto Sievert, Dave Angulo, Ian T. Foster, Ruth A. Aydt, Daniel A. Reed, Dennis Gannon, S. Lennart Johnsson, Carl Kesselman, Jack Dongarra, Sathish S. Vadhiyar, Richard Wolski: Toward a Framework for Preparing and Executing Adaptive Grid Programs. IPDPS 2002 | |
| 2001 | ||
| j9 | Francine Berman, Andrew A. Chien, Keith D. Cooper, Jack Dongarra, Ian T. Foster, Dennis Gannon, S. Lennart Johnsson, Ken Kennedy, Carl Kesselman, John M. Mellor-Crummey, Daniel A. Reed, Linda Torczon, Richard Wolski: The GrADS Project: Software Support for High-Level Grid Application Development. IJHPCA 15(4): 327-344 (2001) | |
| j8 | John M. Mellor-Crummey, David B. Whalley, Ken Kennedy: Improving Memory Hierarchy Performance for Irregular Applications Using Data and Computation Reorderings. International Journal of Parallel Programming 29(3): 217-247 (2001) | |
| j7 | Ken Kennedy, Bradley Broom, Keith D. Cooper, Jack Dongarra, Robert J. Fowler, Dennis Gannon, S. Lennart Johnsson, John M. Mellor-Crummey, Linda Torczon: Telescoping Languages: A Strategy for Automatic Generation of Scientific Problem-Solving Systems from Annotated Libraries. J. Parallel Distrib. Comput. 61(12): 1803-1826 (2001) | |
| c27 | Vikram S. Adve, John M. Mellor-Crummey: Advanced Code Generation for High Performance Fortran. Compiler Optimizations for Scalable Parallel Systems Languages 2001: 553-596 | |
| c26 | Daniel G. Chavarría-Miranda, John M. Mellor-Crummey, Trushar Sarang: Data-Parallel Compiler Support for Multipartitioning. Euro-Par 2001: 241-253 | |
| c25 | John M. Mellor-Crummey, Robert J. Fowler, David B. Whalley: Tools for application-oriented performance tuning. ICS 2001: 154-165 | |
| c24 | Guohua Jin, John M. Mellor-Crummey, Robert J. Fowler: Increasing temporal locality with skewing and recursive blocking. SC 2001: 43 | |
| c23 | John M. Mellor-Crummey, Robert J. Fowler, David B. Whalley: On providing useful information for analyzing and tuning applications. SIGMETRICS/Performance 2001: 332-333 | |
| 2000 | ||
| c22 | Kai Zhang, John M. Mellor-Crummey, Robert J. Fowler: Compilation and Runtime-Optimizations for Software Distributed Shared Memory. LCR 2000: 182-191 | |
| c21 | Daniel G. Chavarría-Miranda, John M. Mellor-Crummey: Toward Compiler Support for Scalable Parallelism Using Multipartitioning. LCR 2000: 272-284 | |
| 1999 | ||
| c20 | John M. Mellor-Crummey, David B. Whalley, Ken Kennedy: Improving memory hierarchy performance for irregular applications. International Conference on Supercomputing 1999: 425-433 | |
| c19 | Collin McCurdy, John M. Mellor-Crummey: An Evaluation of Computing Paradigms for N-Body Simulations on Distributed Memory Architectures. PPOPP 1999: 25-36 | |
| 1998 | ||
| j6 | John M. Mellor-Crummey, Vikram S. Adve: Simplifying Control Flow in Compiler-Generated Parallel Code. International Journal of Parallel Programming 26(5): 613-638 (1998) | |
| c18 | Bo Lu, John M. Mellor-Crummey: Compiler-Optimization of Implicit Reductions for Distributed Memory Multiprocessors. IPPS/SPDP 1998: 42-51 | |
| c17 | Vikram S. Adve, John M. Mellor-Crummey: Using Integer Sets for Data-Parallel Program Analysis and Optimization. PLDI 1998: 186-198 | |
| c16 | Vikram S. Adve, Guohua Jin, John M. Mellor-Crummey, Qing Yi: High Performance Fortran Compilation Techniques for Parallelizing Scientific Codes. SC 1998: 11 | |
| 1997 | ||
| c15 | John M. Mellor-Crummey, Vikram S. Adve: Simplifying Control Flow in Compiler-Generated Parallel Code. LCPC 1997: 235-239 | |
| c14 | Gerald Roth, John M. Mellor-Crummey, Ken Kennedy, R. Gregg Brickner: Compiling Stencils in High Performance Fortran. SC 1997: 12 | |
| 1995 | ||
| c13 | Ken Kennedy, John M. Mellor-Crummey, Gerald Roth: Optimizing Fortran 90 Shift Operations on Distributed-Memory Multicomputers. LCPC 1995: 161-175 | |
| c12 | Vikram S. Adve, John M. Mellor-Crummey, Mark Anderson, Ken Kennedy, Jhy-Chun Wang, Daniel A. Reed: An Integrated Compilation and Performance Analysis Environment for Data Parallel Programs. SC 1995: 50 | |
| 1994 | ||
| j5 | Michael L. Scott, John M. Mellor-Crummey: Fast, contention-free combining tree barriers for shared-memory multiprocessors. International Journal of Parallel Programming 22(4): 449-481 (1994) | |
| c11 | Seema Hiranandani, Ken Kennedy, John M. Mellor-Crummey, Ajay Sethi: Compilation techniques for block-cyclic distributions. International Conference on Supercomputing 1994: 392-403 | |
| 1993 | ||
| c10 | Mary W. Hall, John M. Mellor-Crummey, Alan Carle, René G. Rodríguez: FIAT: A Framework for Interprocedural Analysis and Transfomation. LCPC 1993: 522-545 | |
| c9 | John M. Mellor-Crummey: Compile-Time Support for Efficient Data Race Detection in Shared-Memory Parallel Programs. Workshop on Parallel and Distributed Debugging 1993: 129-139 | |
| 1992 | ||
| c8 | Ervan Darnell, John M. Mellor-Crummey, Ken Kennedy: Automatic software cache coherence through vectorization. ICS 1992: 129-138 | |
| 1991 | ||
| j4 | John M. Mellor-Crummey, Michael L. Scott: Algorithms for Scalable Synchronization on Shared-Memory Multiprocessors. ACM Trans. Comput. Syst. 9(1): 21-65 (1991) | |
| c7 | ||
| c6 | John M. Mellor-Crummey, Michael L. Scott: Scalable Reader-Writer Synchronization for Shared-Memory Multiprocessors. PPOPP 1991: 106-113 | |
| c5 | John M. Mellor-Crummey: On-the-fly detection of data races for programs with nested fork-join parallelism. SC 1991: 24-33 | |
| 1990 | ||
| j3 | Thomas J. LeBlanc, John M. Mellor-Crummey, Robert J. Fowler: Analyzing Parallel Program Executions Using Multiple Views. J. Parallel Distrib. Comput. 9(2): 203-217 (1990) | |
| c4 | Robert Hood, Ken Kennedy, John M. Mellor-Crummey: Parallel program debugging with on-the-fly anomaly detection. SC 1990: 74-81 | |
| 1989 | ||
| j2 | Thomas J. LeBlanc, John M. Mellor-Crummey, Neal M. Gafter, Lawrence A. Crowl, Peter C. Dibble: The Elmwood Multiprocessor Operating System. Softw., Pract. Exper. 19(11): 1029-1055 (1989) | |
| c3 | ||
| 1988 | ||
| c2 | ||
| c1 | Robert J. Fowler, Thomas J. LeBlanc, John M. Mellor-Crummey: An Integrated Approach to Parallel Program Debugging and Performance Analysis of Large-Scal Multiprocessors. Workshop on Parallel and Distributed Debugging 1988: 163-173 | |
| 1987 | ||
| j1 | Thomas J. LeBlanc, John M. Mellor-Crummey: Debugging Parallel Programs with Instant Replay. IEEE Trans. Computers 36(4): 471-482 (1987) | |
Data released under the ODC-BY 1.0 license — See also our legal information page