| 2013 | ||
|---|---|---|
| j10 | Wenjing Ma, Sriram Krishnamoorthy, Oreste Villa, Karol Kowalski, Gagan Agrawal: Optimizing tensor contraction expressions for hybrid CPU-GPU execution. Cluster Computing 16(1): 131-155 (2013) | |
| j9 | Nawab Ali, Sriram Krishnamoorthy, Mahantesh Halappanavar, Jeff Daily: Multi-Fault Tolerance for Cartesian Data Distributions. International Journal of Parallel Programming 41(3): 469-493 (2013) | |
| j8 | Marc-André Hermanns, Sriram Krishnamoorthy, Felix Wolf: A scalable infrastructure for the performance analysis of passive target synchronization. Parallel Computing 39(3): 132-145 (2013) | |
| 2012 | ||
| j7 | Jeff R. Hammond, Sriram Krishnamoorthy, Sameer Shende, Nichols A. Romero, Allen D. Malony: Performance characterization of global address space applications: a case study with NWChem. Concurrency and Computation: Practice and Experience 24(2): 135-154 (2012) | |
| j6 | Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Empirical performance model-driven data layout optimization and library call selection for tensor contraction expressions. J. Parallel Distrib. Comput. 72(3): 338-352 (2012) | |
| c57 | Daniel G. Chavarría-Miranda, Sriram Krishnamoorthy, Abhinav Vishnu: Global Futures: A Multithreaded Execution Model for Global Arrays-based Applications. CCGRID 2012: 393-401 | |
| c56 | Wenjing Ma, Sriram Krishnamoorthy, Gagan Agrawal: Parameterized micro-benchmarking: an auto-tuning approach for complex applications. Conf. Computing Frontiers 2012: 213-222 | |
| c55 | Jeff Daily, Sriram Krishnamoorthy, Ananth Kalyanaraman: Towards scalable optimal sequence homology detection. HiPC 2012: 1-8 | |
| c54 | Jonathan Lifflander, Sriram Krishnamoorthy, Laxmikant V. Kalé: Work stealing and persistence-based load balancers for iterative overdecomposed applications. HPDC 2012: 137-148 | |
| c53 | Ajay Panyala, Daniel G. Chavarría-Miranda, Sriram Krishnamoorthy: On the Use of Term Rewriting for Performance Ooptimization of Legacy HPC Applications. ICPP 2012: 399-409 | |
| c52 | Wenjing Ma, Sriram Krishnamoorthy: Data-driven fault tolerance for work stealing computations. ICS 2012: 79-90 | |
| c51 | Humayun Arafat, P. Sadayappan, James Dinan, Sriram Krishnamoorthy, Theresa L. Windus: Load Balancing of Dynamical Nucleation Theory Monte Carlo Simulations through Resource Sharing Barriers. IPDPS 2012: 285-295 | |
| c50 | James Dinan, Pavan Balaji, Jeff R. Hammond, Sriram Krishnamoorthy, Vinod Tipparaju: Supporting the Global Arrays PGAS Model Using MPI One-Sided Communication. IPDPS 2012: 739-750 | |
| 2011 | ||
| c49 | Wenjing Ma, Sriram Krishnamoorthy, Gagan Agrawal: Parameterized Micro-benchmarking: An Auto-tuning Approach for Complex Applications. PACT 2011: 181-182 | |
| c48 | Wenjing Ma, Sriram Krishnamoorthy, Gagan Agrawal: Practical Loop Transformations for Tensor Contraction Expressions on Multi-level Memory Hierarchies. CC 2011: 266-285 | |
| c47 | Nawab Ali, Sriram Krishnamoorthy, Mahantesh Halappanavar, Jeff Daily: Tolerating correlated failures for generalized Cartesian distributions via bipartite matching. Conf. Computing Frontiers 2011: 36 | |
| c46 | Nawab Ali, Sriram Krishnamoorthy, Niranjan Govind, Karol Kowalski, Ponnuswamy Sadayappan: Application-Specific Fault Tolerance via Data Access Characterization. Euro-Par (2) 2011: 340-352 | |
| c45 | Nawab Ali, Sriram Krishnamoorthy, Niranjan Govind, Bruce Palmer: A Redundant Communication Approach to Scalable Fault Tolerance in PGAS Programming Models. PDP 2011: 24-31 | |
| c44 | Vijay A. Saraswat, Prabhanjan Kambadur, Sreedhar B. Kodali, David Grove, Sriram Krishnamoorthy: Lifeline-based global load balancing. PPOPP 2011: 201-212 | |
| c43 | James Dinan, Sriram Krishnamoorthy, Pavan Balaji, Jeff R. Hammond, Manojkumar Krishnan, Vinod Tipparaju, Abhinav Vishnu: Noncollective Communicator Creation in MPI. EuroMPI 2011: 282-291 | |
| c42 | James Dinan, Pavan Balaji, Jeff R. Hammond, Sriram Krishnamoorthy, Vinod Tipparaju: Poster: High-level, one-sided programming models on MPI: a case study with global arrays and NWChem. SC Companion 2011: 37-38 | |
| c41 | Karol Kowalski, Sriram Krishnamoorthy, Ryan M. Olson, Vinod Tipparaju, Edoardo Aprà: Scalable implementations of accurate excited-state coupled cluster theories: application of high-level methods to porphyrin-based systems. SC 2011: 72 | |
| c40 | Ronald Minnich, Curtis L. Janssen, Sriram Krishnamoorthy, Andres Marquez, Wenjing Ma, Maya Gokhale, Ponnuswamy Sadayappan, Eric Van Hensbergen, Jonathan Appavoo, Jim McKie: Poster: FOX: a fault-oblivious extreme scale execution environment. SC Companion 2011: 91-92 | |
| 2010 | ||
| c39 | Sriram Krishnamoorthy, Khushbu Agarwal: Scalable Communication Trace Compression. CCGRID 2010: 408-417 | |
| c38 | James Dinan, Arjun Singri, P. Sadayappan, Sriram Krishnamoorthy: Selective Recovery from Failures in a Task Parallel Programming Model. CCGRID 2010: 709-714 | |
| c37 | Wenjing Ma, Sriram Krishnamoorthy, Oreste Villa, Karol Kowalski: Acceleration of Streamed Tensor Contraction Expressions on GPGPU-Based Clusters. CLUSTER 2010: 207-216 | |
| c36 | Long Chen, Oreste Villa, Sriram Krishnamoorthy, Guang R. Gao: Dynamic load balancing on single- and multi-GPU systems. IPDPS 2010: 1-12 | |
| c35 | Oreste Villa, Long Chen, Sriram Krishnamoorthy: High performance Molecular Dynamic simulation on single and multi-GPU systems. ISCAS 2010: 3805-3808 | |
| 2009 | ||
| j5 | Nagavijayalakshmi Vydyanathan, Sriram Krishnamoorthy, Gerald M. Sabin, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: An Integrated Approach to Locality-Conscious Processor Allocation and Scheduling of Mixed-Parallel Applications. IEEE Trans. Parallel Distrib. Syst. 20(8): 1158-1172 (2009) | |
| c34 | Qingda Lu, Christophe Alias, Uday Bondhugula, Thomas Henretty, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan, Yongjian Chen, Haibo Lin, Tin-fook Ngai: Data Layout Transformation for Enhancing Data Locality on NUCA Chip Multiprocessors. PACT 2009: 348-357 | |
| c33 | Oreste Villa, Sriram Krishnamoorthy, Jarek Nieplocha, David M. Brown Jr.: Scalable transparent checkpoint-restart of global address space applications on virtual machines over infiniband. Conf. Computing Frontiers 2009: 197-206 | |
| c32 | Albert Hartono, Muthu Manikandan Baskaran, Cédric Bastoul, Albert Cohen, Sriram Krishnamoorthy, Boyana Norris, J. Ramanujam, P. Sadayappan: Parametric multi-level tiling of imperfectly nested loops. ICS 2009: 147-157 | |
| c31 | James Dinan, D. Brian Larkins, P. Sadayappan, Sriram Krishnamoorthy, Jarek Nieplocha: Scalable work stealing. SC 2009 | |
| 2008 | ||
| c30 | Uday Bondhugula, Muthu Manikandan Baskaran, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model. CC 2008: 132-146 | |
| c29 | Jarek Nieplocha, Sriram Krishnamoorthy, Marat Valiev, Manojkumar Krishnan, Bruce Palmer, P. Sadayappan: Integrated Data and Task Management for Scientific Applications. ICCS (1) 2008: 20-31 | |
| c28 | Guojing Cong, Sreedhar B. Kodali, Sriram Krishnamoorthy, Doug Lea, Vijay A. Saraswat, Tong Wen: Solving Large, Irregular Graph Problems Using Adaptive Work-Stealing. ICPP 2008: 536-545 | |
| c27 | James Dinan, Sriram Krishnamoorthy, D. Brian Larkins, Jarek Nieplocha, P. Sadayappan: Scioto: A Framework for Global-View Task Parallelism. ICPP 2008: 586-593 | |
| c26 | Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: A compiler framework for optimization of affine loop nests for gpgpus. ICS 2008: 225-234 | |
| c25 | Uday Bondhugula, Muthu Manikandan Baskaran, Albert Hartono, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: Towards effective automatic parallelization for multicore systems. IPDPS 2008: 1-5 | |
| c24 | Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories. PPOPP 2008: 1-10 | |
| c23 | D. Brian Larkins, James Dinan, Sriram Krishnamoorthy, Srinivasan Parthasarathy, Atanas Rountev, P. Sadayappan: Global trees: a framework for linked data structures on distributed memory parallel systems. SC 2008: 57 | |
| 2007 | ||
| j4 | Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Efficient search-space pruning for integrated fusion and tiling transformations. Concurrency and Computation: Practice and Experience 19(18): 2425-2443 (2007) | |
| c22 | Sriram Krishnamoorthy, Juan Piernas, Vinod Tipparaju, Jarek Nieplocha, P. Sadayappan: Non-collective parallel I/O for global address space programming models. CLUSTER 2007: 41-49 | |
| c21 | Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, Atanas Rountev, P. Sadayappan: A global address space framework for locality aware scheduling of block-sparse computations. IPDPS 2007: 1-8 | |
| c20 | Sriram Krishnamoorthy, Muthu Manikandan Baskaran, Uday Bondhugula, J. Ramanujam, Atanas Rountev, P. Sadayappan: Effective automatic parallelization of stencil computations. PLDI 2007: 235-244 | |
| 2006 | ||
| j3 | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella: Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver. J. Parallel Distrib. Comput. 66(5): 659-673 (2006) | |
| j2 | Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, Jarek Nieplocha, P. Sadayappan: Layout transformation support for the disk resident arrays framework. The Journal of Supercomputing 36(2): 153-170 (2006) | |
| c19 | Qingda Lu, Sriram Krishnamoorthy, P. Sadayappan: Combining analytical and empirical approaches in tuning matrix transposition. PACT 2006: 233-242 | |
| c18 | Nagavijayalakshmi Vydyanathan, Sriram Krishnamoorthy, Gerald Sabin, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: Locality Conscious Processor Allocation and Scheduling for Mixed Parallel Applications. CLUSTER 2006 | |
| c17 | Gaurav Khanna, Nagavijayalakshmi Vydyanathan, Ümit V. Çatalyürek, Tahsin M. Kurç, Sriram Krishnamoorthy, P. Sadayappan, Joel H. Saltz: Task Scheduling and File Replication for Data-Intensive Jobs with Batch-shared I/O. HPDC 2006: 241-252 | |
| c16 | Albert Hartono, Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Marcel Nooijen, Gerald Baumgartner, David E. Bernholdt, Venkatesh Choppella, Russell M. Pitzer, J. Ramanujam, Atanas Rountev, P. Sadayappan: Identifying Cost-Effective Common Subexpressions to Reduce Operation Count in Tensor Contraction Evaluations. International Conference on Computational Science (1) 2006: 267-275 | |
| c15 | Nagavijayalakshmi Vydyanathan, Sriram Krishnamoorthy, Gerald Sabin, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: An Integrated Approach for Processor Allocation and Scheduling of Mixed-Parallel Applications. ICPP 2006: 443-450 | |
| c14 | Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, Atanas Rountev, P. Sadayappan: An extensible global address space framework with decoupled task and data abstractions. IPDPS 2006 | |
| c13 | Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, P. Sadayappan: An approach to locality-conscious load balancing and transparent memory hierarchy management with a global-address-space parallel programming model. IPDPS 2006 | |
| c12 | Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, Atanas Rountev, P. Sadayappan: Data management and query - Hypergraph partitioning for automatic memory hierarchy management. SC 2006: 98 | |
| c11 | Michael Blocksome, Charles Archer, Todd Inglett, Patrick McCarthy, Michael Mundy, Joe Ratterman, A. Sidelnik, Brian E. Smith, George Almási, José G. Castaños, Derek Lieber, José E. Moreira, Sriram Krishnamoorthy, Vinod Tipparaju, Jarek Nieplocha: Blue Gene system software - Design and implementation of a one-sided communication interface for the IBM eServer Blue Gene® supercomputer. SC 2006: 120 | |
| 2005 | ||
| c10 | Sriram Krishnamoorthy, Jarek Nieplocha, P. Sadayappan: Data and Computation Abstractions for Dynamic and Irregular Computations. HiPC 2005: 258-269 | |
| c9 | Swarup Kumar Sahoo, Rajkiran Panuganti, Sriram Krishnamoorthy, P. Sadayappan: Cache Miss Characterization and Data Locality Optimization for Imperfectly Nested Loops on Shared Memory Multiprocessors. IPDPS 2005 | |
| c8 | Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Efficient Search-Space Pruning for Integrated Fusion and Tiling Transformations. LCPC 2005: 215-229 | |
| c7 | Swarup Kumar Sahoo, Sriram Krishnamoorthy, Rajkiran Panuganti, P. Sadayappan: Integrated Loop Optimizations for Data Locality Enhancement of Tensor Contraction Expressions. SC 2005: 13 | |
| 2004 | ||
| j1 | Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan: Efficient parallel out-of-core matrix transposition. IJHPCN 2(2/3/4): 110-119 (2004) | |
| c6 | Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, Jarek Nieplocha, P. Sadayappan: Efficient Layout Transformation for Disk-Based Multidimensional Arrays. HiPC 2004: 386-398 | |
| c5 | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella: Efficient Synthesis of Out-of-Core Algorithms Using a Nonlinear Optimization Solver. IPDPS 2004 | |
| c4 | Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Empirical Performance-Model Driven Data Layout Optimization. LCPC 2004: 72-86 | |
| 2003 | ||
| c3 | Sudha Srinivasan, Sriram Krishnamoorthy, P. Sadayappan: A Robust Scheduling Strategy for Moldable Scheduling of Parallel Jobs. CLUSTER 2003: 92-99 | |
| c2 | Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan: Efficient Parallel Out-of-Core Matrix Transposition. CLUSTER 2003: 300-307 | |
| c1 | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, David E. Bernholdt, Venkatesh Choppella: Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms. HiPC 2003: 406-417 | |
Data released under the ODC-BY 1.0 license — See also our legal information page