| 2009 | ||
|---|---|---|
| 181 | Vijay S. Kumar, P. Sadayappan, Gaurang Mehta, Karan Vahi, Ewa Deelman, Varun Ratnakar, Jihie Kim, Yolanda Gil, Mary W. Hall, Tahsin M. Kurç, Joel H. Saltz: An integrated framework for performance-based optimization of scientific workflows. HPDC 2009: 177-186 | |
| 180 | Albert Hartono, Muthu Manikandan Baskaran, Cédric Bastoul, Albert Cohen, Sriram Krishnamoorthy, Boyana Norris, J. Ramanujam, P. Sadayappan: Parametric multi-level tiling of imperfectly nested loops. ICS 2009: 147-157 | |
| 179 | Muthu Manikandan Baskaran, Nagavijayalakshmi Vydyanathan, Uday Bondhugula, J. Ramanujam, Atanas Rountev, P. Sadayappan: Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors. PPOPP 2009: 219-228 | |
| 178 | Nagavijayalakshmi Vydyanathan, Sriram Krishnamoorthy, Gerald M. Sabin, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: An Integrated Approach to Locality-Conscious Processor Allocation and Scheduling of Mixed-Parallel Applications. IEEE Trans. Parallel Distrib. Syst. 20(8): 1158-1172 (2009) | |
| 2008 | ||
| 177 | P. Sadayappan, Manish Parashar, Ramamurthy Badrinath, Viktor K. Prasanna: High Performance Computing - HiPC 2008, 15th International Conference, Bangalore, India, December 17-20, 2008. Proceedings Springer 2008 | |
| 176 | Uday Bondhugula, Muthu Manikandan Baskaran, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model. CC 2008: 132-146 | |
| 175 | N. Desai, Pavan Balaji, P. Sadayappan, Mohammad Islam: Are nonblocking networks really needed for high-end-computing workloads? CLUSTER 2008: 152-159 | |
| 174 | Nawab Ali, Ananth Devulapalli, Dennis Dalessandro, Pete Wyckoff, P. Sadayappan: An OSD-based approach to managing directory operations in parallel file systems. CLUSTER 2008: 175-184 | |
| 173 | Jiang Lin, Qingda Lu, Xiaoning Ding, Zhao Zhang, Xiaodong Zhang, P. Sadayappan: Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. HPCA 2008: 367-378 | |
| 172 | Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz, Rajkumar Kettimuthu, Ian T. Foster: Multi-hop path splitting and multi-pathing optimizations for data transfers over shared wide-area networks using gridFTP. HPDC 2008: 225-226 | |
| 171 | Jarek Nieplocha, Sriram Krishnamoorthy, Marat Valiev, Manojkumar Krishnan, Bruce Palmer, P. Sadayappan: Integrated Data and Task Management for Scientific Applications. ICCS (1) 2008: 20-31 | |
| 170 | Nagavijayalakshmi Vydyanathan, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: A Duplication Based Algorithm for Optimizing Latency Under Throughput Constraints for Streaming Workflows. ICPP 2008: 254-261 | |
| 169 | James Dinan, Sriram Krishnamoorthy, D. Brian Larkins, Jarek Nieplocha, P. Sadayappan: Scioto: A Framework for Global-View Task Parallelism. ICPP 2008: 586-593 | |
| 168 | Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: A compiler framework for optimization of affine loop nests for gpgpus. ICS 2008: 225-234 | |
| 167 | Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, Rajkumar Kettimuthu, P. Sadayappan, Joel H. Saltz: A dynamic scheduling approach for coordinated wide-area data transfers using GridFTP. IPDPS 2008: 1-12 | |
| 166 | Uday Bondhugula, Muthu Manikandan Baskaran, Albert Hartono, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: Towards effective automatic parallelization for multicore systems. IPDPS 2008: 1-5 | |
| 165 | Uday Bondhugula, Albert Hartono, J. Ramanujam, P. Sadayappan: A practical automatic polyhedral parallelizer and locality optimizer. PLDI 2008: 101-113 | |
| 164 | Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories. PPOPP 2008: 1-10 | |
| 163 | Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, Rajkumar Kettimuthu, P. Sadayappan, Ian T. Foster, Joel H. Saltz: Using overlays for efficient data transfer over shared wide-area networks. SC 2008: 47 | |
| 162 | D. Brian Larkins, James Dinan, Sriram Krishnamoorthy, Srinivasan Parthasarathy, Atanas Rountev, P. Sadayappan: Global trees: a framework for linked data structures on distributed memory parallel systems. SC 2008: 57 | |
| 161 | Aniruddha G. Shet, P. Sadayappan, David E. Bernholdt, Jarek Nieplocha, Vinod Tipparaju: A framework for characterizing overlap of communication and computation in parallel applications. Cluster Computing 11(1): 75-90 (2008) | |
| 160 | James Dinan, Stephen Olivier, Gerald Sabin, Jan Prins, P. Sadayappan, Chau-Wen Tseng: A message passing benchmark for unbalanced applications. Simulation Modelling Practice and Theory 16(9): 1177-1189 (2008) | |
| 2007 | ||
| 159 | Sriram Krishnamoorthy, Juan Piernas, Vinod Tipparaju, Jarek Nieplocha, P. Sadayappan: Non-collective parallel I/O for global address space programming models. CLUSTER 2007: 41-49 | |
| 158 | Nagavijayalakshmi Vydyanathan, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: Toward Optimizing Latency Under Throughput Constraints for Application Workflows on Clusters. Euro-Par 2007: 173-183 | |
| 157 | Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: Scheduling File Transfers for Data-Intensive Jobs on Heterogeneous Clusters. Euro-Par 2007: 214-223 | |
| 156 | Mohammad Islam, Pavan Balaji, Gerald Sabin, P. Sadayappan: Analyzing and Minimizing the Impact of Opportunity Cost in QoS-aware Job Scheduling. ICPP 2007: 42 | |
| 155 | Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, Atanas Rountev, P. Sadayappan: A global address space framework for locality aware scheduling of block-sparse computations. IPDPS 2007: 1-8 | |
| 154 | James Dinan, Stephen Olivier, Gerald Sabin, Jan Prins, P. Sadayappan, Chau-Wen Tseng: Dynamic Load Balancing of Unbalanced Computations Using Message Passing. IPDPS 2007: 1-8 | |
| 153 | Sriram Krishnamoorthy, Muthu Manikandan Baskaran, Uday Bondhugula, J. Ramanujam, Atanas Rountev, P. Sadayappan: Effective automatic parallelization of stencil computations. PLDI 2007: 235-244 | |
| 152 | Uday Bondhugula, J. Ramanujam, P. Sadayappan: Automatic mapping of nested loops to FPGAS. PPOPP 2007: 101-111 | |
| 151 | Ananth Devulapalli, Dennis Dalessandro, Pete Wyckoff, Nawab Ali, P. Sadayappan: Integrating parallel file systems with object-based storage devices. SC 2007: 27 | |
| 150 | Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Efficient search-space pruning for integrated fusion and tiling transformations. Concurrency and Computation: Practice and Experience 19(18): 2425-2443 (2007) | |
| 2006 | ||
| 149 | Eduard Ayguadé, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Languages and Compilers for Parallel Computing, 18th International Workshop, LCPC 2005, Hawthorne, NY, USA, October 20-22, 2005, Revised Selected Papers Springer 2006 | |
| 148 | Aniruddha G. Shet, P. Sadayappan, David E. Bernholdt, Jarek Nieplocha, Vinod Tipparaju: A Performance Instrumentation Framework to Characterize Computation-Communication Overlap in Message-Passing Systems. CLUSTER 2006 | |
| 147 | Nagavijayalakshmi Vydyanathan, Sriram Krishnamoorthy, Gerald Sabin, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: Locality Conscious Processor Allocation and Scheduling for Mixed Parallel Applications. CLUSTER 2006 | |
| 146 | Uday Bondhugula, Ananth Devulapalli, James Dinan, Joseph Fernando, Pete Wyckoff, Eric Stahlberg, P. Sadayappan: Hardware/Software Integration for FPGA-based All-Pairs Shortest-Paths. FCCM 2006: 152-164 | |
| 145 | Gaurav Khanna, Nagavijayalakshmi Vydyanathan, Ümit V. Çatalyürek, Tahsin M. Kurç, Sriram Krishnamoorthy, P. Sadayappan, Joel H. Saltz: Task Scheduling and File Replication for Data-Intensive Jobs with Batch-shared I/O. HPDC 2006: 241-252 | |
| 144 | Nagavijayalakshmi Vydyanathan, Sriram Krishnamoorthy, Gerald Sabin, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: An Integrated Approach for Processor Allocation and Scheduling of Mixed-Parallel Applications. ICPP 2006: 443-450 | |
| 143 | Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, P. Sadayappan: An approach to locality-conscious load balancing and transparent memory hierarchy management with a global-address-space parallel programming model. IPDPS 2006 | |
| 142 | Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, Atanas Rountev, P. Sadayappan: An extensible global address space framework with decoupled task and data abstractions. IPDPS 2006 | |
| 141 | A. Allam, J. Ramanujam, Gerald Baumgartner, P. Sadayappan: Memory minimization for tensor contractions using integer linear programming. IPDPS 2006 | |
| 140 | Uday Bondhugula, Ananth Devulapalli, Joseph Fernando, Pete Wyckoff, P. Sadayappan: Parallel FPGA-based all-pairs shortest-paths in a directed graph. IPDPS 2006 | |
| 139 | Albert Hartono, Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Marcel Nooijen, Gerald Baumgartner, David E. Bernholdt, Venkatesh Choppella, Russell M. Pitzer, J. Ramanujam, Atanas Rountev, P. Sadayappan: Identifying Cost-Effective Common Subexpressions to Reduce Operation Count in Tensor Contraction Evaluations. International Conference on Computational Science (1) 2006: 267-275 | |
| 138 | Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: A Data Locality Aware Online Scheduling Approach for I/O-Intensive Jobs with File Sharing. JSSPP 2006: 141-160 | |
| 137 | Gerald Sabin, Matthew Lang, P. Sadayappan: Moldable Parallel Job Scheduling Using Job Efficiency: An Iterative Approach. JSSPP 2006: 94-114 | |
| 136 | Stephen Olivier, Jun Huan, Jinze Liu, Jan Prins, James Dinan, P. Sadayappan, Chau-Wen Tseng: UTS: An Unbalanced Tree Search Benchmark. LCPC 2006: 235-250 | |
| 135 | Qingda Lu, Sriram Krishnamoorthy, P. Sadayappan: Combining analytical and empirical approaches in tuning matrix transposition. PACT 2006: 233-242 | |
| 134 | Jarek Nieplocha, Bruce Palmer, Manojkumar Krishnan, P. Sadayappan: M12 - Overview of the global arrays parallel software development toolkit. SC 2006: 226 | |
| 133 | Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, Atanas Rountev, P. Sadayappan: Data management and query - Hypergraph partitioning for automatic memory hierarchy management. SC 2006: 98 | |
| 132 | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella: Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver. J. Parallel Distrib. Comput. 66(5): 659-673 (2006) | |
| 131 | Christian Engelmann, Stephen L. Scott, David E. Bernholdt, Narasimha Raju Gottumukkala, Chokchai Leangsuksun, Jyothish Varma, Chao Wang, Frank Mueller, Aniruddha G. Shet, P. Sadayappan: MOLAR: adaptive runtime support for high-end computing operating and runtime systems. Operating Systems Review 40(2): 63-72 (2006) | |
| 130 | Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, Jarek Nieplocha, P. Sadayappan: Layout transformation support for the disk resident arrays framework. The Journal of Supercomputing 36(2): 153-170 (2006) | |
| 2005 | ||
| 129 | Gaurav Khanna, Nagavijayalakshmi Vydyanathan, Tahsin M. Kurç, Ümit V. Çatalyürek, Pete Wyckoff, Joel H. Saltz, P. Sadayappan: A hypergraph partitioning based approach for scheduling of tasks with batch-shared I/O. CCGRID 2005: 792-799 | |
| 128 | Sriram Krishnamoorthy, Jarek Nieplocha, P. Sadayappan: Data and Computation Abstractions for Dynamic and Irregular Computations. HiPC 2005: 258-269 | |
| 127 | Vipin Chaudhary, P. Sadayappan: Message from the Chairs. ICPP Workshops 2005: 282 | |
| 126 | Swarup Kumar Sahoo, Rajkiran Panuganti, Sriram Krishnamoorthy, P. Sadayappan: Cache Miss Characterization and Data Locality Optimization for Imperfectly Nested Loops on Shared Memory Multiprocessors. IPDPS 2005 | |
| 125 | Albert Hartono, Alexander Sibiryakov, Marcel Nooijen, Gerald Baumgartner, David E. Bernholdt, So Hirata, Chi-Chung Lam, Russell M. Pitzer, J. Ramanujam, P. Sadayappan: Automated Operation Minimization of Tensor Contraction Expressions in Electronic Structure Calculations. International Conference on Computational Science (1) 2005: 155-164 | |
| 124 | Gerald Sabin, P. Sadayappan: Unfairness Metrics for Space-Sharing Parallel Job Schedulers. JSSPP 2005: 238-256 | |
| 123 | Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Efficient Search-Space Pruning for Integrated Fusion and Tiling Transformations. LCPC 2005: 215-229 | |
| 122 | Xiaoyang Gao, Swarup Kumar Sahoo, Chi-Chung Lam, J. Ramanujam, Qingda Lu, Gerald Baumgartner, P. Sadayappan: Performance modeling and optimization of parallel out-of-core tensor contractions. PPOPP 2005: 266-276 | |
| 121 | Swarup Kumar Sahoo, Sriram Krishnamoorthy, Rajkiran Panuganti, P. Sadayappan: Integrated Loop Optimizations for Data Locality Enhancement of Tensor Contraction Expressions. SC 2005: 13 | |
| 120 | Rajkumar Kettimuthu, Vijay Subramani, Srividya Srinivasan, Thiagaraja Gopalsamy, Dhabaleswar K. Panda, P. Sadayappan: Selective preemption strategies for parallel job scheduling. IJHPCN 3(2/3): 122-152 (2005) | |
| 2004 | ||
| 119 | Mohammad Islam, Pavan Balaji, P. Sadayappan, Dhabaleswar K. Panda: Towards provision of quality of service guarantees in job scheduling. CLUSTER 2004: 245-254 | |
| 118 | Gerald Sabin, V. Sahasrabudhe, P. Sadayappan: On fairness in distributed job scheduling across multiple sites. CLUSTER 2004: 35-44 | |
| 117 | Nagavijayalakshmi Vydyanathan, Gaurav Khanna, Tahsin M. Kurç, Ümit V. Çatalyürek, Pete Wyckoff, Joel H. Saltz, P. Sadayappan: Use of PVFS for Efficient Execution of Jobs with Pipeline-Shared I/O. GRID 2004: 235-242 | |
| 116 | Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, Jarek Nieplocha, P. Sadayappan: Efficient Layout Transformation for Disk-Based Multidimensional Arrays. HiPC 2004: 386-398 | |
| 115 | Gerald Sabin, Garima Kochhar, P. Sadayappan: Job Fairness in Non-Preemptive Job Scheduling. ICPP 2004: 186-194 | |
| 114 | Vipin Chaudhary, P. Sadayappan: Message from the Chairs: International Workshop on Compile and Run Time Techniques for Parallel Computing. ICPP Workshops 2004: 497 | |
| 113 | Qingda Lu, Jiesheng Wu, Dhabaleswar K. Panda, P. Sadayappan: Applying MPI Derived Datatypes to the NAS Benchmarks: A Case Study. ICPP Workshops 2004: 538-545 | |
| 112 | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella: Efficient Synthesis of Out-of-Core Algorithms Using a Nonlinear Optimization Solver. IPDPS 2004 | |
| 111 | Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Empirical Performance-Model Driven Data Layout Optimization. LCPC 2004: 72-86 | |
| 110 | Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan: Efficient parallel out-of-core matrix transposition. IJHPCN 2(2/3/4): 110-119 (2004) | |
| 109 | Sudha Srinivasan, Savitha Krishnamoorthy, P. Sadayappan: Robust scheduling of moldable parallel jobs. IJHPCN 2(2/3/4): 120-132 (2004) | |
| 2003 | ||
| 108 | Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan: Efficient Parallel Out-of-Core Matrix Transposition. CLUSTER 2003: 300-307 | |
| 107 | Sudha Srinivasan, Sriram Krishnamoorthy, P. Sadayappan: A Robust Scheduling Strategy for Moldable Scheduling of Parallel Jobs. CLUSTER 2003: 92-99 | |
| 106 | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, David E. Bernholdt, Venkatesh Choppella: Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms. HiPC 2003: 406-417 | |
| 105 | Daniel Cociorva, Xiaoyang Gao, Sandhya Krishnan, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam: Global Communication Optimization for Tensor Contraction Expressions under Memory Constraints. IPDPS 2003: 37 | |
| 104 | Mohammad Islam, Pavan Balaji, P. Sadayappan, Dhabaleswar K. Panda: QoPS: A QoS Based Scheme for Parallel Job Scheduling. JSSPP 2003: 252-268 | |
| 103 | Gerald Sabin, Rajkumar Kettimuthu, Arun Rajan, P. Sadayappan: Scheduling of Parallel Jobs in a Heterogeneous Multi-site Environement. JSSPP 2003: 87-104 | |
| 102 | Konstantin Berlin, Jun Huan, Mary Jacob, Garima Kochhar, Jan Prins, William Pugh, P. Sadayappan, Jaime Spacco, Chau-Wen Tseng: Evaluating the Impact of Programming Language Features on the Performance of Parallel Applications on Cluster Architectures. LCPC 2003: 194-208 | |
| 101 | Alina Bibireata, Sandhya Krishnan, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, David E. Bernholdt, Venkatesh Choppella: Memory-Constrained Data Locality Optimization for Tensor Contractions. LCPC 2003: 93-108 | |
| 2002 | ||
| 100 | Vijay Subramani, Rajkumar Kettimuthu, Srividya Srinivasan, Jeanette Johnston, P. Sadayappan: Selective Buddy Allocation for Scheduling Parallel Jobs on Clusters. CLUSTER 2002: 107- | |
| 99 | Vijay Subramani, Rajkumar Kettimuthu, Srividya Srinivasan, P. Sadayappan: Distributed Job Scheduling on Computational Grids Using Multiple Simultaneous Requests. HPDC 2002: 359- | |
| 98 | Srividya Srinivasan, Vijay Subramani, Rajkumar Kettimuthu, Praveen Holenarsipur, P. Sadayappan: Effective Selection of Partition Sizes for Moldable Scheduling of Parallel Jobs. HiPC 2002: 174-183 | |
| 97 | Thiagaraja Gopalsamy, Mukesh Singhal, Dhabaleswar K. Panda, P. Sadayappan: A Reliable Multicast Algorithm for Mobile Ad Hoc Networks. ICDCS 2002: 563-570 | |
| 96 | P. Sadayappan: Message from the Chair. ICPP Workshops 2002: 495-498 | |
| 95 | Srividya Srinivasan, Rajkumar Kettimuthu, Vijay Subramani, P. Sadayappan: Characterization of Backfilling Strategies for Parallel Job Scheduling. ICPP Workshops 2002: 514-522 | |
| 94 | Vipin Chaudhary, P. Sadayappan: Message from the Co-Chairs. ICPP Workshops 2002: 547-550 | |
| 93 | Gerald Baumgartner, David E. Bernholdt, Daniel Cociorva, Chi-Chung Lam, J. Ramanujam, Robert J. Harrison, Marcel Nooijen, P. Sadayappan: A Performance Optimization Framework for Compilation of Tensor Contraction Expressions into Parallel Programs. IPDPS 2002 | |
| 92 | Srividya Srinivasan, Rajkumar Kettimuthu, Vijay Subramani, P. Sadayappan: Selective Reservation Strategies for Backfill Job Scheduling. JSSPP 2002: 55-71 | |
| 91 | Daniel Cociorva, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam: Memory-Constrained Communication Minimization for a Class of Array Computations. LCPC 2002: 1-15 | |
| 90 | Daniel Cociorva, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, Marcel Nooijen, David E. Bernholdt, Robert J. Harrison: Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations. PLDI 2002: 177-186 | |
| 89 | Gerald Baumgartner, David E. Bernholdt, Daniel Cociorva, Robert J. Harrison, So Hirata, Chi-Chung Lam, Marcel Nooijen, Russell M. Pitzer, J. Ramanujam, P. Sadayappan: A high-level approach to synthesis of high-performance codes for quantum chemistry. SC 2002: 1-10 | |
| 2001 | ||
| 88 | Daniel Cociorva, J. W. Wilkins, Gerald Baumgartner, P. Sadayappan, J. Ramanujam, Marcel Nooijen, David E. Bernholdt, Robert J. Harrison: Towards Automatic Synthesis of High-Performance Codes for Electronic Structure Calculations: Data Locality Optimization. HiPC 2001: 237-248 | |
| 87 | Mohammad Banikazemi, Jiuxing Liu, Dhabaleswar K. Panda, P. Sadayappan: Implementing TreadMarksover VIA on Myrinet and Gigabit Ethernet: Challenges, Design Experience, and Performance Evaluation. ICPP 2001: 167-174 | |
| 86 | Abhishek Gulati, Dhabaleswar K. Panda, P. Sadayappan, Pete Wyckoff: NIC-Based Rate Control for Proportional Bandwidth Allocation in Myrinet Clusters. ICPP 2001: 305-312 | |
| 85 | Daniel Cociorva, J. W. Wilkins, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Loop optimization for a class of memory-constrained computations. ICS 2001: 103-113 | |
| 84 | Darius Buntinas, Dhabaleswar K. Panda, P. Sadayappan: Performance Benefits of NIC-Based Barrier on Myrinet/GM. IPDPS 2001: 166 | |
| 83 | Mohammad Banikazemi, Jiuxing Liu, S. Kutlug, P. Sadayappan, H. Shah, Dhabaleswar K. Panda: VIBe: A Micro-benchmark Suite for Evaluating Virtual Interface Architecture (VIA) Implementations. IPDPS 2001: 24 | |
| 82 | Darius Buntinas, Dhabaleswar K. Panda, P. Sadayappan: Fast NIC-Based Barrier over Myrinet/GM. IPDPS 2001: 52 | |
| 81 | Amit Singhal, Mohammad Banikazemi, P. Sadayappan, Dhabaleswar K. Panda: Efficient Multicast Algorithms for Heterogeneous Switch-based Irregular Networks of Workstations. IPDPS 2001: 71 | |
| 2000 | ||
| 80 | Vijay Moorthy, Dhabaleswar K. Panda, P. Sadayappan: Fast Collective Communication Algorithms for Reflective Memory Network Clusters. CANPC 2000: 100-114 | |
| 79 | Darius Buntinas, Dhabaleswar K. Panda, José Duato, P. Sadayappan: Broadcast/Multicast over Myrinet Using NIC-Assisted Multidestination Messages. CANPC 2000: 115-129 | |
| 78 | Praveen Holenarsipur, Vladimir Yarmolenko, José Duato, Dhabaleswar K. Panda, P. Sadayappan: Characterization and enhancement of Static Mapping Heuristics for Heterogeneous Systems. HiPC 2000: 37-48 | |
| 77 | P. Sadayappan: Message from the Chair. ICPP Workshops 2000: 391- | |
| 76 | Vladimir Yarmolenko, José Duato, Dhabaleswar K. Panda, P. Sadayappan: Characterization and Enhancement of Dynamic Mapping Heuristics for Heterogeneous Systems. ICPP Workshops 2000: 437- | |
| 75 | Arindam Paul, Wu-chi Feng, Dhabaleswar K. Panda, P. Sadayappan: Balancing Web Server Load for Adaptable Video Distribution. ICPP Workshops 2000: 469- | |
| 1999 | ||
| 74 | Matthew G. Jacunski, Vijay Moorthy, Peter P. Ware, Manoj Pillai, Dhabaleswar K. Panda, P. Sadayappan: Low Latency Message-Passing for Reflective Memory Networks. CANPC 1999: 211-224 | |
| 73 | Mohammad Banikazemi, Jayanthi Sampathkumar, Sandeep Prabhu, Dhabaleswar K. Panda, P. Sadayappan: Communication Modeling of Heterogeneous Networks of Workstations for Performance Characterization of Collective Operations. Heterogeneous Computing Workshop 1999: 125- | |
| 72 | Chi-Chung Lam, Daniel Cociorva, Gerald Baumgartner, P. Sadayappan: Memory-Optimal Evaluation of Expression Trees Involving Large Objects. HiPC 1999: 103-110 | |
| 71 | N. S. Sundar, S. Jayanthi, P. Sadayappan, Miguel Visbal: An Incremental Methodology for Parallelizing Legacy Stencil Codes on Message-Passing Computers. ICPP 1999: 302-310 | |
| 70 | Vijay Moorthy, Matthew G. Jacunski, Manoj Pillai, Peter P. Ware, Dhabaleswar K. Panda, Thomas W. Page Jr., P. Sadayappan, V. Nagarajan, Johns Daniel: Low-Latency Message Passing on Workstation Clusters using SCRAMNet. IPPS/SPDP 1999: 148-152 | |
| 69 | Matthew G. Jacunski, P. Sadayappan, Dhabaleswar K. Panda: All-to-All Broadcast on Switch-Based Clusters of Workstations. IPPS/SPDP 1999: 325-329 | |
| 68 | Chi-Chung Lam, Daniel Cociorva, Gerald Baumgartner, P. Sadayappan: Optimization of Memory Usage Requirement for a Class of Loops Implementing Multi-dimensional Integrals. LCPC 1999: 350-364 | |
| 67 | Chi-Chung Lam, P. Sadayappan, Daniel Cociorva, Mebarek Alouani, John Wilkins: Performance Optimization of a Class of Loops Involving Sums of Products of Sparse Arrays. PPSC 1999 | |
| 1998 | ||
| 66 | Zhiyuan Li, Pen-Chung Yew, Siddhartha Chatterjee, Chua-Huang Huang, P. Sadayappan, David C. Sehr: Languages and Compilers for Parallel Computing, 10th International Workshop, LCPC'97, Minneapolis, Minnesota, USA, August 7-9, 1997, Proceedings Springer 1998 | |
| 65 | Sandeep K. S. Gupta, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson: A technique for overlapping computation and communication for block recursive algorithms. Concurrency - Practice and Experience 10(2): 73-90 (1998) | |
| 64 | P. Sadayappan, Fikret Erçal, J. Ramanujam: Partitioning Graphs on Message-Passing Machines by Pairwise Mincut. Inf. Sci. 111(1-4): 223-237 (1998) | |
| 1997 | ||
| 63 | Chi-Chung Lam, P. Sadayappan, Rephael Wenger: Optimization of a Class of Multi-Dimensional Integrals on Parallel Machines. PPSC 1997 | |
| 62 | Chi-Chung Lam, Chua-Huang Huang, P. Sadayappan: Optimal Algorithms for All-to-All Personalized Communication on Rings and Two Dimensional Tori. J. Parallel Distrib. Comput. 43(1): 3-13 (1997) | |
| 61 | Chi-Chung Lam, P. Sadayappan, Rephael Wenger: On Optimizing a Class of Multi-Dimensional Loops with Reductions for Parallel Execution. Parallel Processing Letters 7(2): 157-168 (1997) | |
| 1996 | ||
| 60 | Chua-Huang Huang, P. Sadayappan, Utpal Banerjee, David Gelernter, Alexandru Nicolau, David A. Padua: Languages and Compilers for Parallel Computing, 8th International Workshop, LCPC'95, Columbus, Ohio, USA, August 10-12, 1995, Proceedings Springer 1996 | |
| 59 | N. S. Sundar, D. N. Jayasimha, Dhabaleswar K. Panda, P. Sadayappan: Hybrid Algorithms for Complete Exchange in 2D Meshes. International Conference on Supercomputing 1996: 181-188 | |
| 58 | Chi-Chung Lam, P. Sadayappan, Rephael Wenger: Optimal Reordering and Mapping of a Class of Nested-Loops for Parallel Execution. LCPC 1996: 315-329 | |
| 57 | S. D. Kaushik, Sanjay Sharma, Chua-Huang Huang, John R. Johnson, Rodney W. Johnson, P. Sadayappan: An Algebraic Theory for Modeling Directt Interconnection Networks. J. Inf. Sci. Eng. 12(1): 25-49 (1996) | |
| 56 | Sandeep K. S. Gupta, S. D. Kaushik, Chua-Huang Huang, P. Sadayappan: Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines. J. Parallel Distrib. Comput. 32(2): 155-172 (1996) | |
| 55 | Sandeep K. S. Gupta, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson: A Framework for Generating Distributed-Memory Parallel Programs for Block Recursive Algorithms. J. Parallel Distrib. Comput. 34(2): 137-153 (1996) | |
| 54 | S. D. Kaushik, Chua-Huang Huang, P. Sadayappan: Efficient Index Set Generation for Compiling HPF Array Statements on Distributed-Memory Machines. J. Parallel Distrib. Comput. 38(2): 237-247 (1996) | |
| 53 | Himanshu Gupta, P. Sadayappan: Communication-Efficient Matrix Multiplication on Hypercubes. Parallel Computing 22(1): 75-99 (1996) | |
| 1995 | ||
| 52 | S. D. Kaushik, Chua-Huang Huang, J. Ramanujam, P. Sadayappan: Multi-phase array redistribution: modeling and evaluation. IPPS 1995: 441-445 | |
| 51 | S. D. Kaushik, Chua-Huang Huang, P. Sadayappan: Compiling Array Statements for Efficient Execution on Distributed-Memory Machines: Two-Level Mappings. LCPC 1995: 209-223 | |
| 50 | Bharat Kumar, Kalluri Eswar, P. Sadayappan, Chua-Huang Huang: A Clustering Algorithm for Parallel Sparse Cholesky Factorization. Parallel Processing Letters 5: 685-696 (1995) | |
| 49 | Bharat Kumar, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson: A Tensor Product Formulation of Strassen's Matrix Multiplication Algorithm with Memory Reduction. Scientific Programming 4(4): 275-289 (1995) | |
| 1994 | ||
| 48 | Sandeep K. S. Gupta, Chua-Huang Huang, Rodney W. Johnson, P. Sadayappan: Communication-Efficient Implementation of Block Recursive Algorithms on Distributed-Memory Machines. ICPADS 1994: 113-119 | |
| 47 | Alpesh Amin, P. Sadayappan, Murali Gudavalli: A Clustered Reduced Communication Element by Element Preconditioned Conjugate Gradient Algorithm for Finite Element Computations. IPPS 1994: 509-516 | |
| 46 | S. D. Kaushik, Chua-Huang Huang, Rodney W. Johnson, P. Sadayappan: An approach to communication-efficient data redistribution. International Conference on Supercomputing 1994: 364-373 | |
| 45 | Bharat Kumar, P. Sadayappan, Chua-Huang Huang: On sparse matrix reordering for parallel factorization. International Conference on Supercomputing 1994: 431-438 | |
| 44 | S. D. Kaushik, Chua-Huang Huang, P. Sadayappan: Incremental Generation of Index Sets for Array Statement Execution on Distributed-Memory Machines. LCPC 1994: 251-265 | |
| 43 | D. L. Dai, Sandeep K. S. Gupta, S. D. Kaushik, J. H. Lu, R. V. Singh, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson: EXTENT: a portable programming environment for designing and implementing high-performance block recursive algorithms. SC 1994: 49-58 | |
| 42 | Himanshu Gupta, P. Sadayappan: Communication Efficient Matrix Multiplication on Hypercubes. SPAA 1994: 320-329 | |
| 41 | Sandeep K. S. Gupta, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson: Implementing Fast Fourier Transforms on Distributed-Memory Multiprocessors Using Data Redistributions. Parallel Processing Letters 4: 477-488 (1994) | |
| 1993 | ||
| 40 | Debabrata Ghosh, S. K. Nandy, P. Sadayappan, K. Parthasarathy: Architectural Synthesis of Performance-Driven Multipliers with Accumulator Interleaving. DAC 1993: 303-307 | |
| 39 | Kalluri Eswar, P. Sadayappan, Chua-Huang Huang: Compile-Time Characterization of Recurrent Patterns in Irregular Computations. ICPP 1993: 148-155 | |
| 38 | Kalluri Eswar, P. Sadayappan, Chua-Huang Huang, V. Visvanathan: Supernodal Sparse Cholesky Facotrization on Distributed-Memory Multiprocessors. ICPP 1993: 18-22 | |
| 37 | Sandeep K. S. Gupta, S. D. Kaushik, S. Mufti, Sanjay Sharma, Chua-Huang Huang, P. Sadayappan: On Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines. ICPP 1993: 301-305 | |
| 36 | S. K. Nandy, Ranjani Narayan, V. Visvanathan, P. Sadayappan, Prashant S. Chauhan: A Parallel Progressive Refinement Image Rendering Algorithm on a Scalable Multithreaded VLSI Processor Array. ICPP 1993: 94-97 | |
| 35 | Bharat Kumar, Chua-Huang Huang, Rodney W. Johnson, P. Sadayappan: A Tensor Product Formulation of Strassen's Matrix Multiplication Algorithm with Memory Reduction. IPPS 1993: 582-588 | |
| 34 | S. D. Kaushik, Chua-Huang Huang, Rodney W. Johnson, P. Sadayappan: A Methodology for Generating Efficient Disk-Based Algorithms from Tensor Product Formulas. LCPC 1993: 358-373 | |
| 33 | S. D. Kaushik, Chua-Huang Huang, John R. Johnson, Rodney W. Johnson, P. Sadayappan: Efficient transposition algorithms for large matrices. SC 1993: 656-665 | |
| 32 | Chua-Huang Huang, P. Sadayappan: Communication-Free Hyperplane Partitioning of Nested Loops. J. Parallel Distrib. Comput. 19(2): 90-102 (1993) | |
| 1992 | ||
| 31 | Sandeep K. S. Gupta, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson: On the Synthesis of Parallel Programs from Tensor Product Formulas for Block Recursive Algorithms. LCPC 1992: 264-280 | |
| 30 | S. D. Kaushik, Sanjay Sharma, Chua-Huang Huang, Jeremy R. Johnson, Rodney W. Johnson, P. Sadayappan: An Algebraic Theory for Modeling Direct Interconnection Networks. SC 1992: 488-497 | |
| 29 | Sanjay Sharma, Chua-Huang Huang, P. Sadayappan: On Data Dependence Analysis for Compiling Programs on Distributed-Memory Machines (Extended Abstract). SIGPLAN Workshop 1992: 13-16 | |
| 28 | Sandeep K. S. Gupta, S. D. Kaushik, Chua-Huang Huang, John R. Johnson, Rodney W. Johnson, P. Sadayappan: On the Automatic Generation of Data Distributions. SIGPLAN Workshop 1992: 82 | |
| 27 | Sandeep K. S. Gupta, S. D. Kaushik, Chua-Huang Huang, John R. Johnson, Rodney W. Johnson, P. Sadayappan: A Methodology for Generating Data Distributions to Optimize Communication. SPDP 1992: 436-441 | |
| 26 | Sailesh K. Rao, P. Sadayappan, Frank K. Hwang, Peter W. Shor: The Rectilinear Steiner Arborescence Problem. Algorithmica 7(2&3): 277-288 (1992) | |
| 25 | J. Ramanujam, P. Sadayappan: Tiling Multidimensional Itertion Spaces for Multicomputers. J. Parallel Distrib. Comput. 16(2): 108-120 (1992) | |
| 1991 | ||
| 24 | Kalluri Eswar, P. Sadayappan, V. Visvanathan: Multifrontal Factorization of Sparse Matrices on Shared-Memory Multiprocessors. ICPP (3) 1991: 159-166 | |
| 23 | Scott Whitman, P. Sadayappan: Computer Graphics Rendering on a Shared Memory Multiprocessor. ICPP (3) 1991: 197-200 | |
| 22 | Chua-Huang Huang, P. Sadayappan: Communication-Free Hyperplane Partitioning of Nested Loops. LCPC 1991: 186-200 | |
| 21 | V. Prasad Krothapalli, P. Sadayappan: Removal of Redundant Dependences in DOACROSS Lops with Constant Dependences. PPOPP 1991: 51-60 | |
| 20 | J. Ramanujam, P. Sadayappan: Tiling multidimensional iteration spaces for nonshared memory machines. SC 1991: 111-120 | |
| 19 | V. Prasad Krothapalli, P. Sadayappan: Removal of Redundant Dependences in DOACROSS Loops with Constant Dependences. IEEE Trans. Parallel Distrib. Syst. 2(3): 281-289 (1991) | |
| 18 | J. Ramanujam, P. Sadayappan: Compile-Time Techniques for Data Distribution in Distributed Memory Machines. IEEE Trans. Parallel Distrib. Syst. 2(4): 472-482 (1991) | |
| 1990 | ||
| 17 | V. Prasad Krothapalli, P. Sadayappan: Exploiting Parallelism Through Run-Time Analysis on a Vector Processor (Abstract). ACM Conference on Computer Science 1990: 434 | |
| 16 | J. Ramanujam, P. Sadayappan: Tiling of Iteration Spaces for Multicomputers. ICPP (2) 1990: 179-186 | |
| 15 | V. Prasad Krothapalli, P. Sadayappan: Dynamic Scheduling of DOACROSS Loops for Multiprocessors. PARBASE / Architectures 1990: 141-160 | |
| 14 | Fikret Erçal, J. Ramanujam, P. Sadayappan: Task Allocation onto a Hypercube by Recursive Mincut Bipartitioning. J. Parallel Distrib. Comput. 10(1): 35-44 (1990) | |
| 13 | P. Sadayappan, Fikret Erçal, J. Ramanujam: Cluster partitioning approaches to mapping parallel programs onto a hypercube. Parallel Computing 13(1): 1-16 (1990) | |
| 1989 | ||
| 12 | P. Sadayappan, V. Visvanathan: Efficient Sparse Matrix Factorization for Circuit Simulation on Vector Supercomputers. DAC 1989: 13-18 | |
| 11 | Amr Zaky, P. Sadayappan: Optimal Static Scheduling of Sequential Loops on Multiprocessors. ICPP (3) 1989: 130-137 | |
| 10 | Fikret Erçal, P. Sadayappan: One-to-one mapping of process graphs onto a hypercube. ICS 1989: 91-98 | |
| 9 | P. Sadayappan, V. Visvanathan: Efficient sparse matrix factorization for circuit simulation on vector supercomputers. IEEE Trans. on CAD of Integrated Circuits and Systems 8(12): 1276-1285 (1989) | |
| 1988 | ||
| 8 | P. Sadayappan, V. Visvanathan: Parallelization and performance evaluation of circuit simulation on a shared-memory multiprocessor. ICS 1988: 254-265 | |
| 7 | V. Prasad Krothapalli, P. Sadayappan: An approach to synchronization for parallel computing. ICS 1988: 573-581 | |
| 6 | Cevdet Aykanat, Füsun Özgüner, Fikret Erçal, P. Sadayappan: Iterative Algorithms for Solution of Large Sparse Systems of Linear Equations on Hypercubes. IEEE Trans. Computers 37(12): 1554-1568 (1988) | |
| 5 | P. Sadayappan, V. Visvanathan: Circuit Simulation on Shared-Memory Multiprocessors. IEEE Trans. Computers 37(12): 1634-1642 (1988) | |
| 1987 | ||
| 4 | P. Sadayappan, Fikret Erçal, Steven Martin: Mapping Finite Element Graphs onto Processor Meshes. ICPP 1987: 192-195 | |
| 3 | P. Sadayappan, Fikret Erçal: Cluster-Partitioning Approaches to Mapping Parallel Programs onto a Hypercube. ICS 1987: 475-497 | |
| 2 | P. Sadayappan, Fikret Erçal: Nearest-Neighbor Mapping of Finite Element Graphs onto Processor Meshes. IEEE Trans. Computers 36(12): 1408-1424 (1987) | |
| 1985 | ||
| 1 | V. Ashok, Roger L. Costello, P. Sadayappan: Modeling switch-level simulation using data flow. DAC 1985: 637-644 | |