| 2009 | ||
|---|---|---|
| 116 | Hassan Salamy, J. Ramanujam: A Framework for Task Scheduling and Memory Partitioning for Multi-Processor System-on-Chip. HiPEAC 2009: 263-277 | |
| 115 | Albert Hartono, Muthu Manikandan Baskaran, Cédric Bastoul, Albert Cohen, Sriram Krishnamoorthy, Boyana Norris, J. Ramanujam, P. Sadayappan: Parametric multi-level tiling of imperfectly nested loops. ICS 2009: 147-157 | |
| 114 | Qingda Lu, Christophe Alias, Uday Bondhugula, Thomas Henretty, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan, Yongjian Chen, Haibo Lin, Tin-fook Ngai: Data Layout Transformation for Enhancing Data Locality on NUCA Chip Multiprocessors. PACT 2009: 348-357 | |
| 113 | Muthu Manikandan Baskaran, Nagavijayalakshmi Vydyanathan, Uday Bondhugula, J. Ramanujam, Atanas Rountev, P. Sadayappan: Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors. PPOPP 2009: 219-228 | |
| 2008 | ||
| 112 | Uday Bondhugula, Muthu Manikandan Baskaran, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model. CC 2008: 132-146 | |
| 111 | Hassan Salamy, J. Ramanujam: Storage optimization through code size reduction for digital signal processors. ESTImedia 2008: 107-112 | |
| 110 | Hassan Salamy, J. Ramanujam: Optimal address register allocation for arrays in DSP applications. ESTImedia 2008: 67-72 | |
| 109 | Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: A compiler framework for optimization of affine loop nests for gpgpus. ICS 2008: 225-234 | |
| 108 | Uday Bondhugula, Muthu Manikandan Baskaran, Albert Hartono, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: Towards effective automatic parallelization for multicore systems. IPDPS 2008: 1-5 | |
| 107 | Uday Bondhugula, Albert Hartono, J. Ramanujam, P. Sadayappan: A practical automatic polyhedral parallelizer and locality optimizer. PLDI 2008: 101-113 | |
| 106 | Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories. PPOPP 2008: 1-10 | |
| 2007 | ||
| 105 | Jinpyo Hong, J. Ramanujam: Memory Offset Assignment for DSPs. ICESS 2007: 80-87 | |
| 104 | Sriram Krishnamoorthy, Muthu Manikandan Baskaran, Uday Bondhugula, J. Ramanujam, Atanas Rountev, P. Sadayappan: Effective automatic parallelization of stencil computations. PLDI 2007: 235-244 | |
| 103 | Uday Bondhugula, J. Ramanujam, P. Sadayappan: Automatic mapping of nested loops to FPGAS. PPOPP 2007: 101-111 | |
| 102 | Sai Pinnepalli, Jinpyo Hong, J. Ramanujam, Doris L. Carver: Code Size Optimization for Embedded Processors using Commutative Transformations. RTCSA 2007: 409-416 | |
| 101 | Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Efficient search-space pruning for integrated fusion and tiling transformations. Concurrency and Computation: Practice and Experience 19(18): 2425-2443 (2007) | |
| 2006 | ||
| 100 | Eduard Ayguadé, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Languages and Compilers for Parallel Computing, 18th International Workshop, LCPC 2005, Hawthorne, NY, USA, October 20-22, 2005, Revised Selected Papers Springer 2006 | |
| 99 | A. Allam, J. Ramanujam, Gerald Baumgartner, P. Sadayappan: Memory minimization for tensor contractions using integer linear programming. IPDPS 2006 | |
| 98 | Albert Hartono, Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Marcel Nooijen, Gerald Baumgartner, David E. Bernholdt, Venkatesh Choppella, Russell M. Pitzer, J. Ramanujam, Atanas Rountev, P. Sadayappan: Identifying Cost-Effective Common Subexpressions to Reduce Operation Count in Tensor Contraction Evaluations. International Conference on Computational Science (1) 2006: 267-275 | |
| 97 | Hassan Salamy, J. Ramanujam: An Effective Heuristic for Simple Offset Assignment with Variable Coalescing. LCPC 2006: 158-172 | |
| 96 | Mahmut T. Kandemir, J. Ramanujam, Ugur Sezer: Improving the energy behavior of block buffering using compiler optimizations. ACM Trans. Design Autom. Electr. Syst. 11(1): 228-250 (2006) | |
| 95 | Guilin Chen, Mahmut T. Kandemir, Mary Jane Irwin, J. Ramanujam: Reducing code size through address register assignment. ACM Trans. Embedded Comput. Syst. 5(1): 225-258 (2006) | |
| 94 | J. Ramanujam, Jinpyo Hong, Mahmut T. Kandemir, Amit Narayan, A. Agarwal: Estimating and reducing the memory requirements of signal processing codes for embedded systems. IEEE Transactions on Signal Processing 54(1): 286-294 (2006) | |
| 93 | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella: Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver. J. Parallel Distrib. Comput. 66(5): 659-673 (2006) | |
| 2005 | ||
| 92 | Albert Hartono, Alexander Sibiryakov, Marcel Nooijen, Gerald Baumgartner, David E. Bernholdt, So Hirata, Chi-Chung Lam, Russell M. Pitzer, J. Ramanujam, P. Sadayappan: Automated Operation Minimization of Tensor Contraction Expressions in Electronic Structure Calculations. International Conference on Computational Science (1) 2005: 155-164 | |
| 91 | Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Efficient Search-Space Pruning for Integrated Fusion and Tiling Transformations. LCPC 2005: 215-229 | |
| 90 | Xiaoyang Gao, Swarup Kumar Sahoo, Chi-Chung Lam, J. Ramanujam, Qingda Lu, Gerald Baumgartner, P. Sadayappan: Performance modeling and optimization of parallel out-of-core tensor contractions. PPOPP 2005: 266-276 | |
| 2004 | ||
| 89 | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella: Efficient Synthesis of Out-of-Core Algorithms Using a Nonlinear Optimization Solver. IPDPS 2004 | |
| 88 | Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Empirical Performance-Model Driven Data Layout Optimization. LCPC 2004: 72-86 | |
| 87 | Mahmut T. Kandemir, J. Ramanujam, Mary Jane Irwin, Narayanan Vijaykrishnan, Ismail Kadayif, Amisha Parikh: A compiler-based approach for dynamically managing scratch-pad memories in embedded systems. IEEE Trans. on CAD of Integrated Circuits and Systems 23(2): 243-260 (2004) | |
| 2003 | ||
| 86 | Mahmut T. Kandemir, Mary Jane Irwin, Guilin Chen, J. Ramanujam: Address Register Assignment for Reducing Code Size. CC 2003: 273-289 | |
| 85 | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, David E. Bernholdt, Venkatesh Choppella: Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms. HiPC 2003: 406-417 | |
| 84 | Daniel Cociorva, Xiaoyang Gao, Sandhya Krishnan, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam: Global Communication Optimization for Tensor Contraction Expressions under Memory Constraints. IPDPS 2003: 37 | |
| 83 | Alina Bibireata, Sandhya Krishnan, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, David E. Bernholdt, Venkatesh Choppella: Memory-Constrained Data Locality Optimization for Tensor Contractions. LCPC 2003: 93-108 | |
| 82 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee: Reducing False Sharing and Improving Spatial Locality in a Unified Compilation Framework. IEEE Trans. Parallel Distrib. Syst. 14(4): 337-354 (2003) | |
| 2002 | ||
| 81 | Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary: Exploiting shared scratch pad memory space in embedded multiprocessor systems. DAC 2002: 219-224 | |
| 80 | Gerald Baumgartner, David E. Bernholdt, Daniel Cociorva, Chi-Chung Lam, J. Ramanujam, Robert J. Harrison, Marcel Nooijen, P. Sadayappan: A Performance Optimization Framework for Compilation of Tensor Contraction Expressions into Parallel Programs. IPDPS 2002 | |
| 79 | Daniel Cociorva, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam: Memory-Constrained Communication Minimization for a Class of Array Computations. LCPC 2002: 1-15 | |
| 78 | Daniel Cociorva, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, Marcel Nooijen, David E. Bernholdt, Robert J. Harrison: Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations. PLDI 2002: 177-186 | |
| 77 | Gerald Baumgartner, David E. Bernholdt, Daniel Cociorva, Robert J. Harrison, So Hirata, Chi-Chung Lam, Marcel Nooijen, Russell M. Pitzer, J. Ramanujam, P. Sadayappan: A high-level approach to synthesis of high-performance codes for quantum chemistry. SC 2002: 1-10 | |
| 76 | J. Ramanujam, Sandeep Deshpande, Jinpyo Hong, Mahmut T. Kandemir: A Heuristic for Clock Selection in High-Level Synthesis. VLSI Design 2002: 414-419 | |
| 75 | J. Ramanujam, Satish Krishnamurthy, Jinpyo Hong, Mahmut T. Kandemir: Address Code and Arithmetic Optimizations for Embedded Systems. VLSI Design 2002: 619-624 | |
| 74 | N. E. Crosbie, Mahmut T. Kandemir, Ibrahim Kolcu, J. Ramanujam, Alok N. Choudhary: Strategies for Improving Data Locality in Embedded Applications. VLSI Design 2002: 631- | |
| 73 | J. Ramanujam: Automatic Data Distribution. The Compiler Design Handbook 2002: 409-460 | |
| 72 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam: An I/O-Conscious Tiling Strategy for Disk-Resident Data Sets. The Journal of Supercomputing 21(3): 257-284 (2002) | |
| 2001 | ||
| 71 | J. Ramanujam: Integer Lattice Based Methods for Local Address Generation for Block-Cyclic Distributions. Compiler Optimizations for Scalable Parallel Systems Languages 2001: 597-648 | |
| 70 | J. Ramanujam, Jinpyo Hong, Mahmut T. Kandemir, Amit Narayan: Reducing Memory Requirements of Nested Loops for Embedded Systems. DAC 2001: 359-364 | |
| 69 | Mahmut T. Kandemir, J. Ramanujam, Mary Jane Irwin, Narayanan Vijaykrishnan, Ismail Kadayif, Amisha Parikh: Dynamic Management of Scratch-Pad Memory Space. DAC 2001: 690-695 | |
| 68 | Daniel Cociorva, J. W. Wilkins, Gerald Baumgartner, P. Sadayappan, J. Ramanujam, Marcel Nooijen, David E. Bernholdt, Robert J. Harrison: Towards Automatic Synthesis of High-Performance Codes for Electronic Structure Calculations: Data Locality Optimization. HiPC 2001: 237-248 | |
| 67 | Daniel Cociorva, J. W. Wilkins, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Loop optimization for a class of memory-constrained computations. ICS 2001: 103-113 | |
| 66 | Mahmut T. Kandemir, J. Ramanujam, Ugur Sezer: Compiler support for block buffering. ISLPED 2001: 76-79 | |
| 65 | Ismail Kadayif, Mahmut T. Kandemir, Narayanan Vijaykrishnan, Mary Jane Irwin, J. Ramanujam: Morphable Cache Architectures: Potential Benefits. LCTES/OM 2001: 128-137 | |
| 64 | M. Narasimhan, J. Ramanujam: A fast approach to computing exact solutions to the resource-constrained scheduling problem. ACM Trans. Design Autom. Electr. Syst. 6(4): 490-500 (2001) | |
| 63 | Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary, Prithviraj Banerjee: A Layout-Conscious Iteration Space Transformation Technique. IEEE Trans. Computers 50(12): 1321-1336 (2001) | |
| 62 | Mahmut T. Kandemir, J. Ramanujam: Data Relation Vectors: A New Abstraction for Data Optimizations. IEEE Trans. Computers 50(8): 798-810 (2001) | |
| 61 | Mahmut T. Kandemir, Prithviraj Banerjee, Alok N. Choudhary, J. Ramanujam, Eduard Ayguadé: Static and Dynamic Locality Optimizations Using Integer Linear Programming. IEEE Trans. Parallel Distrib. Syst. 12(9): 922-941 (2001) | |
| 60 | Siddharth Rele, Vipin Jain, Santosh Pande, J. Ramanujam: Compact and efficient code generation through program restructuringon limited memory embedded DSPs. IEEE Trans. on CAD of Integrated Circuits and Systems 20(4): 477-494 (2001) | |
| 2000 | ||
| 59 | M. Narasimhan, J. Ramanujam: On lower bounds for scheduling problems in high-level synthesis. DAC 2000: 546-551 | |
| 58 | Sunil Atri, J. Ramanujam, Mahmut T. Kandemir: Improving Offset Assignment on Embedded Processors Using Transformations. HiPC 2000: 367-374 | |
| 57 | Mahmut T. Kandemir, J. Ramanujam: Data Relation Vectors: A New Abstraction for Data Optimizations. IEEE PACT 2000: 227-236 | |
| 56 | Sunil Atri, J. Ramanujam, Mahmut T. Kandemir: Improving Offset Assignment for Embedded Processors. LCPC 2000: 158-172 | |
| 55 | Mahmut T. Kandemir, Alok N. Choudhary, Prithviraj Banerjee, J. Ramanujam, U. Nagaraj Shenoy: Minimizing Data and Synchronization Costs in One-Way Communication. IEEE Trans. Parallel Distrib. Syst. 11(12): 1232-1251 (2000) | |
| 54 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Meenakshi A. Kandaswamy: A Unified Framework for Optimizing Locality, Parallelism, and Communication in Out-of-Core Computations. IEEE Trans. Parallel Distrib. Syst. 11(7): 648-668 (2000) | |
| 53 | Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary: Compiler Algorithms for Optimizing Locality and Parallelism on Shared and Distributed-Memory Machines. J. Parallel Distrib. Comput. 60(8): 924-965 (2000) | |
| 1999 | ||
| 52 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam: I/O-Conscious Tiling for Disk-Resident Data Sets. Euro-Par 1999: 430-439 | |
| 51 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam: Restructuring I/O-Intensive Computations for Locality. HPCN Europe 1999: 1097-1106 | |
| 50 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam: Compiler Optimizations for I/O-Intensive Computations. ICPP 1999: 164-171 | |
| 49 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee: A Framework for Interprocedural Locality Optimization Using Both Loop and Data Layout Transformations. ICPP 1999: 95-102 | |
| 48 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee: On Reducing False Sharing while Improving Locality on Shared Memory Multiprocessors. IEEE PACT 1999: 203-211 | |
| 47 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee: A Graph Based Framework to Detect Optimal Memory Layouts for Improving Data Locality. IPPS/SPDP 1999: 738-743 | |
| 46 | Mahmut T. Kandemir, Prithviraj Banerjee, Alok N. Choudhary, J. Ramanujam, Eduard Ayguadé: An integer linear programming approach for optimizing cache locality. International Conference on Supercomputing 1999: 500-509 | |
| 45 | Vipin Jain, Siddharth Rele, Santosh Pande, J. Ramanujam: Code Restructuring for Improving Real Time Response through Code Speed, Size Trade-offs on Limited Memory Embedded DSPs. LCPC 1999: 459-463 | |
| 44 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee: Improving Locality Using a Graph-Based Technique for Detecting Memory Layouts of Arrays. PPSC 1999 | |
| 43 | Mahmut T. Kandemir, Prithviraj Banerjee, Alok N. Choudhary, J. Ramanujam, U. Nagaraj Shenoy: A global communication optimization technique based on data-flow analysis and linear algebra. ACM Trans. Program. Lang. Syst. 21(6): 1251-1297 (1999) | |
| 42 | Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary: Improving Cache Locality by a Combination of Loop and Data Transformation. IEEE Trans. Computers 48(2): 159-167 (1999) | |
| 41 | Mahmut T. Kandemir, Alok N. Choudhary, U. Nagaraj Shenoy, Prithviraj Banerjee, J. Ramanujam: A Linear Algebra Framework for Automatic Determination of Optimal Data Layouts. IEEE Trans. Parallel Distrib. Syst. 10(2): 115-135 (1999) | |
| 40 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee: A Matrix-Based Approach to Global Locality Optimization. J. Parallel Distrib. Comput. 58(2): 190-235 (1999) | |
| 1998 | ||
| 39 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, U. Nagaraj Shenoy, Prithviraj Banerjee: Enhancing Spatial Locality via Data Layout Optimizations. Euro-Par 1998: 422-434 | |
| 38 | M. Narasimhan, J. Ramanujam: Improving the computational performance of ILP-based problems. ICCAD 1998: 593-596 | |
| 37 | Mahmut T. Kandemir, U. Nagaraj Shenoy, Prithviraj Banerjee, J. Ramanujam, Alok N. Choudhary: Minimizing Data and Synchronization Costs in One-Way Communication. ICPP 1998: 180-188 | |
| 36 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee: A Matrix-Based Approach to the Global Locality Optimization Problem. IEEE PACT 1998: 306-313 | |
| 35 | Mahmut T. Kandemir, Prithviraj Banerjee, Alok N. Choudhary, J. Ramanujam, U. Nagaraj Shenoy: A Generalized Framework for Global Communication Optimization. IPPS/SPDP 1998: 69-73 | |
| 34 | Mahmut T. Kandemir, Alok N. Choudhary, U. Nagaraj Shenoy, Prithviraj Banerjee, J. Ramanujam: A Hyperplane Based Approach for Optimizing Spatial Locality in Loop Nests. International Conference on Supercomputing 1998: 69-76 | |
| 33 | Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary, Prithviraj Banerjee: A Loop Transformation Algorithm Based on Explicit Data Layout Representation for Optimizing Locality. LCPC 1998: 34-50 | |
| 32 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam: Improving Locality in Out-of-Core Computations Using Data Layout Transformations. LCR 1998: 359-366 | |
| 31 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee: Improving Locality Using Loop and Data Transformations in an Integrated Framework. MICRO 1998: 285-297 | |
| 30 | P. Sadayappan, Fikret Erçal, J. Ramanujam: Partitioning Graphs on Message-Passing Machines by Pairwise Mincut. Inf. Sci. 111(1-4): 223-237 (1998) | |
| 29 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Meenakshi A. Kandaswamy: Locality Optimization Algorithms for Compilation of Out-of-Core Codes. J. Inf. Sci. Eng. 14(1): 107-138 (1998) | |
| 28 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Rajesh Bordawekar: Compilation Techniques for Out-of-Core Parallel Computations. Parallel Computing 24(3-4): 597-628 (1998) | |
| 1997 | ||
| 27 | Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary: Optimization of Out-of-Core Computations Using Chain Vectors. Euro-Par 1997: 601-608 | |
| 26 | Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary: Improving the Performance of Out-of-Core Computations. ICPP 1997: 128-136 | |
| 25 | Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary: Compiler Algorithms for Optimizing Locality and Parallelism on Shared and Distributed Memory Machines. IEEE PACT 1997: 236- | |
| 24 | Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Meenakshi A. Kandaswamy: A Unified Compiler Algorithm for Optimizing Locality, Parallelism and Communication in Out-of-core Computations. IOPADS 1997: 79-92 | |
| 23 | Mahmut T. Kandemir, J. Ramanujam, Alok N. Choudhary: A Compiler Algorithm for Optimizing Locality in Loop Nests. International Conference on Supercomputing 1997: 269-276 | |
| 22 | J. Ramanujam, Swaroop Dutta, Arun Venkatachar: Code Generation for Complex Subscripts in Data-Parallel Programs. LCPC 1997: 49-63 | |
| 21 | Arun Venkatachar, J. Ramanujam, Ashwath Thirumalai: Communication Generation for Block-Cyclic Distributions. Parallel Processing Letters 7(2): 195-202 (1997) | |
| 1996 | ||
| 20 | Rajesh Bordawekar, Alok N. Choudhary, J. Ramanujam: A Framework for Integrated Communication and I/O Placement. Euro-Par, Vol. I 1996: 541-552 | |
| 19 | Rajesh Bordawekar, Alok N. Choudhary, J. Ramanujam: Automatic Optimization of Communication in Compiling Out-of-Core Stencil Codes. International Conference on Supercomputing 1996: 366-373 | |
| 18 | Arun Venkatachar, J. Ramanujam, Ashwath Thirumalai: Generalized Overlap Regions for Communication Optimization in Data-Parallel Programs. LCPC 1996: 404-419 | |
| 17 | Rajeev Thakur, Alok N. Choudhary, J. Ramanujam: Efficient Algorithms for Array Redistribution. IEEE Trans. Parallel Distrib. Syst. 7(6): 587-594 (1996) | |
| 16 | Ashwath Thirumalai, J. Ramanujam: Efficient Computation of Address Sequences in Data Parallel Programs Using Closed Forms for Basis Vectors. J. Parallel Distrib. Comput. 38(2): 188-203 (1996) | |
| 15 | Rajesh Bordawekar, Alok N. Choudhary, J. Ramanujam: Compilation and Communication Strategies for Out-of-Core Programs on Distributed Memory Machines. J. Parallel Distrib. Comput. 38(2): 277-288 (1996) | |
| 1995 | ||
| 14 | J. Ramanujam, S. Vasanthakumar: Statement-level independent partitioning of uniform recurrences. IPPS 1995: 229-233 | |
| 13 | S. D. Kaushik, Chua-Huang Huang, J. Ramanujam, P. Sadayappan: Multi-phase array redistribution: modeling and evaluation. IPPS 1995: 441-445 | |
| 12 | Ashwath Thirumalai, J. Ramanujam: Fast Address Sequence Generation for Data-Parallel Programs Using Integer Lattices. LCPC 1995: 191-208 | |
| 11 | J. Ramanujam, Amit Narayan: Integrating Data Distribution and Loop Transformations. PPSC 1995: 668-673 | |
| 1994 | ||
| 10 | J. Ramanujam: Optimal Software Pipelining of Nested Loops. IPPS 1994: 335-342 | |
| 9 | J. Ramanujam, A. Mathew: Analysis of Event Synchronization in Parallel Programs. LCPC 1994: 300-315 | |
| 1992 | ||
| 8 | J. Ramanujam: Non-Unimodular Transformations of Nested Loops. SC 1992: 214-223 | |
| 7 | J. Ramanujam, P. Sadayappan: Tiling Multidimensional Itertion Spaces for Multicomputers. J. Parallel Distrib. Comput. 16(2): 108-120 (1992) | |
| 1991 | ||
| 6 | J. Ramanujam: A Linear Algebraic View of Loop Transformations and Their Interaction. PPSC 1991: 543-548 | |
| 5 | J. Ramanujam, P. Sadayappan: Tiling multidimensional iteration spaces for nonshared memory machines. SC 1991: 111-120 | |
| 4 | J. Ramanujam, P. Sadayappan: Compile-Time Techniques for Data Distribution in Distributed Memory Machines. IEEE Trans. Parallel Distrib. Syst. 2(4): 472-482 (1991) | |
| 1990 | ||
| 3 | J. Ramanujam, P. Sadayappan: Tiling of Iteration Spaces for Multicomputers. ICPP (2) 1990: 179-186 | |
| 2 | Fikret Erçal, J. Ramanujam, P. Sadayappan: Task Allocation onto a Hypercube by Recursive Mincut Bipartitioning. J. Parallel Distrib. Comput. 10(1): 35-44 (1990) | |
| 1 | P. Sadayappan, Fikret Erçal, J. Ramanujam: Cluster partitioning approaches to mapping parallel programs onto a hypercube. Parallel Computing 13(1): 1-16 (1990) | |