13th International Parallel Processing Symposium / 10th Symposium on Parallel and Distributed Processing (IPPS / SPDP '99), 12-16 April 1999, San Juan, Puerto Rico, Proceedings. IEEE Computer Society 1999 ISBN 0-7695-0143-5
Session 1: Algorithmic Paradigms and Primitives


Frank K. H. A. Dehne, David A. Hutchinson, Anil Maheshwari, Wolfgang Dittrich: Reducing I/O Complexity by Simulating Coarse Grained Parallel Algorithms. 14-20
Hettihe P. Dharmasena, Ramachandran Vaidyanathan: Lower Bounds on the Loading of Degree-2 Multiple Bus Networks for Binary-Tree Algorithms. 21-25
Koji Nakano, Stephan Olariu, Albert Y. Zomaya: A Time-Optimal Solution for the Path Cover Problem on Cographs. 26-30
Keqin Li, Victor Y. Pan: Parallel Matrix Multiplication on a Linear Array with a Reconfigurable Pipelined Bus System. 31-
Session 2: Latency Tolerance and Performance Modeling

Toni Cortes, Jesús Labarta: Linear Aggressive Prefetching: A Way to Increase the Performance of Cooperative Caches. 46-54
Ahmad Afsahi, Nikitas J. Dimopoulos: Hiding Communication Latency in Reconfigurable Message-Passing Environments. 55-60
Xian-He Sun, Dongmei He, Kirk W. Cameron, Yong Luo: A Factorial Performance Evaluation for Hierarchical Memory Systems. 70-74
Nor Jaidi Tuah, Mohan Kumar, Svetha Venkatesh: A Performance Model of Speculative Prefetching in Distributed Information Systems. 75-80
Session 3: Communication, Run-Time Systems
David K. Lowenthal, Michael James: Run-Time Selection of Block Size in Pipelined Parallel Programs. 82-87

Olaf Bonorden, Ben H. H. Juurlink, Ingo von Otte, Ingo Rieping: The Paderborn University BSP (PUB) Library - Design, Implementation and Performance. 99-104
Shridhar Diwan, Dennis Gannon: A Capabilities Based Communication Model for High-Performance Distributed Applications: The Open HPC++ Approach. 105-
Session 4: Scalable Computing
Keqin Li, Xian-He Sun: Average-Case Analysis of Isospeed Scalability of Parallel Computations on Multiprocessors. 112-116
Prosenjit Bose, Albert Chan, Frank K. H. A. Dehne, Markus Latzel: Coarse Grained Parallel Maximum Matching In Convex Bipartite Graphs. 125-129
Brian Grayson, Michael Dahlin, Vijaya Ramachandran: Experimental Evaluation of QSM, a Simple Shared-Memory Model. 130-
Session 5: Communication and Protocols for Clusters

Dimiter R. Avresky, Vladimir Shurbanov, Robert W. Horst, Pankaj Mehra: Performance Evaluation of the ServerNet R SAN under Self-Similar Traffic. 143-147
Vijay Moorthy, Matthew G. Jacunski, Manoj Pillai, Peter P. Ware, Dhabaleswar K. Panda, Thomas W. Page Jr., P. Sadayappan, V. Nagarajan, Johns Daniel: Low-Latency Message Passing on Workstation Clusters using SCRAMNet. 148-152
Sandhya Dwarkadas, Robert Stets, Nikos Hardavellas, Leonidas I. Kontothanassis, Rishiyur S. Nikhil: Cashmere-VLM: Remote Memory Paging for Software Distributed Shared Memory. 153-159
Walfredo Cirne, Keith Marzullo: The Computational Co-op: Gathering Clusters into a Metacomputer. 160-166
Session 6: Communication Libraries
Stephen R. Donaldson, Jonathan M. D. Hill, David B. Skillicorn: Exploiting Global Structure for Performance on Clusters. 176-182
Mohammad Banikazemi, Rama Govindaraju, Robert Blackmore, Dhabaleswar K. Panda: Implementing Efficient MPI on LAPI for IBM RS/6000 SP Systems: Experiences and Performance Evaluation. 183-190

Kivanc Dincer: Ubiquitous Message Passing Interface Implementation in Java: jmpi. 203-
Session 7: Routing and Broadcasting I


Danny Krizanc, Flaminia L. Luccio, Rajeev Raman: Dynamic Interval Routing on Asynchronous Rings. 225-232
Jerry L. Trahan, Anu G. Bourgeois, Ramachandran Vaidyanathan, Yi Pan: Optimally Scaling Permutation Routing on Reconfigurable Linear Arrays with Optical Buses. 233-
Session 8: Miscellaneous Architecture
José Duato, Antonio Robles, Federico Silla, Ramón Beivide: A Comparison of Router Architectures for Virtual Cut-Through and Wormhole Switching in a NOWEnvironment. 240-247
Alberto Ferreira de Souza, Peter Rounce: Dynamically Scheduling the Trace Produced During Program Execution into VLIW Instructions. 248-257
Jong Hyuk Choi, Kyu Ho Park: Segment Directory Enhancing the Limited Directory Cache Coherence Schemes. 258-267
Kichul Kim: Shuffle Memory System. 268-272
Rong Lin, Koji Nakano, Stephan Olariu, Albert Y. Zomaya: An Efficient VLSI Architecture Parallel Prefix Counting With Domino Logic. 273-
Session 9: Advanced Software for Applications Support
Luís Moura Silva, João Gabriel Silva: The Performance of Coordinated and Independent Checkpointing. 280-284
Igor Z. Milosavljevic, Marwan A. Jabri: Automatic Array Alignment in Parallel Matlab Scripts. 285-289
Lonnie R. Welch, Binoy Ravindran, Paul V. Werme, Michael W. Masters, Behrooz Shirazi, Prashant A. Shirolkar, Robert D. Harrison, D. Wayne Mills, Tuy Do, Judy Lafratta, Shafqat Anwar, Steve Sharp, Terry Sergeant, George Bilowus, Mark Swick, Jim Hoppel, Joseph M. Caruso: Distributed, Scalable, Dependable Real-Time Systems: Middleware Services and Applications. 297-301
Session 10: Routing and Broadcasting II
Jin Suk Kim, Eric Lehman, Frank Thomson Leighton: Oblivious Deadlock-Free Routing in a Faulty Hypercube. 312-319
Matthew G. Jacunski, P. Sadayappan, Dhabaleswar K. Panda: All-to-All Broadcast on Switch-Based Clusters of Workstations. 325-329
Xicheng Liu, Timothy J. Li, Wen Gao: VBMAR: Virtual Network Load Balanced Minimal Adaptive Routing. 330-
Session 11: Scientific Engineering Systems
Rupak Biswas, Leonid Oliker, Sajal K. Das, Daniel J. Harvey: Portable Parallel Programming for the Dynamic Load Balancing of Unstructured Grid Applications. 338-342
Tracy D. Braun, Anthony A. Maciejewski, Howard Jay Siegel: A Parallel Algorithm for Singular Value Decomposition as Applied to Failure Tolerant Manipulators. 343-349
Nordine Melab, El-Ghazali Talbi, Serge G. Petiton: A Parallel Adaptive version of the Block-based Gauss-Jordan Algorithm. 350-354
Gerd Heber, Guang R. Gao, Rupak Biswas: A New Approach to Parallel Dynamic Partitioning for Adaptive Unstructured Meshes. 360-364
Salvatore Filippone, Michele Colajanni, Dario Pascucci: An Object-Oriented Environment for Sparse Parallel Computation on Adaptive Grids. 365-
Session 12: Performance
JunSeong Kim, David J. Lilja: A Network Status Predictor to Support Dynamic Scheduling in Network-Based Computing Systems. 372-378
Thomas T. Kwan, Daniel A. Reed: Performance Evaluation of an Infrastructure for Worldwide Parallel Computing. 379-386
Aleksandar M. Bakic, Matt W. Mutka, Diane T. Rover: BRISK: A Portable and Flexible Distributed Instrumentation System. 387-391
Franco Zambonelli: An Efficient Logging Algorithm for Incremental Replay of Message. 392-398
Angkul Kongmunvattana, Nian-Feng Tzeng: Lazy Logging and Prefetch-Based Crash Recovery in Software Distributed Shared Memory Systems. 399-406
Magnus Broberg, Lars Lundberg, Håkan Grahn: Visualization and Performance Prediction of Multithreaded Solaris Programs by Tracing Kernel Threads. 407-
Industrial Track Presentation
John Robinson: Delivering On Standards, Balancing Portability and Performance. 416-
Session 13: Mesh Architecture
Jop F. Sibeyn: Better Deterministic Routing on Meshes. 420-425
Hong Shen: Efficient Parallel Algorithms for Selection and Multiselection on Mesh-Connected Computers. 426-430
Amitava Datta: Constant-Time Algorithm for Medial Axis Transform on the Reconfigurable Mesh. 431-435
Chi-Hsiang Yeh, Behrooz Parhami, Hua Lee, Emmanouel A. Varvarigos: 2.5n-Step Sorting on n*n Meshes in the Presence of o(sqrt(n)) Worst-Case Faults. 436-440
Chi-Hsiang Yeh, Behrooz Parhami, Emmanouel A. Varvarigos: The Recursive Grid Layout Scheme for VLSI Layout of Hierarchical Networks. 441-
Session 14: Signal Processing
Wei-keng Liao, Alok N. Choudhary, Donald Weiner, Pramod K. Varshney: Multi-Threaded Design and Implementation of Parallel Pipelined STAP on Parallel Computers with SMP Nodes. 448-452
Sang-Hwa Chung, Min-Uk Park, Hyung-Soon Kim: A Parallel Phoneme Recognition Algorithm Based on Continuous Hidden Markov Model. 453-457
Ashfaq A. Khokhar, Gerd Heber, Parimala Thulasiraman, Guang R. Gao: Load Adaptive Algorithms and Implementations for the 2D Discrete Wavelet Transform on Fine-Grain Multithreaded Architectures. 458-462
David R. Martinez: Application of Parallel Processors to Real-Time Sensor Array Processing. 463-469
Reinhard Lüling: Mapping Media Streams onto a Network of Servers. 470-476
Session 15: Program Optimization, Resource Allocation, Scheduling
Jeffrey S. Vetter, Karsten Schwan: Optimizations for Language-Directed Computational Steering. 486-491
Sergei Gorlatch, Christoph Wedler, Christian Lengauer: Optimization Rules for Programming with Collective Operations. 492-499
S. Chingchit, Mohan Kumar, Laxmi N. Bhuyan: A Flexible Clustering and Scheduling Scheme for Efficient Parallel Computation. 500-505
Arash Baratloo, Ayal Itzkovitz, Zvi M. Kedem, Yuanyuan Zhao: Mechanisms for Just-in-Time Allocation of Resources to Adaptive Parallel Programs. 506-512
Fangzhe Chang, Vijay Karamcheti, Zvi M. Kedem: Exploiting Application Tunability for Efficient, Predictable Parallel Resource Management. 749-
David Talby, Dror G. Feitelson: Supporting Priorities and Improving Utilization of the IBM SP Scheduler Using Slack-Based Backfilling. 513-
Session 16: Load Balancing and Distributed Computing
Arnold L. Rosenberg: Guidelines for Data-Parallel Cycle-Stealing in Networks of Workstations, II: On Maximizing Guaranteed Output. 520-524
Andrei Radulescu, Arjan J. C. van Gemund, Hai-Xiang Lin: LLB: A Fast and Effective Scheduling Algorithm for Distributed-Memory Systems. 525-530
Stefan Bischof, Ralf Ebner, Thomas Erlebach: Parallel Load Balancing for Problems with Good Bisectors. 531-538
Tatsuya Hayashi, Koji Nakano, Stephan Olariu: Randomized Initialization Protocols for Packet Radio Networks. 544-
Session 17: Data Mining and Databases
Sajal K. Das, Maria Cristina Pinotti: An Optimal Disk Allocation Strategy for Partial Match Queries on Non-Uniform Cartesian Product Files. 550-554
Mahesh K. Sreenivas, Khaled Alsabti, Sanjay Ranka: Parallel Out-of-Core Divide-and-Conquer Techniques with Application to Classification Trees. 555-562
Pedro E. López-de-Teruel, José M. García, Manuel E. Acacio, Óscar Cánovas Reverte: P-EDR: An Algorithm for Parallel Implementation of Parzen Density Estimation from Uncertain Observations. 563-568
Sanjay Goil, Alok N. Choudhary: Design and Implementation of a Scalable Parallel System for Multidimensional Analysis and OLAP. 576-581
Chialin Chang, Renato Ferreira, Alan Sussman, Joel H. Saltz: Infrastructure for Building Parallel Database Systems for Multi-Dimensional Data. 582-
Session 18: Compilers
Alessandro Marongiu, Paolo Palazzari, C. Casaccia: A New Memory-Saving Technique to Map System of Affine Recurrence Equations (SARE) onto Distributed Memory Systems. 590-596
Dhruva R. Chakrabarti, Prithviraj Banerjee: A Novel Compilation Framework for Supporting Semi-Regular Distributions in Hybrid Applications. 597-602
Xin Yuan, Rajiv Gupta, Rami G. Melhem: Compiler Analysis to Support Compiled Communication for HPF-Like Programs. 603-608
Pramod G. Joisha, Prithviraj Banerjee: PARADIGM (version 2.0): A New HPF Compilation System. 609-615
Christian Queinnec: Marshaling/Demarshaling as a Compilation/Interpretation Process. 616-
Industrial Track Presentation
Steve Casselman, John Schewel, Christophe Beaumont: IP Validation for FPGAs using Hardware Object TechnologyTM. 624-
Session 19: Biological and Discrete Systems
Robert E. Lynch, Dan C. Marinescu, Hong Lin, Timothy S. Baker: Parallel Algorithms for 3D Reconstruction of Asymmetric Objects from Electron Micrographs. 632-637
Pierre-Eric Bernard, Thierry Gautier, Denis Trystram: Large Scale Simulation of Parallel Molecular Dynamics. 638-644
Srinivas Aluru, Natsuhiko Futamura, Kishan Mehrotra: Parallel Biological Sequence Comparison Using Prefix Computations. 653-659
Session 20: Real-Time Simulation and Load Balancing
Maurizio A. Bonuccelli, M. Claudia Clò: EDD Algorithm Performance Guarantee for Periodic Hard-Real-Time Scheduling in Distributed Systems. 668-677
Jan Jonsson: A Robust Adaptive Metric for Deadline Assignment in Heterogeneous Distributed Real-Time Systems. 678-687
Charles Michael Johnston, Anthony T. Chronopoulos: A Communication Latency Hiding Parallelization of a Traffic Flow Simulation. 688-695
Narayanan V. Thondugulam, Dhananjai Madhava Rao, Radharamanan Radhakrishnan, Philip A. Wilsey: Relaxing Causal Constraints in PDES. 696-700
Luis Miguel Campos, Isaac D. Scherson: Rate of Change Load Balancing in Distributed and Parallel Systems. 701-707
Hwakyung Rim, Ju-wook Jang, Sung-Chun Kim: An Efficient Dynamic Load Balancing using the Dimension Exchange Method for Balancing of Quantized Loads on Hypercube Multiprocessors. 708-
Session 21: Miscellaneous Software
Ruth E. Anderson, Thu D. Nguyen, John Zahorjan: Cascaded Execution: Speeding Up Unparallelized Execution on Shared-Memory Multiprocessors. 714-719
Vítor Santos Costa: COWL: Copy-On-Write for Logic Programs. 720-727
João Luís Sobral, Alberto José Proença: Dynamic Grain-Size Adaptation on Object Oriented Parallel Programming The SCOOPP Approach. 728-732
Azzedine Boukerche, Sajal K. Das, Ajoy Kumar Datta, Timothy E. LeMaster: Implementation of a Virtual Time Synchronizer for Distributed Databases. 733-737
Mahmut T. Kandemir, Alok N. Choudhary, J. Ramanujam, Prithviraj Banerjee: A Graph Based Framework to Detect Optimal Memory Layouts for Improving Data Locality. 738-743
S. R. Prakash, Y. N. Srikant: Hyperplane Partitioning: An Approach to Global Data Partitioning for Distributed Memory Machines. 744-



