SC 1994: Washington, DC, USA
Gary M. Johnson (Ed.): Proceedings Supercomputing '94, Washington, DC, USA, November 14-18, 1994. IEEE Computer Society / ACM 1994 ISBN 0-8186-6605-6
Roudtables
Milissa M. Benincasa: Issues in applying high performance computing to real-time systems (roundtable session). 1-2
Thomas L. Marchioro, David Martin: Issues in undergraduate education in computational science and high performance computing. 3
Panels
Myron Ginsberg: Selecting ``best'' MPP: do benchmarks really tell the whole story? (panel). 4
Wallace Feurzeig, Nancy Roberts: Computer modeling in science education: toward a research and planning agenda (panel). 5-6
Plenary: Manufacturing and Design
F. Ron Bailey, Mike George, Dennis Koga, Pieter Buning, Paul Delzio, Robert Kulfan: Integration of numerical and experimental wind tunnels (IofNEWT). 7
Engineering Applictaions
Deb Banerjee, Thomas Tysinger, Wayne Smith: A scalable high-performance environment for fluid flow analysis on unstructured grids. 8-17
Eric Tomacruz, Jagesh V. Sanghavi, Alberto L. Sangiovanni-Vincentelli: A parallel iterative linear solver for solving irregular grid semiconductor device matrices. 24-33
Mathematical Algorithms
Ramesh C. Agarwal, Fred G. Gustavson, Mohammad Zubair: A high performance parallel algorithm for 1-D FFT. 34-40
Jonathan Eckstein: Control strategies for parallel mixed integer branch and bound. 41-48
D. L. Dai, Sandeep K. S. Gupta, S. D. Kaushik, J. H. Lu, R. V. Singh, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson: EXTENT: a portable programming environment for designing and implementing high-performance block recursive algorithms. 49-58
Languages
Wai-Mee Ching, Alexander Katz: An experimental APL compiler for a distributed memory parallel machine. 59-68
Steven Parkes, John A. Chandy, Prithviraj Banerjee: A library-based approach to portable, parallel, object-oriented programming: interface, implementation, and application. 69-78
Tomio Kamada, Satoshi Matsuoka, Akinori Yonezawa: Efficient parallel global garbage collection on massively parallel computers. 79-88
Communication Libraries
Allan D. Knies, William J. Harrod, F. Ray Barriuso, George B. Adams III: SLICC: a low latency interface for collective communications. 89-96
Shamik D. Sharma, Ravi Ponnusamy, Bongki Moon, Yuan-Shin Hwang, Raja Das, Joel H. Saltz: Run-time and compile-time support for adaptive irregular problems. 97-106
Michael Barnett, Lance Shuler, Satya Gupta, David G. Payne, Robert A. van de Geijn, Jerrell Watts: Building a high-performance collective communication library. 107-116
University Education
Sridhar V. Iyer, Alan Beck, Umberto Ravaioli, Jeffrey A. Terstriep: Distributed exploratorium for high performance computational techniques. 117-125
Curtis R. Cook, Cherri M. Pancake, Rebecca A. Walpole: Are expectations for parallelism too high? a survey of potential parallel users. 126-133
G. V. Wilson, N. B. MacDonald, C. Thornborrow, C. M. Brough: The development and operation of Edinburgh Parallel Computing Centre's Summer Scholarship Programme. 134-143
Invited Papers
Murali Beddhu, Min-Yee Jiang, Lafayette K. Taylor, David L. Whitfield: Towards computations of ocean flows using Navier-Stokes equations. 144-153
Networks (WAN/LAN)
Mengjou Lin, Jenwei Hsieh, David Hung-Chang Du, Joseph P. Thomas, James A. MacDonald: Distributed network computing over local ATM networks. 154-163
Chengchang Huang, Eric P. Kasten, Philip K. McKinley: Design and implementation of multicast operations for ATM-based high performance computing. 164-173
Mengjou Lin, Jenwei Hsieh, David Hung-Chang Du, James A. MacDonald: Performance of high-speed network I/O subsystems: case study of a fibre channel network. 174-183
Linear Algebra I
D. P. Koester, Sanjay Ranka, Geoffrey Fox: A parallel Gauss-Seidel algorithm for sparse power system matrices. 184-193
Osamu Tatebe, Yoshio Oyanagi: Efficient implementation of the multigrid preconditioned conjugate gradient method on distributed memory machines. 194-203
George Karypis, Anshul Gupta, Vipin Kumar: A parallel formulation of interior point algorithms. 204-213
Processor Allocation and Scheduling

Wanqian Liu, Virginia Mary Lo, Kurt J. Windisch, Bill Nitzberg: Non-contiguous processor allocation algorithms for distributed memory multicomputers. 227-236
G. N. Srinivasa Prasanna, Bruce R. Musicus: Generalized multiprocessor scheduling for directed acyclic graphs. 237-246
Hardware Performance

Leonidas I. Kontothanassis, Rabin A. Sugumar, Greg Faanes, James E. Smith, Michael L. Scott: Cache performance in vector supercomputers. 255-264
Roundtables

Diane T. Rover: Performance evaluation: integrating techniques and tools into environments and frameworks. 277-278
Panels
Donald A. B. Lindberg, John C. Toole, Paul R. Young, John S. Cavallini, Lee B. Holcomb, Richard J. Linn Jr., George R. Cotter, Helen M. Wood, Joan H. Novak, Alexis T. Poliakoff: The High Performance Computing and Communication (HPCC) Program: technologies for the National Information Infrastructure (panel). 279
Plenary: Environment
Gregory J. McRae: Realizing the benefits of high performance computing for solving environmental and other problems: what do we need beyond faster hardware? 280
Engineering Applications II
Charles J. Turner, Jennifer G. Turner: Adaptive data parallel methods for ecosystem monitoring. 281-290
Gordon Cameron, Brian J. N. Wylie, David McArthur: PARAMICS - moving vehicles on the connection machine. 291-300
Seminumerical Algorithms
Andrew Sohn, Mitsuhisa Sato, Shuichi Sakai, Yuetsu Kodama, Yoshinori Yamaguchi: Nonnumeric search results on the EM-4 distributed-memory multiprocessor. 301-310
Daniel V. Pryor, Steven A. Cuccaro, Michael Mascagni, M. L. Robinson: Implementation of a portable and reproducible parallel pseudorandom number generator. 311-319
Alexander V. Kozlov, Jaswinder Pal Singh: A parallel Lauritzen-Spiegelhalter algorithm for probabilistic inference. 320-329
Languages and Runtime Libraries
Jaspal Subhlok, David R. O'Hallaron, Thomas R. Gross, Peter A. Dinda, Jon A. Webb: Communication and memory requirements as the basis for mapping task and data parallel programs. 330-339
Jarek Nieplocha, Robert J. Harrison, Richard J. Littlefield: Global arrays: a portable "shared-memory" programming model for distributed memory computers. 340-349
Matthew Haines, David Cronk, Piyush Mehrotra: On the design of Chant: a talking threads package. 350-359
Scheduling and Operating Systems
Jhy-Chun Wang, Sanjay Ranka: Scheduling of unstructured communication on the Intel iPSC/860. 360-369
Alain Gefflaut, Christine Morin, Michel Banâtre: Tolerating node failures in cache only memory architectures. 370-379
Babak Falsafi, Alvin R. Lebeck, Steven K. Reinhardt, Ioannis Schoinas, Mark D. Hill, James R. Larus, Anne Rogers, David A. Wood: Application-specific protocols for user-level shared memory. 380-389
Cluster Computing
Jeremy Casas, Ravi B. Konuru, Steve W. Otto, Robert M. Prouty, Jonathan Walpole: Adaptive load migration systems for PVM. 390-399
Rod Fatoohi, Sisira Weeratunga: Performance evaluation of three distributed computing environments for scientific applications. 400-409
K. Castagnera, D. Cheng, Rod Fatoohi, E. Hook, B. Kramer, C. Manning, J. Musch, C. Niggley, William Saphir, D. Sheppard, M. Smith, Ian Stockdale, S. Welch, R. Williams, D. Yip: NAS experiences with a prototype cluster of workstations. 410-419
Physics Applications
Srinivas Aluru, Gurpur M. Prabhu, John L. Gustafson: Truly distribution-independent algorithms for the N-body problem. 420-428
Junichiro Makino, Makoto Taiji, Toshikazu Ebisuzaki, Daiichiro Sugimoto: GRAPE-4: a one-Tflops special-purpose computer for astrophysical N-body problem. 429-438
Ananth Grama, Vipin Kumar, Ahmed H. Sameh: Scalable parallel formulations of the Barnes-Hut method for n-body simulations. 439-448
Load Balancing and Domain Decomposition
Harpal Maini, Kishan Mehrotra, Chilukuri K. Mohan, Sanjay Ranka: Genetic algorithms for graph partitioning and incremental graph partitioning. 449-457
Chao-Wei Ou, Sanjay Ranka: Parallel incremental graph partitioning using linear programming. 458-467
Architecture
Teruo Utsumi, Masayuki Ikeda, Moriyuki Takamura: Architecture of the VPP500 parallel supercomputer. 478-487
Tzi-cker Chiueh: Sunder: a programmable hardware prefetch architecture for numerical loops. 488-497
Akhilesh Kumar, Phanindra K. Mannava, Laxmi N. Bhuyan: Efficient and scalable cache coherence schemes for shared memory hypercube multiprocessors. 498-507
Dependence Analysis/Loop Parallelization
A. Zaafrani, Mabo Robert Ito: Expressing cross-loop dependencies through hyperplane data dependence analysis. 508-517
Ding-Kai Chen, Josep Torrellas, Pen-Chung Yew: An efficient algorithm for the run-time parallelization of DOACROSS loops. 518-527
William Blume, Rudolf Eigenmann: The range test: a dependence test for symbolic, non-linear expressions. 528-537
Roundtables
Scott E. Fahlman, Lowell D. Harris, Clinton S. Potter, Richard A. Robb: Supercomputing support for advanced biomedical imaging (roundtable). 538-539
Plenary: Biology and Medicine
Christopher R. Johnson, Steven G. Parker: A computational steering model applied to problems in medicine. 540-549
Biomedical Applications
Calvin A. Johnson, Neil I. Weisenfeld, Benes L. Trus, J. F. Conway, Robert L. Martino, Alasdair C. Steven: Orientation determination in the 3D reconstruction of icosahedral viruses using a parallel computer. 550-559
Raimond L. Winslow, Dongming Cai, Ying-Cheng Lai: Factors controlling generation and propagation of pacemaker potentials in network models of mammalian SA node. 560-569
Cheng Che Chen, Jaswinder Pal Singh, William B. Poland, Russ B. Altman: Parallel protein structure determination from uncertain data. 570-579
Software Performance
Subhash Saini, Horst D. Simon: Applications performance under OSF/1 AD and SUNMOS on Intel Paragon XP/S-15. 580-589
Doug Burger, Rahmat S. Hyder, Barton P. Miller, David A. Wood: Paging tradeoffs in distributed-shared-memory multiprocessors. 590-599
Computer-Communication Interaction
Brian Tierney, William E. Johnston, Hanan Herzog, Gary Hoo, Guojun Jin, Jason Lee, Ling Tony Chen, Doron Rotem: Using high speed networks to enable distributed parallel image server systems. 610-619
Ronald Mraz: Reducing the variance of point to point transfers in the IBM 9076 parallel computer. 620-629
Stephen W. Turner, Lionel M. Ni, Betty H. C. Cheng: Time and/or space sharing in a workstation cluster environment. 630-639
High Performance I/O
David Kotz, Nils Nieuwejaar: Dynamic file-access characteristics of a production parallel scientific workload. 640-649
Kent E. Seamons, Marianne Winslett: An efficient abstract interface for multidimensional array I/O. 650-659
Barbara K. Pasquale, George C. Polyzos: Dynamic I/O characterization of I/O intensive scientific applications. 660-669
Gordon Bell Awards Session

John N. Shadid, Scott A. Hutchinson, Harry Moffat, Gary L. Hennigan, Bruce Hendrickson, Robert W. Leland: A 65+ Gflops/s unstructured finite element simulation of chemically reacting flows on the Intel Paragon. 673-679
David E. Womble, David S. Greenberg, Stephen R. Wheat, Robert E. Benner, Marc S. Ingber, Greg Henry, Satya Gupta: Applications of boundary element methods on the Intel Paragon. 680-684
H. Miyoshi, Masahiro Fukuda, T. Iwamiya, Takashi Nakamura, M. Tuchiya, Masahiro Yoshida, Kazuomi Yamamoto, Y. Yamamoto, S. Ogawa, Y. Matsuo, Takashi Yamane, M. Takamura, M. Ikeda, S. Okada, Y. Sakamoto, T. Kitamura, H. Hatama, M. Kishimoto: Development and achievement of NAL Numerical Wind Tunnel (NWT) for CFD computations. 685-692
Internal Interconnect
Rajendra V. Boppana, Suresh Chalasani: Fault-tolerant routing with non-adaptive wormhole algorithms in mesh networks. 693-702
Hong Xu, Yadong Gui, Lionel M. Ni: Optimal software multicast in wormhole-routed multistage networks. 703-712
Joonho Park, Brian W. O'Krafka, Stamatis Vassiliadis, José G. Delgado-Frias: Design and evaluation of a DAMQ multiprocessor network with self-compacting buffers. 713-722
Software Tools

Seema Hiranandani, Ken Kennedy, Chau-Wen Tseng, Scott K. Warren: The D editor: a new interactive parallel programming tool. 733-742
Manish Parashar, Salim Hariri, Tomasz Haupt, Geoffrey Fox: Interpreting the performance of HPF/Fortran 90D. 743-752
Compilers

Ravi Mirchandaney, Seema Hiranandani, Ajay Sethi: Improving the performance of DSM systems via compiler involvement. 763-772
Lynn Choi, Pen-Chung Yew: A compiler-directed cache coherence scheme with improved intertask locality. 773-782
Linear Algebra II
Edward Rothberg, Robert Schreiber: Improved load distribution in parallel sparse Cholesky factorization. 783-792
Makoto Nakanishi, Hiroshi Ina, Ken'ichi Miura: A high performance linear equation solver on the VPP500 parallel supercomputer. 803-810
Steven Hotovy, Wayne Pfeiffer, Thomas L. Sterling: Findings of the joint NSF-NASA initiative in evaluation. 811-813



