| 2013 | ||
|---|---|---|
| j50 | Ian C. Atkinson, Geng (Daniel) Liu, Nady Obeid, Keith R. Thulborn, Wen-mei W. Hwu: Rapid computation of sodium bioscales using gpu-accelerated image reconstruction. Int. J. Imaging Systems and Technology 23(1): 29-35 (2013) | |
| j49 | Xiaohuang Huang, Christopher I. Rodrigues, Stephen Jones, Ian Buck, Wen-mei W. Hwu: Scalable SIMD-parallel memory allocation for many-core machines. The Journal of Supercomputing 64(3): 1008-1020 (2013) | |
| 2012 | ||
| b2 | Hyesoon Kim, Richard W. Vuduc, Sara S. Baghsorkhi, JeeWhan Choi, Wen-mei W. Hwu: Performance Analysis and Tuning for General Purpose Graphics Processing Units (GPGPU). Synthesis Lectures on Computer Architecture, Morgan & Claypool Publishers 2012 | |
| j48 | Xiaolong Wu, Yun Heo, Izzat El Hajj, Wen-mei W. Hwu, Deming Chen, Jian Ma: TIGER: tiled iterative genome assembler. BMC Bioinformatics 13(S-19): S18 (2012) | |
| j47 | John A. Stratton, Christopher I. Rodrigues, I-Jui Sung, Li-Wen Chang, Nasser Anssari, Geng (Daniel) Liu, Wen-mei W. Hwu, Nady Obeid: Algorithm and Data Optimization Techniques for Scaling to Massively Threaded Systems. IEEE Computer 45(8): 26-32 (2012) | |
| j46 | I-Jui Sung, Nasser Anssari, John A. Stratton, Wen-mei W. Hwu: Data Layout Transformation Exploiting Memory-Level Parallelism in Structured Grid Many-Core Applications. International Journal of Parallel Programming 40(1): 4-24 (2012) | |
| c119 | Hee-Seok Kim, Minwook Ahn, John A. Stratton, Wen-mei W. Hwu: Design evaluation of OpenCL compiler framework for Coarse-Grained Reconfigurable Arrays. FPT 2012: 313-320 | |
| c118 | Kai-Wei Chang, Biplab Deka, Wen-mei W. Hwu, Dan Roth: Efficient Pattern-Based Time Series Classification on GPU. ICDM 2012: 131-140 | |
| c117 | Sara S. Baghsorkhi, Isaac Gelado, Matthieu Delahaye, Wen-mei W. Hwu: Efficient performance evaluation of memory hierarchy for highly multithreaded graphics processors. PPOPP 2012: 23-34 | |
| c116 | Li-Wen Chang, John A. Stratton, Hee-Seok Kim, Wen-mei W. Hwu: A scalable, numerically stable, high-performance tridiagonal solver using GPUs. SC 2012: 27 | |
| 2011 | ||
| j45 | Michael T. Showerman, Jeremy Enos, Craig P. Steffen, Sean Treichler, William Gropp, Wen-mei W. Hwu: EcoG: A Power-Efficient GPU Cluster Architecture for Scientific Computing. Computing in Science and Engineering 13(2): 83-87 (2011) | |
| c115 | Alexandros Papakonstantinou, Yun Liang, John A. Stratton, Karthik Gururaj, Deming Chen, Wen-mei W. Hwu, Jason Cong: Multilevel Granularity Parallelism Synthesis on FPGAs. FCCM 2011: 178-185 | |
| c114 | Li-Wen Chang, Men-Tzung Lo, Nasser Anssari, Ke-Hsin Hsu, Norden E. Huang, Wen-mei W. Hwu: Parallel implementation of Multi-dimensional Ensemble Empirical Mode Decomposition. ICASSP 2011: 1621-1624 | |
| c113 | Hee-Seok Kim, Shengzhao Wu, Li-Wen Chang, Wen-mei W. Hwu: A Scalable Tridiagonal Solver for GPUs. ICPP 2011: 444-453 | |
| c112 | Per Stenström, Doug Burger, Wen-mei W. Hwu, Vipin Kumar, Kunle Olukotun, David A. Padua, Burton Smith: Panel Statement. IPDPS 2011: 877 | |
| c111 | Xiaolong Wu, Jiading Gai, Fan Lam, Maojing Fu, Justin P. Haldar, Yue Zhuo, Zhi-Pei Liang, Wen-mei W. Hwu, Bradley P. Sutton: Impatient MRI: Illinois Massively Parallel Acceleration Toolkit for image reconstruction with enhanced throughput in MRI. ISBI 2011: 69-72 | |
| 2010 | ||
| b1 | David Blair Kirk, Wen-mei W. Hwu: Programming Massively Parallel Processors - A Hands-on Approach. Morgan Kaufmann 2010, isbn 978-0-12-381472-2, pp. I-XVIII, 1-258 | |
| j44 | Volodymyr V. Kindratenko, Robert Wilhelmson, Robert J. Brunner, Todd J. Martinez, Wen-mei W. Hwu: High-Performance Computing with Accelerators. Computing in Science and Engineering 12(4): 12-16 (2010) | |
| c110 | Xiaohuang Huang, Christopher I. Rodrigues, Stephen Jones, Ian Buck, Wen-mei W. Hwu: XMalloc: A Scalable Lock-free Dynamic Memory Allocator for Many-core Machines. CIT 2010: 1134-1139 | |
| c109 | Xiaolong Wu, Nady Obeid, Wen-mei W. Hwu: Exploiting More Parallelism from Applications Having Generalized Reductions on GPU Architectures. CIT 2010: 1175-1180 | |
| c108 | Wen-mei W. Hwu: Raising the level of many-core programming with compiler technology: meeting a grand challenge. PACT 2010: 5-6 | |
| c107 | I-Jui Sung, John A. Stratton, Wen-mei W. Hwu: Data layout transformation exploiting memory-level parallelism in structured grid many-core applications. PACT 2010: 513-522 | |
| c106 | Isaac Gelado, Javier Cabezas, Nacho Navarro, John E. Stone, Sanjay J. Patel, Wen-mei W. Hwu: An asymmetric distributed shared memory model for heterogeneous parallel systems. ASPLOS 2010: 347-358 | |
| c105 | John A. Stratton, Vinod Grover, Jaydeep Marathe, Bastiaan Aarts, Mike Murphy, Ziang Hu, Wen-mei W. Hwu: Efficient compilation of fine-grained SPMD-threaded programs for multicore CPUs. CGO 2010: 111-119 | |
| c104 | Lijuan Luo, Martin D. F. Wong, Wen-mei W. Hwu: An effective GPU implementation of breadth-first search. DAC 2010: 52-55 | |
| c103 | Yue Zhuo, Xiaolong Wu, Justin P. Haldar, Wen-mei W. Hwu, Zhi-Pei Liang, Bradley P. Sutton: Accelerating iterative field-compensated MR image reconstruction on GPUS. ISBI 2010: 820-823 | |
| c102 | Stephen M. Kofsky, Daniel R. Johnson, John A. Stratton, Wen-mei W. Hwu, Sanjay J. Patel, Steven S. Lumetta: Implementing a GPU Programming Model on a Non-GPU Accelerator Architecture. ISCA Workshops 2010: 40-51 | |
| c101 | Sara S. Baghsorkhi, Matthieu Delahaye, Sanjay J. Patel, William D. Gropp, Wen-mei W. Hwu: An adaptive performance modeling tool for GPU architectures. PPOPP 2010: 105-114 | |
| 2009 | ||
| j43 | Wen-mei W. Hwu, Christopher I. Rodrigues, Shane Ryoo, John A. Stratton: Compute Unified Device Architecture Application Suitability. Computing in Science and Engineering 11(3): 16-26 (2009) | |
| j42 | Hillery C. Hunter, Erik M. Nystrom, Daniel A. Connors, Wen-mei W. Hwu: Hardware-compiler co-design for adjustable data power savings. Microprocessors and Microsystems - Embedded Hardware Design 33(4): 244-253 (2009) | |
| c100 | John E. Stone, Jan Saam, David J. Hardy, Kirby L. Vandivort, Wen-mei W. Hwu, Klaus Schulten: High performance computation and interactive display of molecular orbitals on GPUs and multi-core CPUs. GPGPU 2009: 9-18 | |
| c99 | Albert Sidelnik, I-Jui Sung, Wanmin Wu, María Jesús Garzarán, Wen-mei W. Hwu, Klara Nahrstedt, David A. Padua, Sanjay J. Patel: Optimization of tele-immersion codes. GPGPU 2009: 85-93 | |
| c98 | Volodymyr V. Kindratenko, Jeremy Enos, Guochun Shi, Michael T. Showerman, Galen W. Arnold, John E. Stone, James C. Phillips, Wen-mei W. Hwu: GPU clusters for high-performance computing. CLUSTER 2009: 1-8 | |
| c97 | Alexandros Papakonstantinou, Karthik Gururaj, John A. Stratton, Deming Chen, Jason Cong, Wen-mei W. Hwu: High-performance CUDA kernel execution on FPGAs. ICS 2009: 515-516 | |
| c96 | Wen-mei W. Hwu: Many-core parallel computing - Can compilers and tools do the heavy lifting? IPDPS 2009: 1 | |
| c95 | Elijah Roberts, John E. Stone, Leonardo Sepulveda, Wen-mei W. Hwu, Zaida Luthey-Schulten: Long time-scale simulations of in vivo diffusion using GPU hardware. IPDPS 2009: 1-8 | |
| c94 | Wen-mei W. Hwu, Deepthi Nandakumar, Justin P. Haldar, Ian C. Atkinson, Bradley P. Sutton, Zhi-Pei Liang, Keith R. Thulborn: Accelerating MR Image Reconstruction on GPUS. ISBI 2009: 1283-1286 | |
| c93 | Alexandros Papakonstantinou, Karthik Gururaj, John A. Stratton, Deming Chen, Jason Cong, Wen-mei W. Hwu: FCUDA: Enabling efficient compilation of CUDA kernels onto FPGAs. SASP 2009: 35-42 | |
| 2008 | ||
| j41 | David Yeh, Li-Shiuan Peh, Shekhar Borkar, John A. Darringer, Anant Agarwal, Wen-mei W. Hwu: Thousand-Core Chips [Roundtable]. IEEE Design & Test of Computers 25(3): 272-278 (2008) | |
| j40 | Wen-mei W. Hwu, Kurt Keutzer, Timothy G. Mattson: The Concurrency Challenge. IEEE Design & Test of Computers 25(4): 312-320 (2008) | |
| j39 | Sam S. Stone, Justin P. Haldar, Stephanie C. Tsao, Wen-mei W. Hwu, Bradley P. Sutton, Zhi-Pei Liang: Accelerating advanced MRI reconstructions on GPUs. J. Parallel Distrib. Comput. 68(10): 1307-1318 (2008) | |
| j38 | Shane Ryoo, Christopher I. Rodrigues, Sam S. Stone, John A. Stratton, Sain-Zee Ueng, Sara S. Baghsorkhi, Wen-mei W. Hwu: Program optimization carving for GPU computing. J. Parallel Distrib. Comput. 68(10): 1389-1401 (2008) | |
| j37 | Sanjay J. Patel, Wen-mei W. Hwu: Guest Editors' Introduction: Accelerator Architectures. IEEE Micro 28(4): 4-12 (2008) | |
| c92 | Sam S. Stone, Justin P. Haldar, Stephanie C. Tsao, Wen-mei W. Hwu, Zhi-Pei Liang, Bradley P. Sutton: Accelerating advanced mri reconstructions on gpus. Conf. Computing Frontiers 2008: 261-272 | |
| c91 | Christopher I. Rodrigues, David J. Hardy, John E. Stone, Klaus Schulten, Wen-mei W. Hwu: GPU acceleration of cutoff pair potentials for molecular modeling applications. Conf. Computing Frontiers 2008: 273-282 | |
| c90 | Shane Ryoo, Christopher I. Rodrigues, Sam S. Stone, Sara S. Baghsorkhi, Sain-Zee Ueng, John A. Stratton, Wen-mei W. Hwu: Program optimization space pruning for a multithreaded gpu. CGO 2008: 195-204 | |
| c89 | Elaine Wah, Erik Johnson, Loretta Auvil, Umesh Thakkar, Wen-mei W. Hwu, David Blair Kirk, Thom H. Dunning, Sharon C. Glotzer: Visualization and Analysis of GPU Summer School Applicants and Participants. eScience 2008: 362-363 | |
| c88 | Isaac Gelado, John H. Kelm, Shane Ryoo, Steven S. Lumetta, Nacho Navarro, Wen-mei W. Hwu: CUBA: an architecture for efficient CPU/co-processor data communication. ICS 2008: 299-308 | |
| c87 | Sain-Zee Ueng, Melvin Lathara, Sara S. Baghsorkhi, Wen-mei W. Hwu: CUDA-Lite: Reducing GPU Programming Complexity. LCPC 2008: 1-15 | |
| c86 | John A. Stratton, Sam S. Stone, Wen-mei W. Hwu: MCUDA: An Efficient Implementation of CUDA Kernels for Multi-core CPUs. LCPC 2008: 16-30 | |
| c85 | Shane Ryoo, Christopher I. Rodrigues, Sara S. Baghsorkhi, Sam S. Stone, David Blair Kirk, Wen-mei W. Hwu: Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. PPOPP 2008: 73-82 | |
| c84 | Alexandros Papakonstantinou, Deming Chen, Wen-mei W. Hwu: Application Acceleration with the Explicitly Parallel Operations System - the EPOS Processor. SASP 2008: 20-25 | |
| 2007 | ||
| j36 | Ravishankar K. Iyer, Zbigniew Kalbarczyk, Karthik Pattabiraman, William Healey, Wen-mei W. Hwu, Peter Klemperer, Reza Farivar: Toward Application-Aware Security and Reliability. IEEE Security & Privacy 5(1): 57-62 (2007) | |
| j35 | Shane Ryoo, Sain-Zee Ueng, Christopher I. Rodrigues, Robert E. Kidd, Matthew I. Frank, Wen-mei W. Hwu: Automatic Discovery of Coarse-Grained Parallelism in Media Applications. T. HiPEAC 1: 194-213 (2007) | |
| c83 | John H. Kelm, Isaac Gelado, Mark J. Murphy, Nacho Navarro, Steven S. Lumetta, Wen-mei W. Hwu: CIGAR: Application Partitioning for a CPU/Coprocessor Architecture. PACT 2007: 317-326 | |
| c82 | Lauren Sarno, Wen-mei W. Hwu, Craig Lund, Markus Levy, James R. Larus, James Reinders, Gordon Cameron, Chris Lennard, Takashi Yoshimori: Corezilla: Build and Tame the Multicore Beast? DAC 2007: 632-633 | |
| c81 | Wen-mei W. Hwu, Shane Ryoo, Sain-Zee Ueng, John H. Kelm, Isaac Gelado, Sam S. Stone, Robert E. Kidd, Sara S. Baghsorkhi, Aqeel Mahesri, Stephanie C. Tsao, Nacho Navarro, Steven S. Lumetta, Matthew I. Frank, Sanjay J. Patel: Implicitly Parallel Programming Models for Thousand-Core Microprocessors. DAC 2007: 754-759 | |
| c80 | Shane Ryoo, Christopher I. Rodrigues, Wen-mei W. Hwu: Iteration Disambiguation for Parallelism Identification in Time-Sliced Applications. LCPC 2007: 110-124 | |
| 2006 | ||
| j34 | Ronald D. Barnes, Shane Ryoo, Wen-mei W. Hwu: Tolerating Cache-Miss Latency with Multipass Pipelines. IEEE Micro 26(1): 40-47 (2006) | |
| j33 | Ronald D. Barnes, John W. Sias, Erik M. Nystrom, Sanjay J. Patel, Jose (Nacho) Navarro, Wen-mei W. Hwu: Beating In-Order Stalls with "Flea-Flicker" Two-Pass Pipelining. IEEE Trans. Computers 55(1): 18-33 (2006) | |
| 2005 | ||
| j32 | Wen-mei W. Hwu, Krishna V. Palem: Guest Editors' Introduction. IEEE Trans. Computers 54(10): 1185-1187 (2005) | |
| c79 | Wen-mei W. Hwu, Sanjay J. Patel: The Future of Computer Architecture Research: An Industrial Perspective. HPCA 2005: 264 | |
| c78 | Ronald D. Barnes, Shane Ryoo, Wen-mei W. Hwu: "Flea-flicker" Multipass Pipelining: An Alternative to the High-Power Out-of-Order Offense. MICRO 2005: 319-330 | |
| e1 | Thomas M. Conte, Nacho Navarro, Wen-mei W. Hwu, Mateo Valero, Theo Ungerer (Eds.): High Performance Embedded Architectures and Compilers, First International Conference, HiPEAC 2005, Barcelona, Spain, November 17-18, 2005, Proceedings. Lecture Notes in Computer Science 3793, Springer 2005, isbn 3-540-30317-0 | |
| 2004 | ||
| c77 | John W. Sias, Sain-Zee Ueng, Geoff A. Kent, Ian M. Steiner, Erik M. Nystrom, Wen-mei W. Hwu: Field-testing IMPACT EPIC research results in Itanium 2. ISCA 2004: 26-39 | |
| c76 | Lakshmi N. Chakrapani, John C. Gyllenhaal, Wen-mei W. Hwu, Scott A. Mahlke, Krishna V. Palem, Rodric M. Rabbah: Trimaran: An Infrastructure for Research in Instruction-Level Parallelism. LCPC 2004: 32-41 | |
| c75 | Erik M. Nystrom, Hong-Seok Kim, Wen-mei W. Hwu: Importance of heap specialization in pointer analysis. PASTE 2004: 43-48 | |
| c74 | Erik M. Nystrom, Hong-Seok Kim, Wen-mei W. Hwu: Bottom-Up and Top-Down Context-Sensitive Summary-Based Pointer Analysis. SAS 2004: 165-180 | |
| 2003 | ||
| j31 | Jeffrey P. Monks, Jean-Pierre Ebert, Wen-mei W. Hwu, Adam Wolisz: Energy saving and capacity improvement potential of power control in multi-hop wireless networks. Computer Networks 41(3): 313-330 (2003) | |
| c73 | Ronald D. Barnes, Erik M. Nystrom, John W. Sias, Sanjay J. Patel, Nacho Navarro, Wen-mei W. Hwu: Beating in-order stalls with "flea-flicker" two-pass pipelining. MICRO 2003: 387-398 | |
| 2002 | ||
| c72 | Hillery C. Hunter, Wen-mei W. Hwu: Code coverage and input variability: effects on architecture and compiler research. CASES 2002: 79-87 | |
| c71 | Ronald D. Barnes, Erik M. Nystrom, Matthew C. Merten, Wen-mei W. Hwu: Vacuum packing: extracting hardware-detected program phases for post-link optimization. MICRO 2002: 233-244 | |
| 2001 | ||
| j30 | Matthew C. Merten, Andrew R. Trick, Ronald D. Barnes, Erik M. Nystrom, Christopher N. George, John C. Gyllenhaal, Wen-mei W. Hwu: An Architectural Framework for Runtime Optimization. IEEE Trans. Computers 50(6): 567-589 (2001) | |
| c70 | Erik M. Nystrom, Ronald D. Barnes, Matthew C. Merten, Wen-mei W. Hwu: Code Reordering and Speculation Support for Dynamic Optimization System. IEEE PACT 2001: 163-174 | |
| c69 | Jeffrey P. Monks, Vaduvur Bharghavan, Wen-mei W. Hwu: A Power Controlled Multiple Access Protocol for Wireless Packet Networks. INFOCOM 2001: 219-228 | |
| c68 | Jeffrey P. Monks, Jean-Pierre Ebert, Adam Wolisz, Wen-mei W. Hwu: A Study of the Energy Saving and Capacity Improvement Potential of Power Control in Multi-Hop Wireless Networks. LCN 2001: 550-559 | |
| c67 | ||
| c66 | John W. Sias, Hillery C. Hunter, Wen-mei W. Hwu: Enhancing loop buffering of media and telecommunications applications using low-overhead predication. MICRO 2001: 262-273 | |
| 2000 | ||
| c65 | Daniel A. Connors, Hillery C. Hunter, Ben-Chung Cheng, Wen-mei W. Hwu: Hardware Support for Dynamic Management of Compiler-Directed Computation Reuse. ASPLOS 2000: 222-233 | |
| c64 | Matthew C. Merten, Andrew R. Trick, Erik M. Nystrom, Ronald D. Barnes, Wen-mei W. Hwu: A hardware mechanism for dynamic extraction and relayout of program hot spots. ISCA 2000: 59-70 | |
| c63 | Jeffrey P. Monks, Vaduvur Bharghavan, Wen-mei W. Hwu: Transmission Power Control for Multiple Access Wireless Packet Networks. LCN 2000: 12-21 | |
| c62 | John W. Sias, Wen-mei W. Hwu, David I. August: Accurate and efficient predicate analysis with binary decision diagrams. MICRO 2000: 112-123 | |
| c61 | Ben-Chung Cheng, Wen-mei W. Hwu: Modular interprocedural pointer analysis using access paths: design, implementation, and evaluation. PLDI 2000: 57-69 | |
| 1999 | ||
| j29 | Thomas M. Conte, Wen-mei W. Hwu, Mark Smotherman: Editor's Introduction. International Journal of Parallel Programming 27(5): 325-326 (1999) | |
| j28 | David I. August, Wen-mei W. Hwu, Scott A. Mahlke: The Partial Reverse If-Conversion Framework for Balancing Control Flow and Predication. International Journal of Parallel Programming 27(5): 381-423 (1999) | |
| j27 | Thomas M. Conte, Wen-mei W. Hwu, Mark Smotherman: Editors' Introduction. International Journal of Parallel Programming 27(6): 425-426 (1999) | |
| j26 | Teresa L. Johnson, Daniel A. Connors, Matthew C. Merten, Wen-mei W. Hwu: Run-Time Cache Bypassing. IEEE Trans. Computers 48(12): 1338-1354 (1999) | |
| c60 | Daniel A. Connors, Jean-Michel Puiatti, David I. August, Kevin M. Crozier, Wen-mei W. Hwu: An Architecture Framework for Introducing Predicated Execution into Embedded Microprocessors. Euro-Par 1999: 1301-1311 | |
| c59 | Matthew C. Merten, Andrew R. Trick, Christopher N. George, John C. Gyllenhaal, Wen-mei W. Hwu: A Hardware-Driven Profiling Scheme for Identifying Program Hot Spots to Support Runtime Optimization. ISCA 1999: 136-147 | |
| c58 | David I. August, John W. Sias, Jean-Michel Puiatti, Scott A. Mahlke, Daniel A. Connors, Kevin M. Crozier, Wen-mei W. Hwu: The Program Decision Logic Approach to Predicated Execution. ISCA 1999: 208-219 | |
| c57 | Ben-Chung Cheng, Wen-mei W. Hwu: An Empirical Study of Function Pointers Using SPEC Benchmarks. LCPC 1999: 490-493 | |
| c56 | Daniel A. Connors, Wen-mei W. Hwu: Compiler-Directed Dynamic Computation Reuse: Rationale and Initial Results. MICRO 1999: 158-169 | |
| c55 | Le-Chun Wu, Rajiv Mirani, Harish Patil, Bruce Olsen, Wen-mei W. Hwu: A New Framework for Debugging Globally Optimized Code. PLDI 1999: 181-191 | |
| 1998 | ||
| j25 | ||
| j24 | Steve Beaty, Wen-mei W. Hwu: Foreword to the Special Issue. International Journal of Parallel Programming 26(4): 345-347 (1998) | |
| j23 | John C. Gyllenhaal, Wen-mei W. Hwu, B. Ramakrishna Rau: Optimization of Machine Descriptions for Efficient Use. International Journal of Parallel Programming 26(4): 417-447 (1998) | |
| j22 | Thomas M. Conte, Mary Ann Hirsch, Wen-mei W. Hwu: Combining Trace Sampling with Single Pass Methods for Efficient Cache Simulation. IEEE Trans. Computers 47(6): 714-720 (1998) | |
| c54 | Brian L. Deitrich, Ben-Chung Cheng, Wen-mei W. Hwu: Improving Static Branch Prediction in a Compiler. IEEE PACT 1998: 214-221 | |
| c53 | Wen-mei W. Hwu, Yale N. Patt: Retrospective: HPSm, a High Performance Restricted Data Flow Architecture Having Minimal Functionality. 25 Years ISCA: Retrospectives and Reprints 1998: 43-44 | |
| c52 | Wen-mei W. Hwu: Retrospective: IMPACT: An Architectural Framework for Multiple-Instruction Issue. 25 Years ISCA: Retrospectives and Reprints 1998: 77-79 | |
| c51 | David I. August, Daniel A. Connors, Scott A. Mahlke, John W. Sias, Kevin M. Crozier, Ben-Chung Cheng, Patrick R. Eaton, Qudus B. Olaniran, Wen-mei W. Hwu: Integrated Predicated and Speculative Execution in the IMPACT EPIC Architecture. ISCA 1998: 227-237 | |
| c50 | Wen-mei W. Hwu, Yale N. Patt: HPSm, a High Performance Restricted Data Flow Architecture Having Minimal Functionality. 25 Years ISCA: Retrospectives and Reprints 1998: 300-308 | |
| c49 | Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Nancy J. Warter, Wen-mei W. Hwu: IMPACT: An Architectural Framework for Multiple-Instruction-Issue Processors. 25 Years ISCA: Retrospectives and Reprints 1998: 408-417 | |
| c48 | Ben-Chung Cheng, Daniel A. Connors, Wen-mei W. Hwu: Compiler-Directed Early Load-Address Generation. MICRO 1998: 138-147 | |
| 1997 | ||
| j21 | Cheng-Hsueh A. Hsieh, Marie T. Conte, Teresa L. Johnson, John C. Gyllenhaal, Wen-mei W. Hwu: Optimizing NET Compilers for Improved Java Performance. IEEE Computer 30(6): 67-75 (1997) | |
| j20 | Richard E. Hank, Wen-mei W. Hwu, B. Ramakrishna Rau: Region-based compilation: Introduction, motivation, and initial experience. International Journal of Parallel Programming 25(2): 113-146 (1997) | |
| c47 | David I. August, Daniel A. Connors, John C. Gyllenhaal, Wen-mei W. Hwu: Architectural Support for Compiler-Synthesized Dynamic Branch Prediction Strategies: Rationale and Initial Results. HPCA 1997: 84-93 | |
| c46 | Teresa L. Johnson, Wen-mei W. Hwu: Run-Time Adaptive Cache Hierarchy Management via Reference Analysis. ISCA 1997: 315-326 | |
| c45 | Teresa L. Johnson, Matthew C. Merten, Wen-mei W. Hwu: Run-Time Spatial Locality Detection and Optimization. MICRO 1997: 57-64 | |
| c44 | David I. August, Wen-mei W. Hwu, Scott A. Mahlke: A Framework for Balancing Control Flow and Predication. MICRO 1997: 92-103 | |
| 1996 | ||
| c43 | Brian L. Deitrich, Wen-mei W. Hwu: Speculative Hedge: Regulating Compile-time Speculation Against Profile Variations. MICRO 1996: 70-79 | |
| c42 | Cheng-Hsueh A. Hsieh, John C. Gyllenhaal, Wen-mei W. Hwu: Java Bytecode to Native Code Translation: The Caffeine Prototype and Preliminary Results. MICRO 1996: 90-99 | |
| c41 | Daniel M. Lavery, Wen-mei W. Hwu: Modulo Scheduling of Loops in Control-intensive Non-numeric Programs. MICRO 1996: 126-137 | |
| c40 | John C. Gyllenhaal, Wen-mei W. Hwu, B. Ramakrishna Rau: Optimization of Machine Descriptions for Efficient Use. MICRO 1996: 349-358 | |
| 1995 | ||
| j19 | Thomas M. Conte, Wen-mei W. Hwu: Advances in Benchmarking Techniques: New Standards and Quantitative Metrics. Advances in Computers 41: 231-253 (1995) | |
| j18 | Chung-Chi Jim Li, Shyh-Kwei Chen, W. Kent Fuchs, Wen-mei W. Hwu: Compiler-Based Multiple Instruction Retry. IEEE Trans. Computers 44(1): 35-46 (1995) | |
| j17 | Pohua P. Chang, Daniel M. Lavery, Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu: The Importance of Prepass Code Scheduling for Superscalar and Superpipelined Processors. IEEE Trans. Computers 44(3): 353-370 (1995) | |
| j16 | Pohua P. Chang, Nancy J. Warter, Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu: Three Architecutral Models for Compiler-Controlled Speculative Execution. IEEE Trans. Computers 44(4): 481-494 (1995) | |
| j15 | Neal J. Alewine, Shyh-Kwei Chen, W. Kent Fuchs, Wen-mei W. Hwu: Compiler-Assisted Multiple Instruction Rollback Recovery Using a Read Buffer. IEEE Trans. Computers 44(9): 1096-1107 (1995) | |
| c39 | Roger A. Bringmann, Scott A. Mahlke, Wen-mei W. Hwu: A study of the effects of compiler-controlled speculation on instruction and data caches. HICSS (1) 1995: 211-220 | |
| c38 | Scott A. Mahlke, Richard E. Hank, James E. McCormick, David I. August, Wen-mei W. Hwu: A Comparison of Full and Partial Predicated Execution Support for ILP Processors. ISCA 1995: 138-150 | |
| c37 | Richard E. Hank, Wen-mei W. Hwu, B. Ramakrishna Rau: Region-based compilation: an introduction and motivation. MICRO 1995: 158-168 | |
| c36 | Daniel M. Lavery, Wen-mei W. Hwu: Unrolling-based optimizations for modulo scheduling. MICRO 1995: 327-337 | |
| 1994 | ||
| j14 | William Y. Chen, Scott A. Mahlke, Nancy J. Warter, Sadun Anik, Wen-mei W. Hwu: Profile-assisted instruction scheduling. International Journal of Parallel Programming 22(2): 151-181 (1994) | |
| j13 | Wen-mei W. Hwu, Alex Nicolau: From the guest editors. International Journal of Parallel Programming 22(3): 207-208 (1994) | |
| j12 | Sadun Anik, Wen-mei W. Hwu: Performance Implications of Synchronization Support for Parallel Fortran Programs. J. Parallel Distrib. Comput. 22(2): 202-215 (1994) | |
| j11 | Shyh-Kwei Chen, Neal J. Alewine, W. Kent Fuchs, Wen-mei W. Hwu: Incremental Compiler Transformations for Multiple Instruction Retry. Softw., Pract. Exper. 24(12): 1179-1198 (1994) | |
| j10 | Wen-mei W. Hwu, Thomas M. Conte: The Susceptibility of Programs to Context Switching. IEEE Trans. Computers 43(9): 994-1003 (1994) | |
| c35 | David M. Gallagher, William Y. Chen, Scott A. Mahlke, John C. Gyllenhaal, Wen-mei W. Hwu: Dynamic Memory Disambiguation Using the Memory Conflict Buffer. ASPLOS 1994: 183-193 | |
| c34 | Shyh-Kwei Chen, W. Kent Fuchs, Wen-mei W. Hwu: An Analytical Approach to Scheduling Code for Superscalar and VLIW Architectures. ICPP (1) 1994: 285-292 | |
| c33 | Yoji Yamada, John Gyllenhall, Grant E. Haab, Wen-mei W. Hwu: Data relocation and prefetching for programs with large data sets. MICRO 1994: 118-127 | |
| c32 | Scott A. Mahlke, Richard E. Hank, Roger A. Bringmann, John C. Gyllenhaal, David M. Gallagher, Wen-mei W. Hwu: Characterizing the impact of predicated execution on branch prediction. MICRO 1994: 217-227 | |
| 1993 | ||
| j9 | Aloke Gupta, Wen-mei W. Hwu: An execution Profiler for Window-oriented Applications. Softw., Pract. Exper. 23(5): 487-510 (1993) | |
| j8 | William Y. Chen, Pohua P. Chang, Thomas M. Conte, Wen-mei W. Hwu: The Effect of Code Expanding Optimizations on Instruction Cache Design. IEEE Trans. Computers 42(9): 1045-1057 (1993) | |
| j7 | Wen-mei W. Hwu, Scott A. Mahlke, William Y. Chen, Pohua P. Chang, Nancy J. Warter, Roger A. Bringmann, Roland G. Ouellette, Richard E. Hank, Tokuzo Kiyohara, Grant E. Haab, John G. Holm, Daniel M. Lavery: The superblock: An effective technique for VLIW and superscalar compilation. The Journal of Supercomputing 7(1-2): 229-248 (1993) | |
| j6 | Scott A. Mahlke, William Y. Chen, Roger A. Bringmann, Richard E. Hank, Wen-mei W. Hwu, B. Ramakrishna Rau, Michael S. Schlansker: Sentinel Scheduling for VLIW and Superscalar Processors. ACM Trans. Comput. Syst. 11(4): 376-408 (1993) | |
| c31 | W. Kent Fuchs, Wen-mei W. Hwu, Neal J. Alewine: Application of Compiler-Assisted Rollback Recovery to Speculative Execution Repair. Hardware and Software Architectures for Fault Tolerance 1993: 45-65 | |
| c30 | Tokuzo Kiyohara, Scott A. Mahlke, William Y. Chen, Roger A. Bringmann, Richard E. Hank, Sadun Anik, Wen-mei W. Hwu: Register Connection: A New Approach to Adding Registers into Instruction Set Architectures. ISCA 1993: 247-256 | |
| c29 | Roger A. Bringmann, Scott A. Mahlke, Richard E. Hank, John C. Gyllenhaal, Wen-mei W. Hwu: Speculative execution exception recovery using write-back suppression. MICRO 1993: 214-223 | |
| c28 | Richard E. Hank, Scott A. Mahlke, Roger A. Bringmann, John C. Gyllenhaal, Wen-mei W. Hwu: Superblock formation using static program analysis. MICRO 1993: 247-255 | |
| c27 | Nancy J. Warter, Scott A. Mahlke, Wen-mei W. Hwu, B. Ramakrishna Rau: Reverse If-Conversion. PLDI 1993: 290-299 | |
| 1992 | ||
| j5 | Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu: Profile-guided Automatic Inline Expansion for C Programs. Softw., Pract. Exper. 22(5): 349-369 (1992) | |
| j4 | Wen-mei W. Hwu, Pohua P. Chang: Efficient Instruction Sequencing with Inline Target Insertion. IEEE Trans. Computers 41(12): 1537-1551 (1992) | |
| c26 | Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu, B. Ramakrishna Rau, Michael S. Schlansker: Sentinel Scheduling for VLIW and Superscalar Processors. ASPLOS 1992: 238-247 | |
| c25 | Neal J. Alewine, Shyh-Kwei Chen, Chung-Chi Jim Li, W. Kent Fuchs, Wen-mei W. Hwu: Branch Recovery with Compiler-Assisted Multiple Instruction Retry. FTCS 1992: 66-73 | |
| c24 | William Y. Chen, Scott A. Mahlke, Wen-mei W. Hwu: Tolerating First Level Memory Access Latency in High-Performance Systems. ICPP (1) 1992: 36-43 | |
| c23 | Sadun Anik, Wen-mei W. Hwu: Executing Nested Parallel Loops on Shared-Memory Multiprocessors. ICPP (3) 1992: 241-244 | |
| c22 | William Y. Chen, Scott A. Mahlke, Wen-mei W. Hwu, Tokuzo Kiyohara, Pohua P. Chang: Tolerating data access latency with register preloading. ICS 1992: 104-113 | |
| c21 | William Y. Chen, Roger A. Bringmann, Scott A. Mahlke, Sadun Anik, Tokuzo Kiyohara, Nancy J. Warter, Daniel M. Lavery, Wen-mei W. Hwu, Richard E. Hank, John C. Gyllenhaal: Using Profile Information to Assist Advaced Compiler Optimization and Scheduling. LCPC 1992: 31-48 | |
| c20 | Scott A. Mahlke, William Y. Chen, John C. Gyllenhaal, Wen-mei W. Hwu: Compiler Code Transformations for Superscalar-Based High Performance Systems. SC 1992: 808-817 | |
| c19 | Aloke Gupta, Wen-mei W. Hwu: Xprof: Profiling the Execution of X Window Programs. SIGMETRICS 1992: 253-254 | |
| 1991 | ||
| j3 | ||
| j2 | Pohua P. Chang, Scott A. Mahlke, Wen-mei W. Hwu: Using Profile Information to Assist Classic Code Optimizations. Softw., Pract. Exper. 21(12): 1301-1321 (1991) | |
| c18 | Scott A. Mahlke, Nancy J. Warter, William Y. Chen, Pohua P. Chang, Wen-mei W. Hwu: The Effect of Compiler Optimizations on Available Parallelism in Scalar Programs. ICPP (2) 1991: 142-145 | |
| c17 | Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Nancy J. Warter, Wen-mei W. Hwu: IMPACT: An Architectural Framework for Multiple-Instruction-Issue Processors. ISCA 1991: 266-275 | |
| c16 | Pohua P. Chang, William Y. Chen, Scott A. Mahlke, Wen-mei W. Hwu: Comparing Static and Dynamic Code Scheduling for Multiple-Instruction-Issue Processors. MICRO 1991: 25-33 | |
| c15 | William Y. Chen, Scott A. Mahlke, Pohua P. Chang, Wen-mei W. Hwu: Data Access Microarchitectures for Superscalar Processors with Compiler-Assisted Data Prefetching. MICRO 1991: 69-73 | |
| 1990 | ||
| c14 | Nancy J. Warter, Wen-mei W. Hwu: A software based approach to achieving optimal performance for signature control flow checking. FTCS 1990: 442-449 | |
| 1989 | ||
| c13 | Pohua P. Chang, Wen-mei W. Hwu: Control flow optimization for supercomputer scalar processing. ICS 1989: 145-153 | |
| c12 | Wen-mei W. Hwu, Thomas M. Conte, Pohua P. Chang: Comparing Software and Hardware Schemes For Reducing the Cost of Branches. ISCA 1989: 224-233 | |
| c11 | Wen-mei W. Hwu, Pohua P. Chang: Achieving High Instruction Cache Performance with an Optimizing Compiler. ISCA 1989: 242-251 | |
| c10 | P.-H. Chang, Wen-mei W. Hwu: Forward semantic: a compiler-assisted instruction fetch method for heavily pipelined processors. MICRO 1989: 188-198 | |
| c9 | Wen-mei W. Hwu, Pohua P. Chang: Inline Function Expansion for Compiling C Programs. PLDI 1989: 246-257 | |
| c8 | Wen-mei W. Hwu, Thomas M. Conte: A Simulation Study of Simultaneous Vector Prefetch Performance in Multiprocessor Memory Subsystems (Extended Abstract). SIGMETRICS 1989: 227 | |
| 1988 | ||
| c7 | Wen-mei W. Hwu, Pohua P. Chang: Exploiting Parallel Microprocessor Microarchitectures With a Compiler Code Generator. ISCA 1988: 45-53 | |
| c6 | Pohua P. Chang, Wen-mei W. Hwu: Trace selection for compiling large C application programs to microcode. MICRO 1988: 21-29 | |
| 1987 | ||
| j1 | Wen-mei W. Hwu, Yale N. Patt: Checkpoint Repair for High-Performance Out-of-Order Execution Machines. IEEE Trans. Computers 36(12): 1496-1514 (1987) | |
| c5 | Wen-mei W. Hwu, Yale N. Patt: Checkpoint Repair for Out-of-order Execution Machines. ISCA 1987: 18-26 | |
| c4 | Wen-mei W. Hwu, Yale N. Patt: Exploiting horizontal and vertical concurrency via the HPSm microprocessor. MICRO 1987: 154-161 | |
| c3 | James E. Wilson, Stephen W. Melvin, Michael Shebanow, Wen-mei W. Hwu, Yale N. Patt: On tuning the microarchitecture of an HPS implementation of the VAX. MICRO 1987: 162-167 | |
| 1986 | ||
| c2 | Yale N. Patt, Wen-mei W. Hwu, Stephen W. Melvin, Michael Shebanow, Chein Chen, Jiajuin Wei: Experiments with HPS, a Restricted Data Flow Microarchitecture for High Performance Computers. COMPCON 1986: 254-258 | |
| c1 | Wen-mei W. Hwu, Yale N. Patt: HPSm, a High Performance Restricted Data Flow Architecture Having Minimal Functionality. ISCA 1986: 297-306 | |
Colors in the list of coauthors
Last update Sat May 25 12:59:14 2013 CET by the DBLP Team —
Data released under the ODC-BY 1.0 license — See also our legal information page