Other Microarchitecture Publications
- Aniket Deshmukh, Chester Cai, and Yale Patt,
"Timely Efficient and Accurate Branch Precomputation,"
(slides)
The 57th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), November 2024.
- Aniket Deshmukh, Chester Cai, and Yale Patt,
"Alternate Path Fetch,"
The 51th Annual IEEE/ACM International Symposium on Computer Architecture (ISCA), June 2024.
- Stephen Pruett and Yale Patt,
"Branch Runahead: An Alternative to Branch Prediction for Impossible to Predict Branches,"
The 54th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), October 2021.
- Aniket Deshmukh and Yale Patt,
"Criticality Driven Fetch,"
The 54th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), October 2021.
- Siavash Zangeneh, Stephen Pruett, Sangkug Lym, and Yale Patt,
"BranchNet: A Convolutional Neural Network to Predict Hard-To-Predict Branches,"
The 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), October 2020.
- Ben (Ching-Pei) Lin, Michael B. Healy, Rustam Miftakhutdinov, Philip G. Emma, and Yale Patt,
"Duplicon Cache: Mitigating Off-Chip Memory Bank and Bank Group Conflicts via Data Duplication,"
The 51st Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), October 2018.
- Milad Hashemi, Onur Mutlu, and Yale N. Patt,
"Continuous Runhaead: Transparent Hardware Acceleration for Memory Intensive Workloads,"
The 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), October 2016.
- Milad Hashemi, Khubaib, Eiman Ebrahimi, Onur Mutlu, and Yale N. Patt,
"Accelerating Dependent Cache Misses with an Enhanced Memory Controller,"
The 43rd ACM/IEEE International Symposium on Computer Architecture (ISCA) , June 2016.
- Milad Hashemi, and Yale N. Patt,
"Filtered Runahead Execution with a Runahead Buffer,"
The 48th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO), December 2015.
- Milad Hashemi, Debbie Marr, Doug Carmean, and Yale N. Patt,
"Efficient Execution of Bursty Applications,"
IEEE Computer Architecture Letters (CAL), July 2015.
- Khubaib, M. Aater Suleman, Milad Hashemi, Chris Wilkerson, Yale N. Patt,
"MorphCore: An Energy-Efficient Microarchitecture for High Performance ILP and High Throughput TLP,"
Proceedings of the 45th Annual ACM/IEEE International Symposium on Microarchitecture (MICRO),
Vancouver, December 2012.
- Rustam Miftakhutdinov, Eiman Ebrahimi, Yale N. Patt,
"Predicting Performance Impact of DVFS for Realistic Memory Systems,"
Proceedings of the 45th Annual ACM/IEEE International Symposium on Microarchitecture (MICRO),
Vancouver, December 2012.
- Eiman Ebrahimi, Chang Joo Lee, Onur Mutlu, and Yale N. Patt,
"Prefetch-Aware Shared-Resource Management for Multi-Core Systems,"
Proceedings of the 38th International Symposium on Computer Architecture (ISCA),
San Jose, June 2011.
- M. Aater Suleman, Moinuddin K. Qureshi, Khubaib, and Yale N. Patt,
"Feedback-Directed Pipelined Parallelism"
Proceedings of the 19th International Conference on Parallel Architecture and Compilation Techniques (PACT),
Vienna, Austria, September 2010.
- M. Aater Suleman, Onur Mutlu, José A. Joao, Khubaib, and Yale N. Patt,
"Data Marshaling for Multi-core Architectures"
Proceedings of the 37th International Symposium on Computer Architecture (ISCA),
Saint-Malo, France, June 2010.
- Eiman Ebrahimi, Chang Joo Lee, Onur Mutlu, and Yale N. Patt,
"Fairness via Source Throttling: A Configurable and High-Performance Fairness Substrate for Multi-Core Memory Systems"
Proceedings of the 15th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS),
Pittsburgh, March 2010.
- Chang Joo Lee, Veynu Narasiman, Onur Mutlu, and Yale N. Patt,
"Improving Memory Bank-Level Parallelism in the Presence of Prefetching"
Proceedings of the 42nd International Symposium on Microarchitecture (MICRO),
New York, December 2009.
- Eiman Ebrahimi, Onur Mutlu, Chang Joo Lee, and Yale N. Patt,
"Coordinated Control of Multiple Prefetchers in Multi-Core Systems"
Proceedings of the 42nd International Symposium on Microarchitecture (MICRO),
New York, December 2009.
- José A. Joao, Onur Mutlu, and Yale N. Patt,
"Flexible Reference-Counting-Based Hardware Acceleration for Garbage Collection,"
Proceedings of the 36th Annual International Symposium on Computer Architecture (ISCA),
Austin, TX, June 2009.
- M. Aater Suleman, Onur Mutlu, Moinuddin K. Qureshi, and Yale N. Patt,
"Accelerating Critical Section Execution with Asymmetric Multi-Core Architectures"
Proceedings of the 14th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS),
Washington, DC, March 2009.
- Eiman Ebrahimi, Onur Mutlu, and Yale N. Patt,
"Techniques for Bandwidth-Efficient Prefetching of Linked Data Structures in Hybrid Prefetching Systems"
Proceedings of the 15th International Symposium on High-Performance Computer Architecture (HPCA),
Raleigh, NC, February 2009.
- Chang Joo Lee, Onur Mutlu, Veynu Narasiman, and Yale N. Patt,
"Prefetch-Aware DRAM Controllers"
Proceedings of the 41st International Symposium on Microarchitecture (MICRO),
Lake Como, Italy, September 2008.
- Chang Joo Lee, Hyesoon Kim, Onur Mutlu, and Yale N. Patt,
"Performance-Aware Speculation Control using Wrong Path Usefulness Prediction"
Proceedings of the 14th International Symposium on High-Performance Computer Architecture (HPCA),
Salt Lake City, UT, February 2008.
- Onur Mutlu, Hyesoon Kim, and Yale N. Patt,
"Efficient Runahead Execution: Power-Efficient Memory Latency Tolerance"
IEEE Micro, Special Issue: Micro's Top Picks from Microarchitecture Conferences (MICRO TOP PICKS), Vol. 26, No. 1, pages 10-20, January/February 2006.
- Onur Mutlu, Hyesoon Kim, David N. Armstrong, and Yale N. Patt,
"An Analysis of the Performance Impact of Wrong-Path Memory References on Out-of-Order and Runahead Execution Processors,"
IEEE Transactions on Computers (TC), Vol. 54, No. 12, pages 1556-1571, December 2005.
- Onur Mutlu, Hyesoon Kim, and Yale N. Patt,
"Address-Value Delta (AVD) Prediction: Increasing the Effectiveness of Runahead Execution by Exploiting Regular Memory Allocation Patterns,"
Proceedings of the 38th ACM/IEEE International Symposium on Microarchitecture,
Barcelona, Spain, November 2005.
- Onur Mutlu, Hyesoon Kim, David N. Armstrong, and Yale N. Patt,
"Using the First-Level Caches as Filters to Reduce the Pollution Caused by Speculative Memory References,"
International Journal of Parallel Programming (IJPP), Vol. 33, No. 5, pages 529-559, October 2005.
- Moinuddin K. Qureshi, Onur Mutlu, and Yale N. Patt,
"Microarchitecture-Based Introspection: A Technique for Transient-Fault Tolerance in Microprocessors,"
Proceedings of the International Conference on Dependable Systems and Networks, Yokohama, Japan, June 2005.
- Onur Mutlu, Hyesoon Kim, and Yale N. Patt,
"Techniques for Efficient Processing in Runahead Execution Engines,"
Proceedings of the 32nd International Symposium on Computer Architecture, Madison, WI, June 2005.
- Moinuddin K. Qureshi, David Thompson, and Yale N. Patt,
"The V-Way Cache: Demand Based Associativity via Global Replacement,"
Proceedings of the 32nd International Symposium on Computer Architecture, Madison, WI, June 2005.
- Onur Mutlu, Hyesoon Kim, Jared Stark, and Yale N. Patt,
"On Reusing the Results of Pre-Executed Instructions in a Runahead Execution Processor,"
Computer Architecture Letters, Vol. 4, January 2005.
- Onur Mutlu, Hyesoon Kim, David N. Armstrong, and Yale N. Patt,
"Cache Filtering Techniques to Reduce the Negative Impact of Useless Speculative Memory References on Processor Performance,"
Proceeedings of the 16th Symposium on Computer Architecture and High Performance Computing, Foz Do Iguacu, PR, Brazil, October 2004.
- Mary D. Brown and Yale N. Patt,
"Demand-Only Broadcast: Reducing Register File and Bypass Power in Clustered Execution Cores,"
Proceedings of the First Watson Conference on Interaction between Architecture, Circuits, and Compilers,
Yorktown Heights, NY, October 2004.
- Onur Mutlu, Hyesoon Kim, David N. Armstrong, and Yale N. Patt,
"Understanding The Effects of Wrong-path Memory References on Processor Performance,"
Proceedings of the 3rd Workshop on Memory Performance Issues, in conjunction with 31st International Symposium on Computer Architecture, Munchen, Germany, June 2004.
- Onur Mutlu, Jared Stark, Chris Wilkerson, and Yale N. Patt,
"Runahead Execution: An Effective Alternative to Large Instruction Windows,"
IEEE Micro, Special Issue: Micro's Top Picks from Microarchitecture Conferences,
Vol. 23, No. 6, pp. 20-25, November/December 2003.
- Paul Racunas and Yale N. Patt,
"Partitioned first-level cache design for clustered microarchitectures,"
Proceedings of the 17th Annual International Conference on Supercomputing,
San Francisco, CA, June 2003.
- Onur Mutlu, Jared Stark, Chris Wilkerson, and Yale N. Patt,
"Runahead Execution: An Alternative to Very Large Instruction Windows for Out-of-order Processors,"
Proceedings of the 9th International Symposium on High-Performance Computer Architecture,
Anaheim, CA, February 2003.
- Mary D. Brown and Yale N. Patt,
"Using Internal Redundant Representations and Limited Bypass to Support Pipelined Adders and Register Files,"
Proceedings of the 8th International Symposium on High-Performance Computer Architecture,
Cambridge, MA, February 2002.
- Mary D. Brown, Jared Stark, and Yale N. Patt,
"Select-Free Instruction Scheduling Logic,"
Proceedings of the 34th ACM/IEEE International Symposium on Microarchitecture,
Austin, Texas, December 2001.
- Jared Stark, Mary D. Brown, and Yale N. Patt,
"On Pipelining Dynamic Instruction Scheduling Logic,"
Proceedings of the 33rd ACM/IEEE International Symposium on Microarchitecture,
Monterey, California, December 2000.
- Eric Hao, Po-Yung Chang, Marius Evers, and Yale N. Patt,
"Increasing the Instruction Fetch Rate via Block-Structured Instruction Set Architectures,"
International Journal of Parallel Programming,
Vol. 26, No. 4, 1998
- Jared Stark, Paul B. Racunas, and Yale N. Patt,
"Reducing the Performance Impact of ICache Misses by Writing Instructions into the Reservation Stations Out-of-Order,"
Proceedings of the 30th ACM/IEEE International Symposium on Microarchitecture,
Research Triangle Park, North Carolina, November 1997.
- Yale N. Patt, Sanjay J. Patel, Marius Evers, Daniel H. Friendly, and Jared Stark,
"One Billion Transistors, One Uniprocessor, One Chip,"
IEEE Computer, September 1997.
- Eric Hao, Po-Yung Chang, Marius Evers, and Yale N. Patt,
"Increasing the Instruction Fetch Rate via Block-Structured Instruction Set Architectures,"
Proceedings of the 29th ACM/IEEE International Symposium on Microarchitecture,
Paris, France, November 1996.
- Yale N. Patt,
"The Microprocessor for Scientific Computing in the year 2000,"
IEEE Computational Science and Engineering,
Summer 1996.
- Yale Patt,
"First, Let's Get the Uniprocessor Right,"
MicroDesign Resources, Microprocessor Report,
August 1996.
- Stephen Melvin and Yale N. Patt,
Enhancing Instruction Scheduling With a Block-Structured ISA,
International Journal of Parallel Programming,
Vol. 23, No. 3, 1995.
- Eric Sprangle and Yale N. Patt,
"Facilitating Superscalar Processing via a Combined Static/Dynamic Register Renaming Scheme,"
Proceedings of the 27th International Symposium on Microarchitecture,
San Jose, California, November 1994.
- Michael Butler and Yale Patt,
"A Comparative Performance Evaluation of Various State maintenance Mechanisms,"
Proceedings of the 26th International Symposium and workshop on Microarchitecture,
Austin, TX, December 1993.
- Michael Butler and Yale Patt,
"An Investigation of the Performance of Various Dynamic Scheduling Techniques,"
Proceedings, 25th International Symposium and workshop on Microarchitecture,
Portland OR, November 1992.
- Tse-Yu Yeh and Yale Patt,
"A Comprehensive Instruction Fetch Mechanism for a Processor Supporting Speculative Execution,"
Proceedings, 25th International Symposium and workshop on Microarchitecture,
Portland OR, November 1992.
- Michael Butler, David Dyer, and Yale Patt,
"Toward the Specification of an ISA for High Performance Computing Engines -- Part I: The Hardware Perspective,"
Proceedings, 25th Hawaii International Conference on Systems Sciences,
Kauai, January 1992.
- Michael Butler and Yale Patt,
"The Effect of Real Data Cache Behavior on the Performance of a Microarchitecture that Supports Dynamic Scheduling,"
Proceedings, 24th International Symposium and workshop on Microarchitecture,
Albuquerque, November 1991.
- M. Butler, T-Y Yeh, Y. Patt, M. Alsup, H. Scales, and M. Shebanow,
"Single Instruction Stream Parallelism is Greater than Two,"
Proceedings of the 18th International Symposium on Computer Architecture,
May 1991.
- Stephen Melvin and Yale Patt,
"Exploiting Fine-Grained Parallelism through Combined Hardware and Software Techniques,"
Proceedings of the 18th International Symposium on Computer Architecture,
May 1991.
- Michael Butler and Yale Patt,
"An Area-efficient Register Alias Table for Implementing HPS,"
Proceedings of the International Conference on Parallel Processing,
August 1990.
- Yale Patt,
"Microarchitecture choices (implementation of the VAX),"
Proceedings of the 22nd annual international workshop on Microprogramming and Microarchitecture,
August 1989.
- Stephen W. Melvin and Yale Patt,
"Performance benefits of large execution atomic units in dynamically scheduled machines"
Proceedings of the 3rd international conference on Supercomputing,
June 1989.
- Ashok Singhal and Yale Patt,
"A high performance Prolog processor with multiple function units,"
Proceedings of the 16th International Symposium on Computer Architecture,
May 1989.
- John A. Swensen and Yale Patt,
"Hierarchical registers for scientific computers,"
Proceedings of the 2nd international conference on Supercomputing,
November 1988.
- Ashok Singhal and Yale Patt,
"Implementing a Prolog machine with multiple functional units,"
Proceedings of the 21st annual international workshop on Microprogramming and Microarchitecture,
November 1988.
- Stephen W. Melvin, Michael C. Shebanow, and Yale Patt,
"Hardware support for large atomic units in dynamically scheduled machines,"
Proceedings of the 21st annual international workshop on Microprogramming and Microarchitecture,
November 1988.
- Wen-mei W. Hwu and Yale Patt,
"Checkpoint repair for out-of-order execution machines,"
Proceedings of the 14th International Symposium on Computer Architecture,
June 1987.
- John Swensen and Yale Patt,
"Fast temporary storage for serial and parallel execution,"
Proceedings of the 14th International Symposium on Computer Architecture,
June 1987.
- Stephen W. Melvin and Yale Patt,
"A clarification of the dynamic/static interface,"
Proceedings of the 20th Annual Hawaii International Conference on System Sciences,
1987.
- Wen-mei W. Hwu and Yale Patt,
"Exploiting horizontal and vertical concurrency via the HPSm microprocessor,"
Proceedings of the 20th annual workshop on Microprogramming,
December 1987.
- James E. Wilson, Steve Melvin, Michael Shebanow, Wen-mei Hwu, and Yale Patt,
"On tuning the microarchitecture of an HPS implementation of the VAX,"
Proceedings of the 20th annual workshop on Microprogramming,
December 1987.
- Stephen W. Melvin and Yale Patt,
"SPAM: a microcode based tool for tracing operating system events,"
Proceedings of the 20th annual workshop on Microprogramming,
December 1987.
- Jeff Gee, Stephen W. Melvin and Yale Patt,
"The implementation of Prolog via VAX 8600 microcode,"
Proceedings of the 19th annual workshop on Microprogramming,
October 1986.
- Yale Patt, Stephen W. Melvin, Wen-mei Hwu, Michael C. Shebanow, Chien Chen, and Jiajuin Wei
"Run-time generation of HPS microinstructions from a VAX instruction stream,"
Proceedings of the 19th annual workshop on Microprogramming,
October 1986.
- Stephen W. Melvin and Yale Patt,
"A microcode-based environment for noninvasive performance analysis,"
Proceedings of the 19th annual workshop on Microprogramming,
October 1986.
- Wen-mei Hwu and Yale Patt,
"HPSm, a high performance restricted data flow architecture having minimal functionality,"
Proceedings of the 13th International Symposium on Computer Architecture,
June 1986.
- Barry Fagin, Yale Patt, Vason Srini, and Alvin Despain
"Compiling Prolog into microcode: a case study using the NCR/32-000,"
Proceedings of the 18th annual workshop on Microprogramming,
December 1985.
- Yale Patt, Wen-mei Hwu, and Michael Shebanow
"HPS, a new microarchitecture: rationale and introduction,"
Proceedings of the 18th annual workshop on Microprogramming,
December 1985.
- Yale Patt, Stephen W. Melvin, Wen-mei Hwu, and Michael Shebanow
"Critical issues regarding HPS, a high performance microarchitecture,"
Proceedings of the 18th annual workshop on Microprogramming,
December 1985.
- Yale Patt and John K. Ahlstrom
"Microcode and the protection of intellectual effort,"
Proceedings of the 18th annual workshop on Microprogramming,
December 1985.