IPSJ Online Transactions
Online ISSN : 1882-6660
ISSN-L : 1882-6660
Two-Step Physical Register Deallocation for Data Prefetching and Address Pre-Calculation
Akihiro YamamotoYusuke TanakaHideki AndoToshio Shimada
Author information
JOURNAL FREE ACCESS

2008 Volume 1 Pages 94-106

Details
Abstract
This paper proposes an instruction pre-execution scheme for a high performance processor, that reduces latency and early scheduling of loads. Our scheme exploits the difference between the amount of instruction-level parallelism available with an unlimited number of physical registers and that available with an actual number of physical registers. We introduce the two-step physical register deallocation scheme, which deallocates physical registers at the renaming stage as a first step, and eliminates pipeline stalls caused by a shortage of physical registers. Instructions wait for the final deallocation as a second step in the instruction window. While waiting, the scheme allows pre-execution of instructions, that enables prefetching of load data and early calculation of memory effective addresses. Our evaluation results show that our scheme improves the performance significantly, and achieves a 1.26 times speedup over a processor without a prefetcher. If combined with a stride prefetcher, it achieves a 1.18 times speedup over a processor with a stride prefetcher.
Content from these authors
© 2008 by the Information Processing Society of Japan
Previous article Next article
feedback
Top