computer science & information technology: Memory disambiguation hardware:

Monday, April 5, 2010

Memory disambiguation hardware:

1. INTRODUCTION

With high operation frequency, modern out-of-order processors often need to buffer a very large amount of instructions to be able to overlap useful processing with relatively long latencies associated with accesses to lower levels of the memory hierarchy. Processor features such as multithreading further increase the demand on the instruction buffering capability. However, increasing the number of in-flight instructions requires scaling up different micro architectural structures, which has a significant impact on energy consumption, especially if the structure is accessed associatively. One such example is the logic that enforces correct memory-based dependences, commonly referred to as the load-store queue (LSQ), and typically implemented as two separated queues: the load queue (LQ) and the store queue (SQ). Conventional implementations of these queues contain complete addresses and their entries are allocated in program order. To enable early execution of loads without compromising program correctness, memory instructions are tracked by the two queues and associative searches are used to find the correct producer or to detect dependence violations. These associative search operations are a major concern for the scalability of these queues. Not only energy consumption increases with the size of the queue, the latency of accesses also worsens and may present complications in the logic design. As such, a range of implementations that avoid associative searches have been explored recently. The main observation behind these designs is that memory-based dependencies are very infrequent and hence, through clever filtering or prediction, it is posible to reduce the number of associative accesses. Sections II and III recap the conventional design of the LSQ and the main alternatives. Section IV explores our proposals. Finally, Section V concludes.

2. CONVENTIONAL DESIGN

Modern out-of-order processors usually employ an array of sophisticated techniques to allow early execution of loads to improve performance. Almost all designs include techniques such as load bypassing and load forwarding. Both schemes allow early execution of loads when all preceding stores have calculated their addresses. More aggressive implementations go a step further and allow execution of loads when the address of a preceding store is not yet resolved. Such speculative execution can be premature if an earlier store in program order writes to the memory space loaded and executes afterwards. Clearly, this speculation has to be applied such that program correctness is not compromised. Thus, the processor needs to detect, squash and re-execute (or replay) premature loads and their dependents. To simplify implementation, processors typically replay many more instructions (such as all instructions following the store [1]), as these premature loads are rare in general and sometimes extra logic is employed to further reduce their occurrence [2].

The dependence enforcement is achieved using age-ordered load queue and store queue. A memory instruction of one type needs to check the queue of the opposite kind in an associative fashion (see Figure 1): a load searches the SQ to forward data from an earlier, in-flight store and a store searches the LQ to identify loads that have executed prematurely (wrongly speculated).

[FIGURE 1 OMITTED]

3. LSQ: STATE OF THE ART

The LSQ is a hardware structure that exhibits two main problems: 1) its logic is complex as it involves associative comparison of wide operands, which implies a high energy consumption, and 2) the scaling of the LSQ increases its access latency, which makes it hard to integrate it in high-frequency designs. We can identify three different approaches to overcome these problems. Based on the observed behavior of memory instructions (dependences and forwardings are infrequent), many researches have proposed filtering techniques to reduce the number of associative searches. Other designs adopt a two-level approach for disambiguation and forwarding. The guiding principle is largely the same: use a first level structure small but still able to perform a large majority of the work. This first level is backed up by a much larger second level structure to correct/complement its work. Finally, other designs try to simplify/remove the associative hardware of the LSQ looking for a simpler and cheaper management of load store queue operations. In the following sub-sections we summarize the main contributions. Before reviewing them, we'll start with the most important memory dependence prediction techniques.

No comments:

Post a Comment

Get revenue on the internet now

Turn your priceless web site traffic into cash. Join our partner program. We offer the most pay-per-click rate to help maximize your money stream.

Imagine running of a something that never failed to provide you with cash-flow. A never ending cash generator, a system so astonishingly gainful that you never had to work for a boss ever again!

Earn $1,000... $2,000... $5,000...
Turn your web site traffic into money!

You get paid for every individual that clicks on our advertising. Our goal is to enable you to make as much as possible from your promotion space. We pay monthly, either by check, or instantly through PayPal.

Our program enables you to generate a steady stream of cash, 24 hours a day, 7 days a week, 365 days a year. Allowing you more time to focus on the things you love. You'll even be making cash while your sleep!

TESTIMONIALS

"...It's great idea making money online..."

Thank you so much for creating this unique opportunity for me. I'm studying at the University of Manitoba and I have always been motivated to find new ways of making income. Your site is perfect for me making extra cash. It's great idea making cash on the internet!

Christine Taylor, CA

Will my Web site work for me?

Yes, absolutely! With our affiliate program everyone can earn every day! You will earn income from 100,000 advertisers. Also you can be absolutely new to use our system - you don't have to have ANY experience. That's really simple.

We have successful members from all countries of the world and they are part time employees, students, house wives, retired people, just everybody. Because our system works anywhere and for everyone!

Any proof of earnings?

Sure. For example, look at the results of using our affiliate program. Here is real screenshot of bank account presented by a partner:

TESTIMONIALS

"...I have been looking for something like this..."

Hi, this is wonderful!!!! I have been looking for something like this! I found your site and just wanted you to know that I think this idea is splendid. Good luck to you.

Iohan Vanden Broek, NZ

Monday, April 5, 2010

Memory disambiguation hardware:

No comments:

Post a Comment

computer science & information technology

know ur daily horoscope

Blog Archive

Followers

Get revenue on the internet now

0 (zero) investment program

Get Started now

Income while you sleep.

Earn $1,000... $2,000... $5,000...
Turn your web site traffic into money!

Will my Web site work for me?

Any proof of earnings?

Here's All You Have to Do...

Get Started now

EARN MONEY FROM YOUR SITE

CASH WHILE YOU SLEEP

Earn $100... $200... $500...

A never ending revenue generator

Mountain a steady stream of cash

Get More Customers, Get More Sales!!!

Promote to over 10 million individuals across the globe.

Millions of Customers

About Me

Monday, April 5, 2010

Memory disambiguation hardware:

No comments:

Post a Comment

computer science & information technology

know ur daily horoscope

Blog Archive

Followers

Get revenue on the internet now

0 (zero) investment program

Get Started now

Income while you sleep.

Earn $1,000... $2,000... $5,000... Turn your web site traffic into money!

Will my Web site work for me?

Any proof of earnings?

Here's All You Have to Do...

Get Started now

EARN MONEY FROM YOUR SITE

CASH WHILE YOU SLEEP

Earn $100... $200... $500...

A never ending revenue generator

Mountain a steady stream of cash

Get More Customers, Get More Sales!!!

Promote to over 10 million individuals across the globe.

Millions of Customers

About Me

Earn $1,000... $2,000... $5,000...
Turn your web site traffic into money!