CSC 456 Fall 2013/1d vb

From Expertiza_Wiki
Jump to navigation Jump to search

Trends in Pipelining

Introduction

Computing architectures have changed greatly over the relatively short span of a few decades. In the pursuit of good performance and economical cost, processor architectures have taken many forms. There have been many trends over the years relating to specific processor characteristics such as pipeline length. Some changes are based on technological limitations of the time period, and other decisions are based on hypothetical and real world performance research.

Factors Favoring Longer Pipeline Length

With each tick of the clock, the pipeline is advanced by one stage. Having a much longer pipeline allows for each individual step to be very short. Since each individual pipeline step is relatively small it is possible for the clock speed to be much faster since each step does not require as much time or work.

There is a secondary affect of longer pipelines as well. The resulting higher clock speed can also be used as a marketing point. The average user does not understand the metrics of raw processor power, but being able to compare two numbers such as 2.9Ghz vs 3.4Ghz is a simple way in which many attempt to understand different processors.

Factors Favoring Shorter Pipeline Length

The issue with increased pipeline length is the problem of incorrect branch predictions. The longer a pipeline is, the more stages of wasted processing have been wasted when a different branch is taken. Decreasing the pipeline length has resulted in lower clock frequencies, but equal or better IPC. A smaller pipeline suffers less of a loss for every bad prediction, and the overall performance is improved. With all processor properties, there is no simple "best" pipeline, there is always a bell curve pointing to the the most effective pipeline pipeline length for a given setup.

Latch delays can also play a role when you increase pipeline Length.. " When a performance increase is attempted by further increasing the number of stages for a processor in which the number of stages is more than 90% of the optimum for performance, the performance is found not to be increased unless the increasable latch overhead time is less than the overhead time. " [5] - An Analysis about Increasable Latch Overhead Time for Processor Pipeline Depth Increase

Power Consumption vs. Performance

Along with performance, power is another concern in microarchitectural design. There have been multiple studies on what optimal pipeline depth when considering both performance and power. Srinivasan stated that majority of power used is related to latches, including clocking and the leakage of power per latch. The number of latches grows super linearly with the number of pipeline stages according to Srinivasan,

An Example of Pipeline changes in Cray Systems

Pipeline Specifications of Cray Systems
Year Name Pipeline Length Number of Pipelines
1976 Cray 1 3 [2] 12
2006 IBM Cell BE 23 [7] 16
2012 Cray XK7 12 for scalar , 17 for vector [3] 6

An example of the Cell Pipeline

Sources

  1. The AMD Opteron Processor for Multiprocessor Servers
  2. Cray 1
  3. Cray SK7
  4. The Optimum Pipeline Depth for a Microprocessor
  5. An Analysis about Increasable Latch Overheard time for Processor Pipeline Depth Increase
  6. Optimum Power/Performance Pipeline Depth
  7. Cell Broadband Engine Architecture