CSC/ECE 506 Spring 2011/ch6a ep

Recent Architectures and their Cache Characteristics

Multicore processor architectures with both shared and private caches are common in desktop and server platform from a variety of manufacturers. Below is a table that illustrates the variety of core and cache designs in microprocessors from four major manufactureres:

Table 1: Recent Architectures and their Cache Characteristics [1][2][3][4]
Company	Processor	Cores	L1 Cache	L2 Cache	L3 Cache
AMD	Athlon 64 X2	2	128 KB x 2	512 KB x 2 1MB x 2
AMD	Athlon 64 FX	2	128 KB x 2	1MB x 2
AMD	Athlon X2	2	128 KB x 2	512 KB x 2
AMD	Phenom X3	3	128 KB x 3	512 KB x 3	2 MB
AMD	Phenom X4	4	128 KB x 4	512 KB x 4	2 MB
AMD	Athlon II X2	2	128 KB x 2	512 KB x 2 1MB x 2
AMD	Athlon II x3	3	128 KB x 3	512 KB x 3
AMD	Athlon II X4	4	128 KB x 4	512 KB x 4
AMD	Phenom II X2	2	128 KB x 2	512 KB x 2	6 MB
AMD	Phenom II X3	3	128 KB x 3	512 KB x 3	6 MB
AMD	Phenom II X4	4	128 KB x 4	512 KB x 4	4-6 MB
AMD	Phenom X6	6	128 KB x 4	512 KB x 4	6 MB
Intel	Pentium D	2	12+16KB x 2	2 MB
Intel	Celeron E	2	32 KB x2	512 -1 MB
Intel	Pentium D	2	32 KB x2	1 MB
Intel	Core 2 Duo	2	32 KB x2	2 - 4 MB
Intel	Core 2 Quad	4	32 KB x2	8 MB
Intel	Atom 330	2	32+24KB x 2	512 MB x 2
Intel	Pentium E	2	32 KB x 2	2 MB
Intel	Core 2 Duo E	2	32 KB x 2	3-6 MB
Intel	Core 2 Quad	4	32 KB x 4	2-6 MB x 2
Intel	Core i3	2	32+32 KB x 2	256 KB x2	4 MB
Intel	Core i5 - 6 Series	2	32+32 KB x 2	256 KB x2	4 MB
Intel	Core i5 - 7 Series	4	32+32 KB x 4	256 KB x 4	8 MB
Intel	Core i5 - 2400 Series Core i5 - 2500 Series	4	32+32 KB x 4	256 KB x 4	6 MB
Intel	Core i7 - 8 Series	4	32+32 KB x 4	256 KB x 4	8 MB
Intel	Core i7 - 9 Series	4	32+32 KB x 4	256 KB x 4	8 MB
Intel	Core i7 - 970	6	32+32 KB x 6	256 KB x 6	12 MB
Sun	UltraSPARC T1	8	8 K x 8 Inst. 16 K x 8 Data	3 MB
Sun	SPARC64 VI	2	128 K x 2 Inst. 128 K x 2 Data	5 MB
Sun	UltraSPARC T2	8	8 K x 4 Inst. 16 K x 4 Data	4 MB
Sun	SPARC64 VII	4	64 K x 4 Inst. 64 K x 4 Data	6 MB
Sun	SPARC T3	8	8 K x 8 Inst. 16 K x 8 Data	6 MB
Sun	SPARC64 VII+	2	128 K x 2 Inst. 128 K x 2 Data	12 MB
IBM	Power5	2	64 K x 2 Inst. 64 K x 2 Data	4 MB x 2	32 MB
IBM	Power7	4, 6, or 8	32+32 KB x C	256 kB x C	4 - 32 MB x C

Cache Write Policies[5]

In section 6.2.3, cache write hit policies and write miss policies were explored. The write hit policies, write-through and write-back, determine when the data written in the local cache is propagated to lower-level memory. The write miss policies, write-allocate and no-write-allocate, determine if a memory block is stored in a cache line after the write occurs. Note that since a miss occurred on write, the block is not already in a cache line.

Write-miss policies can also be expressed in terms of write-allocate, fetch-on-write, and write-before hit. These can be used in combination with the same write-through and write-back hit policies to define the complete cache write policy.

Fetch-on-Write

On a write miss, the memory block is not found in the cache, and the memory block containing the address to be written is fetched from the lower level memory hierarchy before the write proceeds. Note that this is different from write-allocate. Fetch-on-write determines if the block is fetched from memory, and write-allocate determines if the memory block is stored in a cache line. Certain write policies may allocate a line in cache for data that is written by the CPU without retrieving it from memory first.

No-Fetch-on-Write

On a write miss, the memory block is not fetched first from the lower level memory hierarchy. Therefore, the write can proceed with out having to wait for the memory block to be returned.

Write-Before-Hit

On a write, the write waits until the cache determines it the block being written to is in the cache or not.

No-Write-Before-Hit

On a write, the write proceeds before the cache determines if a hit or miss occurred. In this scenario, the tag and the data can be written simultaneously.

Write-Miss Policy Combinations

Four combinations of these three write miss policies are employed to provide an over-all write policy. These are expressed in the table below:

Prefetching

References

[1]: http://www.techarp.com/showarticle.aspx?artno=337
[2]: http://en.wikipedia.org/wiki/SPARC
[3]: http://en.wikipedia.org/wiki/POWER7
[4]: http://en.wikipedia.org/wiki/POWER5
[5]: “Cache write policies and performance,” Norman Jouppi, Proc. 20th International Symposium on Computer Architecture (ACM Computer Architecture News 21:2), May 1993, pp. 191–201. http://www.hpl.hp.com/techreports/Compaq-DEC/WRL-91-12.pdf [6]: "Intel® 64 and IA-32 Architectures Optimization Reference Manual", Intel Corporation, 1997-2011. http://www.intel.com/Assets/PDF/manual/248966.pdf

CSC/ECE 506 Spring 2011/ch6a ep

Contents

Recent Architectures and their Cache Characteristics

Cache Write Policies[5]

Fetch-on-Write

No-Fetch-on-Write

Write-Before-Hit

No-Write-Before-Hit

Write-Miss Policy Combinations

Prefetching

References

Navigation menu

CSC/ECE 506 Spring 2011/ch6a ep

Recent Architectures and their Cache Characteristics

Cache Write Policies[5]

Fetch-on-Write

No-Fetch-on-Write

Write-Before-Hit

No-Write-Before-Hit

Write-Miss Policy Combinations

Prefetching

References

Navigation menu

Search