|
|
(One intermediate revision by the same user not shown) |
Line 1: |
Line 1: |
| =<center>Cache sizes in multicore architectures</center>=
| | This page was mistakenly created. Please access this page at - http://pg.ece.ncsu.edu/mediawiki/index.php/CSC/ECE_506_Fall_2007/wiki2_5_as |
| | |
| ''Topic'' - Create a table of caches used in current multicore architectures,
| |
| including such parameters as number of levels, line size, size and
| |
| associativity of each level, latency of each level, whether each level
| |
| is shared, and coherence protocol used. Compare this with two or three
| |
| recent single-core designs.
| |
| | |
| | |
| {| border="1" cellpadding="5" cellspacing="0" align="center"
| |
| |+'''Detail of Caches'''
| |
| |-
| |
| ! colspan="6" style="background:#ffdead;" | Multicore Processors
| |
| |-
| |
| ! Processor Name
| |
| ! Number of Levels
| |
| ! Line Size
| |
| ! Cache Size
| |
| ! Associativity
| |
| ! Coherence Protocol
| |
| |-
| |
| | AMD Athlon 64 X2
| |
| | 2
| |
| | 64 bytes (for both L1 & L2)
| |
| | L1 - 64KB (Data) + 64KB (Instruction) per core<br/>L2 - 512KB to 1MB per core
| |
| | L1 - 2 way (Data and Instruction cache)<br/>L2 - 16 way associative
| |
| | Modified Owner Exclusive Shared Invalid (MOESI)
| |
| |-
| |
| | AMD Athlon 64 FX
| |
| | 2
| |
| | 64 bytes (for both L1 & L2)
| |
| | L1 - 64KB (Data) + 64KB (Instruction) per core<br/>L2 - 1MB per core
| |
| | L1 - 2 way (Data and Instruction cache)<br/>L2 - 16 way associative
| |
| | Modified Owner Exclusive Shared Invalid (MOESI)
| |
| |-
| |
| | AMD Athlon Opteron<br/>(marketed for servers)
| |
| | 2
| |
| | 64 bytes (for both L1 & L2)
| |
| | L1 - 64KB (Data) + 64KB (Instruction) per core<br/>L2 - 1MB per core
| |
| | L1 - 2 way (Data and Instruction cache)<br/>L2 - 16 way associative
| |
| | Modified Owner Exclusive Shared Invalid (MOESI)
| |
| |-
| |
| | Intel Pentium D
| |
| | 2
| |
| | L1 - 64 byte lines<br/>L2 - 128 byte lines
| |
| | L1 - 16 KB (data only. Instead of instruction cache, a "150KB trace cache" is used)<br/>L2 - 1MB or 2MB per core
| |
| | L1 - 4 way<br/>L2 - 8 way
| |
| | Modified Exclusive Shared Invalid (MESI)
| |
| |-
| |
| | Intel Pentium Dual Core
| |
| | 2
| |
| | L1 - 64 byte lines<br/>L2 - 64 byte lines
| |
| | L1 - 32 KB (both Data and Instruction cache)<br/>L2 - 1MB or 2MB per core
| |
| | L1 - 4 way<br/>L2 - 8 way
| |
| | Modified Exclusive Shared Invalid (MESI)
| |
| |-
| |
| | Intel Core 2 Duo
| |
| | 2
| |
| | L1 - 64 byte lines<br/>L2 - 64 byte lines
| |
| | L1 - 32 KB (each for Data and Instruction cache)<br/>L2 - 2MB or 4MB
| |
| | L1 - 4 way<br/>L2 - 8 way
| |
| | Modified Exclusive Shared Invalid (MESI)
| |
| |-
| |
| | Broadcom SiByte SB1250
| |
| | 2
| |
| | L1 - 32 byte lines<br/>L2 - 32 byte lines
| |
| | L1 - 32 KB (a piece for Data and Instruction caches)<br/>L2 - 512KB
| |
| | L1 - 2 way<br/>L2 - 4 way
| |
| | Modified Exclusive Shared Invalid (MESI)
| |
| |-
| |
| | Sun Microsystems UltraSPARC IV
| |
| | 2
| |
| | L1 - 128byte lines<br/>L2 - 128 byte lines
| |
| | L1 - 64KB data, 32KB instruction<br/>L2 - up to 16MB
| |
| | L2 - 2 way
| |
| | Modified Owner Exclusive Shared Invalid (MOESI)
| |
| |-
| |
| | IBM Cell Processor
| |
| | 2
| |
| | Not Available
| |
| | L1 - 32 KB (a piece for both data and instruction caches)<br/>L2 - 512KB
| |
| | L1 - 2 way instruction, 4 way data<br/>L2 - 8 way
| |
| | Modified Exclusive Shared Invalid (MESI)
| |
| |-
| |
| ! colspan="6" style="background:#ffdead;" | Singlecore Processors
| |
| |-
| |
| | AMD Athlon 64
| |
| | 2
| |
| | L1 - 64 byte lines<br/>L2 - 64 byte lines
| |
| | L1 - 64 KB (each for Data and Instruction cache)<br/>L2 - 512KB
| |
| | L1 - 2 way<br/>L2 - 16 way
| |
| | Modified Owner Exclusive Shared Invalid (MOESI)
| |
| |-
| |
| | AMD K6 / K6 III
| |
| | 2
| |
| | L1 - 32 byte lines<br/2>L2 -
| |
| | L1 - 32KB data (2-way associative), 32KB instruction (2-Way associative)<br/>L2 - 256KB
| |
| | L1 - 2 way<br/>L2 - 4 way
| |
| | Modified Exclusive Shared Invalid (MESI)
| |
| |-
| |
| | Intel Pentium 4
| |
| | 2
| |
| | L1 - 64 byte lines<br/>L2 - 128 byte lines
| |
| | L1 - 8 KB (data only. Instead of instruction cache, a "150KB trace cache" is used))<br/>L2 -256KB, 512KB or 1MB
| |
| | L1 - 4 way<br/>L2 - 8 way
| |
| | Modified Exclusive Shared Invalid (MESI)
| |
| |-
| |
| | Intel PentiumIII 600
| |
| | 2
| |
| | L1 - <br/>L2 -
| |
| | L1 - 16 KB data, 16K Instruction<br/>L2 - 256KB
| |
| | L1 - <br/>L2 - 8 way
| |
| |
| |
| |-
| |
| | Intel Core Solo
| |
| | 2
| |
| | L1 - <br/>L2 -
| |
| | L1 - 32 KB data, 32KB Instruction<br/>L2 - 2MB
| |
| | L1 - <br/>L2 -
| |
| |
| |
| |}
| |
| | |
| | |
| == Conclusion ==
| |
| | |
| | |
| | |
| | |
| == References ==
| |
| [1] http://www.amd.com/us-en/Processors/ProductInformation
| |
| | |
| [2] http://www.broadcom.com/products/Enterprise-Networking/Communications-Processors/BCM1250
| |
| | |
| [3] http://www-01.ibm.com/chips/techlib/techlib.nsf/products/PowerPC_970MP_Microprocessor
| |
| | |
| [4] http://www.intel.com/products/processor/xeon7000/documentation.htm?iid=products_xeon7000+tab_techdocs#datasheets
| |
| | |
| [5] http://www.sun.com/processors/UltraSPARC-IV/
| |
| | |
| [6] http://www.sun.com/processors/UltraSPARC-IV+/
| |
| | |
| [7] http://www.sun.com/processors/UltraSPARC-T1/
| |
| | |
| [8] http://www.streamprocessors.com/streamprocessors/Home/Products/Storm-1Family.html
| |
| | |
| [9] http://www.netlib.org/utk/papers/advanced-computers/pa-risc.html
| |
| | |
| [10] http://www.netlib.org/utk/papers/advanced-computers/power4.html
| |
| | |
| [11] http://www.netlib.org/utk/papers/advanced-computers/power5.html
| |
| | |
| [12] http://en.wikipedia.org/wiki/Cell_microprocessor
| |
| | |
| [13] http://techreport.com/articles.x/8236/2
| |
| | |
| [14] http://www.hardwaresecrets.com/article/481/9
| |