<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.expertiza.ncsu.edu/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Bachish2</id>
	<title>Expertiza_Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.expertiza.ncsu.edu/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Bachish2"/>
	<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=Special:Contributions/Bachish2"/>
	<updated>2026-06-29T04:02:51Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.41.0</generator>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62774</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62774"/>
		<updated>2012-04-26T04:57:13Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
With modern technology, large-scale multiprocessors (LSMs) have become more prevalent.  There has been considerable research into networking topologies for connecting the processors, and several methods have been conceived to ensure coherence.  Additionally, there are numerous manufacturers who make the materials necessary to build LSMs.  In this article, we show examples of each of these.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a LSM, you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
[[File:kcomputer.jpg|thumb|right|none|upright=2|alt=alt text|Fujitsu's K Computer&amp;lt;ref name=&amp;quot;k computer image&amp;quot;/&amp;gt;]] &lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Year&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2009&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bisection bandwidth, and degree. The diameter of a network is the longest number of network hops between any pair of nodes. Bisection bandwidth refers to the minimum number of links that need to be cut to divide the network in half. The degree of a network refers to the number of in/out links on each node. The following figure displays some examples.&lt;br /&gt;
&lt;br /&gt;
[[File:NetworkTopologies.png|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
The following table gives some more detail on the different characteristics of some network types. &amp;quot;p&amp;quot; is the number of nodes, &amp;quot;d&amp;quot; is dimensions, and &amp;quot;k&amp;quot; is the number of nodes in each dimension.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 3: Network Properties&lt;br /&gt;
|-&lt;br /&gt;
! Topology&lt;br /&gt;
! Diameter&lt;br /&gt;
! Bandwidth&lt;br /&gt;
! Degree&lt;br /&gt;
! Example(s)&lt;br /&gt;
|-&lt;br /&gt;
| Ring&lt;br /&gt;
| p/2&lt;br /&gt;
| 2&lt;br /&gt;
| 2&lt;br /&gt;
| KSR-1, NUMA-chine &amp;lt;ref name=&amp;quot;ring&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| k-ary d Mesh&lt;br /&gt;
| 2(sqrt(p) - 1)&lt;br /&gt;
| sqrt(p)&lt;br /&gt;
| 4&lt;br /&gt;
| Intel Paragon, Cray T3D &amp;lt;ref name=&amp;quot;mesh&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Butterfly&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| 4&lt;br /&gt;
| BBN Butterfly&amp;lt;ref name=&amp;quot;butterfly&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| k-ary Fat Tree&lt;br /&gt;
| 2 x log_k(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| k+1&lt;br /&gt;
| Xtreme-X&amp;lt;ref name=&amp;quot;xtreme&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Hypercube&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| nCUBE 1&amp;lt;ref name=&amp;quot;ncube&amp;quot;/&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor.&amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory.&amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Another example of a memory design used by LSMs is Non Uniform Memory Access (NUMA).  NUMA has a coherent version of its system, called cache coherent NUMA (ccNUMA), where data and memory is accessed globally.&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;/&amp;gt;  The 2008 IBM Roadrunner supercomputer, which has 6480 Opteron processors and 12960 IBM Cell processors, uses ccNUMA.&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/SPARC64_VIIIfx Fujitsu SPARC64 VIIIfx Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22 Intel Xeon Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Power7 IBM Power 7 Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29 AMD Opteron Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;[http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm Supermicro Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;[http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html HP Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;[http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html IBM Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;[http://top500.org/lists/2011/11/press-release K Computer]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/19-inch_rack 19 inch Rack]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;[http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a Cache Coherence Protocols for Large Scale Multiprocessors]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;[http://www.psc.edu/machines/sgi/uv/blacklight.php Blacklight Multiprocessor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;[http://en.wikibooks.org/wiki/Communication_Networks/Network_Topologies Network Topologies]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer image&amp;quot;&amp;gt;[http://www.top500.org/files/systems/k.jpg K Computer Image]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ring&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.133.6894&amp;amp;rep=rep1&amp;amp;type=pdf Ring Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;mesh&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.48.4149&amp;amp;rep=rep1&amp;amp;type=pdf Mesh Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;&amp;gt;[http://www.top500.org/2007_overview_recent_supercomputers/ccnuma_machines ccNuma Machines]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;&amp;gt;[http://www.leb.eei.uni-erlangen.de/winterakademie/2009/report/content/course02/pdf/0211.pdf Supercomputer Architecture]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;butterfly&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/BBN_Butterfly BBN Butterfly Supercomputer]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ncube&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/NCUBE nCUBE 1]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;xtreme&amp;quot;&amp;gt;[http://www.appro.com/products/supercomputers/xtreme-x_supercomputer/ Xtreme-X]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62773</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62773"/>
		<updated>2012-04-26T04:56:41Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
With modern technology, large-scale multiprocessors (LSMs) have become more prevalent.  There has been considerable research into networking topologies for connecting the processors, and several methods have been conceived to ensure coherence.  Additionally, there are numerous manufacturers who make the materials necessary to build LSMs.  In this article, we show examples of each of these.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a LSM, you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
[[File:kcomputer.jpg|thumb|right|none|upright=2|alt=alt text|Fujitsu's K Computer&amp;lt;ref name=&amp;quot;k computer image&amp;quot;/&amp;gt;]] &lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Year&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2009&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bisection bandwidth, and degree. The diameter of a network is the longest number of network hops between any pair of nodes. Bisection bandwidth refers to the minimum number of links that need to be cut to divide the network in half. The degree of a network refers to the number of in/out links on each node. The following figure displays some examples.&lt;br /&gt;
&lt;br /&gt;
[[File:NetworkTopologies.png|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
The following table gives some more detail on the different characteristics of some network types. &amp;quot;p&amp;quot; is the number of nodes, &amp;quot;d&amp;quot; is dimensions, and &amp;quot;k&amp;quot; is the number of nodes in each dimension.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 3: Network Properties&lt;br /&gt;
|-&lt;br /&gt;
! Topology&lt;br /&gt;
! Diameter&lt;br /&gt;
! Bandwidth&lt;br /&gt;
! Degree&lt;br /&gt;
! Example(s)&lt;br /&gt;
|-&lt;br /&gt;
| Ring&lt;br /&gt;
| p/2&lt;br /&gt;
| 2&lt;br /&gt;
| 2&lt;br /&gt;
| KSR-1, NUMA-chine &amp;lt;ref name=&amp;quot;ring&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| k-ary d Mesh&lt;br /&gt;
| 2(sqrt(p) - 1)&lt;br /&gt;
| sqrt(p)&lt;br /&gt;
| 4&lt;br /&gt;
| Intel Paragon, Cray T3D &amp;lt;ref name=&amp;quot;mesh&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Butterfly&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| 4&lt;br /&gt;
| BBN Butterfly&amp;lt;ref name=&amp;quot;butterfly&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| k-ary Fat Tree&lt;br /&gt;
| 2 x log_k(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| k+1&lt;br /&gt;
| Xtreme-X&amp;lt;ref name=&amp;quot;xtreme&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Hypercube&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| nCUBE 1&amp;lt;ref name=&amp;quot;ncube&amp;quot;/&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor.&amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory.&amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Another example of a memory design used by LSMs is Non Uniform Memory Access (NUMA).  NUMA has a coherent version of its system, called cache coherent NUMA (ccNUMA), where data and memory is accessed globally.&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;/&amp;gt;  The 2008 IBM Roadrunner supercomputer, which has 6480 Opteron processors and 12960 IBM Cell processors, uses ccNUMA.&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/SPARC64_VIIIfx Fujitsu SPARC64 VIIIfx Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22 Intel Xeon Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Power7 IBM Power 7 Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29 AMD Opteron Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;[http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm Supermicro Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;[http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html HP Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;[http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html IBM Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;[http://top500.org/lists/2011/11/press-release K Computer]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/19-inch_rack 19 inch Rack]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;[http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a Cache Coherence Protocols for Large Scale Multiprocessors]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;[http://www.psc.edu/machines/sgi/uv/blacklight.php Blacklight Multiprocessor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;[http://en.wikibooks.org/wiki/Communication_Networks/Network_Topologies Network Topologies]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer image&amp;quot;&amp;gt;[http://www.top500.org/files/systems/k.jpg K Computer Image]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ring&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.133.6894&amp;amp;rep=rep1&amp;amp;type=pdf Ring Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;mesh&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.48.4149&amp;amp;rep=rep1&amp;amp;type=pdf Mesh Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;&amp;gt;[http://www.top500.org/2007_overview_recent_supercomputers/ccnuma_machines ccNuma Machines]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;&amp;gt;[http://www.leb.eei.uni-erlangen.de/winterakademie/2009/report/content/course02/pdf/0211.pdf Supercomputer Architecture]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;butterfly&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/BBN_Butterfly BBN Butterfly Supercomputer]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ncube&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/NCUBE nCUBE 1]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62772</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62772"/>
		<updated>2012-04-26T04:53:12Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
With modern technology, large-scale multiprocessors (LSMs) have become more prevalent.  There has been considerable research into networking topologies for connecting the processors, and several methods have been conceived to ensure coherence.  Additionally, there are numerous manufacturers who make the materials necessary to build LSMs.  In this article, we show examples of each of these.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a LSM, you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
[[File:kcomputer.jpg|thumb|right|none|upright=2|alt=alt text|Fujitsu's K Computer&amp;lt;ref name=&amp;quot;k computer image&amp;quot;/&amp;gt;]] &lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Year&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2009&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bisection bandwidth, and degree. The diameter of a network is the longest number of network hops between any pair of nodes. Bisection bandwidth refers to the minimum number of links that need to be cut to divide the network in half. The degree of a network refers to the number of in/out links on each node. The following figure displays some examples.&lt;br /&gt;
&lt;br /&gt;
[[File:NetworkTopologies.png|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
The following table gives some more detail on the different characteristics of some network types. &amp;quot;p&amp;quot; is the number of nodes, &amp;quot;d&amp;quot; is dimensions, and &amp;quot;k&amp;quot; is the number of nodes in each dimension.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 3: Network Properties&lt;br /&gt;
|-&lt;br /&gt;
! Topology&lt;br /&gt;
! Diameter&lt;br /&gt;
! Bandwidth&lt;br /&gt;
! Degree&lt;br /&gt;
! Example(s)&lt;br /&gt;
|-&lt;br /&gt;
| Ring&lt;br /&gt;
| p/2&lt;br /&gt;
| 2&lt;br /&gt;
| 2&lt;br /&gt;
| KSR-1, NUMA-chine &amp;lt;ref name=&amp;quot;ring&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| k-ary d Mesh&lt;br /&gt;
| 2(sqrt(p) - 1)&lt;br /&gt;
| sqrt(p)&lt;br /&gt;
| 4&lt;br /&gt;
| Intel Paragon, Cray T3D &amp;lt;ref name=&amp;quot;mesh&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Butterfly&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| 4&lt;br /&gt;
| BBN Butterfly&amp;lt;ref name=&amp;quot;butterfly&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| k-ary Tree&lt;br /&gt;
| 2 x log_k(p)&lt;br /&gt;
| 1&lt;br /&gt;
| k+1&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Hypercube&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| nCUBE 1&amp;lt;ref name=&amp;quot;ncube&amp;quot;/&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor.&amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory.&amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Another example of a memory design used by LSMs is Non Uniform Memory Access (NUMA).  NUMA has a coherent version of its system, called cache coherent NUMA (ccNUMA), where data and memory is accessed globally.&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;/&amp;gt;  The 2008 IBM Roadrunner supercomputer, which has 6480 Opteron processors and 12960 IBM Cell processors, uses ccNUMA.&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/SPARC64_VIIIfx Fujitsu SPARC64 VIIIfx Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22 Intel Xeon Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Power7 IBM Power 7 Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29 AMD Opteron Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;[http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm Supermicro Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;[http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html HP Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;[http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html IBM Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;[http://top500.org/lists/2011/11/press-release K Computer]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/19-inch_rack 19 inch Rack]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;[http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a Cache Coherence Protocols for Large Scale Multiprocessors]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;[http://www.psc.edu/machines/sgi/uv/blacklight.php Blacklight Multiprocessor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;[http://en.wikibooks.org/wiki/Communication_Networks/Network_Topologies Network Topologies]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer image&amp;quot;&amp;gt;[http://www.top500.org/files/systems/k.jpg K Computer Image]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ring&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.133.6894&amp;amp;rep=rep1&amp;amp;type=pdf Ring Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;mesh&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.48.4149&amp;amp;rep=rep1&amp;amp;type=pdf Mesh Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;&amp;gt;[http://www.top500.org/2007_overview_recent_supercomputers/ccnuma_machines ccNuma Machines]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;&amp;gt;[http://www.leb.eei.uni-erlangen.de/winterakademie/2009/report/content/course02/pdf/0211.pdf Supercomputer Architecture]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;butterfly&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/BBN_Butterfly BBN Butterfly Supercomputer]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ncube&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/NCUBE nCUBE 1]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62771</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62771"/>
		<updated>2012-04-26T04:52:42Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
With modern technology, large-scale multiprocessors (LSMs) have become more prevalent.  There has been considerable research into networking topologies for connecting the processors, and several methods have been conceived to ensure coherence.  Additionally, there are numerous manufacturers who make the materials necessary to build LSMs.  In this article, we show examples of each of these.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a LSM, you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
[[File:kcomputer.jpg|thumb|right|none|upright=2|alt=alt text|Fujitsu's K Computer&amp;lt;ref name=&amp;quot;k computer image&amp;quot;/&amp;gt;]] &lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Year&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2009&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bisection bandwidth, and degree. The diameter of a network is the longest number of network hops between any pair of nodes. Bisection bandwidth refers to the minimum number of links that need to be cut to divide the network in half. The degree of a network refers to the number of in/out links on each node. The following figure displays some examples.&lt;br /&gt;
&lt;br /&gt;
[[File:NetworkTopologies.png|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
The following table gives some more detail on the different characteristics of some network types. &amp;quot;p&amp;quot; is the number of nodes, &amp;quot;d&amp;quot; is dimensions, and &amp;quot;k&amp;quot; is the number of nodes in each dimension.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 3: Network Properties&lt;br /&gt;
|-&lt;br /&gt;
! Topology&lt;br /&gt;
! Diameter&lt;br /&gt;
! Bandwidth&lt;br /&gt;
! Degree&lt;br /&gt;
! Example(s)&lt;br /&gt;
|-&lt;br /&gt;
| Ring&lt;br /&gt;
| p/2&lt;br /&gt;
| 2&lt;br /&gt;
| 2&lt;br /&gt;
| KSR-1, NUMA-chine &amp;lt;ref name=&amp;quot;ring&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| k-ary d Mesh&lt;br /&gt;
| 2(sqrt(p) - 1)&lt;br /&gt;
| sqrt(p)&lt;br /&gt;
| 4&lt;br /&gt;
| Intel Paragon, Cray T3D &amp;lt;ref name=&amp;quot;mesh&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Butterfly&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| 4&lt;br /&gt;
| BBN Butterfly&amp;lt;ref name=&amp;quot;butterfly&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| k-ary Tree&lt;br /&gt;
| 2 x log_k(p)&lt;br /&gt;
| 1&lt;br /&gt;
| k+1&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Hypercube&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| nCUBE 1&amp;lt;ref name=&amp;quot;ncube&amp;quot;/&amp;gt;&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor.&amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory.&amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Another example of a memory design used by LSMs is Non Uniform Memory Access (NUMA).  NUMA has a coherent version of its system, called cache coherent NUMA (ccNUMA), where data and memory is accessed globally.&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;/&amp;gt;  The 2008 IBM Roadrunner supercomputer, which has 6480 Opteron processors and 12960 IBM Cell processors, uses ccNUMA.&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/SPARC64_VIIIfx Fujitsu SPARC64 VIIIfx Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22 Intel Xeon Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Power7 IBM Power 7 Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29 AMD Opteron Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;[http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm Supermicro Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;[http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html HP Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;[http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html IBM Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;[http://top500.org/lists/2011/11/press-release K Computer]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/19-inch_rack 19 inch Rack]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;[http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a Cache Coherence Protocols for Large Scale Multiprocessors]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;[http://www.psc.edu/machines/sgi/uv/blacklight.php Blacklight Multiprocessor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;[http://en.wikibooks.org/wiki/Communication_Networks/Network_Topologies Network Topologies]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer image&amp;quot;&amp;gt;[http://www.top500.org/files/systems/k.jpg K Computer Image]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ring&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.133.6894&amp;amp;rep=rep1&amp;amp;type=pdf Ring Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;mesh&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.48.4149&amp;amp;rep=rep1&amp;amp;type=pdf Mesh Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;&amp;gt;[http://www.top500.org/2007_overview_recent_supercomputers/ccnuma_machines ccNuma Machines]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;&amp;gt;[http://www.leb.eei.uni-erlangen.de/winterakademie/2009/report/content/course02/pdf/0211.pdf Supercomputer Architecture]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;butterfly&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/BBN_Butterfly BBN Butterfly Supercomputer]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62770</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62770"/>
		<updated>2012-04-26T04:49:12Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
With modern technology, large-scale multiprocessors (LSMs) have become more prevalent.  There has been considerable research into networking topologies for connecting the processors, and several methods have been conceived to ensure coherence.  Additionally, there are numerous manufacturers who make the materials necessary to build LSMs.  In this article, we show examples of each of these.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a LSM, you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
[[File:kcomputer.jpg|thumb|right|none|upright=2|alt=alt text|Fujitsu's K Computer&amp;lt;ref name=&amp;quot;k computer image&amp;quot;/&amp;gt;]] &lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Year&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2009&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bisection bandwidth, and degree. The diameter of a network is the longest number of network hops between any pair of nodes. Bisection bandwidth refers to the minimum number of links that need to be cut to divide the network in half. The degree of a network refers to the number of in/out links on each node. The following figure displays some examples.&lt;br /&gt;
&lt;br /&gt;
[[File:NetworkTopologies.png|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
The following table gives some more detail on the different characteristics of some network types. &amp;quot;p&amp;quot; is the number of nodes, &amp;quot;d&amp;quot; is dimensions, and &amp;quot;k&amp;quot; is the number of nodes in each dimension.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 3: Network Properties&lt;br /&gt;
|-&lt;br /&gt;
! Topology&lt;br /&gt;
! Diameter&lt;br /&gt;
! Bandwidth&lt;br /&gt;
! Degree&lt;br /&gt;
! Example(s)&lt;br /&gt;
|-&lt;br /&gt;
| Ring&lt;br /&gt;
| p/2&lt;br /&gt;
| 2&lt;br /&gt;
| 2&lt;br /&gt;
| KSR-1, NUMA-chine &amp;lt;ref name=&amp;quot;ring&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| k-ary d Mesh&lt;br /&gt;
| 2(sqrt(p) - 1)&lt;br /&gt;
| sqrt(p)&lt;br /&gt;
| 4&lt;br /&gt;
| Intel Paragon, Cray T3D &amp;lt;ref name=&amp;quot;mesh&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Butterfly&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| 4&lt;br /&gt;
| BBN Butterfly&amp;lt;ref name=&amp;quot;butterfly&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| k-ary Tree&lt;br /&gt;
| 2 x log_k(p)&lt;br /&gt;
| 1&lt;br /&gt;
| k+1&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Fully Connected&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor.&amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory.&amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Another example of a memory design used by LSMs is Non Uniform Memory Access (NUMA).  NUMA has a coherent version of its system, called cache coherent NUMA (ccNUMA), where data and memory is accessed globally.&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;/&amp;gt;  The 2008 IBM Roadrunner supercomputer, which has 6480 Opteron processors and 12960 IBM Cell processors, uses ccNUMA.&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/SPARC64_VIIIfx Fujitsu SPARC64 VIIIfx Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22 Intel Xeon Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Power7 IBM Power 7 Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29 AMD Opteron Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;[http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm Supermicro Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;[http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html HP Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;[http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html IBM Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;[http://top500.org/lists/2011/11/press-release K Computer]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/19-inch_rack 19 inch Rack]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;[http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a Cache Coherence Protocols for Large Scale Multiprocessors]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;[http://www.psc.edu/machines/sgi/uv/blacklight.php Blacklight Multiprocessor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;[http://en.wikibooks.org/wiki/Communication_Networks/Network_Topologies Network Topologies]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer image&amp;quot;&amp;gt;[http://www.top500.org/files/systems/k.jpg K Computer Image]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ring&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.133.6894&amp;amp;rep=rep1&amp;amp;type=pdf Ring Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;mesh&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.48.4149&amp;amp;rep=rep1&amp;amp;type=pdf Mesh Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;&amp;gt;[http://www.top500.org/2007_overview_recent_supercomputers/ccnuma_machines ccNuma Machines]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;&amp;gt;[http://www.leb.eei.uni-erlangen.de/winterakademie/2009/report/content/course02/pdf/0211.pdf Supercomputer Architecture]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;butterfly&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/BBN_Butterfly BBN Butterfly Supercomputer]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62769</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62769"/>
		<updated>2012-04-26T04:47:54Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
With modern technology, large-scale multiprocessors (LSMs) have become more prevalent.  There has been considerable research into networking topologies for connecting the processors, and several methods have been conceived to ensure coherence.  Additionally, there are numerous manufacturers who make the materials necessary to build LSMs.  In this article, we show examples of each of these.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a LSM, you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
[[File:kcomputer.jpg|thumb|right|none|upright=2|alt=alt text|Fujitsu's K Computer&amp;lt;ref name=&amp;quot;k computer image&amp;quot;/&amp;gt;]] &lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Year&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2009&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bisection bandwidth, and degree. The diameter of a network is the longest number of network hops between any pair of nodes. Bisection bandwidth refers to the minimum number of links that need to be cut to divide the network in half. The degree of a network refers to the number of in/out links on each node. The following figure displays some examples.&lt;br /&gt;
&lt;br /&gt;
[[File:NetworkTopologies.png|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
The following table gives some more detail on the different characteristics of some network types. &amp;quot;p&amp;quot; is the number of nodes, &amp;quot;d&amp;quot; is dimensions, and &amp;quot;k&amp;quot; is the number of nodes in each dimension.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 3: Network Properties&lt;br /&gt;
|-&lt;br /&gt;
! Topology&lt;br /&gt;
! Diameter&lt;br /&gt;
! Bandwidth&lt;br /&gt;
! Degree&lt;br /&gt;
! Example(s)&lt;br /&gt;
|-&lt;br /&gt;
| Ring&lt;br /&gt;
| p/2&lt;br /&gt;
| 2&lt;br /&gt;
| 2&lt;br /&gt;
| KSR-1, NUMA-chine &amp;lt;ref name=&amp;quot;ring&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| k-ary d Mesh&lt;br /&gt;
| 2(sqrt(p) - 1)&lt;br /&gt;
| sqrt(p)&lt;br /&gt;
| 4&lt;br /&gt;
| Intel Paragon, Cray T3D &amp;lt;ref name=&amp;quot;mesh&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Line&lt;br /&gt;
| p - 1&lt;br /&gt;
| 1&lt;br /&gt;
| 2&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| k-ary Tree&lt;br /&gt;
| 2 x log_k(p)&lt;br /&gt;
| 1&lt;br /&gt;
| k+1&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Fully Connected&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor.&amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory.&amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Another example of a memory design used by LSMs is Non Uniform Memory Access (NUMA).  NUMA has a coherent version of its system, called cache coherent NUMA (ccNUMA), where data and memory is accessed globally.&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;/&amp;gt;  The 2008 IBM Roadrunner supercomputer, which has 6480 Opteron processors and 12960 IBM Cell processors, uses ccNUMA.&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/SPARC64_VIIIfx Fujitsu SPARC64 VIIIfx Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22 Intel Xeon Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Power7 IBM Power 7 Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29 AMD Opteron Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;[http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm Supermicro Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;[http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html HP Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;[http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html IBM Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;[http://top500.org/lists/2011/11/press-release K Computer]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/19-inch_rack 19 inch Rack]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;[http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a Cache Coherence Protocols for Large Scale Multiprocessors]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;[http://www.psc.edu/machines/sgi/uv/blacklight.php Blacklight Multiprocessor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;[http://en.wikibooks.org/wiki/Communication_Networks/Network_Topologies Network Topologies]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer image&amp;quot;&amp;gt;[http://www.top500.org/files/systems/k.jpg K Computer Image]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ring&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.133.6894&amp;amp;rep=rep1&amp;amp;type=pdf Ring Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;mesh&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.48.4149&amp;amp;rep=rep1&amp;amp;type=pdf Mesh Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;&amp;gt;[http://www.top500.org/2007_overview_recent_supercomputers/ccnuma_machines ccNuma Machines]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;&amp;gt;[http://www.leb.eei.uni-erlangen.de/winterakademie/2009/report/content/course02/pdf/0211.pdf Supercomputer Architecture]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;butterfly&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/BBN_Butterfly BBN Butterfly Supercomputer]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62768</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62768"/>
		<updated>2012-04-26T04:39:35Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
With modern technology, large-scale multiprocessors (LSMs) have become more prevalent.  There has been considerable research into networking topologies for connecting the processors, and several methods have been conceived to ensure coherence.  Additionally, there are numerous manufacturers who make the materials necessary to build LSMs.  In this article, we show examples of each of these.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a LSM, you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
[[File:kcomputer.jpg|thumb|right|none|upright=2|alt=alt text|Fujitsu's K Computer&amp;lt;ref name=&amp;quot;k computer image&amp;quot;/&amp;gt;]] &lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Year&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2009&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bisection bandwidth, and degree. The diameter of a network is the longest number of network hops between any pair of nodes. Bisection bandwidth refers to the minimum number of links that need to be cut to divide the network in half. The degree of a network refers to the number of in/out links on each node. The following figure displays some examples.&lt;br /&gt;
&lt;br /&gt;
[[File:NetworkTopologies.png|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
The following table gives some more detail on the different characteristics of some network types. &amp;quot;p&amp;quot; is the number of nodes, &amp;quot;d&amp;quot; is dimensions, and &amp;quot;k&amp;quot; is the number of nodes in each dimension.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 3: Network Properties&lt;br /&gt;
|-&lt;br /&gt;
! Topology&lt;br /&gt;
! Diameter&lt;br /&gt;
! Bandwidth&lt;br /&gt;
! Degree&lt;br /&gt;
! Example(s)&lt;br /&gt;
|-&lt;br /&gt;
| Ring&lt;br /&gt;
| p/2&lt;br /&gt;
| 2&lt;br /&gt;
| 2&lt;br /&gt;
| KSR-1, NUMA-chine &amp;lt;ref name=&amp;quot;ring&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| k-ary d Mesh&lt;br /&gt;
| 2(sqrt(p) - 1)&lt;br /&gt;
| sqrt(p)&lt;br /&gt;
| 4&lt;br /&gt;
| Intel Paragon, Cray T3D &amp;lt;ref name=&amp;quot;mesh&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Line&lt;br /&gt;
| p - 1&lt;br /&gt;
| 1&lt;br /&gt;
| 2&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| k-ary Tree&lt;br /&gt;
| 2 x log_k(p)&lt;br /&gt;
| 1&lt;br /&gt;
| k+1&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Fully Connected&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor.&amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory.&amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Another example of a memory design used by LSMs is Non Uniform Memory Access (NUMA).  NUMA has a coherent version of its system, called cache coherent NUMA (ccNUMA), where data and memory is accessed globally.&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;/&amp;gt;  The 2008 IBM Roadrunner supercomputer, which has 6480 Opteron processors and 12960 IBM Cell processors, uses ccNUMA.&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/SPARC64_VIIIfx Fujitsu SPARC64 VIIIfx Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22 Intel Xeon Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Power7 IBM Power 7 Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29 AMD Opteron Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;[http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm Supermicro Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;[http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html HP Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;[http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html IBM Chassis]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;[http://top500.org/lists/2011/11/press-release K Computer]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/19-inch_rack 19 inch Rack]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;[http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a Cache Coherence Protocols for Large Scale Multiprocessors]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;[http://www.psc.edu/machines/sgi/uv/blacklight.php Blacklight Multiprocessor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;[http://en.wikibooks.org/wiki/Communication_Networks/Network_Topologies Network Topologies]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer image&amp;quot;&amp;gt;[http://www.top500.org/files/systems/k.jpg K Computer Image]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ring&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.133.6894&amp;amp;rep=rep1&amp;amp;type=pdf Ring Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;mesh&amp;quot;&amp;gt;[http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.48.4149&amp;amp;rep=rep1&amp;amp;type=pdf Mesh Network Example]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;&amp;gt;[http://www.top500.org/2007_overview_recent_supercomputers/ccnuma_machines ccNuma Machines]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;&amp;gt;[http://www.leb.eei.uni-erlangen.de/winterakademie/2009/report/content/course02/pdf/0211.pdf Supercomputer Architecture]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62767</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62767"/>
		<updated>2012-04-26T04:32:09Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
With modern technology, large-scale multiprocessors (LSMs) have become more prevalent.  There has been considerable research into networking topologies for connecting the processors, and several methods have been conceived to ensure coherence.  Additionally, there are numerous manufacturers who make the materials necessary to build LSMs.  In this article, we show examples of each of these.  &lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a LSM, you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
[[File:kcomputer.jpg|thumb|right|none|upright=2|alt=alt text|Fujitsu's K Computer&amp;lt;ref name=&amp;quot;k computer image&amp;quot;/&amp;gt;]] &lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Year&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2009&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 2010&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bisection bandwidth, and degree. The diameter of a network is the longest number of network hops between any pair of nodes. Bisection bandwidth refers to the minimum number of links that need to be cut to divide the network in half. The degree of a network refers to the number of in/out links on each node. The following figure displays some examples.&lt;br /&gt;
&lt;br /&gt;
[[File:NetworkTopologies.png|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
The following table gives some more detail on the different characteristics of some network types. &amp;quot;p&amp;quot; is the number of nodes, &amp;quot;d&amp;quot; is dimensions, and &amp;quot;k&amp;quot; is the number of nodes in each dimension.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 3: Network Properties&lt;br /&gt;
|-&lt;br /&gt;
! Topology&lt;br /&gt;
! Diameter&lt;br /&gt;
! Bandwidth&lt;br /&gt;
! Degree&lt;br /&gt;
! Example(s)&lt;br /&gt;
|-&lt;br /&gt;
| Ring&lt;br /&gt;
| p/2&lt;br /&gt;
| 2&lt;br /&gt;
| 2&lt;br /&gt;
| KSR-1, NUMA-chine &amp;lt;ref name=&amp;quot;ring&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| k-ary d Mesh&lt;br /&gt;
| 2(sqrt(p) - 1)&lt;br /&gt;
| sqrt(p)&lt;br /&gt;
| 4&lt;br /&gt;
| Intel Paragon, Cray T3D &amp;lt;ref name=&amp;quot;mesh&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
| Line&lt;br /&gt;
| p - 1&lt;br /&gt;
| 1&lt;br /&gt;
| 2&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| k-ary Tree&lt;br /&gt;
| 2 x log_k(p)&lt;br /&gt;
| 1&lt;br /&gt;
| k+1&lt;br /&gt;
| &lt;br /&gt;
|-&lt;br /&gt;
| Fully Connected&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| &lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor.&amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory.&amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system.&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Another example of a memory design used by LSMs is Non Uniform Memory Access (NUMA).  NUMA has a coherent version of its system, called cache coherent NUMA (ccNUMA), where data and memory is accessed globally.&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;/&amp;gt;  The 2008 IBM Roadrunner supercomputer, which has 6480 Opteron processors and 12960 IBM Cell processors, uses ccNUMA.&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;[http://en.wikipedia.org/wiki/SPARC64_VIIIfx Fujitsu Processor]&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://en.wikibooks.org/wiki/Communication_Networks/Network_Topologies&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer image&amp;quot;&amp;gt;http://www.top500.org/files/systems/k.jpg&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ring&amp;quot;&amp;gt;http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.133.6894&amp;amp;rep=rep1&amp;amp;type=pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;mesh&amp;quot;&amp;gt;http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.48.4149&amp;amp;rep=rep1&amp;amp;type=pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ccNUMA&amp;quot;&amp;gt;http://www.top500.org/2007_overview_recent_supercomputers/ccnuma_machines&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;roadrunner&amp;quot;&amp;gt;http://www.leb.eei.uni-erlangen.de/winterakademie/2009/report/content/course02/pdf/0211.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62476</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62476"/>
		<updated>2012-04-17T08:01:18Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bisection bandwidth, and degree. The diameter of a network is the longest number of network hops between any pair of nodes. Bisection bandwidth refers to the minimum number of links that need to be cut to divide the network in half. The degree of a network refers to the number of in/out links on each node. The following figure displays some examples.&lt;br /&gt;
&lt;br /&gt;
[[File:NetworkTopologies.png|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
The following table gives some more detail on the different characteristics of some network types. &amp;quot;p&amp;quot; is the number of nodes, &amp;quot;d&amp;quot; is dimensions, and &amp;quot;k&amp;quot; is the number of nodes in each dimension.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 3: Network Properties&lt;br /&gt;
|-&lt;br /&gt;
! Topology&lt;br /&gt;
! Diameter&lt;br /&gt;
! Bandwidth&lt;br /&gt;
! Degree&lt;br /&gt;
|-&lt;br /&gt;
| Ring&lt;br /&gt;
| p/2&lt;br /&gt;
| 2&lt;br /&gt;
| 2&lt;br /&gt;
|-&lt;br /&gt;
| k-ary d Mesh&lt;br /&gt;
| 2(sqrt(p) - 1)&lt;br /&gt;
| sqrt(p)&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| Line&lt;br /&gt;
| p - 1&lt;br /&gt;
| 1&lt;br /&gt;
| 2&lt;br /&gt;
|-&lt;br /&gt;
| k-ary Tree&lt;br /&gt;
| 2 x log_k(p)&lt;br /&gt;
| 1&lt;br /&gt;
| k+1&lt;br /&gt;
|-&lt;br /&gt;
| Fully Connected&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| log_2(p)&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://en.wikibooks.org/wiki/Communication_Networks/Network_Topologies&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62475</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62475"/>
		<updated>2012-04-17T07:57:39Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bisection bandwidth, and degree. The diameter of a network is the longest number of network hops between any pair of nodes. Bisection bandwidth refers to the minimum number of links that need to be cut to divide the network in half. The degree of a network refers to the number of in/out links on each node. The following figure displays some examples.&lt;br /&gt;
&lt;br /&gt;
[[File:NetworkTopologies.png|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 3: Network Properties&lt;br /&gt;
|-&lt;br /&gt;
! Topology&lt;br /&gt;
! Diameter&lt;br /&gt;
! Bandwidth&lt;br /&gt;
! Degree&lt;br /&gt;
|-&lt;br /&gt;
| Ring&lt;br /&gt;
| p/2&lt;br /&gt;
| 2&lt;br /&gt;
| 2&lt;br /&gt;
|-&lt;br /&gt;
| k-ary d Mesh&lt;br /&gt;
| 2(sqrt(p) - 1)&lt;br /&gt;
| sqrt(p)&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| Line&lt;br /&gt;
| p - 1&lt;br /&gt;
| 1&lt;br /&gt;
| 2&lt;br /&gt;
|-&lt;br /&gt;
| k-ary Tree&lt;br /&gt;
| 2 x log_k(p)&lt;br /&gt;
| 1&lt;br /&gt;
| k+1&lt;br /&gt;
|-&lt;br /&gt;
| Fully Connected&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| log_2(p)&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://en.wikibooks.org/wiki/Communication_Networks/Network_Topologies&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62474</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62474"/>
		<updated>2012-04-17T07:57:29Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bisection bandwidth, and degree. The diameter of a network is the longest number of network hops between any pair of nodes. Bisection bandwidth refers to the minimum number of links that need to be cut to divide the network in half. The degree of a network refers to the number of in/out links on each node. The following figure displays some examples.&lt;br /&gt;
&lt;br /&gt;
[[File:NetworkTopologies.png|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 3: Network Properties&lt;br /&gt;
|-&lt;br /&gt;
! Topology&lt;br /&gt;
! Diameter&lt;br /&gt;
! Bandwidth&lt;br /&gt;
! Degree&lt;br /&gt;
|-&lt;br /&gt;
| Ring&lt;br /&gt;
| p/2&lt;br /&gt;
| 2&lt;br /&gt;
| 2&lt;br /&gt;
|-&lt;br /&gt;
| k-ary d Mesh&lt;br /&gt;
| 2(sqrt(p) - 1)&lt;br /&gt;
| sqrt(p)&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| Line&lt;br /&gt;
| p - 1&lt;br /&gt;
| 1&lt;br /&gt;
| 2&lt;br /&gt;
|-&lt;br /&gt;
| k-ary Tree&lt;br /&gt;
| 2 x log_k(p)&lt;br /&gt;
| 1&lt;br /&gt;
| k+1&lt;br /&gt;
|-&lt;br /&gt;
| Fully Connected&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| log_2(p)&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://learn-networking.com/network-design/a-guide-to-network-topology&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62473</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62473"/>
		<updated>2012-04-17T07:53:09Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bandwidth, and degree. The following figure displays some examples.&lt;br /&gt;
[[File:NetworkTopologies.png|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 3: Network Properties&lt;br /&gt;
|-&lt;br /&gt;
! Topology&lt;br /&gt;
! Diameter&lt;br /&gt;
! Bandwidth&lt;br /&gt;
! Degree&lt;br /&gt;
|-&lt;br /&gt;
| Ring&lt;br /&gt;
| p/2&lt;br /&gt;
| 2&lt;br /&gt;
| 2&lt;br /&gt;
|-&lt;br /&gt;
| k-ary d Mesh&lt;br /&gt;
| 2(sqrt(p) - 1)&lt;br /&gt;
| sqrt(p)&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| Line&lt;br /&gt;
| p - 1&lt;br /&gt;
| 1&lt;br /&gt;
| 2&lt;br /&gt;
|-&lt;br /&gt;
| k-ary Tree&lt;br /&gt;
| 2 x log_k(p)&lt;br /&gt;
| 1&lt;br /&gt;
| k+1&lt;br /&gt;
|-&lt;br /&gt;
| Fully Connected&lt;br /&gt;
| log_2(p)&lt;br /&gt;
| p/2&lt;br /&gt;
| log_2(p)&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://learn-networking.com/network-design/a-guide-to-network-topology&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62472</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62472"/>
		<updated>2012-04-17T07:48:55Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bandwidth, and degree. The following figure displays some examples.&lt;br /&gt;
[[File:NetworkTopologies.png|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://learn-networking.com/network-design/a-guide-to-network-topology&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=File:NetworkTopologies.png&amp;diff=62471</id>
		<title>File:NetworkTopologies.png</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=File:NetworkTopologies.png&amp;diff=62471"/>
		<updated>2012-04-17T07:46:56Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62470</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62470"/>
		<updated>2012-04-17T07:45:53Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bandwidth, and degree. The following figure displays some examples.&lt;br /&gt;
[[File:NetworkTopologies.jpg|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://learn-networking.com/network-design/a-guide-to-network-topology&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62469</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62469"/>
		<updated>2012-04-17T07:40:54Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. Each network type has different properties and values related to their diameter, bandwidth, and degree. The following figure displays some examples.&lt;br /&gt;
[[File:network-topology.jpg|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://learn-networking.com/network-design/a-guide-to-network-topology&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62468</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62468"/>
		<updated>2012-04-17T07:35:29Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. The following figure displays some examples.&lt;br /&gt;
[[File:network-topology.jpg|center|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://learn-networking.com/network-design/a-guide-to-network-topology&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62467</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62467"/>
		<updated>2012-04-17T07:35:17Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
There are many different ways to connect the network of processors. The following figure displays some examples.&lt;br /&gt;
[[File:network-topology.jpg|thumb|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://learn-networking.com/network-design/a-guide-to-network-topology&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62466</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62466"/>
		<updated>2012-04-17T07:34:00Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
[[File:network-topology.jpg|thumb|left|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://learn-networking.com/network-design/a-guide-to-network-topology&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62465</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62465"/>
		<updated>2012-04-17T07:30:45Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
[[File:network-topology.jpg|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://learn-networking.com/network-design/a-guide-to-network-topology&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62464</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62464"/>
		<updated>2012-04-17T07:30:12Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
[[File:network-topology.jpg|An example of possible network structures.]]&amp;lt;ref name = &amp;quot;topology&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://learn-networking.com/network-design/a-guide-to-network-topology&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62463</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62463"/>
		<updated>2012-04-17T07:29:36Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
[[File:network-topology.jpg|An example of possible network structures.&amp;lt;ref name = &amp;quot;topology&amp;quot;&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://learn-networking.com/network-design/a-guide-to-network-topology&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62462</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62462"/>
		<updated>2012-04-17T07:29:08Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* References */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
[[File:network-topology.jpg|An example of possible network structures.]]&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topology&amp;quot;&amp;gt;http://learn-networking.com/network-design/a-guide-to-network-topology&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=File:Network-topology.jpg&amp;diff=62461</id>
		<title>File:Network-topology.jpg</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=File:Network-topology.jpg&amp;diff=62461"/>
		<updated>2012-04-17T07:28:02Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62460</id>
		<title>CSC 456 Spring 2012/11b AB</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/11b_AB&amp;diff=62460"/>
		<updated>2012-04-17T07:27:30Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: /* Network Topology */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;=Large-Scale Multiprocessors=&lt;br /&gt;
&lt;br /&gt;
==Manufacturers==&lt;br /&gt;
&lt;br /&gt;
In order to build a large-scale multiprocessor (LSM), you will need to choose the right processors, as well as the most appropriate cabinet(s) to place them in.  There are several different manufacturers of processors and cabinets that can be used in LSM configurations.  For example, Fujitsu's K computer (the number one ranked supercomputer on TOP500's November 2011 list) uses a configuration of 88,128 SPARC64 VIIIfx processors.  This means it has a total of 705,024 cores at its use&amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.  Additional examples of processors used in LSMs can be found in Table 1.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 1: Processor Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Processor&lt;br /&gt;
! Cores&lt;br /&gt;
! Clock Rate&lt;br /&gt;
! Architecture&lt;br /&gt;
|-&lt;br /&gt;
| Fujitsu&lt;br /&gt;
| SPARC64 VIIIfx&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.0 GHz&lt;br /&gt;
| SPARC&lt;br /&gt;
|-&lt;br /&gt;
| Intel&lt;br /&gt;
| Xeon 7500&amp;lt;ref name=&amp;quot;intel proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 1.733-2.667 GHz&lt;br /&gt;
| Nehalem&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| POWER7&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 8&lt;br /&gt;
| 2.4-4.25 GHz&lt;br /&gt;
| Power ISA v.2.06&lt;br /&gt;
|-&lt;br /&gt;
| AMD&lt;br /&gt;
| Opteron 6100&amp;lt;ref name=&amp;quot;amd proc&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
| 1.7-2.4 GHz&lt;br /&gt;
| Direct Connect 2.0&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
Like processors, different manufacturers offer varying cabinet/server types, such as IBM's BladeCenter HT.  This particular model uses their CoolBlue technology, a set of tools that allows the user to have greater control over cooling and power use.  There are also some standard cabinet frames, such as 19-inch racks, which get their name from the 19-inch panels used in their design.  Typically, these racks allow for easy processor/server installation and removal. &amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|+ Table 2: Cabinet Manufacturers&lt;br /&gt;
|-&lt;br /&gt;
! Manufacturer&lt;br /&gt;
! Cabinet &lt;br /&gt;
! Blade Count&lt;br /&gt;
|-&lt;br /&gt;
| SuperMicro&lt;br /&gt;
| MP Superserver 8064B-TRLF&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 4&lt;br /&gt;
|-&lt;br /&gt;
| HP&lt;br /&gt;
| Integrity Superdome 2&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 32&lt;br /&gt;
|-&lt;br /&gt;
| IBM&lt;br /&gt;
| BladeCenter HT&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;/&amp;gt;&lt;br /&gt;
| 12&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Assembling==&lt;br /&gt;
&lt;br /&gt;
===Network Topology===&lt;br /&gt;
[[File:network-topology.jpg|An example of possible network structures.]]&lt;br /&gt;
&lt;br /&gt;
==Coherence==&lt;br /&gt;
&lt;br /&gt;
For LSMs that use a Distributed Shared Memory (DSM) architecture, cache coherence is an important issue. In 1990, researchers at the Massachusetts Institute of Technology showed that it was possible to build to build a coherent LSM using a directory-based approach with the Alewife multiprocessor &amp;lt;ref name=&amp;quot;alewife&amp;quot;/&amp;gt;.  A modern example is the Pittsburgh Supercomputing Center's Blacklight, a supercomputer with hardware-enabled shared coherent memory &amp;lt;ref name=&amp;quot;blacklight&amp;quot;/&amp;gt;.  &lt;br /&gt;
&lt;br /&gt;
On the other hand, some LSMs use distributed memory systems, meaning that each of the processors has its own private memory, making cache coherency a non-issue.  Fujitsu's K computer is an example of such a system &amp;lt;ref name=&amp;quot;k computer&amp;quot;/&amp;gt;.&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;fujitsu proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/SPARC64_VIIIfx&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Xeon#6500.2F7500-series_.22Beckton.22&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Power7&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd proc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Opteron#Opteron_.2845_nm_SOI.29&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;supermicro chassis&amp;quot;&amp;gt;http://www.supermicro.com/products/system/4U/8046/SYS-8046B-TRLF.cfm&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;hp chassis&amp;quot;&amp;gt;http://h20341.www2.hp.com/integrity/us/en/high-end/integrity-high-end-servers-superdome2.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ibm chassis&amp;quot;&amp;gt;http://www-03.ibm.com/systems/bladecenter/hardware/chassis/bladeht/index.html&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;k computer&amp;quot;&amp;gt;http://top500.org/lists/2011/11/press-release&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;19 inch rack&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/19-inch_rack&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;alewife&amp;quot;&amp;gt;http://webcache.googleusercontent.com/search?q=cache:-oLJbStOeAEJ:groups.csail.mit.edu/cag/pub/papers/chaiken-thesis.ps.Z+&amp;amp;cd=1&amp;amp;hl=en&amp;amp;ct=clnk&amp;amp;gl=us&amp;amp;client=firefox-a&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;blacklight&amp;quot;&amp;gt;http://www.psc.edu/machines/sgi/uv/blacklight.php&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60700</id>
		<title>Chapter 4a: Brandon Chisholm, Chris Barile</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60700"/>
		<updated>2012-03-29T03:38:38Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Automatic Parallelism is the process of automatically converting sequential code into code that will make use of multiple processors. One main reason for implementing automatic parallelism is to save time and energy compared to converting the code manually.&amp;lt;ref name=&amp;quot;wiki&amp;quot; /&amp;gt; There are several techniques that have been created for parallelizing code, but each has limitations.&lt;br /&gt;
&lt;br /&gt;
== Techniques ==&lt;br /&gt;
===Profile-Driven Parallelism===&lt;br /&gt;
This technique involves running the program while recording each memory access in detail. The memory accesses are then looked at to determine what code can be parallelized. This technique is simple, but does not cover all the possible input combinations. These may lead to false positives in which code segments can be labelled as parallelizeable when they actually aren't.&amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
===Polyhedral Transformation===&lt;br /&gt;
Polyhedral transformation involves representing loop iterations from the source code as lattice points in something called a polytope. This polytope is then transformed into a more optimized form. This approach is limited when pointers are found inside loops. The following pictures represent a polytope before and after the transformation.&amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
[[File:polytope_model.png|thumb|center|200px|Polytope model, unskewed&amp;lt;ref name=&amp;quot;polytope&amp;quot;/&amp;gt;]]&lt;br /&gt;
[[File:polytope_model_skewed.png|thumb|center|200px|Polytope model, skewed&amp;lt;ref name=&amp;quot;polytope&amp;quot;/&amp;gt;]]&lt;br /&gt;
&lt;br /&gt;
===Automatic Program Exploration===&lt;br /&gt;
Also known as &amp;quot;concolic execution&amp;quot;, &amp;quot;mixed concrete and symbolic execution&amp;quot;, or &amp;quot;symbolic execution&amp;quot;, automatic program exploration is originally a bug finding technique that provides full coverage of the program. The first step in this approach is to symbolically represent the inputs to the program. Then, each statement that has a symbolic variable is executed by symbolic manipulation. If a branch condition is found with a symbolic variable, then both paths are conceptually taken. The last step is using the path constraints to find the concrete values that allow the program to go down both paths. This is done until there are no more possible inputs. &amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Scalar and Array Analysis===&lt;br /&gt;
Scalar and Array Analysis are actually two different forms of analysis that are often used together. Scalar Analysis checks for dependencies between scalar variables in code. A dependency can be defined as &amp;quot;when a memory location written on one iteration of a loop is accessed (read or write) on a different iteration.&amp;quot;&amp;lt;ref name=&amp;quot;dipa&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Array Analysis is then used on code that could not be parallelized using Scalar Analysis. Array Analysis looks for arrays that can be privatized.&amp;lt;ref name=&amp;quot;dipa&amp;quot; /&amp;gt; Privatization means to allow each thread its own private copy of the array in question, in whole or in part. This is only possible if there are no dependencies in the array.&amp;lt;ref name=&amp;quot;dipa&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Commutativity Analysis===&lt;br /&gt;
One problem in parallelizing code is that synchronization may be required in some programs to prevent errors in computation. Commutativity Analysis is used to determine which operations can be performed out of order without affecting the results of the code. It is based on the mathematical concept of commutativity, where if the result of running two operations is the same, regardless of which order they are performed in.&amp;lt;ref name=&amp;quot;dipa&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
===Low Level Virtual Machine===&lt;br /&gt;
LLVM works by taking C/C++ code and compiling it into LLVM IR bytecode. Optimizations are done at the IR level and then a code generator brings it back into native code to be executed.&amp;lt;ref name=&amp;quot;chia&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== Limitations ==&lt;br /&gt;
One of the major limitations of automatic parallelization is that a computer lacks the insight into the overall intention of a program that a human would have. The programmer understands what the program must do, and can use that to determine if there are alternate approaches or algorithms for parallelizing the code. Even when parallelizing manually, a programmer may not have enough insight into parallel programming, and will need the assistance of an expert to improve code performance.&amp;lt;ref name=&amp;quot;ncsa&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;chia&amp;quot;&amp;gt;http://www.eecs.berkeley.edu/%7Echiayuan/cs262a/cs262a_parallel.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;dipa&amp;quot;&amp;gt;http://www.csc.villanova.edu/%7Etway/publications/DiPasquale_Masplas05_Paper5.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;wiki&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ncsa&amp;quot;&amp;gt;http://www.ncsa.illinois.edu/extremeideas/site/on_the_limits_of_automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;polytope&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Polytope_model&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60367</id>
		<title>Chapter 4a: Brandon Chisholm, Chris Barile</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60367"/>
		<updated>2012-03-21T13:52:47Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: Added &amp;quot;LLVM&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Automatic Parallelism is the process of automatically converting sequential code into code that will make use of multiple processors. One main reason for implementing automatic parallelism is to save time and energy compared to converting the code manually.&amp;lt;ref name=&amp;quot;wiki&amp;quot; /&amp;gt; There are several techniques that have been created for parallelizing code, but each has limitations.&lt;br /&gt;
&lt;br /&gt;
== Techniques ==&lt;br /&gt;
===Profile-Driven Parallelism===&lt;br /&gt;
This technique involves running the program while recording each memory access in detail. The memory accesses are then looked at to determine what code can be parallelized. This technique is simple, but does not cover all the possible input combinations. These may lead to false positives in which code segments can be labelled as parallelizeable when they actually aren't.&amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
===Polyhedral Transformation===&lt;br /&gt;
Polyhedral transformation involves representing loop iterations from the source code as lattice points in something called a polytope. This polytope is then transformed into a more optimized form. This approach is limited when pointers are found inside loops. The following pictures represent a polytope before and after the transformation.&amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
[[File:polytope_model.png|thumb|center|200px|Polytope model, unskewed]]&lt;br /&gt;
[[File:polytope_model_skewed.png|thumb|center|200px|Polytope model, skewed]]&lt;br /&gt;
&lt;br /&gt;
===Automatic Program Exploration===&lt;br /&gt;
Also known as &amp;quot;concolic execution&amp;quot;, &amp;quot;mixed concrete and symbolic execution&amp;quot;, or &amp;quot;symbolic execution&amp;quot;, automatic program exploration is originally a bug finding technique that provides full coverage of the program. The firs0 tstep in this approach is to symbolically represent the inputs to the program. Then, each statement that has a symbolic variable is executed by symbolic manipulation. If a branch condition is found with a symbolic variable, then both paths are conceptually taken. The last step is using the path constraints to find the concrete values that allow the program to go down both paths. This is done until there are no more possible inputs. &amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
===Scalar and Array Analysis===&lt;br /&gt;
===Commutativity Analysis===&lt;br /&gt;
===Low Level Virtual Machine===&lt;br /&gt;
LLVM works by taking C/C++ code and compiling it into LLVM IR bytecode. Optimizations are done at the IR level and then a code generator brings it back into native code to be executed.&lt;br /&gt;
&lt;br /&gt;
== Limitations ==&lt;br /&gt;
One of the major limitations of automatic parallelization is that a computer lacks the insight into the overall intention of a program that a human would have. The programmer understands what the program must do, and can use that to determine if there are alternate approaches or algorithms for parallelizing the code. Even when parallelizing manually, a programmer may not have enough insight into parallel programming, and will need the assistance of an expert to improve code performance.&amp;lt;ref name=&amp;quot;ncsa&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;chia&amp;quot;&amp;gt;http://www.eecs.berkeley.edu/%7Echiayuan/cs262a/cs262a_parallel.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;dipa&amp;quot;&amp;gt;http://www.csc.villanova.edu/%7Etway/publications/DiPasquale_Masplas05_Paper5.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;wiki&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ncsa&amp;quot;&amp;gt;http://www.ncsa.illinois.edu/extremeideas/site/on_the_limits_of_automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60366</id>
		<title>Chapter 4a: Brandon Chisholm, Chris Barile</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60366"/>
		<updated>2012-03-21T13:34:33Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Automatic Parallelism is the process of automatically converting sequential code into code that will make use of multiple processors. One main reason for implementing automatic parallelism is to save time and energy compared to converting the code manually.&amp;lt;ref name=&amp;quot;wiki&amp;quot; /&amp;gt; There are several techniques that have been created for parallelizing code, but each has limitations.&lt;br /&gt;
&lt;br /&gt;
== Techniques ==&lt;br /&gt;
===Profile-Driven Parallelism===&lt;br /&gt;
This technique involves running the program while recording each memory access in detail. The memory accesses are then looked at to determine what code can be parallelized. This technique is simple, but does not cover all the possible input combinations. These may lead to false positives in which code segments can be labelled as parallelizeable when they actually aren't.&amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
===Polyhedral Transformation===&lt;br /&gt;
Polyhedral transformation involves representing loop iterations from the source code as lattice points in something called a polytope. This polytope is then transformed into a more optimized form. This approach is limited when pointers are found inside loops. The following pictures represent a polytope before and after the transformation.&amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
[[File:polytope_model.png|thumb|200px|Polytope model, unskewed]]&lt;br /&gt;
[[File:polytope_model_skewed.png|thumb|200px|Polytope model, skewed]]&lt;br /&gt;
&lt;br /&gt;
===Automatic Program Exploration===&lt;br /&gt;
Also known as &amp;quot;concolic execution&amp;quot;, &amp;quot;mixed concrete and symbolic execution&amp;quot;, or &amp;quot;symbolic execution&amp;quot;, automatic program exploration is originally a bug finding technique that provides full coverage of the program. The firs0 tstep in this approach is to symbolically represent the inputs to the program. Then, each statement that has a symbolic variable is executed by symbolic manipulation. If a branch condition is found with a symbolic variable, then both paths are conceptually taken. The last step is using the path constraints to find the concrete values that allow the program to go down both paths. This is done until there are no more possible inputs. &amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
===Scalar and Array Analysis===&lt;br /&gt;
===Commutativity Analysis===&lt;br /&gt;
===Low Level Virtual Machine===&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Limitations ==&lt;br /&gt;
One of the major limitations of automatic parallelization is that a computer lacks the insight into the overall intention of a program that a human would have. The programmer understands what the program must do, and can use that to determine if there are alternate approaches or algorithms for parallelizing the code. Even when parallelizing manually, a programmer may not have enough insight into parallel programming, and will need the assistance of an expert to improve code performance.&amp;lt;ref name=&amp;quot;ncsa&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;chia&amp;quot;&amp;gt;http://www.eecs.berkeley.edu/%7Echiayuan/cs262a/cs262a_parallel.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;dipa&amp;quot;&amp;gt;http://www.csc.villanova.edu/%7Etway/publications/DiPasquale_Masplas05_Paper5.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;wiki&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ncsa&amp;quot;&amp;gt;http://www.ncsa.illinois.edu/extremeideas/site/on_the_limits_of_automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60365</id>
		<title>Chapter 4a: Brandon Chisholm, Chris Barile</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60365"/>
		<updated>2012-03-21T13:32:38Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Automatic Parallelism is the process of automatically converting sequential code into code that will make use of multiple processors. One main reason for implementing automatic parallelism is to save time and energy compared to converting the code manually.&amp;lt;ref name=&amp;quot;wiki&amp;quot; /&amp;gt; There are several techniques that have been created for parallelizing code, but each has limitations.&lt;br /&gt;
&lt;br /&gt;
== Techniques ==&lt;br /&gt;
===Profile-Driven Parallelism===&lt;br /&gt;
This technique involves running the program while recording each memory access in detail. The memory accesses are then looked at to determine what code can be parallelized. This technique is simple, but does not cover all the possible input combinations. These may lead to false positives in which code segments can be labelled as parallelizeable when they actually aren't.&amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
===Polyhedral Transformation===&lt;br /&gt;
Polyhedral transformation involves representing loop iterations from the source code as lattice points in something called a polytope. This polytope is then transformed into a more optimized form. This approach is limited when pointers are found inside loops. The following pictures represent a polytope before and after the transformation.&amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
[[File:polytope_model.png|thumb|right|200px|Polytope model, unskewed]]&lt;br /&gt;
[[File:polytope_model_skewed.png|thumb|right|200px|Polytope model, skewed]]&lt;br /&gt;
&lt;br /&gt;
===Automatic Program Exploration===&lt;br /&gt;
Also known as &amp;quot;concolic execution&amp;quot;, &amp;quot;mixed concrete and symbolic execution&amp;quot;, or &amp;quot;symbolic execution&amp;quot;, automatic program exploration is originally a bug finding technique that provides full coverage of the program. The firs0 tstep in this approach is to symbolically represent the inputs to the program. Then, each statement that has a symbolic variable is executed by symbolic manipulation. If a branch condition is found with a symbolic variable, then both paths are conceptually taken. The last step is using the path constraints to find the concrete values that allow the program to go down both paths. This is done until there are no more possible inputs. &amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
===Scalar and Array Analysis===&lt;br /&gt;
===Commutativity Analysis===&lt;br /&gt;
===Low Level Virtual Machine===&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Limitations ==&lt;br /&gt;
One of the major limitations of automatic parallelization is that a computer lacks the insight into the overall intention of a program that a human would have. The programmer understands what the program must do, and can use that to determine if there are alternate approaches or algorithms for parallelizing the code. Even when parallelizing manually, a programmer may not have enough insight into parallel programming, and will need the assistance of an expert to improve code performance.&amp;lt;ref name=&amp;quot;ncsa&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;chia&amp;quot;&amp;gt;http://www.eecs.berkeley.edu/%7Echiayuan/cs262a/cs262a_parallel.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;dipa&amp;quot;&amp;gt;http://www.csc.villanova.edu/%7Etway/publications/DiPasquale_Masplas05_Paper5.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;wiki&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ncsa&amp;quot;&amp;gt;http://www.ncsa.illinois.edu/extremeideas/site/on_the_limits_of_automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60364</id>
		<title>Chapter 4a: Brandon Chisholm, Chris Barile</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60364"/>
		<updated>2012-03-21T13:32:12Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: Added &amp;quot;Automatic program exploration&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Automatic Parallelism is the process of automatically converting sequential code into code that will make use of multiple processors. One main reason for implementing automatic parallelism is to save time and energy compared to converting the code manually.&amp;lt;ref name=&amp;quot;wiki&amp;quot; /&amp;gt; There are several techniques that have been created for parallelizing code, but each has limitations.&lt;br /&gt;
&lt;br /&gt;
== Techniques ==&lt;br /&gt;
===Profile-Driven Parallelism===&lt;br /&gt;
This technique involves running the program while recording each memory access in detail. The memory accesses are then looked at to determine what code can be parallelized. This technique is simple, but does not cover all the possible input combinations. These may lead to false positives in which code segments can be labelled as parallelizeable when they actually aren't.&amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
===Polyhedral Transformation===&lt;br /&gt;
Polyhedral transformation involves representing loop iterations from the source code as lattice points in something called a polytope. This polytope is then transformed into a more optimized form. This approach is limited when pointers are found inside loops. The following pictures represent a polytope before and after the transformation.&amp;lt;ref name=&amp;quot;chia&amp;quot; /&amp;gt;&lt;br /&gt;
[[File:polytope_model.png|thumb|right|200px|Polytope model, unskewed]]&lt;br /&gt;
[[File:polytope_model_skewed.png|thumb|right|200px|Polytope model, skewed]]&lt;br /&gt;
&lt;br /&gt;
===Automatic Program Exploration===&lt;br /&gt;
Also known as &amp;quot;concolic execution&amp;quot;, &amp;quot;mixed concrete and symbolic execution&amp;quot;, or &amp;quot;symbolic execution&amp;quot;, automatic program exploration is originally a bug finding technique that provides full coverage of the program. The firs tstep in this approach is to symbolically represent the inputs to the program. Then, each statement that has a symbolic variable is executed by symbolic manipulation. If a branch condition is found with a symbolic variable, then both paths are conceptually taken. The last step is using the path constraints to find the concrete values that allow the program to go down both paths. This is done until there are no more possible inputs. &lt;br /&gt;
===Scalar and Array Analysis===&lt;br /&gt;
===Commutativity Analysis===&lt;br /&gt;
===Low Level Virtual Machine===&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Limitations ==&lt;br /&gt;
One of the major limitations of automatic parallelization is that a computer lacks the insight into the overall intention of a program that a human would have. The programmer understands what the program must do, and can use that to determine if there are alternate approaches or algorithms for parallelizing the code. Even when parallelizing manually, a programmer may not have enough insight into parallel programming, and will need the assistance of an expert to improve code performance.&amp;lt;ref name=&amp;quot;ncsa&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;chia&amp;quot;&amp;gt;http://www.eecs.berkeley.edu/%7Echiayuan/cs262a/cs262a_parallel.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;dipa&amp;quot;&amp;gt;http://www.csc.villanova.edu/%7Etway/publications/DiPasquale_Masplas05_Paper5.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;wiki&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ncsa&amp;quot;&amp;gt;http://www.ncsa.illinois.edu/extremeideas/site/on_the_limits_of_automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60363</id>
		<title>Chapter 4a: Brandon Chisholm, Chris Barile</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60363"/>
		<updated>2012-03-21T13:18:40Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Automatic Parallelism is the process of automatically converting sequential code into code that will make use of multiple processors. One main reason for implementing automatic parallelism is to save time and energy compared to converting the code manually.&amp;lt;ref name=&amp;quot;wiki&amp;quot; /&amp;gt; There are several techniques that have been created for parallelizing code, but each has limitations.&lt;br /&gt;
&lt;br /&gt;
== Techniques ==&lt;br /&gt;
===Profile-Driven Parallelism===&lt;br /&gt;
This technique involves running the program while recording each memory access in detail. The memory accesses are then looked at to determine what code can be parallelized. This technique is simple, but does not cover all the possible input combinations. These may lead to false positives in which code segments can be labelled as parallelizeable when they actually aren't.&lt;br /&gt;
===Polyhedral Transformation===&lt;br /&gt;
Polyhedral transformation involves representing loop iterations from the source code as lattice points in something called a polytope. This polytope is then transformed into a more optimized form. This approach is limited when pointers are found inside loops.&lt;br /&gt;
[[File:polytope_model.png|thumb|right|200px|Polytope model, unskewed]]&lt;br /&gt;
[[File:polytope_model_skewed.png|thumb|right|200px|Polytope model, skewed]]&lt;br /&gt;
&lt;br /&gt;
===Automatic Program Exploration===&lt;br /&gt;
===Scalar and Array Analysis===&lt;br /&gt;
===Commutativity Analysis===&lt;br /&gt;
===Low Level Virtual Machine===&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Limitations ==&lt;br /&gt;
One of the major limitations of automatic parallelization is that a computer lacks the insight into the overall intention of a program that a human would have. The programmer understands what the program must do, and can use that to determine if there are alternate approaches or algorithms for parallelizing the code. Even when parallelizing manually, a programmer may not have enough insight into parallel programming, and will need the assistance of an expert to improve code performance.&amp;lt;ref name=&amp;quot;ncsa&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;chia&amp;quot;&amp;gt;http://www.eecs.berkeley.edu/%7Echiayuan/cs262a/cs262a_parallel.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;dipa&amp;quot;&amp;gt;http://www.csc.villanova.edu/%7Etway/publications/DiPasquale_Masplas05_Paper5.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;wiki&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ncsa&amp;quot;&amp;gt;http://www.ncsa.illinois.edu/extremeideas/site/on_the_limits_of_automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60362</id>
		<title>Chapter 4a: Brandon Chisholm, Chris Barile</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=Chapter_4a:_Brandon_Chisholm,_Chris_Barile&amp;diff=60362"/>
		<updated>2012-03-21T13:10:23Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: Added &amp;quot;profile-driven&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Automatic Parallelism is the process of automatically converting sequential code into code that will make use of multiple processors. One main reason for implementing automatic parallelism is to save time and energy compared to converting the code manually.&amp;lt;ref name=&amp;quot;wiki&amp;quot; /&amp;gt; There are several techniques that have been created for parallelizing code, but each has limitations.&lt;br /&gt;
&lt;br /&gt;
== Techniques ==&lt;br /&gt;
===Profile-Driven Parallelism===&lt;br /&gt;
This technique involves running the program while closely looking at each memory access. The memory accesses are then looked at to determine what code can be parallelized. This technique is simple, but does not cover all the possible input combinations. These may lead to false positives in which code segments can be labelled as parallelizeable when they actually aren't.&lt;br /&gt;
===Polyhedral Transformation===&lt;br /&gt;
[[File:polytope_model.png|thumb|right|200px|Polytope model, unskewed]]&lt;br /&gt;
[[File:polytope_model_skewed.png|thumb|right|200px|Polytope model, skewed]]&lt;br /&gt;
&lt;br /&gt;
===Automatic Program Exploration===&lt;br /&gt;
===Scalar and Array Analysis===&lt;br /&gt;
===Commutativity Analysis===&lt;br /&gt;
===Low Level Virtual Machine===&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Limitations ==&lt;br /&gt;
One of the major limitations of automatic parallelization is that a computer lacks the insight into the overall intention of a program that a human would have. The programmer understands what the program must do, and can use that to determine if there are alternate approaches or algorithms for parallelizing the code. Even when parallelizing manually, a programmer may not have enough insight into parallel programming, and will need the assistance of an expert to improve code performance.&amp;lt;ref name=&amp;quot;ncsa&amp;quot; /&amp;gt;&lt;br /&gt;
&lt;br /&gt;
== References ==&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;chia&amp;quot;&amp;gt;http://www.eecs.berkeley.edu/%7Echiayuan/cs262a/cs262a_parallel.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;dipa&amp;quot;&amp;gt;http://www.csc.villanova.edu/%7Etway/publications/DiPasquale_Masplas05_Paper5.pdf&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;wiki&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;ncsa&amp;quot;&amp;gt;http://www.ncsa.illinois.edu/extremeideas/site/on_the_limits_of_automatic_parallelization&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58969</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58969"/>
		<updated>2012-02-20T18:58:45Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;==Processor Improvement==&lt;br /&gt;
&lt;br /&gt;
From page 2 of Solihin, the change in transistor counts from 1971-2006 is mentioned to have increased from 2,300 to 167 million, a 72,608x increase. From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&amp;lt;ref name=&amp;quot;trans count&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
Also from page 2, it is mentioned that the clock frequency of processors grew from 750 KHz to 2.4 GHz between 1971-2006, a 3,200x increase. From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The following facts have been updated from the original list on page 4 of Solihin:&lt;br /&gt;
&lt;br /&gt;
*IBM now has the 16-core processor Power PC A2&lt;br /&gt;
*Intel has the 10 core Xeon E7&lt;br /&gt;
*AMD has the 16 Opteron Interlagos&lt;br /&gt;
*Sun has the 8-core Niagara.&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;powerpc&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;amd&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;sun&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==Tables==&lt;br /&gt;
&lt;br /&gt;
The following table shows the improvement in Intel processors since 1971. The bold items are Intel processors that have come out since the book was written. Original table can be found on page 4 of the book.&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&amp;lt;ref name=&amp;quot;intel procs&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The following table is a revised version of the one on page 8 of Solihin listing some examples of current high end multicore processors.&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&amp;lt;ref name=&amp;quot;z196&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|IBM z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==Top Trends==&lt;br /&gt;
&lt;br /&gt;
===Super Computers===&lt;br /&gt;
The top rated super computer for November 2011 is Japan's &amp;quot;K Computer&amp;quot;, rated by top500.org.&amp;lt;ref name=&amp;quot;top2011&amp;quot;/&amp;gt; The K Computer operates at 10.51 petaflop/s with 705,024 cores and is the first to break the 10 petaflop/s barrier. Just one year earlier the highest rated supercomputer was China's Tianhe-1A at 2.57 petaflop/s, 1/4 as fast as the K Computer.&lt;br /&gt;
&lt;br /&gt;
This is the second top500 list in a row that the K Computer stayed in first place. In fact, none of the top 10 changed, which is mentioned here by TOP500 editor Erich Strohmaier, &amp;quot;This is the first time since we began publishing the list back in 1993 that the top 10 systems showed no turnover&amp;quot;.&amp;lt;ref name=&amp;quot;top2011&amp;quot;/&amp;gt; This shows that the improvement in potential processing power has started to slow down. This observation combined with the reduction in transistor count and clock frequency growth described earlier would suggest that we are approaching the &amp;quot;wall&amp;quot; of processing capabilities.&lt;br /&gt;
&lt;br /&gt;
===Cluster Computing===&lt;br /&gt;
Since June 2006, the trending supercomputer architecture has gone towards cluster computing. Cluster architecture usage has gone from 72% in 2006 to 82% in 2011, while Constellation has gone from 7.6% to 0%.&amp;lt;ref name=&amp;quot;topstats&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;trans count&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Transistor_count &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Microprocessor_chronologyref &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel procs&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/List_of_Intel_microprocessors &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;z196&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/IBM_z196_(microprocessor) &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;xeon&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)#Westmere &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;powerpc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/PowerPC_A2 &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd&amp;quot;&amp;gt;http://www.tomshardware.com/news/interlagos-bulldozer-opteron-16-core-valencia,13984.html &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;sun&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/UltraSPARC_T1 &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;top2011&amp;quot;&amp;gt;http://top500.org/lists/2011/11&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;topstats&amp;quot;&amp;gt;http://i.top500.org/stats&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58534</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58534"/>
		<updated>2012-02-13T01:55:59Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&amp;lt;ref name=&amp;quot;trans count&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;powerpc&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;amd&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;sun&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&amp;lt;ref name=&amp;quot;intel procs&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&amp;lt;ref name=&amp;quot;z196&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|IBM z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
==References==&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;trans count&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Transistor_count &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Microprocessor_chronologyref &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel procs&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/List_of_Intel_microprocessors &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;z196&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/IBM_z196_(microprocessor) &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;xeon&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)#Westmere &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;powerpc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/PowerPC_A2 &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd&amp;quot;&amp;gt;http://www.tomshardware.com/news/interlagos-bulldozer-opteron-16-core-valencia,13984.html &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;sun&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/UltraSPARC_T1 &amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58531</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58531"/>
		<updated>2012-02-13T01:48:25Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&amp;lt;ref name=&amp;quot;trans count&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;powerpc&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;amd&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;sun&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&amp;lt;ref name=&amp;quot;intel procs&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&amp;lt;ref name=&amp;quot;z196&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|IBM z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;trans count&amp;quot;&amp;gt; http://en.wikipedia.org/wiki/Transistor_count&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;proc chrono&amp;quot; http://en.wikipedia.org/wiki/Microprocessor_chronologyref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel procs&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/List_of_Intel_microprocessors&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;z196&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/IBM_z196_(microprocessor)&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;xeon&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)#Westmere&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;powerpc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/PowerPC_A2&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd&amp;quot;&amp;gt;http://www.tomshardware.com/news/interlagos-bulldozer-opteron-16-core-valencia,13984.html&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;sun&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/UltraSPARC_T1&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58530</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58530"/>
		<updated>2012-02-13T01:45:29Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&amp;lt;ref name=&amp;quot;trans count&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;powerpc&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;amd&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;sun&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&amp;lt;ref name=&amp;quot;intel procs&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&amp;lt;ref name=&amp;quot;z196&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|IBM z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;trans count&amp;quot;&amp;gt; http://en.wikipedia.org/wiki/Transistor_count&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;proc chrono&amp;quot; http://en.wikipedia.org/wiki/Microprocessor_chronologyref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel procs&amp;quot;&amp;gt;&amp;lt;http://en.wikipedia.org/wiki/List_of_Intel_microprocessors&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;z196&amp;quot;&amp;gt;&amp;lt;http://en.wikipedia.org/wiki/IBM_z196_(microprocessor)&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;xeon&amp;quot;&amp;gt;&amp;lt;http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)#Westmere&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;powerpc&amp;quot;&amp;gt;&amp;lt;http://en.wikipedia.org/wiki/PowerPC_A2&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd&amp;quot;&amp;gt;&amp;lt;http://www.tomshardware.com/news/interlagos-bulldozer-opteron-16-core-valencia,13984.html&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;sun&amp;quot;&amp;gt;&amp;lt;http://en.wikipedia.org/wiki/UltraSPARC_T1&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58529</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58529"/>
		<updated>2012-02-13T01:43:42Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&amp;lt;ref name=&amp;quot;trans count&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;powerpc&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;amd&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;sun&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&amp;lt;ref name=&amp;quot;intel procs&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&amp;lt;ref name=&amp;quot;z196&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|IBM z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;trans count&amp;quot;&amp;gt;&amp;lt;http://en.wikipedia.org/wiki/Transistor_count&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;&amp;gt;&amp;lt;http://en.wikipedia.org/wiki/Microprocessor_chronologyref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel procs&amp;quot;&amp;gt;&amp;lt;http://en.wikipedia.org/wiki/List_of_Intel_microprocessors&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;z196&amp;quot;&amp;gt;&amp;lt;http://en.wikipedia.org/wiki/IBM_z196_(microprocessor)&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;xeon&amp;quot;&amp;gt;&amp;lt;http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)#Westmere&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;powerpc&amp;quot;&amp;gt;&amp;lt;http://en.wikipedia.org/wiki/PowerPC_A2&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd&amp;quot;&amp;gt;&amp;lt;http://www.tomshardware.com/news/interlagos-bulldozer-opteron-16-core-valencia,13984.html&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;sun&amp;quot;&amp;gt;&amp;lt;http://en.wikipedia.org/wiki/UltraSPARC_T1&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58526</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58526"/>
		<updated>2012-02-13T01:38:01Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&amp;lt;ref name=&amp;quot;trans count&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;powerpc&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;amd&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;sun&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&amp;lt;ref name=&amp;quot;intel procs&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&amp;lt;ref name=&amp;quot;z196&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|IBM z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;trans count&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Transistor_count&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Microprocessor_chronologyref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel procs&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/List_of_Intel_microprocessors&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;z196&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/IBM_z196_(microprocessor)&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;xeon&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)#Westmere&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;powerpc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/PowerPC_A2&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd&amp;quot;&amp;gt;http://www.tomshardware.com/news/interlagos-bulldozer-opteron-16-core-valencia,13984.html&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;sun&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/UltraSPARC_T1&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58525</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58525"/>
		<updated>2012-02-13T01:37:39Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&amp;lt;ref name=&amp;quot;trans count&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;powerpc&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;amd&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;sun&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}&amp;lt;ref name=&amp;quot;intel procs&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&amp;lt;ref name=&amp;quot;z196&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|IBM z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;trans count&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Transistor_count&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Microprocessor_chronologyref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel procs&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/List_of_Intel_microprocessors&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;z196&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/IBM_z196_(microprocessor)&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;xeon&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)#Westmere&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;powerpc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/PowerPC_A2&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd&amp;quot;&amp;gt;http://www.tomshardware.com/news/interlagos-bulldozer-opteron-16-core-valencia,13984.html&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;sun&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/UltraSPARC_T1&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58524</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58524"/>
		<updated>2012-02-13T01:36:56Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&amp;lt;ref name=&amp;quot;trans count&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;powerpc&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;amd&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;sun&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}&amp;lt;ref name=&amp;quot;intel procs&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|IBM z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&amp;lt;ref name=&amp;quot;z196&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;trans count&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Transistor_count&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Microprocessor_chronologyref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel procs&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/List_of_Intel_microprocessors&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;z196&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/IBM_z196_(microprocessor)&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;xeon&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)#Westmere&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;powerpc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/PowerPC_A2&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd&amp;quot;&amp;gt;http://www.tomshardware.com/news/interlagos-bulldozer-opteron-16-core-valencia,13984.html&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;sun&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/UltraSPARC_T1&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58523</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58523"/>
		<updated>2012-02-13T01:34:07Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&amp;lt;ref name=&amp;quot;trans count&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;powerpc&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;amd&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;sun&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&amp;lt;ref name=&amp;quot;z196&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}.&amp;lt;ref name=&amp;quot;intel procs&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|IBM z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;trans count&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Transistor_count&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Microprocessor_chronologyref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel procs&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/List_of_Intel_microprocessors&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;z196&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/IBM_z196_(microprocessor)&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;xeon&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)#Westmere&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;powerpc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/PowerPC_A2&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd&amp;quot;&amp;gt;http://www.tomshardware.com/news/interlagos-bulldozer-opteron-16-core-valencia,13984.html&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;sun&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/UltraSPARC_T1&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58522</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58522"/>
		<updated>2012-02-13T01:33:43Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&amp;lt;ref name=&amp;quot;trans count&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;powerpc&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;amd&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;sun&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&amp;lt;ref name=&amp;quot;z196/&amp;gt;&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}.&amp;lt;ref name=&amp;quot;intel procs&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|IBM z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;trans count&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Transistor_count&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Microprocessor_chronologyref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel procs&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/List_of_Intel_microprocessors&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;z196&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/IBM_z196_(microprocessor)&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;xeon&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)#Westmere&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;powerpc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/PowerPC_A2&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd&amp;quot;&amp;gt;http://www.tomshardware.com/news/interlagos-bulldozer-opteron-16-core-valencia,13984.html&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;sun&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/UltraSPARC_T1&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58521</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58521"/>
		<updated>2012-02-13T01:33:01Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&amp;lt;ref name=&amp;quot;trans count&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;powerpc&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;amd&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;sun&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}.&amp;lt;ref name=&amp;quot;intel procs&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|IBM z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&amp;lt;ref name=&amp;quot;z196/&amp;gt;&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;trans count&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Transistor_count&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Microprocessor_chronologyref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel procs&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/List_of_Intel_microprocessors&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;z196&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/IBM_z196_(microprocessor)&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;xeon&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)#Westmere&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;powerpc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/PowerPC_A2&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd&amp;quot;&amp;gt;http://www.tomshardware.com/news/interlagos-bulldozer-opteron-16-core-valencia,13984.html&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;sun&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/UltraSPARC_T1&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58519</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58519"/>
		<updated>2012-02-13T01:32:36Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&amp;lt;ref name=&amp;quot;trans count&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&amp;lt;ref name=&amp;quot;powerpc&amp;quot;&amp;gt;&amp;lt;ref name=&amp;quot;amd&amp;quot;&amp;gt;&amp;lt;ref name=&amp;quot;sun&amp;quot;&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}.&amp;lt;ref name=&amp;quot;intel procs&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|IBM z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&amp;lt;ref name=&amp;quot;z196/&amp;gt;&amp;lt;ref name=&amp;quot;xeon&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;trans count&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Transistor_count&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;proc chrono&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Microprocessor_chronologyref&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;intel procs&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/List_of_Intel_microprocessors&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;z196&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/IBM_z196_(microprocessor)&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;xeon&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/Nehalem_(microarchitecture)#Westmere&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;powerpc&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/PowerPC_A2&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;amd&amp;quot;&amp;gt;http://www.tomshardware.com/news/interlagos-bulldozer-opteron-16-core-valencia,13984.html&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;sun&amp;quot;&amp;gt;http://en.wikipedia.org/wiki/UltraSPARC_T1&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58084</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58084"/>
		<updated>2012-02-06T19:05:30Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&amp;lt;ref name=&amp;quot;Proc Specs&amp;quot;/&amp;gt;&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&amp;lt;references&amp;gt;&lt;br /&gt;
&amp;lt;ref name=&amp;quot;Proc Specs&amp;quot;&amp;gt;This is where the processor information .&amp;lt;/ref&amp;gt;&lt;br /&gt;
&amp;lt;/references&amp;gt;&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58062</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58062"/>
		<updated>2012-02-06T18:54:08Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16-bit, 5-10MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32-bit, 16-33MHz, 275K transistors, 4GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|1999&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64-bit, 2GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|'''2008'''&lt;br /&gt;
|'''Intel Core i7'''&lt;br /&gt;
|'''64-bit, 3.2GHz, 730M transistors, 4 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2010'''&lt;br /&gt;
|'''Intel Xeon &amp;quot;Nehalem-EX&amp;quot;'''&lt;br /&gt;
|'''64-bit, 2.66GHz, 2300M transistors, 8 core'''&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|'''2011'''&lt;br /&gt;
|'''Intel Xeon E7'''&lt;br /&gt;
|'''64-bit, 2.67GHz, 2600M transistors, 10 core'''&lt;br /&gt;
|'''First Intel chip with 10 processors'''&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Examples of Current Multicore Processors&lt;br /&gt;
|-&lt;br /&gt;
!Name&lt;br /&gt;
!# Cores&lt;br /&gt;
!Clock Freq&lt;br /&gt;
!Clock Type&lt;br /&gt;
!Caches&lt;br /&gt;
!Chip Power&lt;br /&gt;
|-&lt;br /&gt;
|z196&lt;br /&gt;
|4 cores&lt;br /&gt;
|5.3GHz&lt;br /&gt;
|OOO Superscalar&lt;br /&gt;
|128KB L1, 1.5MB L2, 24MB L3, 192MB L4&lt;br /&gt;
|1800W&lt;br /&gt;
|-&lt;br /&gt;
|Intel Xeon E&lt;br /&gt;
|10 cores&lt;br /&gt;
|2.67GHz&lt;br /&gt;
|SIMD&lt;br /&gt;
|64KB L1, 256KB L2, 30MB L3&lt;br /&gt;
|130W&lt;br /&gt;
|}&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58055</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=58055"/>
		<updated>2012-02-06T18:41:34Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;From 2006-2012 the increase in the number of transistors on a chip has grown from 167 million to 2.6 billion, a 15x increase.&lt;br /&gt;
&lt;br /&gt;
From 2006-2012 the clock frequency has increased from 2.4ghz to 5.2, a 2.2x increase.&lt;br /&gt;
&lt;br /&gt;
IBM now has the 16-core processor Power PC A2, Intel has the 10 core Xeon E7, AMD has the 16 Opteron Interlagos, and Sun has the 8-core Niagara.&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
!Evolution of Intel Processors&lt;br /&gt;
|-&lt;br /&gt;
!From&lt;br /&gt;
!Procs&lt;br /&gt;
!Specifications&lt;br /&gt;
!New Features&lt;br /&gt;
|-&lt;br /&gt;
|1971&lt;br /&gt;
|4004&lt;br /&gt;
|740 KHz, 2300 transistors, 10 micrometers, 640B addressable memory, 4 KB program memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1978&lt;br /&gt;
|8086&lt;br /&gt;
|16 bit, 5-10 MHz, 29000 transistors at 3 micrometers, 1MB addressable memory&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|1982&lt;br /&gt;
|80286&lt;br /&gt;
|8-12.5 MHz&lt;br /&gt;
|Virtual memory and protection mode&lt;br /&gt;
|-&lt;br /&gt;
|1985&lt;br /&gt;
|386&lt;br /&gt;
|32 bit, 16-33MHz, 275K transistors, 4 GB addressable memory&lt;br /&gt;
|Pipelining&lt;br /&gt;
|-&lt;br /&gt;
|1989&lt;br /&gt;
|486&lt;br /&gt;
|25-100MHz, 1.5M transistors&lt;br /&gt;
|FPU integration&lt;br /&gt;
|-&lt;br /&gt;
|1993&lt;br /&gt;
|Pentium&lt;br /&gt;
|60-200 MHz&lt;br /&gt;
|On-chip L1 caches and SMP suport&lt;br /&gt;
|-&lt;br /&gt;
|1995&lt;br /&gt;
|Pentium Pro&lt;br /&gt;
|16KB L1 caches, 5.5M transistors&lt;br /&gt;
|OOO execution&lt;br /&gt;
|-&lt;br /&gt;
|1997&lt;br /&gt;
|Pentium MMX&lt;br /&gt;
|233-450 MHz, 32KB L1 cache, 4.5M transistors&lt;br /&gt;
|Dynamic branch prediction, MMX instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|199&lt;br /&gt;
|Pentium III&lt;br /&gt;
|450-1400MHz, 256KB L2 cache on chip, 28M transistors&lt;br /&gt;
|SSE instruction sets&lt;br /&gt;
|-&lt;br /&gt;
|2000&lt;br /&gt;
|Pentium IV&lt;br /&gt;
|1.4-3 GHz, 55M transistors&lt;br /&gt;
|Hyperpipelining and SMT&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|2006&lt;br /&gt;
|Xeon&lt;br /&gt;
|64 bit, 2 GHz, 167M transistors, 4MB L2 cache on chip&lt;br /&gt;
|Dual-core and virtualization support&lt;br /&gt;
|-&lt;br /&gt;
|2008&lt;br /&gt;
|Intel Core i7&lt;br /&gt;
|64bit, 3.2GHz, 730m transistors, 4 core&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|2010&lt;br /&gt;
|Intel Xeon &amp;quot;Nehalem-EX&amp;quot;&lt;br /&gt;
|64bit, 2.66ghz, 2300m transistors, 8 core&lt;br /&gt;
|&lt;br /&gt;
|-	&lt;br /&gt;
|2011&lt;br /&gt;
|Intel Xeon E7&lt;br /&gt;
|64bit, 2.67ghz, 2600m transistors, 10 core	&lt;br /&gt;
|first Intel chip with 10 processors&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
z196&lt;br /&gt;
4 cores&lt;br /&gt;
5.3ghz&lt;br /&gt;
OOO Superscalar&lt;br /&gt;
128kb L1&lt;br /&gt;
1.5mb L2&lt;br /&gt;
24mb L3&lt;br /&gt;
192mb L4&lt;br /&gt;
1800 watts&lt;br /&gt;
&lt;br /&gt;
Xeon E&lt;br /&gt;
10 cores&lt;br /&gt;
2.67ghz&lt;br /&gt;
SIMD&lt;br /&gt;
64kb l1&lt;br /&gt;
256kb l2&lt;br /&gt;
30mbl3&lt;br /&gt;
130W&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2011&amp;diff=57300</id>
		<title>CSC 456 Spring 2011</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2011&amp;diff=57300"/>
		<updated>2012-01-30T18:25:50Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Chapter 6: Joshua Mohundro, Patrick Wong]]&amp;lt;br /&amp;gt;&lt;br /&gt;
[[Chapter 6: Allison Hamann, Chris Barile]]&amp;lt;br /&amp;gt;&lt;br /&gt;
[[CSC 456 Spring 2012/ch1 BC]]&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2011&amp;diff=57299</id>
		<title>CSC 456 Spring 2011</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2011&amp;diff=57299"/>
		<updated>2012-01-30T18:25:30Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[Chapter 6: Joshua Mohundro, Patrick Wong]]&amp;lt;br /&amp;gt;&lt;br /&gt;
[[Chapter 6: Allison Hamann, Chris Barile]]&lt;br /&gt;
[[CSC 456 Spring 2012/ch1 BC]]&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
	<entry>
		<id>https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=57298</id>
		<title>CSC 456 Spring 2012/ch1 BC</title>
		<link rel="alternate" type="text/html" href="https://wiki.expertiza.ncsu.edu/index.php?title=CSC_456_Spring_2012/ch1_BC&amp;diff=57298"/>
		<updated>2012-01-30T18:23:14Z</updated>

		<summary type="html">&lt;p&gt;Bachish2: Created page with &amp;quot;Placeholder&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Placeholder&lt;/div&gt;</summary>
		<author><name>Bachish2</name></author>
	</entry>
</feed>