CSC/ECE 506 Fall 2007/wiki1 1.3.3 1.3.4 chase2007: Difference between revisions

From Expertiza_Wiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 55: Line 55:
'''Communication Time = Overhead + Occupancy + Network Delay'''
'''Communication Time = Overhead + Occupancy + Network Delay'''
'''Overlap = The portion of the communication operation that is performed concurrently with other useful work.'''
'''Overlap = The portion of the communication operation that is performed concurrently with other useful work.'''
===<u>BSP Model:</u>===
'''Bulk Synchronization Model:'''
'''The BSP model is a series of supersteps.  superstep on a BSP machine is given by W + G*H + L'''
'''W is the maximum possible work for a given processor'''
'''H is the maximum bytes sent or recieved by a processor'''
'''G is the number of available processors'''
'''L is the time required for the barrier synchronization'''
===<u>LogP Model:</u>===


===<u>References:</u>===
===<u>References:</u>===
Line 60: Line 72:
    
    
http://en.wikipedia.org/wiki/Computer_performance
http://en.wikipedia.org/wiki/Computer_performance
The BSP Model
http://wwwcs.uni-paderborn.de/fachbereich/AG/agmadh/WWW/bono/paper/nestedbsp/node6.html
LogP: Towards a Realistic Model of Parallel Computation
http://cs315b-wiki.stanford.edu/images/8/8b/Logp.pdf

Revision as of 23:16, 10 September 2007

Introduction to Parallel Computer Architecture ->Fundamental Design Issues -> Communication and Replication (section 1.3.3)

Replication:

Replication is the creation of a local copy of data to help enable parallelism.

Communication:

Communication occurs when data written from one process is read by another process. Replication helps to avoid unnecessary communication.


Introduction to Parallel Computer Architecture ->Fundamental Design Issues -> Performance (section 1.3.4)

Computer Performance:

Computer performance is a measure of the output of a computer with with respect to time and resources used.

Performance metrics:

Following are the important metrics used to measure a computer's performance:

1.Latency: The time taken to perform an operation
2.Bandwidth:The rate at which the operations are performed
3.Cost: The impact these operations have on the execution time of the program

All the above metrics can be used to define a uniprocessor systems where a single CPU operates.

However, in the context of parallel computers, it becomes difficult to express the performance in above stated metrics. The reason for this is the communication between the processors that occurs mostly in the form of data transfers between the processors. So, to completely define the performace of a parallel computer, the following metrics are also considered.

Data Transfer Time:

It is the time taken for initiation of a data transfer and the time required for actual data transfer. So the Data Transfer Time can be given as:

Transfer Time (n) = T+(n/B)
where
n = Amount of Data (in bytes)
B = Transfer Rate of the component moving the data (bytes per second)
T = Start up cost, a constant

Overhead and Occupancy:

The data transfer operations are initiated by the processor through communication assist
The overhead is the time the processor spends initiating the transfer of data. This may be a fixed cost, if the processor imply has to tell the communication assist to start. The overhead can also be linear with Tranfer time, if the processor has to copy the data into the assist. The occupancy is the time it takes for the data to pass through the slowest componant on the communication path. The occupancy limits how frequently communication operations can be initiated.The next data transfer will have to wait untill the critical resource is no longer occupied before it can use the same resource.

Communication Cost:

It is the time that the processor spends in communicating with other processors. It can be given by the following:

Communication Cost = Frequency of Communication * (Communication Time - Overlap)

Frequency of Communication = Number of communications per unit of work Communication Time = Overhead + Occupancy + Network Delay Overlap = The portion of the communication operation that is performed concurrently with other useful work.

BSP Model:

Bulk Synchronization Model: The BSP model is a series of supersteps. superstep on a BSP machine is given by W + G*H + L W is the maximum possible work for a given processor H is the maximum bytes sent or recieved by a processor G is the number of available processors L is the time required for the barrier synchronization

LogP Model:

References:

Parallel Computer Architecture- A Hardware/Software Approach by David E Culler, Jaswinder Pal Singh and Anoop Guptha

http://en.wikipedia.org/wiki/Computer_performance

The BSP Model http://wwwcs.uni-paderborn.de/fachbereich/AG/agmadh/WWW/bono/paper/nestedbsp/node6.html

LogP: Towards a Realistic Model of Parallel Computation http://cs315b-wiki.stanford.edu/images/8/8b/Logp.pdf