Computational Physics Lectures: How to optimize codes, from vectorization to parallelization

Loading [MathJax]/extensions/TeX/boldsymbol.js

Contents

Speedup and memory

The speedup on $p$ processors can be greater than $p$ if memory usage is optimal! Consider the case of a memorybound computation with $M$ words of memory

If $M/p$ fits into cache while $M$ does not, the time to access memory will be different in the two cases:
$T_1$ uses the main memory bandwidth
$T_p$ uses the appropriate cache bandwidth

«
1
...
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
...
119
»