The key is choosing the correct baseline for comparison
- For our serial vs. vectorization examples, using compiler-provided vectorization, the baseline is simple; the same code, with vectorization turned off
- For parallel applications, this is much harder:
- Choice of algorithm, decomposition, performance of baseline case etc.