-------------------------------------------------------------
This system uses 8 bytes per DOUBLE PRECISION word.
-------------------------------------------------------------
Array size = 32000000, Offset = 0
Total memory required = 732.4 MB.
Each test is run 10 times, but only
the *best* time for each is used.
-------------------------------------------------------------
Number of Threads requested = 4
Number of Threads requested = 4
Number of Threads requested = 4
Number of Threads requested = 4
-------------------------------------------------------------
Your clock granularity/precision appears to be 1 microseconds.
Each test below will take on the order of 41514 microseconds.
   (= 41514 clock ticks)
Increase the size of the arrays if this shows that
you are not getting at least 20 clock ticks per test.
-------------------------------------------------------------
WARNING -- The above is only a rough guideline.
For best results, please be sure you know the
precision of your system timer.
-------------------------------------------------------------
Function      Rate (MB/s)   Avg time     Min time     Max time
Copy:        7132.0329       0.0718       0.0718       0.0719
Scale:       8615.4708       0.0595       0.0594       0.0595
Add:         8396.4578       0.0916       0.0915       0.0919
Triad:       8259.4042       0.0931       0.0930       0.0932
-------------------------------------------------------------
Solution Validates
-------------------------------------------------------------
