-
Notifications
You must be signed in to change notification settings - Fork 11
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #42 from ilyakurdyukov/master
Added Rockchip RK3528 results
- Loading branch information
Showing
64 changed files
with
2,311 additions
and
0 deletions.
There are no files selected for viewing
33 changes: 33 additions & 0 deletions
33
results/rockchip-arm-rk3528-h96max_m1/STREAM_arm-native_O2_mp1.stdout_stderr.log
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
------------------------------------------------------------- | ||
STREAM version $Revision: 5.10 $ | ||
------------------------------------------------------------- | ||
This system uses 8 bytes per array element. | ||
------------------------------------------------------------- | ||
Array size = 80000000 (elements), Offset = 0 (elements) | ||
Memory per array = 610.4 MiB (= 0.6 GiB). | ||
Total memory required = 1831.1 MiB (= 1.8 GiB). | ||
Each kernel will be executed 20 times. | ||
The *best* time for each kernel (excluding the first iteration) | ||
will be used to compute the reported bandwidth. | ||
------------------------------------------------------------- | ||
Number of Threads requested = 1 | ||
Number of Threads counted = 1 | ||
------------------------------------------------------------- | ||
Your clock granularity/precision appears to be 1 microseconds. | ||
Each test below will take on the order of 426975 microseconds. | ||
(= 426975 clock ticks) | ||
Increase the size of the arrays if this shows that | ||
you are not getting at least 20 clock ticks per test. | ||
------------------------------------------------------------- | ||
WARNING -- The above is only a rough guideline. | ||
For best results, please be sure you know the | ||
precision of your system timer. | ||
------------------------------------------------------------- | ||
Function Best Rate MB/s Avg time Min time Max time | ||
Copy: 3705.9 0.347040 0.345396 0.348966 | ||
Scale: 2958.8 0.447160 0.432607 0.455397 | ||
Add: 2521.5 0.761948 0.761448 0.762583 | ||
Triad: 2227.8 0.862771 0.861836 0.863757 | ||
------------------------------------------------------------- | ||
Solution Validates: avg error less than 1.000000e-13 on all three arrays | ||
------------------------------------------------------------- |
33 changes: 33 additions & 0 deletions
33
results/rockchip-arm-rk3528-h96max_m1/STREAM_arm-native_O2_mp2.stdout_stderr.log
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
------------------------------------------------------------- | ||
STREAM version $Revision: 5.10 $ | ||
------------------------------------------------------------- | ||
This system uses 8 bytes per array element. | ||
------------------------------------------------------------- | ||
Array size = 80000000 (elements), Offset = 0 (elements) | ||
Memory per array = 610.4 MiB (= 0.6 GiB). | ||
Total memory required = 1831.1 MiB (= 1.8 GiB). | ||
Each kernel will be executed 20 times. | ||
The *best* time for each kernel (excluding the first iteration) | ||
will be used to compute the reported bandwidth. | ||
------------------------------------------------------------- | ||
Number of Threads requested = 2 | ||
Number of Threads counted = 2 | ||
------------------------------------------------------------- | ||
Your clock granularity/precision appears to be 1 microseconds. | ||
Each test below will take on the order of 315566 microseconds. | ||
(= 315566 clock ticks) | ||
Increase the size of the arrays if this shows that | ||
you are not getting at least 20 clock ticks per test. | ||
------------------------------------------------------------- | ||
WARNING -- The above is only a rough guideline. | ||
For best results, please be sure you know the | ||
precision of your system timer. | ||
------------------------------------------------------------- | ||
Function Best Rate MB/s Avg time Min time Max time | ||
Copy: 4534.9 0.283810 0.282256 0.286187 | ||
Scale: 4379.6 0.296852 0.292267 0.300219 | ||
Add: 4034.9 0.478230 0.475843 0.482446 | ||
Triad: 3868.4 0.498390 0.496334 0.501047 | ||
------------------------------------------------------------- | ||
Solution Validates: avg error less than 1.000000e-13 on all three arrays | ||
------------------------------------------------------------- |
33 changes: 33 additions & 0 deletions
33
results/rockchip-arm-rk3528-h96max_m1/STREAM_arm-native_O2_mp3.stdout_stderr.log
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
------------------------------------------------------------- | ||
STREAM version $Revision: 5.10 $ | ||
------------------------------------------------------------- | ||
This system uses 8 bytes per array element. | ||
------------------------------------------------------------- | ||
Array size = 80000000 (elements), Offset = 0 (elements) | ||
Memory per array = 610.4 MiB (= 0.6 GiB). | ||
Total memory required = 1831.1 MiB (= 1.8 GiB). | ||
Each kernel will be executed 20 times. | ||
The *best* time for each kernel (excluding the first iteration) | ||
will be used to compute the reported bandwidth. | ||
------------------------------------------------------------- | ||
Number of Threads requested = 3 | ||
Number of Threads counted = 3 | ||
------------------------------------------------------------- | ||
Your clock granularity/precision appears to be 1 microseconds. | ||
Each test below will take on the order of 277948 microseconds. | ||
(= 277948 clock ticks) | ||
Increase the size of the arrays if this shows that | ||
you are not getting at least 20 clock ticks per test. | ||
------------------------------------------------------------- | ||
WARNING -- The above is only a rough guideline. | ||
For best results, please be sure you know the | ||
precision of your system timer. | ||
------------------------------------------------------------- | ||
Function Best Rate MB/s Avg time Min time Max time | ||
Copy: 4993.7 0.257680 0.256321 0.258705 | ||
Scale: 4940.4 0.260631 0.259089 0.261941 | ||
Add: 4675.7 0.412184 0.410635 0.414083 | ||
Triad: 4591.9 0.419038 0.418129 0.420066 | ||
------------------------------------------------------------- | ||
Solution Validates: avg error less than 1.000000e-13 on all three arrays | ||
------------------------------------------------------------- |
33 changes: 33 additions & 0 deletions
33
results/rockchip-arm-rk3528-h96max_m1/STREAM_arm-native_O2_mp4.stdout_stderr.log
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
------------------------------------------------------------- | ||
STREAM version $Revision: 5.10 $ | ||
------------------------------------------------------------- | ||
This system uses 8 bytes per array element. | ||
------------------------------------------------------------- | ||
Array size = 80000000 (elements), Offset = 0 (elements) | ||
Memory per array = 610.4 MiB (= 0.6 GiB). | ||
Total memory required = 1831.1 MiB (= 1.8 GiB). | ||
Each kernel will be executed 20 times. | ||
The *best* time for each kernel (excluding the first iteration) | ||
will be used to compute the reported bandwidth. | ||
------------------------------------------------------------- | ||
Number of Threads requested = 4 | ||
Number of Threads counted = 4 | ||
------------------------------------------------------------- | ||
Your clock granularity/precision appears to be 1 microseconds. | ||
Each test below will take on the order of 274307 microseconds. | ||
(= 274307 clock ticks) | ||
Increase the size of the arrays if this shows that | ||
you are not getting at least 20 clock ticks per test. | ||
------------------------------------------------------------- | ||
WARNING -- The above is only a rough guideline. | ||
For best results, please be sure you know the | ||
precision of your system timer. | ||
------------------------------------------------------------- | ||
Function Best Rate MB/s Avg time Min time Max time | ||
Copy: 5043.6 0.254899 0.253789 0.256534 | ||
Scale: 5025.7 0.256102 0.254689 0.258703 | ||
Add: 5030.6 0.383381 0.381664 0.385595 | ||
Triad: 5005.5 0.386038 0.383580 0.388375 | ||
------------------------------------------------------------- | ||
Solution Validates: avg error less than 1.000000e-13 on all three arrays | ||
------------------------------------------------------------- |
33 changes: 33 additions & 0 deletions
33
results/rockchip-arm-rk3528-h96max_m1/STREAM_arm-native_O3_mp1.stdout_stderr.log
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
------------------------------------------------------------- | ||
STREAM version $Revision: 5.10 $ | ||
------------------------------------------------------------- | ||
This system uses 8 bytes per array element. | ||
------------------------------------------------------------- | ||
Array size = 80000000 (elements), Offset = 0 (elements) | ||
Memory per array = 610.4 MiB (= 0.6 GiB). | ||
Total memory required = 1831.1 MiB (= 1.8 GiB). | ||
Each kernel will be executed 20 times. | ||
The *best* time for each kernel (excluding the first iteration) | ||
will be used to compute the reported bandwidth. | ||
------------------------------------------------------------- | ||
Number of Threads requested = 1 | ||
Number of Threads counted = 1 | ||
------------------------------------------------------------- | ||
Your clock granularity/precision appears to be 1 microseconds. | ||
Each test below will take on the order of 408100 microseconds. | ||
(= 408100 clock ticks) | ||
Increase the size of the arrays if this shows that | ||
you are not getting at least 20 clock ticks per test. | ||
------------------------------------------------------------- | ||
WARNING -- The above is only a rough guideline. | ||
For best results, please be sure you know the | ||
precision of your system timer. | ||
------------------------------------------------------------- | ||
Function Best Rate MB/s Avg time Min time Max time | ||
Copy: 3904.9 0.334820 0.327797 0.411225 | ||
Scale: 3396.0 0.386586 0.376910 0.414282 | ||
Add: 2707.9 0.711492 0.709045 0.720314 | ||
Triad: 2839.5 0.678142 0.676165 0.689665 | ||
------------------------------------------------------------- | ||
Solution Validates: avg error less than 1.000000e-13 on all three arrays | ||
------------------------------------------------------------- |
33 changes: 33 additions & 0 deletions
33
results/rockchip-arm-rk3528-h96max_m1/STREAM_arm-native_O3_mp2.stdout_stderr.log
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
------------------------------------------------------------- | ||
STREAM version $Revision: 5.10 $ | ||
------------------------------------------------------------- | ||
This system uses 8 bytes per array element. | ||
------------------------------------------------------------- | ||
Array size = 80000000 (elements), Offset = 0 (elements) | ||
Memory per array = 610.4 MiB (= 0.6 GiB). | ||
Total memory required = 1831.1 MiB (= 1.8 GiB). | ||
Each kernel will be executed 20 times. | ||
The *best* time for each kernel (excluding the first iteration) | ||
will be used to compute the reported bandwidth. | ||
------------------------------------------------------------- | ||
Number of Threads requested = 2 | ||
Number of Threads counted = 2 | ||
------------------------------------------------------------- | ||
Your clock granularity/precision appears to be 1 microseconds. | ||
Each test below will take on the order of 313171 microseconds. | ||
(= 313171 clock ticks) | ||
Increase the size of the arrays if this shows that | ||
you are not getting at least 20 clock ticks per test. | ||
------------------------------------------------------------- | ||
WARNING -- The above is only a rough guideline. | ||
For best results, please be sure you know the | ||
precision of your system timer. | ||
------------------------------------------------------------- | ||
Function Best Rate MB/s Avg time Min time Max time | ||
Copy: 4191.3 0.306857 0.305393 0.308419 | ||
Scale: 4133.5 0.311589 0.309667 0.313781 | ||
Add: 3663.5 0.525060 0.524083 0.525854 | ||
Triad: 3677.9 0.522822 0.522031 0.523269 | ||
------------------------------------------------------------- | ||
Solution Validates: avg error less than 1.000000e-13 on all three arrays | ||
------------------------------------------------------------- |
33 changes: 33 additions & 0 deletions
33
results/rockchip-arm-rk3528-h96max_m1/STREAM_arm-native_O3_mp3.stdout_stderr.log
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
------------------------------------------------------------- | ||
STREAM version $Revision: 5.10 $ | ||
------------------------------------------------------------- | ||
This system uses 8 bytes per array element. | ||
------------------------------------------------------------- | ||
Array size = 80000000 (elements), Offset = 0 (elements) | ||
Memory per array = 610.4 MiB (= 0.6 GiB). | ||
Total memory required = 1831.1 MiB (= 1.8 GiB). | ||
Each kernel will be executed 20 times. | ||
The *best* time for each kernel (excluding the first iteration) | ||
will be used to compute the reported bandwidth. | ||
------------------------------------------------------------- | ||
Number of Threads requested = 3 | ||
Number of Threads counted = 3 | ||
------------------------------------------------------------- | ||
Your clock granularity/precision appears to be 1 microseconds. | ||
Each test below will take on the order of 276467 microseconds. | ||
(= 276467 clock ticks) | ||
Increase the size of the arrays if this shows that | ||
you are not getting at least 20 clock ticks per test. | ||
------------------------------------------------------------- | ||
WARNING -- The above is only a rough guideline. | ||
For best results, please be sure you know the | ||
precision of your system timer. | ||
------------------------------------------------------------- | ||
Function Best Rate MB/s Avg time Min time Max time | ||
Copy: 4936.9 0.260336 0.259270 0.264532 | ||
Scale: 4944.7 0.260846 0.258865 0.262494 | ||
Add: 4468.8 0.430892 0.429646 0.432803 | ||
Triad: 4457.6 0.431594 0.430727 0.436648 | ||
------------------------------------------------------------- | ||
Solution Validates: avg error less than 1.000000e-13 on all three arrays | ||
------------------------------------------------------------- |
33 changes: 33 additions & 0 deletions
33
results/rockchip-arm-rk3528-h96max_m1/STREAM_arm-native_O3_mp4.stdout_stderr.log
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
------------------------------------------------------------- | ||
STREAM version $Revision: 5.10 $ | ||
------------------------------------------------------------- | ||
This system uses 8 bytes per array element. | ||
------------------------------------------------------------- | ||
Array size = 80000000 (elements), Offset = 0 (elements) | ||
Memory per array = 610.4 MiB (= 0.6 GiB). | ||
Total memory required = 1831.1 MiB (= 1.8 GiB). | ||
Each kernel will be executed 20 times. | ||
The *best* time for each kernel (excluding the first iteration) | ||
will be used to compute the reported bandwidth. | ||
------------------------------------------------------------- | ||
Number of Threads requested = 4 | ||
Number of Threads counted = 4 | ||
------------------------------------------------------------- | ||
Your clock granularity/precision appears to be 1 microseconds. | ||
Each test below will take on the order of 269657 microseconds. | ||
(= 269657 clock ticks) | ||
Increase the size of the arrays if this shows that | ||
you are not getting at least 20 clock ticks per test. | ||
------------------------------------------------------------- | ||
WARNING -- The above is only a rough guideline. | ||
For best results, please be sure you know the | ||
precision of your system timer. | ||
------------------------------------------------------------- | ||
Function Best Rate MB/s Avg time Min time Max time | ||
Copy: 5004.6 0.256931 0.255766 0.259589 | ||
Scale: 5030.7 0.256262 0.254440 0.259680 | ||
Add: 4918.7 0.394988 0.390346 0.407445 | ||
Triad: 4927.9 0.393266 0.389622 0.411951 | ||
------------------------------------------------------------- | ||
Solution Validates: avg error less than 1.000000e-13 on all three arrays | ||
------------------------------------------------------------- |
33 changes: 33 additions & 0 deletions
33
results/rockchip-arm-rk3528-h96max_m1/STREAM_arm-native_Ofast_mp1.stdout_stderr.log
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
------------------------------------------------------------- | ||
STREAM version $Revision: 5.10 $ | ||
------------------------------------------------------------- | ||
This system uses 8 bytes per array element. | ||
------------------------------------------------------------- | ||
Array size = 80000000 (elements), Offset = 0 (elements) | ||
Memory per array = 610.4 MiB (= 0.6 GiB). | ||
Total memory required = 1831.1 MiB (= 1.8 GiB). | ||
Each kernel will be executed 20 times. | ||
The *best* time for each kernel (excluding the first iteration) | ||
will be used to compute the reported bandwidth. | ||
------------------------------------------------------------- | ||
Number of Threads requested = 1 | ||
Number of Threads counted = 1 | ||
------------------------------------------------------------- | ||
Your clock granularity/precision appears to be 1 microseconds. | ||
Each test below will take on the order of 415095 microseconds. | ||
(= 415095 clock ticks) | ||
Increase the size of the arrays if this shows that | ||
you are not getting at least 20 clock ticks per test. | ||
------------------------------------------------------------- | ||
WARNING -- The above is only a rough guideline. | ||
For best results, please be sure you know the | ||
precision of your system timer. | ||
------------------------------------------------------------- | ||
Function Best Rate MB/s Avg time Min time Max time | ||
Copy: 3886.4 0.331473 0.329358 0.333419 | ||
Scale: 3374.3 0.389186 0.379337 0.404814 | ||
Add: 2702.1 0.712719 0.710569 0.719447 | ||
Triad: 2838.0 0.677480 0.676524 0.678186 | ||
------------------------------------------------------------- | ||
Solution Validates: avg error less than 1.000000e-13 on all three arrays | ||
------------------------------------------------------------- |
33 changes: 33 additions & 0 deletions
33
results/rockchip-arm-rk3528-h96max_m1/STREAM_arm-native_Ofast_mp2.stdout_stderr.log
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
------------------------------------------------------------- | ||
STREAM version $Revision: 5.10 $ | ||
------------------------------------------------------------- | ||
This system uses 8 bytes per array element. | ||
------------------------------------------------------------- | ||
Array size = 80000000 (elements), Offset = 0 (elements) | ||
Memory per array = 610.4 MiB (= 0.6 GiB). | ||
Total memory required = 1831.1 MiB (= 1.8 GiB). | ||
Each kernel will be executed 20 times. | ||
The *best* time for each kernel (excluding the first iteration) | ||
will be used to compute the reported bandwidth. | ||
------------------------------------------------------------- | ||
Number of Threads requested = 2 | ||
Number of Threads counted = 2 | ||
------------------------------------------------------------- | ||
Your clock granularity/precision appears to be 1 microseconds. | ||
Each test below will take on the order of 313294 microseconds. | ||
(= 313294 clock ticks) | ||
Increase the size of the arrays if this shows that | ||
you are not getting at least 20 clock ticks per test. | ||
------------------------------------------------------------- | ||
WARNING -- The above is only a rough guideline. | ||
For best results, please be sure you know the | ||
precision of your system timer. | ||
------------------------------------------------------------- | ||
Function Best Rate MB/s Avg time Min time Max time | ||
Copy: 4409.2 0.292001 0.290305 0.293500 | ||
Scale: 4239.1 0.303478 0.301953 0.304883 | ||
Add: 3857.3 0.498701 0.497764 0.499437 | ||
Triad: 3850.6 0.499403 0.498620 0.500366 | ||
------------------------------------------------------------- | ||
Solution Validates: avg error less than 1.000000e-13 on all three arrays | ||
------------------------------------------------------------- |
33 changes: 33 additions & 0 deletions
33
results/rockchip-arm-rk3528-h96max_m1/STREAM_arm-native_Ofast_mp3.stdout_stderr.log
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,33 @@ | ||
------------------------------------------------------------- | ||
STREAM version $Revision: 5.10 $ | ||
------------------------------------------------------------- | ||
This system uses 8 bytes per array element. | ||
------------------------------------------------------------- | ||
Array size = 80000000 (elements), Offset = 0 (elements) | ||
Memory per array = 610.4 MiB (= 0.6 GiB). | ||
Total memory required = 1831.1 MiB (= 1.8 GiB). | ||
Each kernel will be executed 20 times. | ||
The *best* time for each kernel (excluding the first iteration) | ||
will be used to compute the reported bandwidth. | ||
------------------------------------------------------------- | ||
Number of Threads requested = 3 | ||
Number of Threads counted = 3 | ||
------------------------------------------------------------- | ||
Your clock granularity/precision appears to be 1 microseconds. | ||
Each test below will take on the order of 277614 microseconds. | ||
(= 277614 clock ticks) | ||
Increase the size of the arrays if this shows that | ||
you are not getting at least 20 clock ticks per test. | ||
------------------------------------------------------------- | ||
WARNING -- The above is only a rough guideline. | ||
For best results, please be sure you know the | ||
precision of your system timer. | ||
------------------------------------------------------------- | ||
Function Best Rate MB/s Avg time Min time Max time | ||
Copy: 4950.3 0.259600 0.258569 0.260578 | ||
Scale: 4905.6 0.263071 0.260927 0.264708 | ||
Add: 4473.1 0.430423 0.429233 0.431693 | ||
Triad: 4482.6 0.429325 0.428320 0.430483 | ||
------------------------------------------------------------- | ||
Solution Validates: avg error less than 1.000000e-13 on all three arrays | ||
------------------------------------------------------------- |
Oops, something went wrong.