7 | | * Extremely poor scaling on Bluestreak |
8 | | |
9 | | * Memory errors with more nodes (remedied with different hypre library) |
10 | | |
11 | | * Delicate balance between speeding up sims with more nodes, and encountering memory issues that kill the sims (remedied with different hypre library) |
12 | | |
13 | | || Nodes || Frames || Time to make 1 frame || Notes || |
14 | | || 32 || 32-39 || 3 hours || wall-time ran out || |
15 | | || 256 || 39-48 || 1 hours || memory died 48.4 || |
16 | | || 512 || 48-49 || 1 hours || memory died 49.5 || |
17 | | || '''32''' || '''49-50''' || '''4 hours''' || '''walltime ran out''' || |
18 | | |
19 | | [[Image(chart_1.png)]] |
20 | | |
21 | | As the number of nodes increases, more patches are made, leading to more ghost zones. Thus, the global info as reported in standard out (that includes physical and ghost zones) increases with nodes. |
22 | | |
23 | | [[Image(chart_2.png)]] |
24 | | |
25 | | Peak goes down, as the amount of info is distributed over more and more cores. |
26 | | |
27 | | [[Image(chart_4.png)]] |
28 | | |
29 | | Percent efficiency as given in the standard out. |
30 | | |
31 | | |
32 | | [[Image(chart_3.png)]] |
33 | | |
34 | | The time to write to file increases with nodes. This can affect scaling computation (see http://astrobear.pas.rochester.edu/trac/astrobear/wiki/u/erica/ScalingBluestreak for details). |
35 | | |
36 | | The memory error is given in 2 places: 1) end of astrobear.log, and 2) a 'core' file that is written to the run directory. I am attaching as an example, the memory error reports for Shear15, nodes 256. |
| 60 | == Colliding Flows Run - Hydro, Shear Angle 15 == |
| 61 | |
| 62 | * Extremely poor scaling on Bluestreak |
| 63 | |
| 64 | * Memory errors with more nodes (remedied with different hypre library) |
| 65 | |
| 66 | * Delicate balance between speeding up sims with more nodes, and encountering memory issues that kill the sims (remedied with different hypre library) |
| 67 | |
| 68 | || Nodes || Frames || Time to make 1 frame || Notes || |
| 69 | || 32 || 32-39 || 3 hours || wall-time ran out || |
| 70 | || 256 || 39-48 || 1 hours || memory died 48.4 || |
| 71 | || 512 || 48-49 || 1 hours || memory died 49.5 || |
| 72 | || '''32''' || '''49-50''' || '''4 hours''' || '''walltime ran out''' || |
| 73 | |
| 74 | [[Image(chart_1.png)]] |
| 75 | |
| 76 | As the number of nodes increases, more patches are made, leading to more ghost zones. Thus, the global info as reported in standard out (that includes physical and ghost zones) increases with nodes. |
| 77 | |
| 78 | [[Image(chart_2.png)]] |
| 79 | |
| 80 | Peak goes down, as the amount of info is distributed over more and more cores. |
| 81 | |
| 82 | [[Image(chart_4.png)]] |
| 83 | |
| 84 | Percent efficiency as given in the standard out. |
| 85 | |
| 86 | |
| 87 | [[Image(chart_3.png)]] |
| 88 | |
| 89 | The time to write to file increases with nodes. This can affect scaling computation (see http://astrobear.pas.rochester.edu/trac/astrobear/wiki/u/erica/ScalingBluestreak for details). |
| 90 | |
| 91 | The memory error is given in 2 places: 1) end of astrobear.log, and 2) a 'core' file that is written to the run directory. I am attaching as an example, the memory error reports for Shear15, nodes 256. |