Load balancing algorithm

I made some minor adjustments to the load balancing algorithm to match the description in the paper and the initial results are encouraging… Now the threading / dynamic load balancing is 10% faster on 8 cores… Need to do some scaling runs on 32 / 64 to confirm that this trend persists. Also trying to figure out the right functional form to fit the scaling data.

Weak Scaling

Weak scaling results for various levels of AMR for 1282 cells per proc per level

And the weak scaling results for 3 levels of AMR with 642, 1282, and 5122 cells per proc per level

If we let represent the total number of cells to update, and represent the computational cost per cell and be the additional computational work that is incurred with processors, and let be the wall time then

Furthermore if then for we can taylor expand which combined with the scaling results gives

Also

However, if we assume that then

The problem is we don't have enough data points to determine which functional fit is accurate so it is difficult to project.

So I went back and looked at different possible fit functions and plotted the results…

The best fit is the first one and if it holds, implies performance drops to 50% at or 1.74 KProcs and to 25% at 5.3 GProcs! Granted this is weak scaling and your resolution at 5.3 GProcs would be 36 MCells2 and simulating one crossing time would take 12 yrs instead of 3 yrs… but that's why strong scaling is more important… or being able to weakly scale when the workload per processor is only a few cells.

Why would the scaling be proportional to ? If we consider the distribution of grids at each time step and assume that the workload distribution follows a normal distribution with some standard deviation , then the time to complete each step is not given by the mean of the workload but by the max of the workload . The average maximum value is given by the last order statistic where is the number of processors. I couldn't find an expression for the last order statistic for a normal distribution, but monte-carlo experiments give a fitting function . Given the best fit this implies that

Attachments (6)

Download all attachments as: .zip

Comments

No comments.