Czarships/ExternalResources/RenewCollidingFlow: Scaling_Test_04152014.tex

File Scaling_Test_04152014.tex, 2.9 KB (added by Baowei Liu, 11 years ago)

Scaling Test to be updated

Line 
1\documentclass{article}
2\usepackage{amssymb,amsmath}
3\usepackage{verbatim}
4\usepackage{graphicx}
5\usepackage{caption}
6\usepackage[utf8]{inputenc}
7\usepackage[T1]{fontenc}
8%\usepackage[square, comma, sort&compress]{natbib}
9\usepackage[square,sort,comma,numbers]{natbib}
10\usepackage{ctable}
11\usepackage{subfigure}
12
13\begin{document}
14
15\title{Code Performance of AstroBEAR2.0 on Stampede}
16\author{Adam Frank, Jonathan J. Carroll-Nellenback, Baowei Liu, Mart\'{i}n Huarte-Espinosa}
17
18\maketitle
19
20--- Will update by Baowei
21
22%Here we present strong scaling results for AstroBEAR. In Figure \ref{fig:diskKr}, we report scaling test results (with Disk module) on Kraken at NICS. Each compute node of Kraken has two six-core AMD Opterons, so we use $120$, $240$, $480$, $960$ and $1200$ cores. The resolution we used for these test are $128^{3} + 5$ level AMR which is same as the computation we are planning to do. The strong scaling test plot of the current code shows a slope $-0.848$ (Figure \ref{fig:diskKr}) while the slope for perfect scaling is $-1$. This shows AstroBEAR has an excellent scaling on Kraken. In Figure \ref{fig:jetSt}, we report scaling test results (with 3D Jet module) on Stampede at TACC. Each compute node of Stampede has two Intel E5 8-core (Sandy Bridge) processors and an Intel Xeon Phi 61-core (Knights Corner) coprocessor, so we use $128$, $256$, $512$, and $1024$ cores. The resolution we used for these test are $21\times105\times21 + 5$ level AMR which is same as the computation we are planning to do. The strong scaling test plot of the current code shows a slope $-0.101$ for Hydro jet and $-0.967$ for MHD jet (Figure \ref{fig:jetSt}) while the slope for perfect scaling is $-1$. This shows AstroBEAR has an excellent scaling on Stampede also.
23
24Here we present the strong and weak scaling results for AstrobEAR on Stampede. We optimized the code and implemented a news Subgrid algorithm which gives higher filling fraction ratio when doing high level AMR. This optimization improves the code behavior as shown in Figure (\ref{fig:CFstrong})
25\begin{figure}[ht]
26\centering
27\subfigure[Colliding Flow Module on Stampede]{
28\includegraphics[scale=0.45]{CFstrongOnStampede.eps}
29\label{fig:CFstrong}
30}
31%\subfigure[Colliding Flow Module with old Subgrid algorithm on Kraken]{
32%\includegraphics[scale=0.45]{strongScalingOnKraken.eps}
33%\label{fig:oldAlg}
34}
35\captionof{figure}{Current strong scaling behavior for AstroBEAR with Colliding Flows module on Stampdede. For these tests we run with the exact same resolution as our proposed production runs ($40^{3}$ + 5 levels AMR for Colliding Flows and $21\times105\times21$ + 5 levels AMR for Jet), but for much shorter final time (1\% of one frame while the production runs need 1000 frames) and without IOs. The current code with the Disk and Hydro and MHD Jet modules shows excellent scaling on Kraken and Stampede (with slope=$-0.848$, $-1.01$ and $-0.967$ respectively while the perfect scaling has a slope -1). }
36\end{figure}
37
38\end{document}