Tree-based mesh-refinement GPU-accelerated tsunami  simulator for real-time operation

Arce Acuña, Marlon; Aoki, Takayuki

doi:https://doi.org/10.5194/nhess-18-2561-2018

Articles | Volume 18, issue 9

https://doi.org/10.5194/nhess-18-2561-2018

Articles | Volume 18, issue 9

Research article

21 Sep 2018

Research article |

| 21 Sep 2018

Tree-based mesh-refinement GPU-accelerated tsunami simulator for real-time operation

Marlon Arce Acuña and Takayuki Aoki

Abstract

This paper presents a fast and accurate tsunami real-time operational model to compute across ocean-wide simulations completely on GPU (graphics processing unit). The spherical shallow water equations are solved using the method of characteristics and upwind cubic interpolation to provide high accuracy and stability. A customized, user interactive, tree-based mesh-refinement method is implemented based on distance from the coast and focal areas to generate a memory-efficient domain with resolutions of up to 50 m. Three specialized and optimized GPU kernels (Wet, Wall and Inundation) are developed to compute the domain block mesh. Multi-GPU is used to further speed up the computation, and a weighted Hilbert space-filling curve is used to produce a balanced workload. Hindcasting of the 2004 Indonesian tsunami is presented to validate and compare the agreement of the arrival times and main peaks at several gauges. Inundation maps are also produced for Kamala and Hambantota to validate the accuracy of our model. Test runs on three Tesla P100 cards on Tsubame 3.0 could fully simulate 10 h in just under 10 min wall-clock time.

How to cite

How to cite.

Dates

Received: 20 Oct 2017 – Discussion started: 10 Nov 2017 – Revised: 16 Jul 2018 – Accepted: 13 Aug 2018 – Published: 21 Sep 2018

1 Introduction

The turn of the 21st century showed us, as never before, the reality of the terrible and devastating damage and death that tsunamis can cause. In 2004, a massive earthquake off Sumatra Island of magnitude M_w=9.0 on the Richter scale triggered a tsunami with deadly consequences. According to the World Health Organization, the death toll for these events exceeds 200 000 (WHO, 2014) in several countries spread along the Indian Ocean. Not much later in 2011, a tsunami triggered by a M_w=9.0 earthquake on the east coast of Japan in the Tohoku region produced yet another disaster. Over 15 000 people died from these events, with massive destruction of port and city infrastructure, housing, and telecommunications. Additionally, the subsequent nuclear crisis was due to the tsunami-induced damage of several reactors in the Fukushima nuclear power plant (Motoki and Toshihiro, 2012).

These events highlight the importance of developing accurate and fast tsunami-forecasting models. For several decades, efforts have been made to develop such models. These can be classified into two main groups: depth-average (i) hydrostatic and (ii) non-hydrostatic long wave equations. Hydrostatic models for the shallow water equations (SWEs) started by solving their linear form based on finite difference methods (FDMs), following the work of Hansen (1956) and Fischer (1959) in the 1950s. The TUNAMI (Tohoku University's Numerical Analysis Model for Investigation) (Imamura et al., 1995) came from these initial steps but solved the shallow water equations in a nonlinear form instead, formulated them in a flux-conservative way for mass conservation and also introduced a discharge computation (Imamura, 1996) for the elevation near the shoreline. In a very similar manner, the ALASKA-tectonic (GI'-T) and Landslide models (GI'-L) were introduced, which solved the nonlinear shallow water and used leapfrog FDM (Nicolsky et al., 2011) similar to TUNAMI. Later came MOST (Method of Splitting Tsunami) (Titov and Synolakis, 1995), an extensively used model for tsunami simulation that tried to incorporate the effect of dispersion during simulations (Burwell et al., 2007). It was original because it introduced a function to add points in the shoreline to improve tracking. Recently, MOST has been ported for GPU computing (Vazhenin et al., 2013). A more recent model is GeoClaw, which implements a unique approach to deal with the issue of transferring fluid kinematics throughout nested grids by refining specified cells during simulation to improve resolution in those areas (Berger and LeVeque, 1998). More recent models incorporate a real-time application such as RIFT (Real-Time Inundation Forecasting of Tsunamis) (Wang et al., 2012). Like several of the previous models, a leapfrog scheme is also used for these real-time models, and a linear SWE is solved in certain areas for lighter computation. COMCOT (Cornell Multi-grid Coupled Tsunami Model) from Cornell University is another example using this approach (Liu, 1998). EasyWave is another model (Babeyko, 2017) which employs linear approximations to improve speed and employs a leapfrog scheme as its numerical scheme. The latest version of EasyWave introduced GPU to accelerate parts of the existing CPU code. More recently, GPU-based models have been developed, like NAMI DANCE (Zaytsev et al., 2006) in its latest version. Additionally, a better known GPU model, Tsunami-HySEA (Macías et al., 2017), has been extensively tested and is currently used by the Centro di Allerta Tsunami (CAT) in Italy.

In order to include the effect of pressure, since the 1990s, some models have taken the direction of solving non-hydrostatic models using the depth-integrated Boussinesq equations (BEs) instead of the SWEs for tsunami propagation. Initial efforts considered them to be weak nonlinear models (Peregrine, 1967); however, models for nonlinear equations were also developed not long after (for instance, Nwogu, 1993; Lynett et al., 2002). Solving the Boussinesq equation is, in general, more computationally demanding than solving the SWEs and in order to reduce the computational time, some techniques have been implemented, such as using parallel clusters or introducing nested grids. An example of this is FUNWAVE-TVD (Shi et al., 2012), which is an extended version of FUNWAVE, a run-up and propagation model based on fully nonlinear and dispersive Boussinesq equations (Wei et al., 1995). FUNWAVE introduced a nested-grid method, and its later version was fully parallelized using MPI-FORTRAN. A well-known non-hydrostatic model which also implements two-way grid nesting is NEOWAVE (Non-hydrostatic Evolution of Ocean WAVE; Yamazaki et al., 2011). Another one of these models is BOSZ (Roeber and Cheung, 2012), which combines the dispersive effect from the BEs with the shock-capturing ability of the nonlinear SWEs. BOSZ is mainly used for nearshore simulation, since it is based on Cartesian coordinates and not suited for large areas. Additionally, it does not implement nested grids.

Recently, efforts to solve the modeling equations in three dimensions have been made as well. Although these models tend to capture difficult coastlines very well and can include multiple fluids or even materials, the computation cost is still so great that it makes it only possible to apply them effectively in small areas and it is not viable for transoceanic propagations. Some examples are SELFEs (semi-implicit Eulerian–Lagrangian finite elements; Zhang and Baptista, 2008; Abadie et al., 2010, 2012; Horrillo et al., 2013).

In this work, we present a new approach for a tsunami operational model that retains a high degree of the complexities of the physics involved and delivers a fast and accurate simulation. This speed also enables real-time operation: a user can start forecasting simultaneously as a tsunami event occurs. Results are generated faster than real time. The main goal is to accomplish a wide-area, ocean-size computation in short time while using resources efficiently. Our model, referred to hereinafter as TRITON-G (Tsunami Refinement and Inundation Real-Time Operational Numerical Model for GPU), implements a full-GPU computing approach for the whole tsunami model, composed of generation, propagation and inundation. Specialized kernels are developed for each part of the tsunami computation, and multi-GPU is used for further acceleration. Load balance is obtained using a weighted Hilbert space-filling curve. TRITON-G solves the nonlinear spherical shallow water equations across the entire domain to preserve the complexity of the propagation and the effects near the coastline. The method of characteristics with directional splitting and a third-order interpolation semi-Lagrangian numerical scheme is used to solve the governing equations. This allows for high accuracy and minimizes effects of numerical dispersion and diffusion while also giving the ability to choose a larger time step compared to using a Runge–Kutta scheme and at the same time permits a light stencil suitable for fast computation. We implement a tree-based block refinement to generate a computational mesh that is flexible, light and can track complex coastlines. Customized refinements by distance and focal area were developed, which permitting an efficient use of memory and computational resources. In a collaborative project with RIMES (Regional Integrated Multi-Hazard Early Warning System, 2017), we utilize their existing databases for bathymetry and fault sources where available and successfully deployed TRITON-G as their tsunami forecast operational model.

This article is organized as follows. A review of the governing equations is given in Sect. 2. The numerical method and boundaries are explained in Sect. 3. In Sect. 4, a description of tree-based refinement and its customization is given. The topography and bathymetry used are also described. GPU and parallel computing are covered in Sect. 5. In Sect. 6, we present comparison results with a known benchmark inundation problem. In Sect. 7, we present several numerical results including TRITON-G validation with existing tsunami propagation data and run-up measurements. Section 8 presents the conclusions of this study. Results from several standard inundation benchmark problems are included in the “Appendix”.

2 Governing equations

The spherical nonlinear shallow water equations (SSWEs) are used to compute the tsunami propagation. In small, specific areas where inundation needs to be computed, the Cartesian coordinate version of the SWEs are solved instead (see Toro 2010). The SSWE (Williamson et al., 1992; Swarztrauber et al., 1997) can be written as

\frac{\partial h}{\partial t} + \frac{1}{a \cos θ} \frac{\partial}{\partial λ} (h u) + \frac{1}{a} \frac{\partial}{\partial θ} (h v) - \frac{h v}{a} \tan θ = 0,

\begin{array}{l} \frac{\partial h u}{\partial t} & + \frac{1}{a \cos θ} \frac{\partial}{\partial λ} (h u^{2} + \frac{g}{2} h^{2}) + \frac{1}{a} \frac{\partial h u v}{\partial θ} - \frac{h v}{a} \tan θ \\ - (f + \frac{u}{a} \tan θ) h v + \frac{g h}{a \cos θ} \frac{\partial z}{\partial λ} + τ_{λ} = 0, \end{array}

\begin{array}{l} \frac{\partial h v}{\partial t} & + \frac{1}{a \cos θ} \frac{\partial h v u}{\partial λ} + \frac{1}{a} \frac{\partial}{\partial θ} (h v^{2} + \frac{g}{2} h^{2}) - \frac{h v^{2}}{a} \tan θ \\ (1) & + (f + \frac{u}{a} \tan θ) h u + \frac{g h}{a} \frac{\partial z}{\partial θ} + τ_{θ} = 0, \end{array}

where λ stands for the longitude coordinate, θ for the latitude coordinate, h is the water depth, hu and hv are the momentum in longitude and latitude, respectively, with corresponding velocities u and v, g is gravity, a is the radius of the Earth, z is the bathymetry (submarine topography), f is the Coriolis force defined as f=2Ωsin θ with Ω being the rotation rate of the Earth and τ is the bottom friction term. The bottom friction is determined using the Manning formula:

\begin{array}{l} τ_{λ} = \frac{g n^{2}}{h^{7 / 3}} h u \sqrt{(h u)^{2} + (h v)^{2}}, \\ (2) & τ_{θ} = \frac{g n^{2}}{h^{7 / 3}} h v \sqrt{(h u)^{2} + (h v)^{2}}, \end{array}

where n is the Manning's roughness coefficient. The default value used for n is 0.025 across all domains except for specific areas where more detailed values in the coastline are given in a database. The parameters used in this work are $a = 6.37122 \times 10^{6}$ [m], $Ω = 7.292 \times 10^{- 5}$ [s⁻¹] and g=9.81 [m s⁻²].

3 Numerical methods and boundary conditions

3.1 Methods of characteristics for SSWEs

The SSWEs are solved using the method of characteristics (MOC). A method developed in the 1960s, explained in detail by Rusanov (1963). MOC is applied to reduce hyperbolic partial differential equations, such as the SSWEs, to a family of ordinary differential equations. A traditional approach when using MOC is to introduce a dimensional splitting (Nakamura et al., 2001) in the 2-dimensional equations to create a smaller stencil and lighter computation. A numerical scheme is regarded as well-balanced, or satisfying the C-property (Bermúdez and Vázquez, 1994) if it preserves steady states at rest, for instance, the undisturbed surface of lake. When the fluid is at rest, i.e., $u (x, t) = 0$ then the constant water height H defined as $H (x, t) = h (x, t) + z (x)$ represents a steady state that should hold in time and not produce spurious oscillations (LeVeque, 1998). In order to make the model well-balanced, the SSWEs are solved for H during the simulation to guarantee this steady state. The original variable h is simply obtained back from the expression $h = H - z$ .

In order to apply the method of characteristics, first the SSWEs Eq. (1) are rewritten in vector form as

\begin{matrix} (3) & \frac{\partial U}{\partial t} + A \frac{\partial U}{\partial λ} + B \frac{\partial U}{\partial θ} + S = 0 \end{matrix}

with

\begin{array}{l} U = [\begin{array}{c} h \\ h u \\ h v \end{array}] \\ A = \frac{1}{a \cos θ} [\begin{array}{lll} 0 & 1 & 0 \\ Γ^{2} - u^{2} & 2 u & 0 \\ - u v & v & u \end{array}] \\ B = \frac{1}{a} [\begin{array}{lll} 0 & 0 & 1 \\ - u v & v & u \\ Γ^{2} - v^{2} & 0 & 2 v \end{array}] \\ S = [\begin{array}{l} \frac{- h v \tan θ}{a} \\ - (f + \frac{u}{a} \tan θ) h v - \frac{h u v}{a} \tan θ + \frac{g h}{a \cos θ} \frac{\partial z}{\partial λ} \\ (f + \frac{u}{a} \tan θ) h u - \frac{h v^{2}}{a} \tan θ + \frac{g h}{a} \frac{\partial z}{\partial θ} \end{array}], \end{array}

where $Γ \equiv \sqrt{g h}$ . Using the directional splitting technique on Eq. (1), three equations are produced: an equation for each coordinate (longitude λ and latitude θ) and a third for the source term S. The latter equation simply represents an ordinary partial differential equation for the source term while, Eqs. (4) and (10) for the coordinates are in advection form. These last two equations are written in diagonal form in order to find the Riemann invariants and characteristics curves; a detailed description of this procedure can be found in Ogata and Takashi (2004) or Stoker (1992). The equation for the longitude coordinate λ given by

\begin{matrix} (4) & \frac{\partial U}{\partial t} + A \frac{\partial U}{\partial λ} = 0 \end{matrix}

has eigenvalues Λ given by

\begin{matrix} (5) & Λ_{\pm}^{λ} = \frac{1}{a \cos θ} (u + Γ), Λ_{3}^{λ} = \frac{1}{a \cos θ} u, \end{matrix}

which inserted in the diagonal form of Eq. (4) leads to

\begin{matrix} (6) & \frac{D^{\pm}}{D t} (Γ \pm \frac{u}{2}) = 0, \end{matrix}

where D∕Dt represents the material derivative. Equation (6) means that the solution at a given grid point i is determined from two characteristic curves along C⁺ and C⁻ (Fig. 1). The result at a time n+1 can be found by adding and subtracting the expressions in Eq. (6) respectively to obtain

\begin{array}{l} (7) & Γ_{i}^{n + 1} = \frac{1}{2} \{Γ^{+} + Γ^{-} + \frac{1}{2} (u^{+} - u^{-})\} \\ (8) & u_{i}^{n + 1} = \frac{1}{2} \{u^{+} + u^{-} + 2 (Γ^{+} + Γ^{-})\}, \end{array}

where Γ^± and u^± are the values at a time n, at positions which might not necessarily lie on a grid point. An interpolation is applied in order to determine these values, and with them solve Eqs. (7) and (8).

Following a similar procedure as Yabe and Aoki (1991), Yabe et al. (2001) and Utsumi et al. (1997), we utilize a cubic-polynomial approximation on the grid profile to find the interpolated values. The polynomial is defined as

\begin{matrix} (9) & F (λ) = a λ^{3} + b λ^{2} + c λ + d \end{matrix}

with

\begin{array}{l} u Δ t > 0 \{\begin{cases} a = \frac{f_{i + 1} - 3 f_{i} + 3 f_{i - 1} - f_{i - 2}}{6 Δ λ^{3}} \\ b = \frac{f_{i + 1} - 2 f_{i} + f_{i - 1}}{2 Δ λ^{2}} \\ c = \frac{2 f_{i + 1} + 3 f_{i} - 6 f_{i - 1} + f_{i - 2}}{6 Δ λ} \\ d = f_{i} \end{cases} \\ u Δ t \leq 0 \{\begin{cases} a = \frac{f_{i + 2} - 3 f_{i + 1} + 3 f_{i} - f_{i - 1}}{6 Δ λ^{3}} \\ b = \frac{f_{i + 1} - 2 f_{i} + f_{i - 1}}{2 Δ λ^{2}} \\ c = \frac{- f_{i + 2} + 6 f_{i + 1} - 3 f_{i} - 2 f_{i - 1}}{6 Δ λ} \\ d = f_{i} \end{cases} . \end{array}

A similar analysis can be made for the latitude equation θ obtained from the splitting method, given by

\begin{matrix} (10) & \frac{\partial U}{\partial t} + B \frac{\partial U}{\partial θ} = 0 \end{matrix}

with analogous results for the eigenvalues and curves

\begin{array}{l} (11) & Λ_{\pm}^{θ} = \frac{1}{a} (v + Γ), Λ_{3}^{θ} = \frac{1}{a} v, \\ (12) & \frac{D^{\pm}}{D t} (Γ \pm \frac{v}{2}) = 0 . \end{array}

From which similar expressions as Eqs. (7) and (8) can be found in order to estimate the values for h and hv.

https://www.nat-hazards-earth-syst-sci.net/18/2561/2018/nhess-18-2561-2018-f01

Figure 1Space–time diagram showing the characteristic curves C^± where black points represent the grid points, white points represent the values Γ^± and u^± at time n to be interpolated to find Γⁿ⁺¹ and uⁿ⁺¹.

Tree-based mesh-refinement GPU-accelerated tsunami simulator for real-time operation

3.1 Methods of characteristics for SSWEs

3.2 Run-up calculation

3.3 Tsunami source model

3.4 Boundary conditions

4.1 Customized mesh generation

4.2 Halo exchange

4.3 Topography and bathymetry

5.1 SSWE GPU kernels

5.1.1 Halo update on GPU

5.1.2 Specialized kernel types

5.2 Space-filling curve and multi-GPU

5.3 Variables and rendering output

5.3.1 Subcycling implementation

5.3.2 Runtime performance

6.1 Benchmark problem no. 9: Okushiri Island tsunami – field

6.1.1 Problem setup

6.1.2 Tasks to be performed

6.1.3 Numerical results

Run-up around Aonae

Arrival of first wave to Aonae

Two waves arriving at Aonae

Tide gauge comparison at Iwanai and Esashi

Maximum run-up around Okushiri

Run-up height at Hamatsumae

Run-up height at a valley north of Monai

7.1 Indonesian 2004 tsunami hindcast

7.1.1 Tide gauge comparison

7.1.2 Inundation map comparison

A1 Problem setup

A1.1 Tasks to be performed

A1.2 Numerical results

A2 Benchmark problem no. 6: solitary wave on a conical island – laboratory

A2.1 Tasks to be performed

A2.2 Problem setup

A2.3 Numerical results

A3 Benchmark problem no. 7: the tsunami run-up onto a complex 3-D beach – laboratory

A3.1 Problem setup

A3.2 Tasks to be performed

A3.3 Numerical results