We present a highly optimized thread-safe lattice Boltzmann model in which the non-equilibrium part of the distribution function is locally reconstructed via recursivity of Hermite polynomials. Such a procedure allows the explicit incorporation of non-equilibrium moments of the distribution up to the order supported by the lattice. Thus, the proposed approach increases accuracy and stability at low viscosities without compromising performance and amenability to parallelization with respect to standard lattice Boltzmann models. The high-order thread-safe lattice Boltzmann is tested on two types of turbulent flows, namely, the turbulent channel flow at Re-tau=180 and the axisymmetric turbulent jet at Re = 7000; it delivers results in excellent agreement with reference data [direct numerical simulations (DNS), theory, and experiments] and (a) achieves peak performance [ similar to 5x10(12) floating point operations (FLOP) per second and an arithmetic intensity of similar to 7 FLOP/byte on a single graphic processing unit] by significantly reducing the memory footprint, (b) retains the algorithmic simplicity of standard lattice Boltzmann computing, and (c) allows to perform stable simulations at vanishingly low viscosities. Our findings open attractive prospects for high-performance simulations of realistic turbulent flows on GPU-based architectures. Such expectations are confirmed by excellent agreement among lattice Boltzmann, experimental, and DNS reference data.
Montessori, A., La Rocca, M., Amati, G., Lauricella, M., Tiribocchi, A., Succi, S. (2024). High-order thread-safe lattice Boltzmann model for high performance computing turbulent flow simulations. PHYSICS OF FLUIDS, 36(3) [10.1063/5.0202155].
High-order thread-safe lattice Boltzmann model for high performance computing turbulent flow simulations
Montessori, Andrea
;La Rocca, Michele;
2024-01-01
Abstract
We present a highly optimized thread-safe lattice Boltzmann model in which the non-equilibrium part of the distribution function is locally reconstructed via recursivity of Hermite polynomials. Such a procedure allows the explicit incorporation of non-equilibrium moments of the distribution up to the order supported by the lattice. Thus, the proposed approach increases accuracy and stability at low viscosities without compromising performance and amenability to parallelization with respect to standard lattice Boltzmann models. The high-order thread-safe lattice Boltzmann is tested on two types of turbulent flows, namely, the turbulent channel flow at Re-tau=180 and the axisymmetric turbulent jet at Re = 7000; it delivers results in excellent agreement with reference data [direct numerical simulations (DNS), theory, and experiments] and (a) achieves peak performance [ similar to 5x10(12) floating point operations (FLOP) per second and an arithmetic intensity of similar to 7 FLOP/byte on a single graphic processing unit] by significantly reducing the memory footprint, (b) retains the algorithmic simplicity of standard lattice Boltzmann computing, and (c) allows to perform stable simulations at vanishingly low viscosities. Our findings open attractive prospects for high-performance simulations of realistic turbulent flows on GPU-based architectures. Such expectations are confirmed by excellent agreement among lattice Boltzmann, experimental, and DNS reference data.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.