Keywords :
Domain decomposition methods; Helmholtz equation; High-performance scientific computing; Optimized restricted additive Schwarz; Optimized Schwarz method; Additive Schwarz; Domain-decomposition methods; Helmholtz problems; Helmholtz's equations; High performance scientific computing; Large-scales; Multiple source; Nonoverlapping domain decomposition; Optimized restricted additive schwarz; Optimized Schwarz methods; Numerical Analysis; Modeling and Simulation; Physics and Astronomy (miscellaneous); Physics and Astronomy (all); Computer Science Applications; Computational Mathematics; Applied Mathematics
Abstract :
[en] Solving large-scale Helmholtz problems discretized with high-order finite elements is notoriously difficult, especially in 3D where direct factorization of the system matrix is very expensive and memory demanding, and robust convergence of iterative methods is difficult to obtain. Domain decomposition methods (DDM) constitute one of the most promising strategies so far, by combining direct and iterative approaches: using direct solvers on overlapping or non-overlapping subdomains, as a preconditioner for a Krylov subspace method on the original Helmholtz system or as an iterative solver on a substructured problem involving field values or Lagrange multipliers on the interfaces between the subdomains. In this work we compare the computational performance of non-overlapping substructured DDM and Optimized Restricted Additive Schwarz (ORAS) preconditioners for solving large-scale Helmholtz problems with multiple sources, as is encountered, e.g., in frequency-domain Full Waveform Inversion. We show on a realistic geophysical test-case that, when appropriately tuned, the non-overlapping methods can reduce the convergence gap sufficiently to significantly outperform the overlapping methods.
Funding text :
The first author gratefully acknowledges financial support from the Fonds de la Recherche Scientifique - FNRS (Belgium) through a FRIA doctoral fellowship. The research was funded in part by the Walloon Region through the Win2Wal EXPANSION project (Grant No. 2010161 ). Computational resources have been provided by the Consortium des Equipements de Calcul Intensif (CECI), funded by the Fonds de la Recherche Scientifique de Belgique (F.R.S.-FNRS) under Grant No. 2.5020.11 and by the Walloon Region. The present research benefited from computational resources made available on Lucia, the Tier-1 supercomputer of the Walloon Region, infrastructure funded by the Walloon Region under the grant agreement n 1910247. We acknowledge LUMI-BE LUMI-BE is joint effort from BELSPO (federal), SPW Economie, Emploi, Recherche (Wallonia), Department of Economy, Science & Innovation (Flanders) and Innoviris (Brussels). for awarding this project access to the LUMI supercomputer, owned by the EuroHPC Joint Undertaking, hosted by CSC (Finland) and the LUMI consortium through a LUMI-BE Regular Access call.
Scopus citations®
without self-citations
0