Preconditioner strategies

1. Relaxation

Split into lower, diagonal, upper parts: A=L+D+U.

1.1. Jacobi

Cheapest preconditioner: P1=D1.

# sequential
pc-type=jacobi
# parallel
pc-type=block_jacobi
bash

1.2. Successive over-relaxation (SOR)

(L+1ωD)xn+1=[(1ω1)DU]xn+ωbP1=k iterations starting with x0=0
  • Implemented as a sweep.

  • ω=1 corresponds to Gauss-Seidel.

  • Very effective at removing high-frequency components of residual.

# sequential
pc-type=sor
bash

2. Factorization

Two phases

  • symbolic factorization: find where fill occurs, only uses sparsity pattern.

  • numeric factorization: compute factors.

2.1. LU decomposition

  • preconditioner.

  • Expensive, for m×m sparse matrix with bandwidth b, traditionally requires O(mb2) time and O(mb) space.

    • Bandwidth scales as md1d in d-dimensions.

    • Optimal in 2D: O(mlogm) space, O(m3/2) time.

    • Optimal in 3D: O(m4/3) space, O(m2) time.

  • Symbolic factorization is problematic in parallel.

2.2. Incomplete LU

  • Allow a limited number of levels of fill: ILU(k).

  • Only allow fill for entries that exceed threshold: ILUT.

  • Usually poor scaling in parallel.

  • No guarantees.

3. 1-level Domain decomposition

Domain size $$L$$, subdomain size $$H$$, element size $$h$$
  • Overlapping/Schwarz

    • Solve Dirichlet problems on overlapping subdomains.

    • No overlap: itsO(LHh).

    • Overlap δ:its(LHδ).

  • Neumann-Neumann

    • Solve Neumann problems on non-overlapping subdomains.

    • itsO(LH(1+logHh)).

    • Tricky null space issues (floating subdomains).

    • Need subdomain matrices, not globally assembled matrix.

Multilevel variants knock off the leading LH.
Both overlapping and nonoverlapping with this bound.
  • BDDC and FETI-DP

    • Neumann problems on subdomains with coarse grid correction.

    • itsO(1+logHh).

4. Multigrid

4.1. Introduction

Hierarchy: Interpolation and restriction operators Π:XcoarseXfineΠ:XfineXcoarse

  • Geometric: define problem on multiple levels, use grid to compute hierarchy.

  • Algebraic: define problem only on finest level, use matrix structure to build hierarchy.

Galerkin approximation

Assemble this matrix: Acoarse=ΠAfineΠ

Application of multigrid preconditioner (V-cycle)

  • Apply pre-smoother on fine level (any preconditioner).

  • Restrict residual to coarse level with Π.

  • Solve on coarse level Acoarsex=r.

  • Interpolate result back to fine level with \Pi^\uparrow.

  • Apply post-smoother on fine level (any preconditioner).

4.2. Multigrid convergence properties

  • Textbook: P1A is spectrally equivalent to identity

    • Constant number of iterations to converge up to discretization error.

  • Most theory applies to SPD systems

    • variable coefficients (e.g. discontinuous): low energy interpolants.

    • mesh- and/or physics-induced anisotropy: semi-coarsening/line smoothers.

    • complex geometry: difficult to have meaningful coarse levels.

  • Deeper algorithmic difficulties

    • nonsymmetric (e.g. advection, shallow water, Euler).

    • indefinite (e.g. incompressible flow, Helmholtz).

  • Performance considerations

    • Aggressive coarsening is critical in parallel.

    • Most theory uses SOR smoothers, ILU often more robust.

    • Coarsest level usually solved semi-redundantly with direct solver.

  • Multilevel Schwarz is essentially the same with different language

    • assume strong smoothers, emphasize aggressive coarsening.