NonlinearSystem

The NonlinearSystem object holds the equation system created by the normal FEM process (e.g. the Matrix and RHS vector) to be solved. Normally MOOSE uses PETSc to store and solve this system. This object is where you will find the callback routines used by the PETSc solvers.

You may find some additional documentation relevant to both NonlinearSystem and NonlinearEigenSystem in NonlinearSystemBase.

Solving Non-linear Systems

Application of the finite element method converts PDE(s) into a system of nonlinear equations, $var element = document.getElementById("moose-equation-48cc0071-9d60-4bf4-abf8-4f02291cd9e3");katex.render("R_i(u_h)=0, \\qquad i=1,\\ldots, N", element, {displayMode:false,throwOnError:false});$ to solve for the coefficients $var element = document.getElementById("moose-equation-18d7085a-068c-4789-a387-f8638a2f4071");katex.render("u_j, j=1,\\dots,N", element, {displayMode:false,throwOnError:false});$ .

Newton's method has good convergence properties, we use it to solve this system of nonlinear equations.
Newton's method is a "root finding" method: it finds zeros of nonlinear equations.
Newton's Method in "Update Form" for finding roots of the scalar equation $var element = document.getElementById("moose-equation-f3c326a2-d760-4faa-8fa8-7500412b7fa9");katex.render("\\begin{array}{rl}f(x)&=0, f(x): \\mathbb{R} &\\rightarrow \\mathbb{R}\\textrm{ is given by}:\\\\ f'(x_n) \\delta x_{n+1} &= -f(x_n) \\\\ x_{n+1} &= x_n + \\delta x_{n+1}\\end{array}", element, {displayMode:false,throwOnError:false});$
We don't have just one scalar equation: we have a system of nonlinear equations.
This leads to the following form of Newton's Method:
$var element = document.getElementById("moose-equation-56a4ac42-bf6f-40e0-8259-c3b7d59715a0");katex.render("\\begin{aligned} \\mathbf{J}(\\vec{u}_n) \\delta\\vec{u}_{n+1} &= -\\vec{R}(\\vec{u}_n) \\\\ \\vec{u}_{n+1} &= \\vec{u}_n + \\delta\\vec{u}_{n+1}\\end{aligned}", element, {displayMode:false,throwOnError:false});$
Where $var element = document.getElementById("moose-equation-85fba9b8-2727-4995-8cf7-3fedbd5a89c2");katex.render("\\mathbf{J}(\\vec{u}_n)", element, {displayMode:false,throwOnError:false});$ is the Jacobian matrix evaluated at the current iterate: $var element = document.getElementById("moose-equation-f8ecee74-64cc-4688-a3a3-0d4fcf6aa722");katex.render("J_{ij}(\\vec{u}_n) = \\frac{\\partial R_i(\\vec{u}_n)}{\\partial u_j}", element, {displayMode:false,throwOnError:false});$
Note that: $var element = document.getElementById("moose-equation-bb4964e9-cbae-4fda-ad23-7ed41e4ed0f4");katex.render("\\frac{\\partial u_h}{\\partial u_j} = \\sum_k\\frac{\\partial }{\\partial u_j}\\left(u_k \\phi_k\\right) = \\phi_j \\qquad \\frac{\\partial \\left(\\nabla u_h\\right)}{\\partial u_j} = \\sum_k \\frac{\\partial }{\\partial u_j}\\left(u_k \\nabla \\phi_k\\right) = \\nabla \\phi_j", element, {displayMode:false,throwOnError:false});$

Jacobian Definition

An efficient Newton solve, e.g. one that requires few "nonlinear" iterations, requires an accurate Jacobian matrix or an accurate approximation of its action on a vector. When no explicit matrix is formed for the Jacobian and only its action on a vector is computed, the algorithm is commonly referred to as matrix-free (PETSc jargon) or Jacobian-free (MOOSE jargon). The default solve algorithm in MOOSE is PJFNK, or Preconditioned Jacobian-Free Newton-Krylov. "Krylov" refers to the linear solution algorithm used to solve each nonlinear iteration of the Newton algorithm. For more information on solving linear systems, please see Solving Linear Systems. Even if a Jacobian-free nonlinear algorithm is chosen, typically a good preconditioning matrix is still needed. Building the matrix can be accomplished automatically, using automatic differentiation and/or manually.

Automatic Differentiation

One can elect to sacrifice some computing speed and calculate Jacobians automatically using automatic differentiation (AD). MOOSE employs the DualNumber class from the MetaPhysicL package in order to enable AD. If the application developer wants to make use of AD, they should inherit from ADKernel as opposed to Kernel. Additionally, when coupling in variables, the adCoupled* methods should be used. For example, to retrieve a coupled value, instead of using coupledValue("v") in the ADKernel constructor, adCoupledValue("v") should be used. adCoupledGradient should replace coupledGradient, etc. An example of coupling in an AD variable can be found in ADCoupledConvection.C and ADCoupledConvection.h. Moreover, material properties that may depend on the nonlinear variables should be retrieved using getADMaterialProperty instead of getMaterialProperty. They should be declared in materials using declareADProperty. Example AD material source and header files can be found here and here; example kernel source and header files that use AD material properties can be found here and here. The object central to AD computing objects is ADReal which is defined in MooseTypes.

Traditional Hand-coded Jacobians

Finite element shape functions are introduced in the documentation section Shape Functions. There we outline how our primary variables are summations of those shape functions multiplied by constant coefficients which are our degrees of freedom. At the end of Solving Non-linear Systems we gave an explicit illustration of how the derivative of a variable u with respect to its jth degree of freedom ( $var element = document.getElementById("moose-equation-895b731f-3635-4cf4-842c-2d9fcb8175d3");katex.render("u_j", element, {displayMode:false,throwOnError:false});$ ) is equal to the jth shape function $var element = document.getElementById("moose-equation-9f926ef5-d0d7-426a-8cad-b239dd583789");katex.render("\\phi_j", element, {displayMode:false,throwOnError:false});$ . Similarly the derivative of $var element = document.getElementById("moose-equation-d0192eb1-1266-44c2-bf4a-a7b51331149f");katex.render("\\nabla u", element, {displayMode:false,throwOnError:false});$ with respect to $var element = document.getElementById("moose-equation-ca772f52-ea7f-4877-8c7f-11ea4ee7c968");katex.render("u_j", element, {displayMode:false,throwOnError:false});$ is equal to $var element = document.getElementById("moose-equation-6386c1db-5671-491b-b918-c6e8cee338bf");katex.render("\\nabla \\phi_j", element, {displayMode:false,throwOnError:false});$ . The code expression _phi[_j][_qp] represents $var element = document.getElementById("moose-equation-e8afd7f6-93fa-42d7-bdc6-cfed1e30794b");katex.render("\\frac{\\partial u}{\\partial u_j}", element, {displayMode:false,throwOnError:false});$ in any MOOSE framework residual and Jacobian computing objects such as kernels and boundary conditions.

Any MOOSE kernel may have an arbitrary number of variables coupled into it. If these coupled variables use the same shape function family and order, then their associated $var element = document.getElementById("moose-equation-fd8f9ef3-0910-4dc9-9763-f54218176855");katex.render("\\phi_j", element, {displayMode:false,throwOnError:false});$ s will be equivalent. However, if u and v use different shape functions then $var element = document.getElementById("moose-equation-90ea7612-8084-4466-9282-70ec60db4977");katex.render("\\phi_{j,u} \\ne \\phi_{j,v}", element, {displayMode:false,throwOnError:false});$ . As a developer, however, you do not in most cases have to worry about these differences in $var element = document.getElementById("moose-equation-0e13c236-49a7-4951-9bca-b5ed7401e3be");katex.render("\\phi", element, {displayMode:false,throwOnError:false});$ . MOOSE automatically updates the object member variable _phi to use the shape functions of the variable for whom the Jacobian is currently being computed. However, if the primary variable u is a scalar-valued (single-component) finite element variable and the coupled variable v is a vector-valued (multi-component) finite element variable (or visa versa), then you must introduce an additional member variable to represent the shape functions of the vector-valued (scalar-valued) variable. The name of this variable is up to the developer, but we suggest perhaps a _standard_ prefix for scalar valued finite-element variables and _vector_ for vector valued finite-element variables. The _standard_ prefix is suggested over _scalar_ so as not to be confused with a MooseVariableScalar, which only has a single value over the entire spatial domain. An example constructor for a standard kernel that couples in a vector-valued FE variable is shown below:


EFieldAdvection::EFieldAdvection(const InputParameters & parameters)
  : Kernel(parameters),
    _efield_id(coupled("efield")),
    _efield(coupledVectorValue("efield")),
    _efield_var(*getVectorVar("efield", 0)),
    _vector_phi(_assembly.phi(_efield_var)),
    _mobility(getParam<Real>("mobility"))
{
}

The associated declarations are:


  const unsigned int _efield_id;
  const VectorVariableValue & _efield;
  VectorMooseVariable & _efield_var;
  const VectorVariablePhiValue & _vector_phi;
  const Real _mobility;
  Real _sgn;

Residual, on-diagonal, and off-diagonal methods are respectively


Real
EFieldAdvection::computeQpResidual()
{
  return -_grad_test[_i][_qp] * _sgn * _mobility * _efield[_qp] * _u[_qp];
}

and


Real
EFieldAdvection::computeQpJacobian()
{
  return -_grad_test[_i][_qp] * _sgn * _mobility * _efield[_qp] * _phi[_j][_qp];
}

and


Real
EFieldAdvection::computeQpOffDiagJacobian(unsigned int jvar)
{
  if (jvar == _efield_id)
    return -_grad_test[_i][_qp] * _sgn * _mobility * _vector_phi[_j][_qp] * _u[_qp];
  else
    return 0;
}

```
An example constructor for a vector kernel that couples in a
scalar-valued FE variable is shown below:

```
VectorCoupledGradientTimeDerivative::VectorCoupledGradientTimeDerivative(
    const InputParameters & parameters)
  : VectorKernel(parameters),
    _grad_v_dot(coupledGradientDot("v")),
    _d_grad_v_dot_dv(coupledDotDu("v")),
    _v_id(coupled("v")),
    _v_var(*getVar("v", 0)),
    _standard_grad_phi(_assembly.gradPhi(_v_var))
{
}

```
The associated declarations are:

```
  const VariableGradient & _grad_v_dot;
  const VariableValue & _d_grad_v_dot_dv;
  const unsigned _v_id;
  MooseVariable & _v_var;
  const VariablePhiGradient & _standard_grad_phi;

Residual and off-diagonal Jacobian methods are respectively:


Real
VectorCoupledGradientTimeDerivative::computeQpResidual()
{
  return _test[_i][_qp] * _grad_v_dot[_qp];
}

and


Real
VectorCoupledGradientTimeDerivative::computeQpOffDiagJacobian(unsigned jvar)
{
  if (jvar == _v_id)
    return _test[_i][_qp] * _d_grad_v_dot_dv[_qp] * _standard_grad_phi[_j][_qp];

  else
    return 0.;
}

note:Flexibility

Note that only one member is needed to represent shape functions for standard MooseVariables and VectorMooseVariables. For example, if the vector-variables v and w are coupled into a standard kernel for u, only a single _vector_phi member needs to be added; there is not need for both a _v_phi and _w_phi. _vector_phi will be automatically updated to represent the shape functions for whichever vector variable the Jacobian is being computed for.

Newton for a Simple Equation

Consider the convection-diffusion equation with nonlinear $var element = document.getElementById("moose-equation-62d71319-a913-4563-9e56-44be188095b5");katex.render("k", element, {displayMode:false,throwOnError:false});$ , $var element = document.getElementById("moose-equation-ce65d787-2fed-4c61-8889-055b1feaf753");katex.render("\\vec{\\beta}", element, {displayMode:false,throwOnError:false});$ , and $var element = document.getElementById("moose-equation-315e40b4-c236-41ac-b9a5-fbabf270827a");katex.render("f", element, {displayMode:false,throwOnError:false});$ : $var element = document.getElementById("moose-equation-b7b66060-0d3f-4901-a5b7-505208492a7f");katex.render("\\begin{aligned}- \\nabla\\cdot k\\nabla u + \\vec{\\beta} \\cdot \\nabla u = f\\end{aligned}", element, {displayMode:false,throwOnError:false});$
The $var element = document.getElementById("moose-equation-8885c141-943e-427d-8fa6-3cbe86e38a99");katex.render("i^{th}", element, {displayMode:false,throwOnError:false});$ component of the residual vector is: $var element = document.getElementById("moose-equation-0b1e67f0-577e-4310-88ca-b5846bf7f684");katex.render("\\begin{aligned} R_i(u_h) = \\left(\\nabla\\psi_i, k\\nabla u_h \\right) - \\langle\\psi_i, k\\nabla u_h\\cdot \\hat{n} \\rangle + \\left(\\psi_i, \\vec{\\beta} \\cdot \\nabla u_h\\right) - \\left(\\psi_i, f\\right)\\end{aligned}", element, {displayMode:false,throwOnError:false});$

Using the previously-defined rules for $var element = document.getElementById("moose-equation-ca400548-6787-489f-85fb-88a213f9fcef");katex.render("\\frac{\\partial u_h}{\\partial u_j}", element, {displayMode:false,throwOnError:false});$ and $var element = document.getElementById("moose-equation-e71b1f20-93ea-4dd2-b3bd-898d0a87ab43");katex.render("\\frac{\\partial \\left(\\nabla u_h\\right)}{\\partial u_j}", element, {displayMode:false,throwOnError:false});$ , the $var element = document.getElementById("moose-equation-54d6baed-6b78-494b-b2a5-34435fc8d62a");katex.render("(i,j)", element, {displayMode:false,throwOnError:false});$ entry of the Jacobian is then:

$var element = document.getElementById("moose-equation-a22ed722-ea48-4eaf-b405-6adbde1d0ad9");katex.render("\\begin{aligned} J_{ij}(u_h) &= \\left(\\nabla\\psi_i, \\frac{\\partial k}{\\partial u_j}\\nabla u_h \\right) + \\left(\\nabla\\psi_i, k \\nabla \\phi_j \\right) - \\left \\langle\\psi_i, \\frac{\\partial k}{\\partial u_j}\\nabla u_h\\cdot \\hat{n} \\right\\rangle \\\\&- \\left \\langle\\psi_i, k\\nabla \\phi_j\\cdot \\hat{n} \\right\\rangle + \\left(\\psi_i, \\frac{\\partial \\vec{\\beta}}{\\partial u_j} \\cdot\\nabla u_h\\right) + \\left(\\psi_i, \\vec{\\beta} \\cdot \\nabla \\phi_j\\right) - \\left(\\psi_i, \\frac{\\partial f}{\\partial u_j}\\right)\\end{aligned}", element, {displayMode:false,throwOnError:false});$

Note that even for this "simple" equation, the Jacobian entries are nontrivial: they depend on the partial derivatives of $var element = document.getElementById("moose-equation-44c5e89c-bea5-4086-ac20-c7e2c69143ff");katex.render("k", element, {displayMode:false,throwOnError:false});$ , $var element = document.getElementById("moose-equation-13453952-76c3-4d0b-96d0-5b0399bde79c");katex.render("\\vec{\\beta}", element, {displayMode:false,throwOnError:false});$ , and $var element = document.getElementById("moose-equation-6f1d7d4c-f41c-459b-990c-c7104e146bc8");katex.render("f", element, {displayMode:false,throwOnError:false});$ , which may be difficult or time-consuming to compute analytically.
In a multiphysics setting with many coupled equations and complicated material properties, the Jacobian might be extremely difficult to determine.

Chain Rule

On the previous slide, the term $var element = document.getElementById("moose-equation-ab38b971-a201-4d62-b675-2d283a6f974e");katex.render("\\frac{\\partial f}{\\partial u_j}", element, {displayMode:false,throwOnError:false});$ was used, where $var element = document.getElementById("moose-equation-f0aaae67-107f-4eef-a62f-db1a4714d9ed");katex.render("f", element, {displayMode:false,throwOnError:false});$ was a nonlinear forcing function.
The chain rule allows us to write this term as
$var element = document.getElementById("moose-equation-34c2408c-6848-4d26-981a-aa9d51f61195");katex.render("\\begin{aligned} \\frac{\\partial f}{\\partial u_j} &= \\frac{\\partial f}{\\partial u_h} \\frac{\\partial u_h}{\\partial u_j} \\\\ &=\\frac{\\partial f}{\\partial u_h} \\phi_j\\end{aligned}", element, {displayMode:false,throwOnError:false});$
If a functional form of $var element = document.getElementById("moose-equation-6eda291c-263c-4fac-a9fd-89cd965c30cc");katex.render("f", element, {displayMode:false,throwOnError:false});$ is known, e.g. $var element = document.getElementById("moose-equation-75526373-181d-441b-b3ba-c3e169b126ba");katex.render("f(u) = \\sin(u)", element, {displayMode:false,throwOnError:false});$ , this formula implies that its Jacobian contribution is given by
$var element = document.getElementById("moose-equation-ffa219c6-04ff-4329-8c11-c64ac581cc11");katex.render("\\frac{\\partial f}{\\partial u_j} = \\cos(u_h) \\phi_j", element, {displayMode:false,throwOnError:false});$

Jacobian-Free Newton-Krylov

$var element = document.getElementById("moose-equation-c81177e0-d2e2-48ab-b4c5-c9b7027505de");katex.render("\\mathbf{J}(\\vec{u}_n)\\delta \\vec{u}_{n+1} = -\\vec{R}(\\vec{u}_n)", element, {displayMode:false,throwOnError:false});$ is a linear system solved during each Newton step.
For simplicity, we can write this linear system as $var element = document.getElementById("moose-equation-8e6b5bc9-4efa-4cad-8b97-a0da20ea5177");katex.render("\\mathbf{A}\\vec{x} = \\vec{b}", element, {displayMode:false,throwOnError:false});$ , where: - $var element = document.getElementById("moose-equation-4815d4dd-639f-436c-9316-6bdd79b54e47");katex.render("\\mathbf{A} \\equiv \\mathbf{J}(\\vec{u}_n)", element, {displayMode:false,throwOnError:false});$ - $var element = document.getElementById("moose-equation-2a20990b-5ed6-43f1-b02a-70cd7c422754");katex.render("\\vec{x} \\equiv \\delta \\vec{u}_{n+1}", element, {displayMode:false,throwOnError:false});$ - $var element = document.getElementById("moose-equation-ac75c339-9ef2-4004-9f14-8ab3fccf90e4");katex.render("\\vec{b} \\equiv -\\vec{R}(\\vec{u}_n)", element, {displayMode:false,throwOnError:false});$
We employ an iterative Krylov method (e.g. GMRES) to produce a sequence of iterates $var element = document.getElementById("moose-equation-618ff6b7-b3da-477d-9696-43ae4b1e5173");katex.render("\\vec{x}_k \\rightarrow \\vec{x}", element, {displayMode:false,throwOnError:false});$ , $var element = document.getElementById("moose-equation-e899e379-c483-4bcd-897b-aa13170a5342");katex.render("k=1,2,\\ldots", element, {displayMode:false,throwOnError:false});$
$var element = document.getElementById("moose-equation-89cac0e1-607d-452f-8626-8d76bcd40ba4");katex.render("\\mathbf{A}", element, {displayMode:false,throwOnError:false});$ and $var element = document.getElementById("moose-equation-5fecd915-5590-4bb4-9413-076217d1a270");katex.render("\\vec{b}", element, {displayMode:false,throwOnError:false});$ remain fixed during the iterative process.
The "linear residual" at step $var element = document.getElementById("moose-equation-49d69deb-c8e3-4b7d-bac8-9e4a0091bf0b");katex.render("k", element, {displayMode:false,throwOnError:false});$ is defined as
$var element = document.getElementById("moose-equation-cf4db04c-48b2-4859-ae30-612d6de70a6c");katex.render("\\vec{\\rho}_k \\equiv \\mathbf{A}\\vec{x}_k - \\vec{b}", element, {displayMode:false,throwOnError:false});$
MOOSE prints the norm of this vector, $var element = document.getElementById("moose-equation-7d3a59ca-3ee2-40f8-817a-37210c3f2a51");katex.render("\\|\\vec{\\rho}_k\\|", element, {displayMode:false,throwOnError:false});$ , at each iteration, if you set print_linear_residuals = true in the Outputs block.
The "nonlinear residual" printed by MOOSE is $var element = document.getElementById("moose-equation-b3c9896a-71a5-4bf4-aaf2-1dea7a402b48");katex.render("\\|\\vec{R}(\\vec{u}_n)\\|", element, {displayMode:false,throwOnError:false});$ .
By iterate $var element = document.getElementById("moose-equation-231a4432-af56-4351-93b7-3d27bd9226cd");katex.render("k", element, {displayMode:false,throwOnError:false});$ , the Krylov method has constructed the subspace
$var element = document.getElementById("moose-equation-482dc944-ee79-4aa0-a3cb-bdd9b4bcc0f0");katex.render("\\mathcal{K}_k = \\text{span}\\{ \\vec{b}, \\mathbf{A}\\vec{b}, \\mathbf{A}^2\\vec{b}, \\ldots, \\mathbf{A}^{k-1}\\vec{b}\\}", element, {displayMode:false,throwOnError:false});$
Different Krylov methods produce the $var element = document.getElementById("moose-equation-b4623f2d-498e-4384-a869-0f5dd45606ce");katex.render("\\vec{x}_k", element, {displayMode:false,throwOnError:false});$ iterates in different ways:
Conjugate Gradients: $var element = document.getElementById("moose-equation-90b5a4d8-8bb0-496c-933d-0584de9a13e5");katex.render("\\vec{\\rho}_k", element, {displayMode:false,throwOnError:false});$ orthogonal to $var element = document.getElementById("moose-equation-643f2934-8089-472c-8589-8f8aeecef1b1");katex.render("\\mathcal{K}_k", element, {displayMode:false,throwOnError:false});$ .
GMRES/MINRES: $var element = document.getElementById("moose-equation-6faac8a9-d9ec-460b-a7b4-cc1b179dd2f5");katex.render("\\vec{\\rho}_k", element, {displayMode:false,throwOnError:false});$ has minimum norm for $var element = document.getElementById("moose-equation-7f949d10-c14d-4710-a2d7-2d6b61024ae1");katex.render("\\vec{x}_k", element, {displayMode:false,throwOnError:false});$ in $var element = document.getElementById("moose-equation-83de85e0-a00d-4487-ae24-5386fb92a5b5");katex.render("\\mathcal{K}_k", element, {displayMode:false,throwOnError:false});$ .
Biconjugate Gradients: $var element = document.getElementById("moose-equation-978a7f0a-57c5-4bfd-a180-545bf9756422");katex.render("\\vec{\\rho}_k", element, {displayMode:false,throwOnError:false});$ is orthogonal to $var element = document.getElementById("moose-equation-2fa7935c-d8f5-4b18-bacf-d6b9dafd247e");katex.render("\\mathcal{K}_k(\\mathbf{A}^T)", element, {displayMode:false,throwOnError:false});$
$var element = document.getElementById("moose-equation-e842fbba-80fa-402a-9994-c820662164b1");katex.render("\\mathbf{J}", element, {displayMode:false,throwOnError:false});$ is never explicitly needed to construct the subspace, only the action of $var element = document.getElementById("moose-equation-92b6c2de-f304-483f-8d5a-75f995f12c9c");katex.render("\\mathbf{J}", element, {displayMode:false,throwOnError:false});$ on a vector is required.

This action can be approximated by: $var element = document.getElementById("moose-equation-4772bc7a-0193-46c7-9419-83e6b71f0a80");katex.render("\\mathbf{J}\\vec{v} \\approx \\frac{\\vec{R}(\\vec{u} + \\epsilon\\vec{v}) - \\vec{R}(\\vec{u})}{\\epsilon}", element, {displayMode:false,throwOnError:false});$
This form has many advantages: - No need to do analytic derivatives to form $var element = document.getElementById("moose-equation-033a19e8-f10c-41f2-9fe9-d976b93d7060");katex.render("\\mathbf{J}", element, {displayMode:false,throwOnError:false});$ - No time needed to compute $var element = document.getElementById("moose-equation-ae5ce34c-18de-480c-a402-df9a76731f8f");katex.render("\\mathbf{J}", element, {displayMode:false,throwOnError:false});$ (just residual computations) - No space needed to store $var element = document.getElementById("moose-equation-2ed8350e-1b42-483b-b582-651f899baf07");katex.render("\\mathbf{J}", element, {displayMode:false,throwOnError:false});$

Solving Linear Systems

You will commonly hear of two ways to solve an implicit linear system of equations: directly or iteratively. A typical direct solve will perform a LU factorization. Direct solves are a great tool for solving small-medium sized systems; however, they are extremely expensive when applied to large-scale problems. To solve large-scale systems, iterative methods must be used. The most successful iterative methods are Krylov methods. Krylov methods are work by finding a solution to $var element = document.getElementById("moose-equation-6940e2c1-a0c5-416b-a637-25eaf52b1ac1");katex.render("Ax=b", element, {displayMode:false,throwOnError:false});$ within a space called the Krylov sub-space which is spanned by images of $var element = document.getElementById("moose-equation-e91c8b6d-ff37-4ede-a490-01041df82264");katex.render("b", element, {displayMode:false,throwOnError:false});$ under powers of $var element = document.getElementById("moose-equation-fd8443df-d476-46ba-aac7-468a69932841");katex.render("A", element, {displayMode:false,throwOnError:false});$ . Two of the most used Krylov algorithms are Conjugate gradient and GMRES. Conjugate gradient generally only works for symmetric positive-definite matrices. Because of its greater flexibility, GMRES is the default linear solution algorithm in PETSc and consequently for MOOSE.

Augmenting Sparsity

One such routine is NonlinearSystemBase::augmentSparsity, which as its name suggests augments the sparsity pattern of the matrix. Currently this method adds sparsity coming from MOOSE Constraint objects. It does this by querying geometric connectivity information between secondary and primary boundary pairs, and then querying the DofMap attached to the NonlinearSystemBase (through the libMesh NonlinearImplicitSystem) for the dof indices that exist on the elements attached to the secondary/primary nodes. The geometric connectivity information comes from NearestNodeLocators held by GeometricSearchData objects in the FEProblemBase and DisplacedProblem (the latter only if there are mesh displacements). In the future sparsity augmentation from constraints will occur through RelationshipManagers instead of through the augmentSparsity method.

Computing Residual and Jacobian Together

The default behavior in MOOSE is to have separate functions compute the residual and Jacobian. However, with the advent of Automatic Differentiation it can make sense to use a single function to compute the residual and Jacobian simultaneously. At the local residual object level, automatic differentiation (AD) already computes the residual and Jacobian simultaneously, with the dual number at the core of AD holding value (residual) and derivatives (Jacobian). Simultaneous evaluation of residual and Jacobian using a single function can be triggered by setting "residual_and_jacobian_together" to true. What this does in the background is funnels the (generally AD) computed local residuals and Jacobians into the global residual vector and Jacobian matrix respectively when PETSc calls the libMesh/MOOSE residual/function evaluation routine. Then when PETSc calls the libMesh/MOOSE Jacobian evaluation routine, we simply return because the global matrix has already been computed.

Computing the residual and Jacobian together has shown 20% gains for Navier-Stokes finite volume simulations for which automatic differentiation is leveraged even during standard residual evaluations. Other areas where computing the residual and Jacobian together may be advantageous is in simulations in which there are quite a few nonlinear iterations per timestep, for which the cost of an additional Jacobian evaluation during the final residual evaluation is amortized. Also, simulations in which material property calculations are very expensive may be good candidates for computing the residual and Jacobian together.

Reusing preconditioners

The simple version of GMRES and other iterative methods converge only very slowly. To improve convergence, PETSc and other iterative solver packages apply a preconditioner to the system of equations/sparse matrix before applying the iterative solver.

A great number of preconditioners exist, but multigrid methods are often among the best choices for problems without significant hyperbolic character. The HYPRE package, specifically the BoomerAMG preconditioner, is often a good choice for a preconditioner to condition the system of equations resulting from the MOOSE simulation.

A direct factorization of the sparse system of equations is a very good preconditioner. An iterative method using the factorized matrix as a preconditioner will typically converge to machine precision in a single iteration. However, as noted above, factorizing the sparse system of equations for a large simulation is numerically expensive.

One option is to form a preconditioner once and then reuse it to solve the linearized system many times. The preconditioner can be carried over through nonlinear iterations and even across time steps. MOOSE allows the user to do this with the reuse_preconditioner flag. Setting


  reuse_preconditioner = true
  reuse_preconditioner_max_linear_its = 20

in the [Executioner] block will reuse the same preconditioner until the number of linear iterations required to solve the linearized system of equations exceeds 20. If the number of linear iterations exceeds reuse_preconditioner_max_linear_its the system does not immediately stop iterating on the current linearized system. Instead it will continue until it either successfully solves the current system or reaches l_max_its. It will then form a new preconditioner for the next nonlinear iteration.

Using these parameters in combination with a direct factorization of the system can be very efficient. The following is an example of how to direct PETSc and MOOSE to solve the equations with this combination:


  petsc_options_iname = '-pc_type -pc_factor_mat_solver_package -ksp_type'
  petsc_options_value = 'lu superlu_dist gmres'

  reuse_preconditioner = true
  reuse_preconditioner_max_linear_its = 20

This solver strategy can be very effective when the system Jacobian does not change very much from nonlinear iteration to nonlinear iteration and/or from time step to time step. The heuristic is also most effective when the cost of solving the linearized system is a large fraction of the total simulation time. As such, it can be especially beneficial when using an expensive preconditioner, like a direct solver, as shown in this example.

There are two differences between reuse_preconditioner and setting up preconditioner reuse directly in PETSc with the -snes_lag_preconditioner_persists and -snes_lag_preconditioner options: 1. -snes_lag_preconditioner X will recalculate a new preconditioner every X linear iterations, regardless of the progress of the linear solve. reuse_preconditioner_max_linear_its = X will continue to reuse the same preconditioner until the number of linear iterations required to solve the linearized equations exceeds X. 2. By default libmesh deletes the PETSc SNES instance after each time step. This means that regardless of how the reuse options are set, the solver cannot retain the preconditioner across time steps. The reuse_preconditioner alters this behavior to retain the SNES instance so that preconditioner reuse can be carried across time steps.

Preconditioner reuse is also different from modified Newton methods, which can be configured with the PETSc -snes_lag_jacobian and -snes_lag_jacobian_persists options. Preconditioner reuse affects how PETSc solves the linearized system of equations formed at each nonlinear iteration. Ideally, if the reused preconditioner achieves the requested l_tol precision before iterating more than l_max_its times, preconditioner reuse will not affect the convergence of the nonlinear iterations compared to a case with the reuse option off. As described above, preconditioner reuse aims to decrease the time required to solve the linearized equations at each nonlinear iteration by reducing the number of times the solver needs to setup the potentially-expensive linear preconditioner.

By contrast, modified Newton methods will affect the nonlinear convergence of the system without affecting how PETSc solves the linearized system of equations. The goal of modified Newton methods is to reduce the time required to solve the nonlinear equations by forming a new Jacobian matrix less often.

Put another way, preconditioner reuse aims to speed up solving the linear system of equations while modified Newton methods aim to accelerate solving the nonlinear equations.

(moose/test/src/kernels/ADCoupledConvection.C)

// This file is part of the MOOSE framework
// https://www.mooseframework.org
//
// All rights reserved, see COPYRIGHT for full restrictions
// https://github.com/idaholab/moose/blob/master/COPYRIGHT
//
// Licensed under LGPL 2.1, please see LICENSE for details
// https://www.gnu.org/licenses/lgpl-2.1.html

#include "ADCoupledConvection.h"

registerMooseObject("MooseTestApp", ADCoupledConvection);

InputParameters
ADCoupledConvection::validParams()
{
  InputParameters params = ADKernel::validParams();
  params.addParam<Real>("scale", 1, "Scaling coefficient");
  params.addRequiredCoupledVar("velocity_vector", "Velocity Vector for the Convection ADKernel");
  return params;
}

ADCoupledConvection::ADCoupledConvection(const InputParameters & parameters)
  : ADKernel(parameters),
    _velocity_vector(adCoupledGradient("velocity_vector")),
    _scale(getParam<Real>("scale"))
{
}

ADReal
ADCoupledConvection::computeQpResidual()
{
  return _scale * _test[_i][_qp] * _velocity_vector[_qp] * _grad_u[_qp];
}

(moose/test/include/kernels/ADCoupledConvection.h)

// This file is part of the MOOSE framework
// https://www.mooseframework.org
//
// All rights reserved, see COPYRIGHT for full restrictions
// https://github.com/idaholab/moose/blob/master/COPYRIGHT
//
// Licensed under LGPL 2.1, please see LICENSE for details
// https://www.gnu.org/licenses/lgpl-2.1.html

#pragma once

#include "ADKernel.h"

/**
 * Define the ADKernel for a convection operator that looks like:
 *
 * grad_some_var dot u'
 */
class ADCoupledConvection : public ADKernel
{
public:
  static InputParameters validParams();

  ADCoupledConvection(const InputParameters & parameters);

protected:
  virtual ADReal computeQpResidual() override;

private:
  const ADVariableGradient & _velocity_vector;

  const Real & _scale;
};

(moose/test/src/materials/ADCoupledMaterial.C)

// This file is part of the MOOSE framework
// https://www.mooseframework.org
//
// All rights reserved, see COPYRIGHT for full restrictions
// https://github.com/idaholab/moose/blob/master/COPYRIGHT
//
// Licensed under LGPL 2.1, please see LICENSE for details
// https://www.gnu.org/licenses/lgpl-2.1.html

#include "ADCoupledMaterial.h"

registerMooseObject("MooseTestApp", ADCoupledMaterial);

InputParameters
ADCoupledMaterial::validParams()
{
  InputParameters params = Material::validParams();
  params.addRequiredCoupledVar("coupled_var", "A coupledvariable");
  params.addRequiredParam<MaterialPropertyName>("ad_mat_prop",
                                                "Name of the ad property this material defines");
  params.addRequiredParam<MaterialPropertyName>(
      "regular_mat_prop", "Name of the regular property this material defines");
  return params;
}

ADCoupledMaterial::ADCoupledMaterial(const InputParameters & parameters)
  : Material(parameters),
    _ad_mat_prop(declareADProperty<Real>(getParam<MaterialPropertyName>("ad_mat_prop"))),
    _regular_mat_prop(declareProperty<Real>(getParam<MaterialPropertyName>("regular_mat_prop"))),
    _coupled_var(adCoupledValue("coupled_var"))
{
}

// Note that the structure of the two (uncommented) methods below are for testing purposes only;
// e.g. this material demonstrates that you get bad convergence when you drop the derivative
// information from the coupled variable. A production version of this material would look like
// this:
//
// // void
// ADCoupledMaterial::computeQpProperties()
// {
//   _ad_mat_prop[_qp] = 4.0 * _coupled_var[_qp];
// }

void
ADCoupledMaterial::computeQpProperties()
{
  _regular_mat_prop[_qp] = 4.0 * MetaPhysicL::raw_value(_coupled_var[_qp]);
  _ad_mat_prop[_qp] = 4.0 * _coupled_var[_qp];
}

(moose/test/include/materials/ADCoupledMaterial.h)

// This file is part of the MOOSE framework
// https://www.mooseframework.org
//
// All rights reserved, see COPYRIGHT for full restrictions
// https://github.com/idaholab/moose/blob/master/COPYRIGHT
//
// Licensed under LGPL 2.1, please see LICENSE for details
// https://www.gnu.org/licenses/lgpl-2.1.html

#pragma once

#include "Material.h"

/**
 * A material that couples a material property
 */
class ADCoupledMaterial : public Material
{
public:
  static InputParameters validParams();

  ADCoupledMaterial(const InputParameters & parameters);

protected:
  virtual void computeQpProperties();

  ADMaterialProperty<Real> & _ad_mat_prop;
  MaterialProperty<Real> & _regular_mat_prop;

  const ADVariableValue & _coupled_var;
};

(moose/test/src/kernels/ADMatDiffusionTest.C)

// This file is part of the MOOSE framework
// https://www.mooseframework.org
//
// All rights reserved, see COPYRIGHT for full restrictions
// https://github.com/idaholab/moose/blob/master/COPYRIGHT
//
// Licensed under LGPL 2.1, please see LICENSE for details
// https://www.gnu.org/licenses/lgpl-2.1.html

#include "ADMatDiffusionTest.h"

registerMooseObject("MooseTestApp", ADMatDiffusionTest);

InputParameters
ADMatDiffusionTest::validParams()
{
  InputParameters params = ADKernel::validParams();
  params.addParam<MaterialPropertyName>(
      "ad_mat_prop", "ad_diffusivity", "the name of the AD material property we are going to use");
  params.addParam<MaterialPropertyName>("regular_mat_prop",
                                        "regular_diffusivity",
                                        "the name of the AD material property we are going to use");
  MooseEnum prop_to_use("AdAd  RegReg", "AdAd");
  params.addParam<MooseEnum>("prop_to_use",
                             prop_to_use,
                             "What type of property to use. The prefix indicates the getter type "
                             "in the kernel; the suffix indicates the declaration type in the "
                             "material.");
  return params;
}

ADMatDiffusionTest::ADMatDiffusionTest(const InputParameters & parameters)
  : ADKernel(parameters),
    _ad_diff_from_ad_prop(getADMaterialProperty<Real>("ad_mat_prop")),
    _regular_diff_from_regular_prop(getMaterialProperty<Real>("regular_mat_prop")),
    _prop_to_use(getParam<MooseEnum>("prop_to_use"))
{
  // check whether our has APIs work

  if (!hasADMaterialProperty<Real>("ad_mat_prop") &&
      !defaultADMaterialProperty<Real>(getMaterialPropertyName("ad_mat_prop")))
    mooseError("It should be impossible to get an AD property without erroring and simultaneously "
               "be neither able to retrieve the property with 'hasADMaterialProperty' nor through "
               "a default property");
  if (!hasMaterialProperty<Real>("regular_mat_prop") &&
      !defaultMaterialProperty<Real>(getMaterialPropertyName("regular_mat_prop")))
    mooseError(
        "It should be impossible to get a regular property without erroring and simultaneously "
        "be neither able to retrieve the property with 'hasMaterialProperty' nor through "
        "a default property");
}

ADReal
ADMatDiffusionTest::computeQpResidual()
{
  if (_prop_to_use == "AdAd")
    return _ad_diff_from_ad_prop[_qp] * _grad_test[_i][_qp] * _grad_u[_qp];
  else if (_prop_to_use == "RegReg")
    return _regular_diff_from_regular_prop[_qp] * _grad_test[_i][_qp] * _grad_u[_qp];
  else
    mooseError("Oops");
}

(moose/test/include/kernels/ADMatDiffusionTest.h)

// This file is part of the MOOSE framework
// https://www.mooseframework.org
//
// All rights reserved, see COPYRIGHT for full restrictions
// https://github.com/idaholab/moose/blob/master/COPYRIGHT
//
// Licensed under LGPL 2.1, please see LICENSE for details
// https://www.gnu.org/licenses/lgpl-2.1.html

#pragma once

#include "ADKernel.h"
#include "MaterialProperty.h"

class ADMatDiffusionTest : public ADKernel
{
public:
  static InputParameters validParams();

  ADMatDiffusionTest(const InputParameters & parameters);

protected:
  virtual ADReal computeQpResidual();

  const ADMaterialProperty<Real> & _ad_diff_from_ad_prop;
  const MaterialProperty<Real> & _regular_diff_from_regular_prop;
  const MooseEnum _prop_to_use;
};