1.3. Classification of equations

$\renewcommand{\Re}{\operatorname{Re}}$ $\renewcommand{\Im}{\operatorname{Im}}$ $\newcommand{\erf}{\operatorname{erf}}$ $\newcommand{\dag}{\dagger}$ $\newcommand{\const}{\mathrm{const}}$

1.3. Classification of equations

  1. Linear and nonlinear equations
  2. Elliptic, hyperbolic and parabolic equations
  3. Scope of this Textbook

Linear and nonlinear equations

Equations of the form \begin{equation} Lu = f(\mathbf{x}) \label{eq-1.3.1} \end{equation} where $Lu$ is a partial differential expression linear with respect to unknown function $u$ is called linear equation (or linear system). This equation is linear homogeneous equation if $f=0$ and linear inhomogeneous equation otherwise. For example, \begin{equation} Lu := a_{11} u_{xx} + 2a_{12} u_{xy}+a_{22} u_{yy}+ a_1u_x+a_2 u_y +a u = f(\mathbf{x}) \label{eq-1.3.2} \end{equation} is linear; if all coefficients $a_{jk}$, $a_j$, $a$ are constant, we call it linear equation with constant coefficients; otherwise we talk about variable coefficients.

Otherwise equation is called nonlinear. However there is a more subtle classification of such equations. Equations of the type (\ref{eq-1.3.1}) where the right-hand expression $f$ depend on the lower-order derivatives are called semilinear, equations where both coefficients and right-hand expression depend on the lower-order derivatives are called quasilinear. For example \begin{equation} Lu := a_{11}(x,y) u_{xx} + 2a_{12}(x,y) u_{xy}+a_{22}(x,y) u_{yy} = f(x,y,u,u_x,u_y) \label{eq-1.3.3} \end{equation} is semilinear, and \begin{multline} Lu := a_{11}(x,y,u,u_x,u_y) u_{xx} + 2a_{12}(x,y,u,u_x,u_y) u_{xy}+a_{22}(x,y,u,u_x,u_y) u_{yy} \\= f(x,y,u,u_x,u_y)\qquad \label{eq-1.3.4} \end{multline} is quasilinear, while \begin{equation} F(x,y,u,u_x,u_y,u_{xx},u_{xy},u_{yx}) =0 \label{eq-1.3.5} \end{equation} is general nonlinear.

Elliptic, hyperbolic and parabolic equations


Consider second order equation (\ref{eq-1.3.2}): \begin{equation} Lu := \sum_{1\le i,j\le n} a_{ij} u_{x_ix_j} + \text{l.o.t.} = f(\mathbf{x}) \label{eq-1.3.6} \end{equation} where l.o.t. means lower order terms with $a_{ij}=a_{ji}$. Let us change variables $\mathbf{x}=\mathbf{x}(\mathbf{x}')$. Then the matrix of principal coefficients $A=\begin{pmatrix} a_{11} & \dots &a_{1n}\\ \vdots & \ddots &\vdots \\ a_{n1}&\dots & a_{nn}\end{pmatrix}$ in the new coordinate system becomes $A'= Q^* A Q$ where $Q=T^{*\,-1}$ and $T=\left( \frac{\partial x_i}{\partial x'_j}\right)_{i,j=1,\dots,n}$ is a Jacobi matrix. The proof easily follows from the chain rule (Calculus II).

Therefore if the principal coefficients are real and constant, by a linear change of variables matrix of the principal coefficients could be reduced to the diagonal form, where diagonal elements could be either $1$, or $-1$ or $0$. Multiplying equation by $-1$ if needed we can assume that there are at least as many $1$ as $-1$. In particular as $n=2$ the principal part becomes either $u_{xx}+u_{yy}$, or $u_{xx}-u_{yy}$, or $u_{xx}$ and such equations are called elliptic, hyperbolic, and parabolic respectively (there will be always second derivative as otherwise it would be the first order equation).

This terminology comes from the curves of the second order conical sections: if $a_{11}a_{22}-a_{12}^2>0$ equation $a_{11}\xi ^2+2a_{12}\xi\eta + a_{22}\eta ^2+a_1\xi +a_2\eta =c$ generically defines an ellipse, if $a_{11}a_{22}-a_{12}^2<0$ this equation generically defines a hyperbole and if $a_{11}a_{22}-a_{12}^2=0$ it defines a parabole.

Let us consider equations in different dimensions:


If we consider only $2$-nd order equations with constant real coefficients then in appropriate coordinates they will look like either \begin{equation} u_{xx}+u_{yy}+\text{l.o.t} =f \label{eq-1.3.7} \end{equation} or \begin{equation} u_{xx}-u_{yy}+\text{l.o.t.} =f, \label{eq-1.3.8} \end{equation} where l.o.t. nean lower order terms, and we call such equations elliptic and hyperbolic respectively.

What to do if one of the 2-nd derivatives is missing? We get parabolic equations \begin{equation} u_{xx}-cu_{y}+\text{l.o.t.} =f. \label{eq-1.3.9} \end{equation} with $c\ne 0$ (we do not consider $cu_y$ as a lower order term here) and IVP $u|_{y=0}=g$ is well-posed in the direction of $y>0$ if $c>0$ and in direction $y<0$ if $c<0$. We can dismiss $c=0$ as not-interesting.

However this classification leaves out very important Schrödinger equation \begin{equation} u_{xx} +i c u_y=0 \label{eq-1.3.10} \end{equation} with real $c\ne 0$. For it IVP $u|_{y=0}=g$ is well-posed in both directions $y>0$ and $y<0$ but it lacks many properties of parabolic equations (like maximum principle or mollification; still it has interesting properties on its own).


Again, if we consider only $2$-nd order equations with constant real coefficients then in appropriate coordinates they will look like either \begin{equation} u_{xx}+u_{yy}+u_{zz}+\text{l.o.t} =f \label{eq-1.3.11} \end{equation} or \begin{equation} u_{xx}+u_{yy}-u_{zz}+\text{l.o.t.} =f. \label{eq-1.3.12} \end{equation} and we call such equations elliptic and hyperbolic respectively.

Also we get parabolic equations like \begin{equation} u_{xx}+u_{yy}-cu_z+\text{l.o.t.} =f. \label{eq-1.3.13} \end{equation} What about \begin{equation} u_{xx}-u_{yy}-cu_z+\text{l.o.t.} =f? \label{eq-1.3.14} \end{equation} Algebraist-formalist would call it parabolic-hyperbolic but since this equation exhibits no interesting analytic properties (unless one considers lack of such properties interesting; in particular, IVP is ill-posed in both directions) it would be a perversion.

Yes, there will be Schrödinger equation \begin{equation} u_{xx} +u_{yy}+i c u_z=0 \label{eq-1.3.15} \end{equation} with real $c\ne 0$ but $u_{xx} -u_{yy}+i c u_z=0$ would also have IVP $u|_{z=0}=g$ well-posed in both directions.


Here we would get also elliptic \begin{equation} u_{xx}+u_{yy}+u_{zz}+u_{tt}+\text{l.o.t.} =f, \label{eq-1.3.16} \end{equation} hyperbolic \begin{equation} u_{xx}+u_{yy}+u_{zz}-u_{tt}+\text{l.o.t.} =f, \label{eq-1.3.17} \end{equation} but also ultrahyperbolic \begin{equation} u_{xx}+u_{yy}-u_{zz}-u_{tt}+\text{l.o.t.} =f \label{eq-1.3.18} \end{equation} which exhibits some interesting analytic properties but these equations are way less important than elliptic, hyperbolic or parabolic.

Parabolic and Schrödinger will be here as well.

Remark 1. The notions of elliptic, hyperbolic or parabolic equations are generalized to higher dimensions (trivially) and to higher-order equations but most of the randomly written equations do not belong to any of these types and there is no reason to classify them.

There is no complete classifications of PDEs and cannot be because any reasonable classification should not be based on how equation looks like but on the reasonable analytic properties it exhibits (which IVP or BVP are well-posed etc).

Equations of the variable type

To make things even more complicated there are equations changing types from point to point, f.e. Tricomi equation \begin{equation} u_{xx}+xu_{yy}=0 \label{eq-1.3.19} \end{equation} which is elliptic as $x>0$ and hyperbolic as $x<0$ and at $x=0$ has a "parabolic degeneration". It is a toy-model describing stationary transsonic flow of gas. These equations are called equations of the variable type (aka mixed equations).

Our purpose was not to give exact definitions but to explain a situation.

Scope of this Textbook

$\Leftarrow$  $\Uparrow$  $\Rightarrow$