3.3 The Inverse Function Theorem

Assume that \(S\) is an open subset of \(\R^{n+k}\) and that \(\mathbf F:S\to \R^k\) is a function of class \(C^1\). Assume also that \((\mathbf a, \mathbf b)\) is a point in \(S\) such that \[ \mathbf F(\mathbf a, \mathbf b) = {\bf 0} \qquad\text{ and } \qquad \det D_\mathbf y \mathbf F(\mathbf a, \mathbf b) \ne 0. \]

(i). There exist \(r_0,r_1>0\) such that for every \(\mathbf x\in \R^n\) with \(|\mathbf x-\mathbf a|< r_0\), there exists a unique \(\mathbf y\in \R^k\) with \(|\mathbf y - \mathbf b|< r_1\) such that \[\begin{equation}\label{ImFT.eq1} \mathbf F(\mathbf x, \mathbf y) = \bf0. \end{equation}\] In other words, equation \(\eqref{ImFT.eq1}\) implicitly defines a function \(\mathbf y = \mathbf f(\mathbf x)\) for \(\mathbf x\in \R^n\) near \(\mathbf a\), with \(\mathbf y = \mathbf f(\mathbf x)\) close to \(\mathbf b\). Note in particular that \(\mathbf f(\mathbf a) = \mathbf b\).

(ii). Moreover, the function \(\mathbf f:B(r_0, \mathbf a)\to B(r_1,\mathbf b)\subset \R^k\) from part (i) above is of class \(C^1\), and its derivatives may be determined by implicit differentiation.

Assume that \(\mathbf F\) satisfies the hypotheses of the Implicit Function Theorem, and define \(\mathbf G:U\to \R^{n+k}\) by \[ G(\mathbf x, \mathbf y) = (\mathbf x, \mathbf F(\mathbf x, \mathbf y)). \] Recalling that all vectors are column vectors by default, this means that \[\begin{equation}\label{G.def} G(\mathbf x, \mathbf y) = \left( \begin{array}{c} x_1\\\ \vdots\\\ x_n\\\ F_1(\mathbf x, \mathbf y)\\\ \vdots\\\ F_k(\mathbf x, \mathbf y) \end{array} \right) . \end{equation}\]

Claim 1. \(\det D\mathbf G(\mathbf a, \mathbf b) = \det D_y\mathbf F(\mathbf a, \mathbf b)\).

This is a linear algebra exercise. Note that \[ D\mathbf G \ = \ \left( \begin{array}{ccccccc} 1&0&\cdots&0&0&\cdots &0\\\ 0&1&\cdots&0&0&\cdots &0\\\ \vdots&\vdots&\ddots&0&0&\cdots &0\\\ 0&0&\cdots&1&0&\cdots &0\\\ \partial_{x_1}F_1&\partial_2 F_1&\cdots&\partial_{x_n}F_1&\partial_{y_1}F_1&\cdots &\partial_{y_k}F_1 \\\ \vdots &\vdots&\cdots&\vdots&\vdots&\cdots &\vdots\\\ \partial_{x_1}F_k&\partial_2 F_k&\cdots&\partial_{x_n}F_k&\partial_{y_1}F_k&\cdots &\partial_{y_k}F_k \\\ \end{array} \right) . \]

Here \(D\mathbf G\) denotes the \((n+k)\times (n+k)\) matrix of derivatives of all components of \(\mathbf G\) with respect to all variables \(x_1,\ldots, x_n, y_1,\ldots, y_k\). If you recall block matrices from linear algebra, you can see why Claim 1 is true. If you want to write a detailed proof, induction on \(n\) is an option.

Thus our assumption \(\det D_y\mathbf F(\mathbf a, \mathbf b)\ne 0\) implies that \(\det D\mathbf G(\mathbf a, \mathbf b) \ne 0\). Therefore according to the Inverse Function Theorem, there are open sets \(M\subset U\) and \(N\subset \R^{n+k}\) such that \((\mathbf a, \mathbf b)\in M\) and \(\mathbf G:M\to N\) is invertible, with inverse of class \(C^1\).

Next, for \(\mathbf x\) such that \((\mathbf x, {\bf 0})\in N\), define \(\mathbf f(\mathbf x)\) by \[ \mathbf G^{-1}(\mathbf x, {\bf 0}) = (\mathbf x, \mathbf f(\mathbf x)) = \left( \begin{array}{c} x_1\\\ \vdots\\\ x_n\\\ f_1(\mathbf x) \\\ \vdots\\\ f_k(\mathbf x) \end{array} \right) . \] This \(\mathbf f\) turns out to be the implicit function whose existence we are trying to prove. Its definition says that \[ \mathbf y = \mathbf f(\mathbf x) \qquad \iff\qquad (\mathbf x,\mathbf y)\in M\text{ and }G(\mathbf x, \mathbf y) = (\mathbf x , {\bf 0}). \] And of course \(\mathbf G(\mathbf x, \mathbf y) = (\mathbf x, {\bf 0 })\) is equivalent to \(\mathbf F(\mathbf x, \mathbf y) = \bf 0\).

Since \(\mathbf G^{-1}\) is \(C^1\), the same is true of \(\mathbf f\).

To complete the proof, it is still necessary to worry about a few details related to the choice of \(r_0,r_1\) in the conclusion of the theorem, but what we have said above is the main point.

3.3 Transformations and the Inverse Function Theorem

Transformations

Visualizing Transformations

Example 1.

Example 2.

Important Coordinate Systems

Polar coordinates in \(\R^2\)

Spherical coordinates in \(\R^3\)

Cylindrical coordinates in \(\R^3\)

The Inverse Function Theorem

The proof of the Inverse Function Theorem

Implicit Function Theorem as a corollary

Problems

Basic

Advanced