跳到主要内容

1 Limits and Continuity of Multivariate Functions

1.1 Examples of Multivariate Functions

Multivariate Function

An mm-variable function is a mapping f:DRf : D \to \mathbb{R}, where DD is Rm\mathbb{R}^m or a subset of it, and the function value yy and independent variables x1,x2,,xmx_1, x_2, \dots, x_m satisfy y=f(x1,x2,,xm)y = f(x_1, x_2, \dots, x_m).

Multivariate Mapping

The values of multiple variables y1,,yny_1, \dots, y_n are determined by a set of independent variables x1,,xmx_1, \dots, x_m:

(y1yn)=(f1(x1,,xm)fn(x1,,xm)),i.e.,{y1=f1(x1,,xm),yn=fn(x1,,xm).\begin{equation*} \begin{pmatrix} y_1 \\ \vdots \\ y_n \end{pmatrix} = \begin{pmatrix} f_1(x_1, \dots, x_m) \\ \vdots \\ f_n(x_1, \dots, x_m) \end{pmatrix}, \quad \text{i.e.,} \quad \begin{cases} y_1 = f_1(x_1, \dots, x_m), \\ \vdots \\ y_n = f_n(x_1, \dots, x_m). \end{cases} \end{equation*}

1.2 Distance and Limits of Sequences in Rm\mathbb{R}^m

Distance on Rm\mathbb{R}^m

A bivariate function d:Rm×RmRd : \mathbb{R}^m \times \mathbb{R}^m \to \mathbb{R} is called a distance on Rm\mathbb{R}^m if it satisfies:

  1. For any x,yRm\mathbf{x}, \mathbf{y} \in \mathbb{R}^m, d(x,y)=d(y,x)0d(\mathbf{x}, \mathbf{y}) = d(\mathbf{y}, \mathbf{x}) \ge 0;

  2. For any x,yRm\mathbf{x}, \mathbf{y} \in \mathbb{R}^m, d(x,y)=0d(\mathbf{x}, \mathbf{y}) = 0 if and only if x=y\mathbf{x} = \mathbf{y};

  3. For any x,y,zRm\mathbf{x}, \mathbf{y}, \mathbf{z} \in \mathbb{R}^m, d(x,z)d(x,y)+d(y,z)d(\mathbf{x}, \mathbf{z}) \le d(\mathbf{x}, \mathbf{y}) + d(\mathbf{y}, \mathbf{z}).

  • For r>0r > 0, denote

    B(x,r):={yRmd(x,y)<r},\begin{equation*} B(\mathbf{x}, r) := \{\mathbf{y} \in \mathbb{R}^m \mid d(\mathbf{x}, \mathbf{y}) < r\}, \end{equation*}

    which is called the open ball centered at x\mathbf{x} with radius rr.

  • A distance dd on Rm\mathbb{R}^m is said to be translation-invariant if for any x,y,zRm\mathbf{x}, \mathbf{y}, \mathbf{z} \in \mathbb{R}^m,

d(x+z,y+z)=d(x,y).\begin{equation*} d(\mathbf{x} + \mathbf{z}, \mathbf{y} + \mathbf{z}) = d(\mathbf{x}, \mathbf{y}). \end{equation*}

Norm on Rm\mathbb{R}^m

A function :RmR\|\cdot\| : \mathbb{R}^m \to \mathbb{R} is called a norm on Rm\mathbb{R}^m if it satisfies:

  1. For any x,yRm\mathbf{x}, \mathbf{y} \in \mathbb{R}^m, x+yx+y\|\mathbf{x} + \mathbf{y}\| \le \|\mathbf{x}\| + \|\mathbf{y}\|;
  2. For any xRm\mathbf{x} \in \mathbb{R}^m, x0\|\mathbf{x}\| \ge 0; x=0\|\mathbf{x}\| = 0 if and only if x=0\mathbf{x} = \mathbf{0};
  3. For any xRm\mathbf{x} \in \mathbb{R}^m and any λR\lambda \in \mathbb{R}, λx=λx\|\lambda \mathbf{x}\| = |\lambda| \|\mathbf{x}\|.

Matrix Norm

Given a norm \|\cdot\| in Rn\mathbb{R}^n and a square matrix AA of order nn, the matrix norm of AA is defined as:

A=maxv=1Av.\begin{align*} \|A\| = \max_{\|\mathbf{v}\|=1} \|A\mathbf{v}\|. \end{align*}

Also

A=maxv0Avv.\begin{align*} \|A\| = \max_{\mathbf{v}\neq 0} \frac{\|A\mathbf{v}\|}{\|\mathbf{v}\|}. \end{align*}

Theorem 1.1.

  1. For any matrix AA, the norm is always non-negative, and it is zero if and only if the matrix itself is the zero matrix.
A0,andA=0    A=0\begin{align*} \|A\| \geq 0, \quad \text{and} \quad \|A\| = 0 \iff A = \mathbf{0} \end{align*}
  1. Multiplying a matrix by a scalar α\alpha scales the norm by the absolute value of that scalar.
αA=αA\begin{align*} \|\alpha A\| = |\alpha| \cdot \|A\| \end{align*}
  1. The norm of the sum of two matrices is less than or equal to the sum of their individual norms.
A+BA+B\begin{align*} \|A + B\| \leq \|A\| + \|B\| \end{align*}
  1. For induced norms, the norm of a product of matrices is bounded by the product of their norms.
ABAB\begin{align*} \|AB\| \leq \|A\| \cdot \|B\| \end{align*}
  1. By the very definition of the induced norm, the transformation of any vector v\mathbf{v} by matrix AA is bounded by the matrix norm:
AvAv\begin{align*} \|A\mathbf{v}\| \leq \|A\| \cdot \|\mathbf{v}\| \end{align*}

Inner Product on Rm\mathbb{R}^m

A bivariate function ,:Rm×RmR\langle \cdot, \cdot \rangle : \mathbb{R}^m \times \mathbb{R}^m \to \mathbb{R} is called an inner product on Rm\mathbb{R}^m if it satisfies:

  1. For any x,yRm\mathbf{x}, \mathbf{y} \in \mathbb{R}^m, x,y=y,x\langle \mathbf{x}, \mathbf{y} \rangle = \langle \mathbf{y}, \mathbf{x} \rangle;
  2. For any x,y,zRm\mathbf{x}, \mathbf{y}, \mathbf{z} \in \mathbb{R}^m and any α,βR\alpha, \beta \in \mathbb{R}, αx+βy,z=αx,z+βy,z;\begin{equation*} \langle \alpha \mathbf{x} + \beta \mathbf{y}, \mathbf{z} \rangle = \alpha \langle \mathbf{x}, \mathbf{z} \rangle + \beta \langle \mathbf{y}, \mathbf{z} \rangle; \end{equation*}
  3. For any xRm\mathbf{x} \in \mathbb{R}^m, x,x0\langle \mathbf{x}, \mathbf{x} \rangle \ge 0; and x,x=0\langle \mathbf{x}, \mathbf{x} \rangle = 0 if and only if x=0\mathbf{x} = \mathbf{0}.

Definition. For any p1p \ge 1,

xp:=(x1p++xmp)1p\begin{equation*} \|\mathbf{x}\|_p := (|x^1|^p + \dots + |x^m|^p)^{\frac{1}{p}} \end{equation*}

defines a norm on Rm\mathbb{R}^m ,which induces a distance on Rm\mathbb{R}^m:

dp(x,y):=(x1y1p++xmymp)1p.\begin{equation*} d_p(\mathbf{x}, \mathbf{y}) := (|x^1 - y^1|^p + \dots + |x^m - y^m|^p)^{\frac{1}{p}}. \end{equation*}

Theorem 1.2.

  1. x:=max{x1,,xm}\|\mathbf{x}\|_\infty := \max\{|x^1|, \dots, |x^m|\}.

  2. xxpm1px\|\mathbf{x}\|_\infty \le \|\mathbf{x}\|_p \le m^{\frac{1}{p}} \|\mathbf{x}\|_\infty.

  3. 1m1qxqxpm1pxq\frac{1}{m^{\frac{1}{q}}} \|\mathbf{x}\|_q \le \|\mathbf{x}\|_p \le m^{\frac{1}{p}} \|\mathbf{x}\|_q.

Corollary 1. Any norm \|\cdot\| on Rm\mathbb{R}^m is equivalent to the norm \|\cdot\|_\infty,

Limits of Sequences in Rm\mathbb{R}^m

Let {xn}n1\{\mathbf{x}_n\}_{n \ge 1} be a sequence in Rm\mathbb{R}^m.

  1. The sequence {xn}n1\{\mathbf{x}_n\}_{n \ge 1} is said to be bounded if there exists M>0M > 0 such that for any positive integer nn, xnM\|\mathbf{x}_n\| \le M.

  2. The sequence {xn}n1\{\mathbf{x}_n\}_{n \ge 1} is said to be convergent if there exists ARmA \in \mathbb{R}^m such that for any ε>0\varepsilon > 0, there exists a positive integer N(ε)>0N(\varepsilon) > 0 such that for any positive integer nn, whenever nN(ε)n \ge N(\varepsilon), we have xnA<ε\|\mathbf{x}_n - A\| < \varepsilon. Such an AA is called a limit of the sequence {xn}n1\{\mathbf{x}_n\}_{n \ge 1}. We denote A=limn+xnA = \lim\limits_{n \to +\infty} \mathbf{x}_n.

  3. The sequence {xn}n1\{\mathbf{x}_n\}_{n \ge 1} is said to be a Cauchy sequence if for any ε>0\varepsilon > 0, there exists a positive integer N(ε)>0N(\varepsilon) > 0 such that for any positive integers p,qp, q, whenever p,qN(ε)p, q \ge N(\varepsilon), we have xpxq<ε\|\mathbf{x}_p - \mathbf{x}_q\| < \varepsilon.

Theorem 1.3. Let {xn}\{\mathbf{x}_n\} be a sequence in Rm\mathbb{R}^m, where xn=(xn1,,xnm)T\mathbf{x}_n = (x_n^1, \dots, x_n^m)^T. Then:

  1. {xn}n1\{\mathbf{x}_n\}_{n \ge 1} is a bounded sequence if and only if for any 1km1 \le k \le m, the sequence {xnk}n1\{x_n^k\}_{n \ge 1} is a bounded sequence.
  2. {xn}n1\{\mathbf{x}_n\}_{n \ge 1} is a Cauchy sequence if and only if for any 1km1 \le k \le m, the sequence {xnk}n1\{x_n^k\}_{n \ge 1} is a Cauchy sequence.
  3. limn+xn=A\lim\limits_{n \to +\infty} \mathbf{x}_n = A if and only if for any 1km1 \le k \le m, limn+xnk=Ak\lim\limits_{n \to +\infty} x_n^k = A_k.

Theorem 1.4.

  1. A sequence in Rm\mathbb{R}^m is convergent if and only if it is a Cauchy sequence.
  2. Any bounded sequence in Rm\mathbb{R}^m has a convergent subsequence.

Closed Set

A set ERmE \subseteq \mathbb{R}^m is said to be a closed set if: for any sequence {xn}n1\{\mathbf{x}_n\}_{n \ge 1} in EE, if the sequence has a limit limn+xn\lim\limits_{n \to +\infty} \mathbf{x}_n, then this limit is also in EE.

1.3 Continuity of Multivariable Functions

Continuity of Multivariable Function

Let ERmE \subseteq \mathbb{R}^m. A mapping f:ERpf: E \to \mathbb{R}^p is said to be continuous at x0E\mathbf{x}_0 \in E, if for any ϵ>0\epsilon > 0, there exists a δ>0\delta > 0 such that for any xE\mathbf{x} \in E, as long as

xx0<δ\begin{align*} \|\mathbf{x} - \mathbf{x}_0\| < \delta \end{align*}

the following inequality holds:

f(x)f(x0)<ϵ\begin{align*} \|f(\mathbf{x}) - f(\mathbf{x}_0)\|_* < \epsilon \end{align*}

Here, \|\cdot\| and \|\cdot\|_* denote the norms in the domain space and the codomain space of ff, respectively.

  • ff is said to be continuous on E0EE_0 \subseteq E, if ff is continuous at every point x0E0\mathbf{x}_0 \in E_0.
  • ff is called a continuous mapping, if it is continuous on its entire domain EE.

Theorem 1.5. Any linear mapping on Rm\mathbb{R}^m is continuous.

Theorem 1.6. Any bilinear mapping B:Rm×RnRpB: \mathbb{R}^m \times \mathbb{R}^n \to \mathbb{R}^p, i.e., α1,α2,β1,β2R,x1,x2,xRm,y1,y2,yRn\forall \alpha_1, \alpha_2, \beta_1, \beta_2 \in \mathbb{R}, \mathbf{x}_1, \mathbf{x}_2, \mathbf{x} \in \mathbb{R}^m, \mathbf{y}_1, \mathbf{y}_2, \mathbf{y} \in \mathbb{R}^n

B(α1x1+α2x2,y)=α1B(x1,y)+α2B(x2,y),B(x,β1y1+β2y2)=β1B(x,y1)+β2B(x,y2),\begin{align*} B(\alpha_1 \mathbf{x}_1 + \alpha_2 \mathbf{x}_2, \mathbf{y}) &= \alpha_1 B(\mathbf{x}_1, \mathbf{y}) + \alpha_2 B(\mathbf{x}_2, \mathbf{y}), \\ B(\mathbf{x}, \beta_1 \mathbf{y}_1 + \beta_2 \mathbf{y}_2) &= \beta_1 B(\mathbf{x}, \mathbf{y}_1) + \beta_2 B(\mathbf{x}, \mathbf{y}_2), \end{align*}

is continuous.

Corollary 1. Any multilinear mapping is a continuous mapping.

Theorem 1.7. Vector addition, scalar multiplication, and the inner product in Rm\mathbb{R}^m, as well as real number multiplication, are all continuous mappings.

Theorem 1.8. Let ERmE \subseteq \mathbb{R}^m, FRpF \subseteq \mathbb{R}^p, and f:ERpf: E \to \mathbb{R}^p be continuous at x0E\mathbf{x}_0 \in E. If g:FRqg: F \to \mathbb{R}^q is continuous at y0=f(x0)F\mathbf{y}_0 = f(\mathbf{x}_0) \in F, then the composite mapping

gf:Ef1(F)Rq\begin{align*} g \circ f: E \cap f^{-1}(F) \to \mathbb{R}^q \end{align*}

is continuous at x0\mathbf{x}_0.

Theorem 1.9. The determinant is a continuous function of the matrix; therefore, small perturbations of an invertible matrix remain invertible.

Theorem 1.10. Let ERmE \subseteq \mathbb{R}^m and f:ERpf: E \to \mathbb{R}^p be a continuous mapping. Then:

  1. For any bounded closed subset KK of EE, the image f(K)f(K) is also a bounded closed set.

  2. If p=1p = 1, then for any non-empty bounded closed subset KK of EE, ff attains both a maximum value and a minimum value on KK.

Theorem 1.11. Let ,\langle \cdot, \cdot \rangle be an inner product on Rn\mathbb{R}^n, and let AA be a symmetric matrix of order nn, i.e.,

Ax,y=x,Ay,x,yRn.\begin{align*} \langle A\mathbf{x}, \mathbf{y} \rangle = \langle \mathbf{x}, A\mathbf{y} \rangle, \quad \forall \mathbf{x}, \mathbf{y} \in \mathbb{R}^n. \end{align*}

There exist nn orthonormal unit vectors v1,,vn\mathbf{v}_1, \dots, \mathbf{v}_n in Rn\mathbb{R}^n and real numbers λ1,,λn\lambda_1, \dots, \lambda_n such that

Avk=λkvk,k=1,,n.\begin{align*} A\mathbf{v}_k = \lambda_k \mathbf{v}_k, \quad k = 1, \dots, n. \end{align*}

Theorem 1.12. Let ERmE \subseteq \mathbb{R}^m and f:ERpf: E \to \mathbb{R}^p be a continuous mapping. For any bounded closed subset KK of EE, ff is uniformly continuous on KK.

This means that for any ϵ>0\epsilon > 0, there exists a δ>0\delta > 0 such that for any xK\mathbf{x} \in K and any yE\mathbf{y} \in E, as long as

xy<δ\begin{align*} \|\mathbf{x} - \mathbf{y}\| < \delta \end{align*}

the following holds:

f(x)f(y)<ϵ\begin{align*} \|f(\mathbf{x}) - f(\mathbf{y})\|_* < \epsilon \end{align*}

Here, \|\cdot\| and \|\cdot\|_* denote the norms in the domain space and the codomain space of ff, respectively.

Path

A set ERmE \subseteq \mathbb{R}^m is called a path-connected set if, for any x,yE\mathbf{x}, \mathbf{y} \in E, there exists a continuous mapping f:[0,1]Rmf : [0, 1] \to \mathbb{R}^m such that

f(0)=x,f(1)=y,and f(t)E for all t[0,1].\begin{align*} f(0) = \mathbf{x}, \quad f(1) = \mathbf{y}, \quad \text{and } f(t) \in E \text{ for all } t \in [0, 1]. \end{align*}

Such a continuous mapping ff is called a path from x\mathbf{x} to y\mathbf{y} in EE.

1.4 Limits of Multivariable Functions

Limits of Multivariable Function

Let ERmE \subseteq \mathbb{R}^m. A point x0Rm\mathbf{x}_0 \in \mathbb{R}^m is called a cluster point (or accumulation point) of EE if, for any ϵ>0\epsilon > 0, there exists xE\mathbf{x} \in E such that

0<d(x,x0)<ϵ\begin{align*} 0 < d(\mathbf{x}, \mathbf{x}_0) < \epsilon \end{align*}

If x0\mathbf{x}_0 is both a cluster point of ERmE \subseteq \mathbb{R}^m and a cluster point of its complement Ec=RmEE^c = \mathbb{R}^m \setminus E (meaning every neighborhood of x0\mathbf{x}_0 contains points from both EE and RmE\mathbb{R}^m \setminus E), then x0\mathbf{x}_0 is called a boundary point of EE. The set of all boundary points of EE is denoted as E\partial E and is called the boundary set of EE.

Let EE be a subset of the domain of mapping ff, and let x0Rm\mathbf{x}_0 \in \mathbb{R}^m be a cluster point of EE. We say that ARp\mathbf{A} \in \mathbb{R}^p is the limit of f(x)f(\mathbf{x}) as xE\mathbf{x} \in E approaches x0\mathbf{x}_0, denoted as

limExx0f(x)=A\begin{align*} \lim_{E \ni \mathbf{x} \to \mathbf{x}_0} f(\mathbf{x}) = \mathbf{A} \end{align*}

if for any ϵ>0\epsilon > 0, there exists a δ>0\delta > 0 such that for any xE\mathbf{x} \in E, as long as 0<d(x,x0)<δ0 < d(\mathbf{x}, \mathbf{x}_0) < \delta, we have

d2(f(x),A)<ϵ\begin{align*} d_2(f(\mathbf{x}), \mathbf{A}) < \epsilon \end{align*}

Here, d2d_2 is the distance in the codomain space of ff.

  • When EE is the entire domain of mapping ff, limExx0f(x)\lim\limits_{E \ni \mathbf{x} \to \mathbf{x}_0} f(\mathbf{x}) is simply written as limxx0f(x)\lim\limits_{\mathbf{x} \to \mathbf{x}_0} f(\mathbf{x}).

Theorem 1.13. Let EE be the domain of the composite function gfg \circ f, with limExx0f(x)=y0\lim\limits_{E \ni \mathbf{x} \to \mathbf{x}_0} f(\mathbf{x}) = \mathbf{y}_0 and limyy0g(y)=A\lim\limits_{\mathbf{y} \to \mathbf{y}_0} g(\mathbf{y}) = A. Define

g~(y)={g(y),yy0;A,y=y0.\begin{align*} \tilde{g}(\mathbf{y}) = \begin{cases} g(\mathbf{y}), & \mathbf{y} \neq \mathbf{y}_0; \\ A, & \mathbf{y} = \mathbf{y}_0. \end{cases} \end{align*}

Then limxx0g~(f(x))=A\lim\limits_{\mathbf{x} \to \mathbf{x}_0} \tilde{g}(f(\mathbf{x})) = A. In particular, if gg is continuous at y0\mathbf{y}_0, then

limxx0g(f(x))=A.\begin{align*} \lim_{\mathbf{x} \to \mathbf{x}_0} g(f(\mathbf{x})) = A. \end{align*}