Unit C4: Directional Derivatives, Tangent Planes, Linearization第 C4 单元:方向导数(directional derivative)、切平面(tangent plane)与线性化(linearization)
From the gradient to the geometry it controls: rates of change in any direction, tangent planes to surfaces, and the linear approximations that make multivariable calculus computable.从梯度(gradient)出发,掌握它所支配的几何:任意方向上的变化率、曲面的切平面,以及让多元微积分可计算的线性近似。
gradient)转化为几何工具。你将把方向导数(directional derivative)计算为 $\nabla f \cdot \mathbf{u}$,读出最速上升方向,为函数图像和等值面(level surface)写出切平面(tangent plane),并用线性化(linearization)与微分(differential)做估值和误差传播。请始终牢记一个事实:梯度垂直于等值集,并指向增长最快的方向。
Directional Derivatives方向导数(directional derivative)
Partial derivatives measure the rate of change of $f$ along the coordinate axes. The directional derivative generalizes this to the rate of change along any unit vector, which is the central object of this unit.偏导数(partial derivative)度量 $f$ 沿坐标轴方向的变化率。方向导数(directional derivative)把它推广为沿任意单位向量(unit vector)方向的变化率,这是本单元的核心对象。
unit vector)$\mathbf{u}$ 方向的方向导数,是当你沿直线 $\mathbf{a} + t\mathbf{u}$ 离开 $\mathbf{a}$ 时 $f$ 的瞬时变化率。它是一个数,回答了:沿方向 $\mathbf{u}$ 每移动单位距离,$f$ 变化多快?When $f$ is differentiable at $\mathbf{a}$, this limit is computed without returning to the definition. Composing $f$ with the line $\mathbf{r}(t) = \mathbf{a} + t\mathbf{u}$ and applying the chain rule gives the gradient dot-product formula.当 $f$ 在 $\mathbf{a}$ 处可微(differentiable)时,这个极限无需回到定义即可计算。把 $f$ 与直线 $\mathbf{r}(t) = \mathbf{a} + t\mathbf{u}$ 复合,再用链式法则(Chain Rule),即得梯度点积公式。
normalize):$\mathbf{u} = \mathbf{v} / |\mathbf{v}|$。跳过归一化会把答案放大 $|\mathbf{v}|$ 倍,结果便不再具有"每单位距离的变化率"这一含义。It is worth seeing why the gradient formula is true rather than only memorizing it. Define the single-variable slice $g(t) = f(\mathbf{a} + t\mathbf{u})$. By the limit definition, $D_{\mathbf{u}} f(\mathbf{a})$ is exactly $g'(0)$. If $f$ is differentiable at $\mathbf{a}$, the multivariable chain rule applied to $g(t) = f(x(t), y(t))$ with $x(t) = a_1 + t u_1$ and $y(t) = a_2 + t u_2$ gives $g'(t) = f_x\, x'(t) + f_y\, y'(t) = f_x u_1 + f_y u_2$. Evaluating at $t = 0$ returns $\nabla f(\mathbf{a}) \cdot \mathbf{u}$. The whole content of the formula is that, for a differentiable function, the rate of change along a line is the projection of the gradient onto the line.值得弄清梯度公式为何成立,而不只是死记。定义单变量切片 $g(t) = f(\mathbf{a} + t\mathbf{u})$。由极限定义,$D_{\mathbf{u}} f(\mathbf{a})$ 恰为 $g'(0)$。若 $f$ 在 $\mathbf{a}$ 处可微,对 $g(t) = f(x(t), y(t))$(其中 $x(t) = a_1 + t u_1$,$y(t) = a_2 + t u_2$)应用多元链式法则,得 $g'(t) = f_x\, x'(t) + f_y\, y'(t) = f_x u_1 + f_y u_2$。在 $t = 0$ 处取值即得 $\nabla f(\mathbf{a}) \cdot \mathbf{u}$。公式的全部内容是:对可微函数而言,沿一条直线的变化率就是梯度在该直线上的投影。
Worked Example 1.1: a directional derivative from the gradient例题 1.1:由梯度求方向导数
Let $f(x,y) = x^2 y + 3y$. Find $D_{\mathbf{u}} f$ at $(1,2)$ in the direction of $\mathbf{v} = \langle 3, 4 \rangle$.设 $f(x,y) = x^2 y + 3y$。求 $f$ 在 $(1,2)$ 处沿 $\mathbf{v} = \langle 3, 4 \rangle$ 方向的 $D_{\mathbf{u}} f$。
First the gradient: $f_x = 2xy$, $f_y = x^2 + 3$, so $\nabla f(1,2) = \langle 4, 4 \rangle$.先求梯度:$f_x = 2xy$,$f_y = x^2 + 3$,故 $\nabla f(1,2) = \langle 4, 4 \rangle$。
Normalize: $|\mathbf{v}| = 5$, so $\mathbf{u} = \langle 3/5, 4/5 \rangle$. Then归一化:$|\mathbf{v}| = 5$,故 $\mathbf{u} = \langle 3/5, 4/5 \rangle$。于是
$$ D_{\mathbf{u}} f(1,2) = \langle 4,4 \rangle \cdot \langle 3/5, 4/5 \rangle = \tfrac{12}{5} + \tfrac{16}{5} = \tfrac{28}{5}. $$Worked Example 1.2: a three-variable directional derivative例题 1.2:三变量的方向导数
Let $f(x,y,z) = xy^2 z^3$. Find $D_{\mathbf{u}} f$ at the point $(2,-1,1)$ in the direction toward $(0,0,0)$, that is along $\mathbf{v} = \langle 0,0,0\rangle - \langle 2,-1,1\rangle = \langle -2, 1, -1\rangle$.设 $f(x,y,z) = xy^2 z^3$。求 $f$ 在点 $(2,-1,1)$ 处沿指向 $(0,0,0)$ 的方向的 $D_{\mathbf{u}} f$,即沿 $\mathbf{v} = \langle 0,0,0\rangle - \langle 2,-1,1\rangle = \langle -2, 1, -1\rangle$。
The partials are $f_x = y^2 z^3$, $f_y = 2xy z^3$, $f_z = 3xy^2 z^2$. At $(2,-1,1)$:各偏导数为 $f_x = y^2 z^3$,$f_y = 2xy z^3$,$f_z = 3xy^2 z^2$。在 $(2,-1,1)$ 处:
$$ \nabla f(2,-1,1) = \langle (1)(1),\ 2(2)(-1)(1),\ 3(2)(1)(1)\rangle = \langle 1,\ -4,\ 6\rangle. $$Normalize the direction: $|\mathbf{v}| = \sqrt{4 + 1 + 1} = \sqrt6$, so $\mathbf{u} = \tfrac{1}{\sqrt6}\langle -2,1,-1\rangle$. Then把方向归一化:$|\mathbf{v}| = \sqrt{4 + 1 + 1} = \sqrt6$,故 $\mathbf{u} = \tfrac{1}{\sqrt6}\langle -2,1,-1\rangle$。于是
$$ D_{\mathbf{u}} f = \langle 1,-4,6\rangle \cdot \tfrac{1}{\sqrt6}\langle -2,1,-1\rangle = \frac{-2 - 4 - 6}{\sqrt6} = \frac{-12}{\sqrt6} = -2\sqrt6. $$The negative sign tells us $f$ is decreasing as we head from $(2,-1,1)$ toward the origin, at a rate of $2\sqrt6 \approx 4.90$ units of $f$ per unit of distance.负号表明:从 $(2,-1,1)$ 朝原点方向走时 $f$ 在减小,速率为每单位距离减少 $2\sqrt6 \approx 4.90$ 个单位的 $f$。
Worked Example 1.3: recovering a partial derivative as a special direction例题 1.3:把偏导数看作特殊方向的特例
Directional derivatives generalize partials, so the partials must reappear as special cases. Take $f(x,y) = e^{x}\sin y$ and the standard basis direction $\mathbf{u} = \mathbf{i} = \langle 1, 0\rangle$ at a general point $(x,y)$.方向导数是偏导数的推广,因此偏导数必定作为特例重新出现。取 $f(x,y) = e^{x}\sin y$,在一般点 $(x,y)$ 处沿标准基方向 $\mathbf{u} = \mathbf{i} = \langle 1, 0\rangle$。
$\nabla f = \langle e^x \sin y,\ e^x \cos y\rangle$, and since $\mathbf{i}$ is already a unit vector,$\nabla f = \langle e^x \sin y,\ e^x \cos y\rangle$,由于 $\mathbf{i}$ 已是单位向量,
$$ D_{\mathbf{i}} f = \nabla f \cdot \langle 1,0\rangle = e^x \sin y = f_x. $$Likewise $D_{\mathbf{j}} f = f_y = e^x \cos y$. This is the sanity check that anchors the whole subject: the directional derivative in the direction of a coordinate axis is just the partial derivative for that variable. If a formula ever fails this test, it is wrong.同理 $D_{\mathbf{j}} f = f_y = e^x \cos y$。这是贯穿整个主题的检验基准:沿坐标轴方向的方向导数就是该变量的偏导数。任何公式若通不过这一检验,就是错的。
The Gradient and Steepest Ascent梯度(gradient)与最速上升
The three facts follow from a single trigonometric identity. The steepest descent direction is $-\nabla f$, and any direction perpendicular to $\nabla f$ keeps $f$ momentarily constant, which is exactly the tangent direction to the level curve.这三个事实都来自同一个三角恒等式。最速下降方向是 $-\nabla f$,任何垂直于 $\nabla f$ 的方向都使 $f$ 瞬时保持不变,而这正是等值线(level curve)的切线方向。
Going deeper: why $\nabla f$ is the steepest-ascent direction深入探讨:为何 $\nabla f$ 是最速上升方向
Fix a point where $\nabla f \neq \mathbf{0}$. For any unit vector $\mathbf{u}$, the Cauchy-Schwarz inequality gives固定一个满足 $\nabla f \neq \mathbf{0}$ 的点。对任意单位向量 $\mathbf{u}$,由柯西-施瓦茨不等式(Cauchy-Schwarz inequality)得
Since $-1 \le \cos\theta \le 1$, the value ranges over $[-|\nabla f|,\, |\nabla f|]$. The maximum $|\nabla f|$ is attained exactly when $\cos\theta = 1$, that is when $\mathbf{u}$ is parallel to $\nabla f$. The minimum is attained at $\mathbf{u} = -\nabla f / |\nabla f|$, and $D_{\mathbf{u}} f = 0$ precisely when $\cos\theta = 0$, i.e. $\mathbf{u}\perp\nabla f$. If $\nabla f = \mathbf{0}$ every directional derivative is zero and the point is critical.由于 $-1 \le \cos\theta \le 1$,该值取遍 $[-|\nabla f|,\, |\nabla f|]$。最大值 $|\nabla f|$ 恰在 $\cos\theta = 1$ 时取得,即 $\mathbf{u}$ 与 $\nabla f$ 平行时。最小值在 $\mathbf{u} = -\nabla f / |\nabla f|$ 处取得;而 $D_{\mathbf{u}} f = 0$ 当且仅当 $\cos\theta = 0$,即 $\mathbf{u}\perp\nabla f$。若 $\nabla f = \mathbf{0}$,则每个方向导数都为零,该点为临界点(critical point)。
Worked Example 2.1: hottest direction to walk例题 2.1:往哪个方向走升温最快
Temperature is $T(x,y) = 100 - x^2 - 2y^2$. At $(2,1)$, in which direction does $T$ increase fastest, and how fast?温度为 $T(x,y) = 100 - x^2 - 2y^2$。在 $(2,1)$ 处,$T$ 沿哪个方向增长最快,增长有多快?
$\nabla T = \langle -2x, -4y \rangle$, so $\nabla T(2,1) = \langle -4, -4 \rangle$. The steepest-ascent direction is $\langle -4,-4\rangle$, or as a unit vector $\langle -1/\sqrt2, -1/\sqrt2\rangle$. The maximum rate is $|\nabla T| = \sqrt{16+16} = 4\sqrt2$ degrees per unit distance.$\nabla T = \langle -2x, -4y \rangle$,故 $\nabla T(2,1) = \langle -4, -4 \rangle$。最速上升方向是 $\langle -4,-4\rangle$,写成单位向量为 $\langle -1/\sqrt2, -1/\sqrt2\rangle$。最大速率为 $|\nabla T| = \sqrt{16+16} = 4\sqrt2$ 度每单位距离。
Worked Example 2.2: a prescribed rate of change in a chosen direction例题 2.2:在选定方向上达到指定的变化率
Let $f(x,y) = x e^{y}$. At the point $P=(2,0)$, find a unit vector $\mathbf{u}$ for which $D_{\mathbf{u}} f(P) = 1$, and explain when no such direction exists.设 $f(x,y) = x e^{y}$。在点 $P=(2,0)$ 处,求使 $D_{\mathbf{u}} f(P) = 1$ 的单位向量 $\mathbf{u}$,并说明何时不存在这样的方向。
First, $\nabla f = \langle e^y,\ x e^y\rangle$, so $\nabla f(2,0) = \langle 1, 2\rangle$ and $|\nabla f| = \sqrt5$. Writing $D_{\mathbf{u}} f = |\nabla f|\cos\theta = \sqrt5\cos\theta$, we need $\sqrt5 \cos\theta = 1$, hence $\cos\theta = 1/\sqrt5 \approx 0.447$, giving $\theta \approx 63.4^\circ$ measured from the gradient. A prescribed rate $r$ is achievable as a directional derivative precisely when $|r| \le |\nabla f|$, because $D_{\mathbf{u}} f$ ranges over $[-|\nabla f|, |\nabla f|]$. Here $1 \le \sqrt5$, so two directions work (one on each side of $\nabla f$). For a concrete one, rotate the unit gradient $\langle 1/\sqrt5, 2/\sqrt5\rangle$ by $\theta = 63.4^\circ$. Asking for a rate above $\sqrt5$ would be impossible: no direction can beat the steepest one.首先,$\nabla f = \langle e^y,\ x e^y\rangle$,故 $\nabla f(2,0) = \langle 1, 2\rangle$,$|\nabla f| = \sqrt5$。写成 $D_{\mathbf{u}} f = |\nabla f|\cos\theta = \sqrt5\cos\theta$,需要 $\sqrt5 \cos\theta = 1$,从而 $\cos\theta = 1/\sqrt5 \approx 0.447$,即从梯度方向起量得 $\theta \approx 63.4^\circ$。指定速率 $r$ 能作为方向导数实现,当且仅当 $|r| \le |\nabla f|$,因为 $D_{\mathbf{u}} f$ 取遍 $[-|\nabla f|, |\nabla f|]$。此处 $1 \le \sqrt5$,故有两个方向可行(梯度两侧各一个)。具体取一个:把单位梯度 $\langle 1/\sqrt5, 2/\sqrt5\rangle$ 旋转 $\theta = 63.4^\circ$ 即可。要求速率超过 $\sqrt5$ 则不可能:没有方向能胜过最陡的方向。
Worked Example 2.3: steepest ascent on a hillside例题 2.3:山坡上的最速上升
The height of a hill is $h(x,y) = 200 - \tfrac{1}{100}(3x^2 + 2y^2)$ meters, with $x,y$ in meters. A hiker stands above the ground point $(60, 40)$. Find the bearing of steepest ascent and the slope encountered in that direction.某山丘的高度为 $h(x,y) = 200 - \tfrac{1}{100}(3x^2 + 2y^2)$ 米,其中 $x,y$ 以米为单位。一名登山者站在地面点 $(60, 40)$ 正上方。求最速上升的方位以及该方向上的坡度。
$\nabla h = \langle -\tfrac{6x}{100}, -\tfrac{4y}{100}\rangle = \langle -0.06x, -0.04y\rangle$. At $(60,40)$: $\nabla h = \langle -3.6,\ -1.6\rangle$. Steepest ascent points along $\langle -3.6, -1.6\rangle$, i.e. back toward smaller $x$ and $y$, which makes sense because the summit is at the origin where $h$ is largest. The slope in that direction is$\nabla h = \langle -\tfrac{6x}{100}, -\tfrac{4y}{100}\rangle = \langle -0.06x, -0.04y\rangle$。在 $(60,40)$ 处:$\nabla h = \langle -3.6,\ -1.6\rangle$。最速上升沿 $\langle -3.6, -1.6\rangle$,即朝更小的 $x$ 和 $y$ 回退,这很合理,因为山顶在原点,那里 $h$ 最大。该方向上的坡度为
$$ |\nabla h| = \sqrt{(-3.6)^2 + (-1.6)^2} = \sqrt{12.96 + 2.56} = \sqrt{15.52} \approx 3.94. $$So the hiker rises about $3.94$ meters of height per meter of horizontal travel in the steepest direction. Walking perpendicular to $\nabla h$, along $\langle 1.6, -3.6\rangle$, keeps elevation momentarily constant: that is a contour line of the hill.因此登山者沿最陡方向每水平前进 $1$ 米,高度约上升 $3.94$ 米。沿垂直于 $\nabla h$ 的方向 $\langle 1.6, -3.6\rangle$ 行走则瞬时保持海拔不变:那就是山丘的一条等高线。
Tangent Planes切平面(tangent plane)
tangent plane)。该平面由两个偏导数构成,它们给出平面在 $x$ 与 $y$ 方向上的斜率。可微性正是保证该平面在局部很好地贴合曲面的条件。This is the graph version of linearization. The same plane can be written with the gradient, which makes the perpendicularity to $\nabla F$ for a level surface (Section 6) transparent. There is a useful bridge between the two viewpoints: a graph $z = f(x,y)$ is itself a level surface of the three-variable function $F(x,y,z) = f(x,y) - z$ at level $0$. Then $\nabla F = \langle f_x, f_y, -1\rangle$, so the surface normal is $\langle f_x, f_y, -1\rangle$ and the plane $f_x(x-a) + f_y(y-b) - (z - f(a,b)) = 0$ rearranges to exactly the formula above. The $-1$ in the third slot is the signature of a graph.这是线性化(linearization)的图像版本。同一平面也可用梯度写出,这使第 6 节中它与等值面法向量 $\nabla F$ 的垂直关系一目了然。两种视角之间有一座有用的桥梁:图像 $z = f(x,y)$ 本身就是三变量函数 $F(x,y,z) = f(x,y) - z$ 在水平值 $0$ 处的等值面。于是 $\nabla F = \langle f_x, f_y, -1\rangle$,故曲面法向量为 $\langle f_x, f_y, -1\rangle$,而平面 $f_x(x-a) + f_y(y-b) - (z - f(a,b)) = 0$ 整理后恰好是上面的公式。第三个分量上的 $-1$ 是"图像"这一情形的标志。
Worked Example 3.1: tangent plane to a paraboloid例题 3.1:抛物面的切平面
Find the tangent plane to $z = x^2 + y^2$ at $(1, 2, 5)$.求 $z = x^2 + y^2$ 在 $(1, 2, 5)$ 处的切平面。
$f_x = 2x$, $f_y = 2y$, so $f_x(1,2) = 2$ and $f_y(1,2) = 4$. With $f(1,2) = 5$:$f_x = 2x$,$f_y = 2y$,故 $f_x(1,2) = 2$,$f_y(1,2) = 4$。又 $f(1,2) = 5$:
$$ z = 5 + 2(x-1) + 4(y-2) = 2x + 4y - 5. $$Check: at $(1,2)$ this gives $z = 2 + 8 - 5 = 5$, matching the point on the surface.检验:在 $(1,2)$ 处得 $z = 2 + 8 - 5 = 5$,与曲面上的点一致。
Worked Example 3.2: tangent plane to a non-polynomial graph例题 3.2:非多项式图像的切平面
Find the tangent plane to $z = \ln(x^2 + y^2)$ at the point above $(1, 0)$.求 $z = \ln(x^2 + y^2)$ 在 $(1, 0)$ 正上方那一点处的切平面。
First the base value: $f(1,0) = \ln(1) = 0$, so the surface point is $(1,0,0)$. The partials are先求基准值:$f(1,0) = \ln(1) = 0$,故曲面点为 $(1,0,0)$。各偏导数为
$$ f_x = \frac{2x}{x^2+y^2}, \qquad f_y = \frac{2y}{x^2+y^2}. $$At $(1,0)$: $f_x = 2/1 = 2$ and $f_y = 0/1 = 0$. The tangent plane is在 $(1,0)$ 处:$f_x = 2/1 = 2$,$f_y = 0/1 = 0$。切平面为
$$ z = 0 + 2(x-1) + 0(y-0) = 2(x-1) = 2x - 2. $$Notice the plane does not involve $y$ at all, because the surface is locally flat in the $y$ direction at this point: $f_y(1,0) = 0$. That is geometrically sensible since $(1,0)$ sits on the $x$-axis, a symmetry line of $\ln(x^2+y^2)$.注意该平面完全不含 $y$,因为在这一点曲面沿 $y$ 方向局部是平的:$f_y(1,0) = 0$。这在几何上很合理,因为 $(1,0)$ 位于 $x$ 轴上,而 $x$ 轴是 $\ln(x^2+y^2)$ 的一条对称线。
Worked Example 3.3: the normal line to a graph例题 3.3:图像的法线
The tangent plane comes paired with a normal line. For $z = x^2 + y^2$ at $(1,2,5)$, using the graph normal $\langle f_x, f_y, -1\rangle = \langle 2, 4, -1\rangle$, the normal line through $(1,2,5)$ is切平面总与一条法线(normal line)配对出现。对 $z = x^2 + y^2$ 在 $(1,2,5)$ 处,用图像法向量 $\langle f_x, f_y, -1\rangle = \langle 2, 4, -1\rangle$,过 $(1,2,5)$ 的法线为
This line is perpendicular to the tangent plane $2x + 4y - z = 5$ found by rearranging Worked Example 3.1. A quick check: the plane's coefficient vector $\langle 2, 4, -1\rangle$ matches the direction of the normal line, confirming they are perpendicular and parallel respectively, as they must be.该直线垂直于把例题 3.1 整理后得到的切平面 $2x + 4y - z = 5$。快速检验:平面的系数向量 $\langle 2, 4, -1\rangle$ 与法线的方向一致,证实它们分别相互垂直、相互平行,正如理应如此。
Linear Approximation and Differentials线性近似与微分(differential)
linearization)$L(x,y)$,就是以切平面为图像的那个函数。对 $(a,b)$ 附近的点,$f(x,y) \approx L(x,y)$,这让我们无需精确计算 $f$ 就能估值并传播小误差。total differential)The differential $dz$ is the change predicted by the tangent plane when the inputs change by $dx$ and $dy$. It is the workhorse of error estimation: if measured quantities carry small uncertainties $dx, dy$, then $dz$ estimates the resulting uncertainty in $z$.微分(differential)$dz$ 是当输入变化 $dx$ 和 $dy$ 时切平面所预测的改变量。它是误差估计的主力工具:若测量量带有小的不确定度 $dx, dy$,则 $dz$ 估计出 $z$ 中由此产生的不确定度。
Worked Example 4.1: estimating a value例题 4.1:估计一个数值
Use linearization to estimate $\sqrt{(3.02)^2 + (3.97)^2}$.用线性化估计 $\sqrt{(3.02)^2 + (3.97)^2}$。
Let $f(x,y) = \sqrt{x^2 + y^2}$ at $(a,b) = (3,4)$, where $f(3,4) = 5$. The partials are $f_x = x/\sqrt{x^2+y^2}$ and $f_y = y/\sqrt{x^2+y^2}$, so $f_x(3,4) = 3/5$, $f_y(3,4) = 4/5$.取 $f(x,y) = \sqrt{x^2 + y^2}$,基点 $(a,b) = (3,4)$,其中 $f(3,4) = 5$。偏导数为 $f_x = x/\sqrt{x^2+y^2}$,$f_y = y/\sqrt{x^2+y^2}$,故 $f_x(3,4) = 3/5$,$f_y(3,4) = 4/5$。
$$ L(3.02, 3.97) = 5 + \tfrac{3}{5}(0.02) + \tfrac{4}{5}(-0.03) = 5 + 0.012 - 0.024 = 4.988. $$Worked Example 4.2: linearizing a product例题 4.2:对乘积做线性化
Estimate $(2.01)^3 (0.98)^4$ using a linear approximation.用线性近似估计 $(2.01)^3 (0.98)^4$。
Let $f(x,y) = x^3 y^4$ at $(a,b) = (2,1)$, where $f(2,1) = 8$. The partials are $f_x = 3x^2 y^4$ and $f_y = 4x^3 y^3$, so $f_x(2,1) = 3(4)(1) = 12$ and $f_y(2,1) = 4(8)(1) = 32$. The increments are $dx = 0.01$, $dy = -0.02$. Then取 $f(x,y) = x^3 y^4$,基点 $(a,b) = (2,1)$,其中 $f(2,1) = 8$。偏导数为 $f_x = 3x^2 y^4$,$f_y = 4x^3 y^3$,故 $f_x(2,1) = 3(4)(1) = 12$,$f_y(2,1) = 4(8)(1) = 32$。增量为 $dx = 0.01$,$dy = -0.02$。于是
$$ L(2.01, 0.98) = 8 + 12(0.01) + 32(-0.02) = 8 + 0.12 - 0.64 = 7.48. $$For comparison the exact value is $(2.01)^3(0.98)^4 \approx 8.1206 \times 0.92237 \approx 7.491$, so the linear estimate $7.48$ is within about $0.15\%$. The size of the error scales with the second derivatives times the square of the increments, which is why small increments make the estimate sharp.作为对照,精确值为 $(2.01)^3(0.98)^4 \approx 8.1206 \times 0.92237 \approx 7.491$,故线性估计 $7.48$ 的误差约在 $0.15\%$ 以内。误差大小与二阶导数乘以增量的平方成比例,这正是增量越小估计越精确的原因。
Worked Example 4.3: percentage error propagation例题 4.3:百分比误差的传播
The period of a pendulum is $T = 2\pi\sqrt{L/g}$. If $L$ is measured with a relative error up to $0.5\%$ and $g$ with a relative error up to $0.1\%$, bound the relative error in $T$.单摆的周期为 $T = 2\pi\sqrt{L/g}$。若 $L$ 的相对误差不超过 $0.5\%$,$g$ 的相对误差不超过 $0.1\%$,请给出 $T$ 的相对误差上界。
Take logarithms before differentiating, a standard trick for products and powers: $\ln T = \ln(2\pi) + \tfrac12\ln L - \tfrac12 \ln g$. The differential is微分前先取对数,这是处理乘积与幂的标准技巧:$\ln T = \ln(2\pi) + \tfrac12\ln L - \tfrac12 \ln g$。其微分为
$$ \frac{dT}{T} = \frac{1}{2}\frac{dL}{L} - \frac{1}{2}\frac{dg}{g}. $$In the worst case the two contributions add in magnitude:在最坏情况下,两项贡献在量级上相加:
$$ \left|\frac{dT}{T}\right| \le \frac{1}{2}(0.005) + \frac{1}{2}(0.001) = 0.0025 + 0.0005 = 0.003 = 0.3\%. $$The logarithmic differential converts each input's relative error into a weighted contribution, and the weights are exactly the exponents in the formula, here $+\tfrac12$ for $L$ and $-\tfrac12$ for $g$.对数微分把每个输入的相对误差转化为一项加权贡献,而权重正是公式中的指数,此处 $L$ 为 $+\tfrac12$,$g$ 为 $-\tfrac12$。
Going deeper: differentials and error propagation深入探讨:微分与误差传播
A rectangle is measured as $x = 30$ cm and $y = 24$ cm, each with a possible error of $\pm 0.1$ cm. Estimate the maximum error in the computed area $A = xy$.一个矩形测得 $x = 30$ cm,$y = 24$ cm,每个测量都可能有 $\pm 0.1$ cm 的误差。估计所算面积 $A = xy$ 的最大误差。
$dA = A_x\, dx + A_y\, dy = y\, dx + x\, dy$. At $(30,24)$ with $|dx|, |dy| \le 0.1$:$dA = A_x\, dx + A_y\, dy = y\, dx + x\, dy$。在 $(30,24)$ 处,且 $|dx|, |dy| \le 0.1$:
$$ |dA| \le 24(0.1) + 30(0.1) = 2.4 + 3.0 = 5.4 \ \text{cm}^2. $$So the area $720\ \text{cm}^2$ carries an estimated uncertainty of about $5.4\ \text{cm}^2$, a relative error near $0.75\%$. The differential turns input tolerances into an output tolerance via a single linear formula.因此面积 $720\ \text{cm}^2$ 的估计不确定度约为 $5.4\ \text{cm}^2$,相对误差接近 $0.75\%$。微分通过一条线性公式把输入的容差转化为输出的容差。
The Chain Rule Revisited再看链式法则(Chain Rule)
Chain Rule)则能处理曲线路径以及多层变量链。The pattern is mechanical once you draw the tree: sum over each path from $f$ to the independent variable, multiplying the partial derivatives along each branch. Writing $df/dt = \nabla f \cdot \mathbf{r}'(t)$ also recovers the directional derivative when $\mathbf{r}'(t)$ is a unit vector.一旦画出依赖树,这个套路就很机械:对从 $f$ 到该自变量的每条路径求和,沿每条分支把偏导数相乘。把式子写成 $df/dt = \nabla f \cdot \mathbf{r}'(t)$,当 $\mathbf{r}'(t)$ 是单位向量时还能重新得到方向导数。
Worked Example 5.1: chain rule along a path例题 5.1:沿路径的链式法则
Let $f(x,y) = x^2 y$ with $x = \cos t$, $y = \sin t$. Find $df/dt$ at $t = 0$.设 $f(x,y) = x^2 y$,且 $x = \cos t$,$y = \sin t$。求 $t = 0$ 处的 $df/dt$。
$f_x = 2xy$, $f_y = x^2$, and $x'(t) = -\sin t$, $y'(t) = \cos t$. So$f_x = 2xy$,$f_y = x^2$,且 $x'(t) = -\sin t$,$y'(t) = \cos t$。于是
$$ \frac{df}{dt} = (2xy)(-\sin t) + (x^2)(\cos t). $$At $t = 0$: $x = 1$, $y = 0$, $\sin 0 = 0$, $\cos 0 = 1$, giving $df/dt = (0)(0) + (1)(1) = 1$.在 $t = 0$ 处:$x = 1$,$y = 0$,$\sin 0 = 0$,$\cos 0 = 1$,得 $df/dt = (0)(0) + (1)(1) = 1$。
Worked Example 5.2: two independent variables (polar coordinates)例题 5.2:两个自变量(极坐标)
Let $f(x,y) = x^2 + y^2$ with $x = r\cos\theta$ and $y = r\sin\theta$. Compute $\partial f/\partial r$ and $\partial f/\partial\theta$ by the chain rule and confirm against direct substitution.设 $f(x,y) = x^2 + y^2$,且 $x = r\cos\theta$,$y = r\sin\theta$。用链式法则求 $\partial f/\partial r$ 和 $\partial f/\partial\theta$,并与直接代入的结果相互印证。
$f_x = 2x$, $f_y = 2y$. The inner partials are $x_r = \cos\theta$, $y_r = \sin\theta$, $x_\theta = -r\sin\theta$, $y_\theta = r\cos\theta$. Then$f_x = 2x$,$f_y = 2y$。内层偏导数为 $x_r = \cos\theta$,$y_r = \sin\theta$,$x_\theta = -r\sin\theta$,$y_\theta = r\cos\theta$。于是
$$ \frac{\partial f}{\partial r} = 2x\cos\theta + 2y\sin\theta = 2r\cos^2\theta + 2r\sin^2\theta = 2r, $$ $$ \frac{\partial f}{\partial\theta} = 2x(-r\sin\theta) + 2y(r\cos\theta) = -2r^2\cos\theta\sin\theta + 2r^2\sin\theta\cos\theta = 0. $$Direct substitution gives $f = r^2\cos^2\theta + r^2\sin^2\theta = r^2$, so indeed $f_r = 2r$ and $f_\theta = 0$. The vanishing $\theta$ derivative is the statement that $f$ is rotationally symmetric: it depends only on the radius.直接代入得 $f = r^2\cos^2\theta + r^2\sin^2\theta = r^2$,故确有 $f_r = 2r$,$f_\theta = 0$。对 $\theta$ 的导数为零,正说明 $f$ 具有旋转对称性:它只依赖于半径。
Worked Example 5.3: implicit differentiation from the chain rule例题 5.3:由链式法则得到隐函数求导
The chain rule yields the implicit-function formula. Suppose $y$ is defined implicitly by $F(x,y) = 0$. Differentiate with respect to $x$, treating $y = y(x)$:链式法则给出隐函数(implicit differentiation)公式。设 $y$ 由 $F(x,y) = 0$ 隐式确定。把 $y = y(x)$,对 $x$ 求导:
Apply it to the folium-style curve $x^3 + y^3 = 6xy$ at the point $(3,3)$. Let $F = x^3 + y^3 - 6xy$. Then $F_x = 3x^2 - 6y$ and $F_y = 3y^2 - 6x$. At $(3,3)$: $F_x = 27 - 18 = 9$ and $F_y = 27 - 18 = 9$, so把它用于笛卡尔叶形线(folium)$x^3 + y^3 = 6xy$ 在点 $(3,3)$ 处。设 $F = x^3 + y^3 - 6xy$。则 $F_x = 3x^2 - 6y$,$F_y = 3y^2 - 6x$。在 $(3,3)$ 处:$F_x = 27 - 18 = 9$,$F_y = 27 - 18 = 9$,故
The tangent line to the curve at $(3,3)$ has slope $-1$. This is the multivariable chain rule doing implicit differentiation in one clean step, with no need to solve for $y$.曲线在 $(3,3)$ 处的切线斜率为 $-1$。这就是多元链式法则一步干净地完成隐函数求导,无需解出 $y$。
Gradients and Level Surfaces梯度与等值面(level surface)
level surface)$F = k$ 上某点处的梯度 $\nabla F$ 与该曲面正交。仅凭这一事实,就能由梯度直接得到曲面的法向量,进而得到切平面,而无需解出 $z$。The normal line to the surface at $\mathbf{a}$ runs in the direction $\nabla F(\mathbf{a})$. In two variables the same principle says $\nabla f$ is perpendicular to the level curve $f = k$, which is why moving perpendicular to $\nabla f$ keeps $f$ constant (Section 2).曲面在 $\mathbf{a}$ 处的法线(normal line)沿方向 $\nabla F(\mathbf{a})$。在二维情形下同样的原理表明 $\nabla f$ 垂直于等值线 $f = k$,这也正是为何沿垂直于 $\nabla f$ 的方向移动会保持 $f$ 不变(见第 2 节)。
Going deeper: why $\nabla F \perp$ the level surface深入探讨:为何 $\nabla F$ 垂直于等值面
Let $\mathbf{r}(t)$ be any differentiable curve lying entirely in the level surface $F(x,y,z) = k$ with $\mathbf{r}(t_0) = \mathbf{a}$. Then $F(\mathbf{r}(t)) = k$ is constant, so differentiating with the chain rule:设 $\mathbf{r}(t)$ 是完全位于等值面 $F(x,y,z) = k$ 内的任一可微曲线,且 $\mathbf{r}(t_0) = \mathbf{a}$。则 $F(\mathbf{r}(t)) = k$ 为常数,用链式法则求导:
$$ \frac{d}{dt}F(\mathbf{r}(t)) = \nabla F(\mathbf{r}(t)) \cdot \mathbf{r}'(t) = 0. $$At $t = t_0$ this gives $\nabla F(\mathbf{a}) \cdot \mathbf{r}'(t_0) = 0$. Since $\mathbf{r}'(t_0)$ can be the tangent vector of any curve through $\mathbf{a}$ in the surface, $\nabla F(\mathbf{a})$ is orthogonal to every tangent vector, hence normal to the surface. The tangent plane is the plane through $\mathbf{a}$ with normal $\nabla F(\mathbf{a})$.在 $t = t_0$ 处得 $\nabla F(\mathbf{a}) \cdot \mathbf{r}'(t_0) = 0$。由于 $\mathbf{r}'(t_0)$ 可以是曲面内过 $\mathbf{a}$ 的任意曲线的切向量,故 $\nabla F(\mathbf{a})$ 与每个切向量都正交,因而是曲面的法向量。切平面就是过 $\mathbf{a}$ 且以 $\nabla F(\mathbf{a})$ 为法向量的平面。
Worked Example 6.1: tangent plane to a sphere例题 6.1:球面的切平面
Find the tangent plane to $x^2 + y^2 + z^2 = 9$ at $(2, 1, 2)$.求 $x^2 + y^2 + z^2 = 9$ 在 $(2, 1, 2)$ 处的切平面。
Let $F = x^2 + y^2 + z^2$. Then $\nabla F = \langle 2x, 2y, 2z \rangle$, so $\nabla F(2,1,2) = \langle 4, 2, 4 \rangle$. The plane is设 $F = x^2 + y^2 + z^2$。则 $\nabla F = \langle 2x, 2y, 2z \rangle$,故 $\nabla F(2,1,2) = \langle 4, 2, 4 \rangle$。切平面为
$$ 4(x-2) + 2(y-1) + 4(z-2) = 0, \quad \text{i.e.}\quad 2x + y + 2z = 9. $$This is consistent with the geometry: the radius to $(2,1,2)$ is the normal to the sphere there.这与几何一致:指向 $(2,1,2)$ 的半径就是球面在该处的法向量。
Worked Example 6.2: tangent plane and normal line to an ellipsoid例题 6.2:椭球面的切平面与法线
Find the tangent plane and the normal line to the ellipsoid $\dfrac{x^2}{4} + y^2 + \dfrac{z^2}{9} = 3$ at $(2, 1, 3)$.求椭球面 $\dfrac{x^2}{4} + y^2 + \dfrac{z^2}{9} = 3$ 在 $(2, 1, 3)$ 处的切平面与法线。
Let $F = \tfrac{x^2}{4} + y^2 + \tfrac{z^2}{9}$. Then $\nabla F = \langle \tfrac{x}{2},\ 2y,\ \tfrac{2z}{9}\rangle$. At $(2,1,3)$: $\nabla F = \langle 1,\ 2,\ \tfrac{2}{3}\rangle$. The tangent plane is设 $F = \tfrac{x^2}{4} + y^2 + \tfrac{z^2}{9}$。则 $\nabla F = \langle \tfrac{x}{2},\ 2y,\ \tfrac{2z}{9}\rangle$。在 $(2,1,3)$ 处:$\nabla F = \langle 1,\ 2,\ \tfrac{2}{3}\rangle$。切平面为
$$ 1(x-2) + 2(y-1) + \tfrac{2}{3}(z-3) = 0 \quad\Longrightarrow\quad x + 2y + \tfrac{2}{3}z = 6, $$or, clearing the fraction, $3x + 6y + 2z = 18$. The normal line uses the same direction $\langle 1, 2, \tfrac23\rangle$ (or its scalar multiple $\langle 3, 6, 2\rangle$):或者去分母得 $3x + 6y + 2z = 18$。法线沿同一方向 $\langle 1, 2, \tfrac23\rangle$(或其标量倍 $\langle 3, 6, 2\rangle$):
$$ (x,y,z) = (2,1,3) + t\langle 3, 6, 2\rangle. $$A quick verification: the point $(2,1,3)$ satisfies $3(2)+6(1)+2(3) = 6+6+6 = 18$, so it lies on the plane, as it must.快速验证:点 $(2,1,3)$ 满足 $3(2)+6(1)+2(3) = 6+6+6 = 18$,故它在该平面上,正如理应如此。
Worked Example 6.3: angle of intersection of two surfaces例题 6.3:两曲面的相交角
The surfaces $x^2 + y^2 + z^2 = 6$ and $x^2 + y^2 - z = 0$ both pass through $(1, 1, 2)$. Find the angle between them there, defined as the angle between their normal vectors.曲面 $x^2 + y^2 + z^2 = 6$ 与 $x^2 + y^2 - z = 0$ 都过 $(1, 1, 2)$。求它们在该处的夹角,定义为两个法向量之间的夹角。
For $F = x^2 + y^2 + z^2$: $\nabla F = \langle 2x, 2y, 2z\rangle$, so $\nabla F(1,1,2) = \langle 2, 2, 4\rangle$. For $G = x^2 + y^2 - z$: $\nabla G = \langle 2x, 2y, -1\rangle$, so $\nabla G(1,1,2) = \langle 2, 2, -1\rangle$. The angle $\phi$ between the normals satisfies对 $F = x^2 + y^2 + z^2$:$\nabla F = \langle 2x, 2y, 2z\rangle$,故 $\nabla F(1,1,2) = \langle 2, 2, 4\rangle$。对 $G = x^2 + y^2 - z$:$\nabla G = \langle 2x, 2y, -1\rangle$,故 $\nabla G(1,1,2) = \langle 2, 2, -1\rangle$。法向量之间的夹角 $\phi$ 满足
$$ \cos\phi = \frac{\nabla F \cdot \nabla G}{|\nabla F||\nabla G|} = \frac{4 + 4 - 4}{\sqrt{4+4+16}\,\sqrt{4+4+1}} = \frac{4}{\sqrt{24}\,\sqrt{9}} = \frac{4}{3\sqrt{24}} = \frac{4}{6\sqrt6} = \frac{2}{3\sqrt6}. $$Numerically $\cos\phi = 2/(3\sqrt6) \approx 0.2722$, so $\phi \approx 74.2^\circ$. The surfaces meet at roughly $74$ degrees at that point. The gradient is doing all the geometric work: it converts each implicit surface into a concrete normal direction.数值上 $\cos\phi = 2/(3\sqrt6) \approx 0.2722$,故 $\phi \approx 74.2^\circ$。两曲面在该点约以 $74$ 度相交。梯度承担了全部几何工作:它把每个隐式曲面转化为一个具体的法向量方向。
Going Deeper深入探讨
The unifying statement of this unit is the differentiability condition: near $(a,b)$,本单元的统一表述就是可微性条件:在 $(a,b)$ 附近,
This says the tangent plane (the linear term $\nabla f \cdot \mathbf{h}$) approximates $f$ with error that vanishes faster than $|\mathbf{h}|$. A useful sufficient condition: if $f_x$ and $f_y$ exist and are continuous near $\mathbf{a}$, then $f$ is differentiable at $\mathbf{a}$, and all of the formulas in this unit apply.这表明切平面(线性项 $\nabla f \cdot \mathbf{h}$)以比 $|\mathbf{h}|$ 更快趋于零的误差逼近 $f$。一个有用的充分条件:若 $f_x$ 和 $f_y$ 在 $\mathbf{a}$ 附近存在且连续,则 $f$ 在 $\mathbf{a}$ 处可微,本单元的全部公式都适用。
Going deeper: why continuous partials force differentiability深入探讨:为何偏导数连续就能保证可微
We prove the standard sufficient condition: if $f_x$ and $f_y$ exist and are continuous on a disk around $\mathbf{a} = (a,b)$, then $f$ is differentiable at $\mathbf{a}$. The engine is the one-variable Mean Value Theorem applied one coordinate at a time.我们证明标准的充分条件:若 $f_x$ 和 $f_y$ 在 $\mathbf{a} = (a,b)$ 的某个圆盘上存在且连续,则 $f$ 在 $\mathbf{a}$ 处可微。核心引擎是逐个坐标地应用一元中值定理(Mean Value Theorem)。
Write the increment $f(a+h, b+k) - f(a,b)$ and split it through the corner point $(a+h, b)$:写出增量 $f(a+h, b+k) - f(a,b)$,并经由拐角点 $(a+h, b)$ 拆分:
$$ \Delta f = \big[f(a+h, b+k) - f(a+h, b)\big] + \big[f(a+h, b) - f(a,b)\big]. $$Apply the Mean Value Theorem to each bracket. In the first, only the second coordinate changes, so there is some $k^\ast$ between $0$ and $k$ with $f(a+h, b+k) - f(a+h, b) = f_y(a+h, b+k^\ast)\,k$. In the second, only the first coordinate changes, so there is some $h^\ast$ between $0$ and $h$ with $f(a+h, b) - f(a,b) = f_x(a+h^\ast, b)\,h$. Hence对每个方括号应用中值定理。在第一个里只有第二个坐标变化,故存在介于 $0$ 与 $k$ 之间的某个 $k^\ast$,使 $f(a+h, b+k) - f(a+h, b) = f_y(a+h, b+k^\ast)\,k$。在第二个里只有第一个坐标变化,故存在介于 $0$ 与 $h$ 之间的某个 $h^\ast$,使 $f(a+h, b) - f(a,b) = f_x(a+h^\ast, b)\,h$。于是
$$ \Delta f = f_x(a+h^\ast, b)\,h + f_y(a+h, b+k^\ast)\,k. $$Now subtract the proposed linear part $f_x(a,b)h + f_y(a,b)k$:现在减去拟用的线性部分 $f_x(a,b)h + f_y(a,b)k$:
$$ \Delta f - \big[f_x(a,b)h + f_y(a,b)k\big] = \underbrace{\big[f_x(a+h^\ast,b) - f_x(a,b)\big]}_{=\,\varepsilon_1}\,h + \underbrace{\big[f_y(a+h,b+k^\ast) - f_y(a,b)\big]}_{=\,\varepsilon_2}\,k. $$Because $f_x$ and $f_y$ are continuous at $(a,b)$, and because $(a+h^\ast, b) \to (a,b)$ and $(a+h, b+k^\ast) \to (a,b)$ as $(h,k) \to (0,0)$, both $\varepsilon_1 \to 0$ and $\varepsilon_2 \to 0$. Finally, since $|h| \le |\mathbf{h}|$ and $|k| \le |\mathbf{h}|$ where $\mathbf{h} = (h,k)$,由于 $f_x$ 和 $f_y$ 在 $(a,b)$ 处连续,且当 $(h,k) \to (0,0)$ 时 $(a+h^\ast, b) \to (a,b)$、$(a+h, b+k^\ast) \to (a,b)$,故 $\varepsilon_1 \to 0$ 且 $\varepsilon_2 \to 0$。最后,由于 $|h| \le |\mathbf{h}|$ 且 $|k| \le |\mathbf{h}|$,其中 $\mathbf{h} = (h,k)$,
$$ \frac{\big|\Delta f - (f_x(a,b)h + f_y(a,b)k)\big|}{|\mathbf{h}|} \le |\varepsilon_1| + |\varepsilon_2| \longrightarrow 0. $$That is exactly the differentiability condition with $\nabla f(\mathbf{a})\cdot\mathbf{h} = f_x h + f_y k$ as the linear term. So $C^1$ implies differentiable, which is the workhorse theorem every formula in this unit quietly relies on.这正是以 $\nabla f(\mathbf{a})\cdot\mathbf{h} = f_x h + f_y k$ 为线性项的可微性条件。于是 $C^1$ 蕴含可微,这是本单元每条公式默默依赖的主力定理。
Going deeper: directional derivatives without differentiability深入探讨:有方向导数却不可微
Consider考虑
$$ f(x,y) = \begin{cases} \dfrac{x^2 y}{x^2 + y^2}, & (x,y)\neq(0,0), \\[4pt] 0, & (x,y)=(0,0). \end{cases} $$Along any unit direction $\mathbf{u} = \langle a, b\rangle$ we compute沿任意单位方向 $\mathbf{u} = \langle a, b\rangle$ 计算
$$ D_{\mathbf{u}}f(0,0) = \lim_{t\to 0}\frac{f(ta,tb)}{t} = \lim_{t\to 0}\frac{1}{t}\cdot\frac{t^3 a^2 b}{t^2(a^2+b^2)} = \frac{a^2 b}{a^2+b^2} = a^2 b. $$So every directional derivative exists at the origin. Yet if the gradient formula held we would need $D_{\mathbf{u}}f = \nabla f(0,0)\cdot\mathbf{u}$ to be linear in $\mathbf{u}$. Here $f_x(0,0) = 0$ and $f_y(0,0) = 0$, so the formula predicts $0$ for every direction, contradicting $a^2 b$ (take $a=b=1/\sqrt2$, giving $1/(2\sqrt2)\neq 0$). Hence $f$ is not differentiable at the origin, even though all directional derivatives exist.所以原点处每个方向导数都存在。然而若梯度公式成立,就需要 $D_{\mathbf{u}}f = \nabla f(0,0)\cdot\mathbf{u}$ 关于 $\mathbf{u}$ 是线性的。此处 $f_x(0,0) = 0$,$f_y(0,0) = 0$,故公式对每个方向都预测为 $0$,与 $a^2 b$ 矛盾(取 $a=b=1/\sqrt2$,得 $1/(2\sqrt2)\neq 0$)。因此尽管全部方向导数都存在,$f$ 在原点仍不可微。
Worked Example 7.1: partials exist but the function is not even continuous例题 7.1:偏导数存在,函数却连连续都不是
The previous counterexample still had all directional derivatives. This one is more dramatic: both partials exist at the origin, yet the function is discontinuous there, so it cannot possibly be differentiable. Let上一个反例好歹还拥有全部方向导数。这一个更极端:两个偏导数在原点都存在,函数却在那里不连续,因而绝不可能可微。设
$$ g(x,y) = \begin{cases} \dfrac{xy}{x^2 + y^2}, & (x,y)\neq(0,0), \\[4pt] 0, & (x,y)=(0,0). \end{cases} $$The partial $g_x(0,0)$ uses only the slice $y = 0$, where $g(x,0) = 0$ for all $x$, so $g_x(0,0) = 0$. By symmetry $g_y(0,0) = 0$. Both partials exist.偏导数 $g_x(0,0)$ 只用到切片 $y = 0$,在该切片上对所有 $x$ 都有 $g(x,0) = 0$,故 $g_x(0,0) = 0$。由对称性 $g_y(0,0) = 0$。两个偏导数都存在。
But approach the origin along $y = x$: then $g(x,x) = \dfrac{x^2}{2x^2} = \dfrac12$ for every $x \neq 0$, whereas along $y = 0$ the value is $0$. The two path limits disagree, so $\lim_{(x,y)\to(0,0)} g$ does not exist and $g$ is discontinuous at the origin. Since differentiability implies continuity, $g$ is not differentiable at $(0,0)$ despite having both partial derivatives there. This is the cleanest possible warning that "partials exist" carries almost no analytic weight on its own.但沿 $y = x$ 趋近原点:对每个 $x \neq 0$ 有 $g(x,x) = \dfrac{x^2}{2x^2} = \dfrac12$,而沿 $y = 0$ 该值为 $0$。两条路径的极限不一致,故 $\lim_{(x,y)\to(0,0)} g$ 不存在,$g$ 在原点不连续。由于可微蕴含连续,尽管 $g$ 在 $(0,0)$ 处两个偏导数都存在,它在那里仍不可微。这是最干净不过的警示:单凭"偏导数存在"几乎不带任何分析上的分量。
Flashcards闪卡
Unit Quiz单元测验
Readiness Checklist学习自测清单
Tap each item you can do without notes. 点击你无需参考资料即可完成的项目。0 / 8 mastered0 / 8 已掌握
- Compute a directional derivative as a normalized gradient dot product.把方向导数计算为归一化后的梯度点积。
- State the steepest-ascent direction and the maximum rate of change at a point.说出某点处的最速上升方向和最大变化率。
- Write the tangent plane to a graph z = f(x,y) using the partial derivatives.用偏导数写出图像 z = f(x,y) 的切平面。
- Build the linearization L(x,y) and use it to estimate a nearby value.构造线性化 L(x,y) 并用它估计附近的数值。
- Use the total differential dz to estimate change and propagate measurement error.用全微分 dz 估计改变量并传播测量误差。
- Apply the multivariable chain rule along a path and for chains of variables.沿路径以及对多层变量链应用多元链式法则。
- Find the tangent plane and normal line to a level surface using the gradient.用梯度求等值面的切平面与法线。
- Explain why directional derivatives can exist where f is not differentiable.解释为何在 f 不可微处方向导数仍可能存在。