多変数関数のテイラーの定理
定理1
$f : \mathbb{R}^{n} \to \mathbb{R}$を$C^{k}$ 関数、$\mathbf{a} = (a_{1}, \dots, a_{n}) \in \mathbb{R}^{n}$としよう。そしたら、次を満たす$C^{k-2}$関数$h_{ij}$が存在する。
$$ f(\mathbf{x}) = f(\mathbf{a}) + \sum_{i} (x_{i} - a_{i})\dfrac{\partial f}{\partial x_{i}}(\mathbf{a}) + \sum_{i,j}h_{ij}(\mathbf{x})(x_{i} - a_{i}) (x_{j} - a_{j}) $$
説明
テイラーの定理を多変数関数に一般化したものだ。
second-order
$$ \begin{align*} f(\mathbf{x}) &= f(\mathbf{a}) + \sum\limits_{i=1}^{n} (x_{i} - a_{i}) \dfrac{\partial f}{\partial x_{i}}(\mathbf{a}) + \dfrac{1}{2!}\sum\limits_{i,j=1}^{n} (x_{i} - a_{i})^{2} \dfrac{\partial^{2} f}{\partial x_{i} \partial x_{j}}(\mathbf{a}) + \text{Remainder} \\ &= f(\mathbf{a}) + (\mathbf{x} - \mathbf{a})^{T} \nabla f (\mathbf{a}) + \dfrac{1}{2!}(\mathbf{x} - \mathbf{a})^{T} (H(\mathbf{a})) (\mathbf{x} - \mathbf{a}) + \text{Remainder} \end{align*} $$
ここで、$\nabla f$は$f$グラディエントで、$H$は$f$のヘシアンだ。
残差項remainder termについては、以下の形も便利に使われる。
$$ f(\mathbf{x} + \mathbf{p}) = f(\mathbf{x}) + \mathbf{p}^{T}\nabla f(\mathbf{x} + t \mathbf{p}) \quad \text{for some } t \in (0,1) $$ $$ f(\mathbf{x} + \mathbf{p}) = f(\mathbf{x}) + \mathbf{p}^{T}\nabla f(\mathbf{x}) + \dfrac{1}{2!}\mathbf{p}^{T} H(\mathbf{x} + t \mathbf{p}) \mathbf{p} \quad \text{for some } t \in (0,1) $$
$$ f(\mathbf{x} + \mathbf{p}) = f(\mathbf{x}) + \int_{0}^{1}\mathbf{p}^{T}\nabla f (\mathbf{x} + t\mathbf{p})dt $$
証明
$$ \begin{align*} f(\mathbf{x}) - f(\mathbf{a}) =&\ \int_{0}^{1} \dfrac{d}{dt} \left[ f(t(\mathbf{x} - \mathbf{a}) + \mathbf{a}) \right]dt \\ =&\ \int_{0}^{1} \left( \sum_{i} \dfrac{\partial f}{\partial x_{i}}\left( t(\mathbf{x} - \mathbf{a}) + \mathbf{a} \right)(x_{i}-a_{i}) \right) dt & \text{by } \href{https://freshrimpsushi.github.io/posts/3134}{\text{chain rule}} \\ =&\ \sum_{i}(x_{i} - a_{i}) \int_{0}^{1} \left( \dfrac{\partial f}{\partial x_{i}}\left( t(\mathbf{x} - \mathbf{a}) + \mathbf{a} \right) \right) dt \end{align*} $$
積分部分を$g_{i}(\mathbf{x})$と表記しよう。$g_{i}(\mathbf{x}) = \displaystyle \int_{0}^{1} \left( \dfrac{\partial f}{\partial x_{i}}\left( t(\mathbf{x} - \mathbf{a}) + \mathbf{a} \right) \right) dt$とすると、
$$ \begin{equation} f(\mathbf{x}) - f(\mathbf{a}) = \sum_{i}(x_{i} - a_{i}) \int_{0}^{1} \left( \dfrac{\partial f}{\partial x_{i}}\left( t(\mathbf{x} - \mathbf{a}) + \mathbf{a} \right) \right) dt = \sum_{i} g_{i}(\mathbf{x}) (x_{i} - a_{i}) \end{equation} $$
$g_{i}(\mathbf{a})$の値は次の通り。
$$ g_{i}(\mathbf{a}) = \int_{0}^{1} \dfrac{\partial f}{\partial x_{i}} \left(t(\mathbf{a} - \mathbf{a}) + \mathbf{a} \right) dt = \int_{0}^{1} \dfrac{\partial f}{\partial x_{i}}\left( \mathbf{a} \right) dt = \dfrac{\partial f}{\partial x_{i}}\left( \mathbf{a} \right) $$
それならば、$(1)$を導出した時と同様の方法で、次の式を得られる。
$$ g_{i}(\mathbf{x}) - g_{i}(\mathbf{a}) = \sum_{j} h_{ij}(\mathbf{x}) (x_{j}-a_{j}) $$
これで、まとめると
$$ \begin{align*} f(\mathbf{x}) =&\ f(\mathbf{a}) + \sum_{i}g_{i}(\mathbf{x})(x_{i}-a_{i}) \\ =&\ f(\mathbf{a}) + \sum_{i}\left( g_{i}(\mathbf{a}) + \sum_{j} h_{ij}(\mathbf{x}) (x_{j}-a_{j}) \right)(x_{i}-a_{i}) \\ =&\ f(\mathbf{a}) + \sum_{i} g_{i}(\mathbf{a})(x_{i}-a_{i}) + \sum_{i,j} h_{ij}(\mathbf{x})(x_{i}-a_{i})(x_{j}-a_{j}) \\ =&\ f(\mathbf{a}) + \sum_{i} \dfrac{\partial f}{\partial x_{i}}\left( \mathbf{a} \right)(x_{i}-a_{i}) + \sum_{i,j} h_{ij}(\mathbf{x})(x_{i}-a_{i})(x_{j}-a_{j}) \end{align*} $$
■
一緒に見る
Richard S. Millman and George D. Parker, Elements of Differential Geometry (1977), p213-214 ↩︎