HW 5 for CS 4220
You may (and probably should) talk about problems with the each other, with the TAs, and with me, providing attribution for any good ideas you might get. Your final write-up should be your own.
Some preliminary reminders
Consider three $C^2$ functions $\phi, f, g : \mathbb{R}^n \rightarrow \mathbb{R}$ given by
$$\phi(x) = f(x) g(x)$$
We will denote partial derivatives compactly by $f_{,i} = \partial f(x)/\partial x_i$ and $f_{,ij} = \partial^2 f/(\partial x_i \partial x_j)$ (and similarly with $g$ and $\phi$). In this case, the product rule gives us
$$\begin{align*} \phi_{,i} &= f_{,i} g + f g_{,i} \\ \phi_{,ij} &= f_{,ij} g + f_{,i} g_{,j} + f_{,j} g_{,i} + f g_{,ij} \end{align*}$$
or, in terms of gradients and Hessians,
$$\begin{align*} \nabla \phi &= (\nabla f) g + f (\nabla g) \\ H_{\phi} &= H_f g + (\nabla f) (\nabla g)^T + (\nabla g) (\nabla f)^T + f H_g \end{align*}$$
We also know that if $f : \mathbb{R}^n \rightarrow \mathbb{R}$ and $h : \mathbb{R} \rightarrow \mathbb{R}$ and $\psi(x) = h(f(x))$, then the chain rule gives
$$\begin{align*} \psi_{,i}(x) &= h'(f(x)) f_{,i}(x) \\ \psi_{,ij}(x) &= h''(f(x)) f_{,i}(x) f_{,j}(x) + h'(f(x)) f_{,ij}(x) \end{align*}$$
or, in terms of gradients and Hessians,
$$\begin{align*} \nabla \psi(x) &= h'(f(x)) \nabla f(x) \\ H_{\psi}(x) &= h''(f(x)) \nabla f(x) \nabla f(x)^T + h'(f(x)) H_f \end{align*}$$
Questions
Consider the objective function $\phi : \mathbb{R}^n \rightarrow \mathbb{R}$ given by
$$\phi(x) = \exp(-\|x^2\|/2) (c^T x - d)$$
where $c \in \mathbb{R}^n$ and $d \in \mathbb{R}$. Assume $c \neq 0$
Complete the following code to compute $\nabla \phi(x)$ and $H_{\phi}(x)$. We have given a tester to sanity check your result – what is this tester doing?
Complete the Newton iteration code for solving $\nabla \phi(x) = 0$, and give a semilog plot to illustrate quadratic convergence. In the test case, does this converge to a local minimum, maximum, or saddle of $\phi$?
Argue that any stationary point of $\phi$ must occur at $x = \alpha c$ where $\alpha$ is a root of the quadratic $\alpha^2 \|c\|^2 - d \alpha - 1 = 0$. Check that this is true for the solution computed in question 2.
ϕ (generic function with 1 method)