but h12+h22h1h2=∣∣h∣∣h1h2 is not a o(∣∣h∣∣) cause if we use (xn,yn)=(1/n,1/n)⇒∣∣h∣∣h1h2/∣∣h∣∣=1/2 !
Gradient
The gradient is the unique vector ∇f such that dxf(h)=<∇f(x)∣h>, i.e
∇f(x)=∂x1f(x)∂x2f(x)⋮∂xnf(x)
Gradient
The Jacobian matrix is the generalization of the gradient but for function f that outputs in multidimensional space,
Jf(x)=∂x1f1(x)∂x1f2(x)⋮∂x1fm(x)∂x2f1(x)∂x2f2(x)⋮∂x2fm(x)⋯⋯⋱⋯∂xnf1(x)∂xnf2(x)⋮∂xnfm(x)
Hessian
The Hessian matrix is the second order of the gradient,