Newton's method for minimisation returns a critical point

조회 수: 33 (최근 30일)

이전 댓글 표시

Dussan Radonich 2020년 11월 10일

0
링크

이 질문에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/643690-newton-s-method-for-minimisation-returns-a-critical-point

편집: Bruno Luong 2020년 11월 11일

채택된 답변: Matt J

MATLAB Online에서 열기

I am trying to implement the newton's method to find the minima in the Himmelblau function.

The code does work most of the time, but on cases like this where my initial guess is (0.5 , 1) it returns a critical point of the function. I understand this is because the gradient becomes 0 and no new points are generated.

Now my question would be, is this normal with this method? Is there a way of getting around this problem?

Thanks for any help

close all; clear; clc
% Initialisation of variables to use
x0 = [0.5;1];
tol = 1e-4;
maxits = 50;
% Himmelblau function
him = @(x,y) (x.^2 + y - 11).^2 + (x + y.^2 - 7).^2;
% Gradient of the Himmelblau
grad_him = @(x,y) [[4*x.^3 + 4*x.*y - 42*x + 2*y.^2 - 14];[4*y.^3 + 4*x.*y - 26*y + 2*x.^2 - 22]];
% Hessian matrix of the Himmelblau
hessian_him = @(x,y) [[ 12*x.^2 + 4*y - 42 , 4*x + 4*y ];[ 4*x + 4*y , 12*y.^2 + 4*x - 26 ]];
% Call to newton's function and displaying our results accordingly
[r, iters, flag] = newton_min(grad_him,hessian_him,x0,tol,maxits);
fprintf ("<strong>Newton's method</strong>\n\n");
switch (flag)
    case 0
        fprintf ("There was a convergence on f\n\n");
        fprintf("The minima found is: \n");
        disp(r);
        fprintf("It took %d iterations.\n\n",iters);
    case 1
        fprintf ("There was a convergence on x\n\n");
        fprintf("The minima found is: \n");
        disp(r);
        fprintf("It took %d iterations.\n\n",iters);
    otherwise
        fprintf ("There was no convergence\n\n");
        
end
function [r, iters, flag] = newton_min(dg,ddg,x0,tol,maxits)
    x = x0(1); y = x0(2);
    r = NaN;
    flag = -1;
    
    for iters = 1 : maxits
    
        x_old = [x;y];
        
        x_new = x_old - (ddg(x,y)\dg(x,y));
        
        if norm(dg(x,y)) < tol
            
            flag = 0;
            r = x_new;
            return;
        end
        
        if norm(x_new - x_old) <= (tol + eps*norm(x_new))
            
            flag = 1;
            r = x_new;
            return;
        
        end
        
        x = x_new(1);
        y = x_new(2);
    
    end
end

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

채택된 답변

Matt J 2020년 11월 10일

1
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/643690-newton-s-method-for-minimisation-returns-a-critical-point#answer_540650

Yes, it's normal.

댓글 수: 30
이전 댓글 28개 표시이전 댓글 28개 숨기기

Matt J 2020년 11월 11일

MATLAB Online에서 열기

The following converged:

% Initialisation of variables to use
x0 = [0.5;1];
tol = 1e-10;
maxits = 50000;
% Himmelblau function
him = @(x,y) (x.^2 + y - 11).^2 + (x + y.^2 - 7).^2;
% Gradient of the Himmelblau
grad_him = @(x,y) [[4*x.^3 + 4*x.*y - 42*x + 2*y.^2 - 14];[4*y.^3 + 4*x.*y - 26*y + 2*x.^2 - 22]];
% Hessian matrix of the Himmelblau
hessian_him = @(x,y) [[ 12*x.^2 + 4*y - 42 , 4*x + 4*y ];[ 4*x + 4*y , 12*y.^2 + 4*x - 26 ]];
% Call to newton's function and displaying our results accordingly
[r, iters, flag] = newton_min(grad_him,hessian_him,x0,tol,maxits);
fprintf ("<strong>Newton's method</strong>\n\n");
Newton's method
switch (flag)
    case 0
        fprintf ("There was a convergence on f\n\n");
        fprintf("The minima found is: \n");
        disp(r);
        fprintf("It took %d iterations.\n\n",iters);
    case 1
        fprintf ("There was a convergence on x\n\n");
        fprintf("The minima found is: \n");
        disp(r);
        fprintf("It took %d iterations.\n\n",iters);
    otherwise
        fprintf ("There was no convergence\n\n");
        
end
There was a convergence on x
The minima found is: 
    3.0000
    2.0000
It took 61 iterations.
function [r, iters, flag] = newton_min(dg,ddg,x0,tol,maxits)
    x = x0(1); y = x0(2);
    r = NaN;
    flag = -1;
    
    for iters = 1 : maxits
    
        x_old = [x;y];
        
        H=ddg(x,y);
        if any(eig(H)<=0), H=eye(2); end
        
        x_new = x_old - 0.4*(H\dg(x,y));
        
        if norm(dg(x,y)) < tol
            
            flag = 0;
            r = x_new;
            return;
        end
        
        if norm(x_new - x_old) <= (tol + eps*norm(x_new))
            
            flag = 1;
            r = x_new;
            return;
        
        end
        
        x = x_new(1);
        y = x_new(2);
    
    end
end

Bruno Luong 2020년 11월 11일

편집: Bruno Luong 2020년 11월 11일

MATLAB Online에서 열기

In practice gradient-based minimizer first computes the descend direction, it can cen Newton like direction

dk = -H(xk)*g(k)

H is estimated from BFGS formula or by other mean such as conjugate gradient direction.

then do the line search

x(k+1) = x(k) + rho*(dk)

When computing dk, the minimizer always ensure

dot(dk,g) <= 0

so that the linesearch always goes downhill for infinitesimal step. It will automatically satisfies when Hk is positive, such as BFGS approximation. In some other methods that can deal with nonpositive Hessian, the direction is given by for example minimizing quadratic form on the sphere around the current point xk (trust region), this also ensures the 2nd condition is meet. The line search never goes up as with the Newton direction like in your case.

The line search approximative per iteration is enough, something costly as fminbnd is too much. Usualy a couple of evaluations of cost function along descend the direction, then the minimzer performs polynomial fit to find the minimum. It must satisfied some other criiteria such as Wolf's criteria etc... if not the step can be reduced aor increased in a geometrical manner until the criteria meet.

Even there are only a few evaluations per step, it will eventually converges to small-enough step along the iteration (one iteration == one change of dk), since the gradient converges to 0 (first order optimality). Thus the most important thing for an optimiser is to find the right descend direction and there is not need to do exact line search.

Anyway all that is well known and one can read through many books of optimization. A classical book is Nosedal that give an overview. In practice solver differs from technical details , and one have to find the corresponding paper to find out what's going on.

Matt J 2020년 11월 11일

MATLAB Online에서 열기

But that is not the point where the gradient would be zero, it is the critical point (-0.1280, -1.9537).

Yes, but as long as the algorithm goes downhill from (0.5,1) at every iteration, it can never approach the inflection point (-0.1280, -1.9537). The inflection point lies uphill from your initial point:

>> him(0.5,1)
ans =
  125.3125
>> him(-0.1280, -1.9537)
ans =
  178.3372

Dussan Radonich 2020년 11월 11일

Great guys, I got it! Thank you so much

댓글을 달려면 로그인하십시오.

추가 답변 (2개)

J. Alex Lee 2020년 11월 10일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/643690-newton-s-method-for-minimisation-returns-a-critical-point#answer_540665

Yes this looks normal, you are only asking to zero the gradient of the function, so naturally that includes non-optimal points where the gradient is [vector] zero.

You can use a non-gradient minimizer, like fminsearch to seek local minima

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Dussan Radonich 2020년 11월 10일

Thank you, the idea is not to used fminsearch as I am trying to compare newton's method against fminsesarch

댓글을 달려면 로그인하십시오.

Bruno Luong 2020년 11월 10일

0
링크

이 답변에 대한 바로 가기 링크

https://kr.mathworks.com/matlabcentral/answers/643690-newton-s-method-for-minimisation-returns-a-critical-point#answer_540670

편집: Bruno Luong 2020년 11월 10일

"Now my question would be, is this normal with this method?"

Your code juts shows it: yes it is normal.

Now in practice it is very rare that one falls on a stationary point that is not a local minimum. As soon as you work with a non-academic objective function. You won't ever get the gradient == 0 exactly.

"Is there a way of getting around this problem?"

All the book I read about optmization, no one care about this specific problem,since as I said it only happens in academic example. However, many methods will compute for each iteration an approximation of the Hessian, and the positiveness of the Hessian is either enforced or monitored. The Hessian that has negative eigenvalues like yours at (0.5,1) will has automatically a special treatment to escape from non-mimum.

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

Dussan Radonich 2020년 11월 10일

Thank you, good to know

댓글을 달려면 로그인하십시오.

이 질문에 답변하려면 로그인하십시오.

카테고리

Mathematics and Optimization Symbolic Math Toolbox Mathematics Assumptions

Help Center 및 File Exchange에서 Assumptions에 대해 자세히 알아보기

제품

MATLAB

릴리스

R2020b

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by

Newton's method for minimisation returns a critical point

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 30
이전 댓글 28개 표시이전 댓글 28개 숨기기

추가 답변 (2개)

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

Newton's method for minimisation returns a critical point

댓글 수: 0 이전 댓글 -2개 표시이전 댓글 -2개 숨기기

채택된 답변

댓글 수: 30 이전 댓글 28개 표시이전 댓글 28개 숨기기

추가 답변 (2개)

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 1 이전 댓글 -1개 표시이전 댓글 -1개 숨기기

참고 항목

카테고리

태그

제품

릴리스

Community Treasure Hunt

댓글 수: 0
이전 댓글 -2개 표시이전 댓글 -2개 숨기기

댓글 수: 30
이전 댓글 28개 표시이전 댓글 28개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기

댓글 수: 1
이전 댓글 -1개 표시이전 댓글 -1개 숨기기