THE LIFTED NEWTON METHOD AND ITS APPLICATION IN OPTIMIZATION

被引:52
作者
Albersmeyer, Jan [1 ]
Diehl, Moritz [2 ,3 ]
机构
[1] Univ Heidelberg, Simulat & Optimizat Team, Interdisciplinary Ctr Sci Comp IWR, D-69120 Heidelberg, Germany
[2] Katholieke Univ Leuven, Elect Engn Dept ESAT, B-3001 Louvain, Belgium
[3] Katholieke Univ Leuven, Optimizat Engn Ctr OPTEC, B-3001 Louvain, Belgium
关键词
nonlinear optimization; constrained optimization; shooting methods; Newton-type methods; SQP methods; ALGORITHMS;
D O I
10.1137/080724885
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We present a new "lifting" approach for the solution of nonlinear optimization problems (NLPs) that have objective and constraint functions with intermediate variables. Introducing these as additional degrees of freedom into the original problem, combined with adding suitable new constraints to ensure equivalence of the problems, we propose to solve this augmented system instead of the original system by a Newton-type method. This often offers advantages in terms of convergence rates and region of attraction. The main contribution of this article is an efficient algorithmic trick to generate the quantities needed for a Newton-type method on the augmented ("lifted") system with (a) almost no additional computational cost per iteration compared to a nonlifted Newton method, and (b) with negligible programming burden. We derive lifted schemes for Newton's method, as well as for constrained Gauss-Newton and adjoint based sequential quadratic programming (SQP) methods, and show equivalence of the new efficiently lifted approaches with "full-space" lifted Newton iterations. We establish conditions on the intermediate functions that imply faster local quadratic convergence for lifted versus nonlifted Newton iterations, a phenomenon often observed in practice but not yet explained theoretically. Finally, we compare numerically the behavior of the lifted approach with the nonlifted approach on several test problems, including a large scale example with 27 million intermediate variables. The algorithms and examples are available as open-source code in the C++ package LiftOpt.
引用
收藏
页码:1655 / 1684
页数:30
相关论文
共 16 条
[1]  
[Anonymous], 2000, FRONTIERS APPL MATH
[2]  
[Anonymous], 1988, BONNER MATH SCHRIFTE
[3]  
BOCK HG, 1987, BONNER MATHEMATISCHE, V183
[4]  
BOCK HG, 1978, Z ANGEWANDTE MATH ME, V58, P407
[5]  
COPSEY D, WAVE MODEL SIMULATIO
[6]  
Diehl M, 2006, LECT NOTES CONTR INF, V340, P65
[7]   An online active set strategy to overcome the limitations of explicit MPC [J].
Ferreau, H. J. ;
Bock, H. G. ;
Diehl, M. .
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2008, 18 (08) :816-830
[8]   Recursive trust-region methods for multiscale nonlinear optimization [J].
Gratton, Serge ;
Sartenaer, Annick ;
Toint, Philippe L. .
SIAM JOURNAL ON OPTIMIZATION, 2008, 19 (01) :414-444
[9]   Algorithm 755: ADOL-C: A package for the automatic differentiation of algorithms written in C/C++ [J].
Griewank, A ;
Juedes, D ;
Utke, J .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1996, 22 (02) :131-167
[10]   LEAST SQUARES PARAMETER ESTIMATION IN CHAOTIC DIFFERENTIAL EQUATIONS [J].
Kallrath, Josef ;
Schloeder, Johannes P. ;
Bock, Hans Georg .
CELESTIAL MECHANICS & DYNAMICAL ASTRONOMY, 1993, 56 (1-2) :353-371