Engineering Optimization: Theory and Practice, Fourth Edition

(Martin Jones) #1
Convex and Concave Functions 781

Theorem A.1A functionf (X)is convex if for any two pointsX 1 andX 2 , we have


f(X 2 )≥f(X 1 ) +∇fT(X 1 )(X 2 −X 1 )

Proof: Iff (X)is convex, we have by definition


f[λX 2 + ( 1 −λ)X 1 ] ≤λf (X 2 ) +( 1 −λ)f (X 1 )

thatis,


f[X 1 + λ(X 2 −X 1 ) ]≤f(X 1 ) +λ[f(X 2 ) −f(X 1 )] (A.3)

This inequality can be rewritten as


f (X 2 ) −f(X 1 )≥

{

f[X 1 + λ(X 2 −X 1 ) ]−f(X 1 )
λ(X 2 −X 1 )

}

(X 2 −X 1 ) (A.4)

By definingX= λ(X 2 −X 1 ) the inequality (A.4) can be written as,


f (X 2 ) −f(X 1 )≥

f[X 1 + X]−f(X 1 )
X

(X 2 −X 1 ) (A.5)

By taking the limit asX→ 0 , inequality (A.5) becomes


f (X 2 ) −f(X 1 ) ≥∇fT(X 1 )(X 2 −X 1 ) (A.6)

which can be seen to be the desired result. Iff (X)is concave, the opposite type of
inequality holds true in (A.6).


Theorem A.2A functionf (X)is convex if the Hessian matrixH(X)=[∂^2 f ( X)/
∂xi∂xj] is positive semidefinite.


Proof: From Taylor’s theorem we have


f (X∗+ h)=f(X∗)+

∑n

i= 1

hi

∂f
∂xi

(X∗)

+

1

2!

∑n

i= 1

∑n

j= 1

hihj

∂^2 f
∂xi∂xj





X=X∗+θh

(A.7)

where 0< θ <1. By lettingX∗=X 1 ,X∗+h=X 2 andh=X 2 −X 1 , Eq. (A.7) can
be rewritten as


f (X 2 ) =f(X 1 ) +∇fT(X 1 )(X 2 −X 1 )+^12 (X 2 −X 1 )T
×H{X 1 + θ(X 2 −X 1 )}(X 2 −X 1 ) (A.8)

It can be seen that inequality (A.6) is satisfied [and hencef (X)will be convex] ifH(X)
is positive semidefinite. Further, ifH(X) is positive definite, the functionf (X)will be
strictly convex. It can also be proved that iff (X)is concave, the Hessian matrix is
negative semidefinite.

Free download pdf