a local minimax method using virtual geometric objects ...j.zhou/ljz-virt-1.pdf · complexity of...

24
Journal of Scientific Computing (2019) 78:202–225 https://doi.org/10.1007/s10915-018-0774-y A Local Minimax Method Using Virtual Geometric Objects: Part I—For Finding Saddles Zhaoxiang Li 1 · Bingbing Ji 2 · Jianxin Zhou 2 Received: 21 December 2016 / Accepted: 22 June 2018 / Published online: 17 July 2018 © Springer Science+Business Media, LLC, part of Springer Nature 2018 Abstract By a dynamics of points on virtual geometric objects such as curves, surfaces, etc., with a flexible endpoint, this paper is to develop a new local minimax method for finding the first few unconstrained saddles of a functional, so that different types of saddle point problems in infinite-dimensional spaces can be solved. Since the mathematical framework of the method is general, it covers several existing algorithms in the literature. Algorithm justifications including a strong energy dissipation law and convergence are established. The new algorithm is implemented and tested on several benchmark examples commonly used in the literature to show its stability and efficiency, and then applied to numerically compute saddles of a semilinear elliptic PDE for both (focusing) M-type and (defocusing) W-type cases, where it is shown that those virtual geometric objects can be easily defined without knowing their explicit expressions or representative (interpolation) points and the method can be easily extended to find k-saddles or modified for other purposes, e.g., to compute constrained k- saddles. Keywords Saddles · Local minimax method · Virtual geometric objects · Convergence · Implementation Mathematics Subject Classification 65N20 · 58E05 · 35A15 · 49M05 Zhaoxiang Li: Supported in part by NSF of China (Nos. 11771298 and 11671251). B Zhaoxiang Li [email protected] B Jianxin Zhou [email protected] Bingbing Ji [email protected] 1 Math. Dept., Shanghai Normal University, Shanghai, China 2 Math. Dept., Texas A&M University, College Station, TX 77843, USA 123

Upload: others

Post on 29-Oct-2019

17 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

Journal of Scientific Computing (2019) 78:202–225https://doi.org/10.1007/s10915-018-0774-y

A Local Minimax Method Using Virtual Geometric Objects:Part I—For Finding Saddles

Zhaoxiang Li1 · Bingbing Ji2 · Jianxin Zhou2

Received: 21 December 2016 / Accepted: 22 June 2018 / Published online: 17 July 2018© Springer Science+Business Media, LLC, part of Springer Nature 2018

AbstractBy a dynamics of points on virtual geometric objects such as curves, surfaces, etc., with aflexible endpoint, this paper is to develop a new local minimax method for finding the firstfew unconstrained saddles of a functional, so that different types of saddle point problems ininfinite-dimensional spaces can be solved. Since the mathematical framework of the methodis general, it covers several existing algorithms in the literature. Algorithm justificationsincluding a strong energy dissipation law and convergence are established. The new algorithmis implemented and tested on several benchmark examples commonly used in the literatureto show its stability and efficiency, and then applied to numerically compute saddles of asemilinear elliptic PDE for both (focusing) M-type and (defocusing) W-type cases, whereit is shown that those virtual geometric objects can be easily defined without knowing theirexplicit expressions or representative (interpolation) points and the method can be easilyextended to find k-saddles or modified for other purposes, e.g., to compute constrained k-saddles.

Keywords Saddles · Local minimax method · Virtual geometric objects · Convergence ·Implementation

Mathematics Subject Classification 65N20 · 58E05 · 35A15 · 49M05

Zhaoxiang Li: Supported in part by NSF of China (Nos. 11771298 and 11671251).

B Zhaoxiang [email protected]

B Jianxin [email protected]

Bingbing [email protected]

1 Math. Dept., Shanghai Normal University, Shanghai, China

2 Math. Dept., Texas A&M University, College Station, TX 77843, USA

123

Page 2: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

Journal of Scientific Computing (2019) 78:202–225 203

1 Introduction

It has been physically observed and mathematically verified that excited states, as unstablelocal equilibria, with various configurations and performance indices at different energylevels exist in many excitation/reaction/transition processes in physics, chemistry, biology,etc. Nowadays new (synchrotronic, laser, etc.) technologies can be developed to induce, reachor control some of those excited states so that they become long-lived as to be stable forpractical purposes. Such excited states are called metastable. Thus search and study of thosemetastable states for new applications are of great interests in modern science and advancedengineering [1–22] and call for the development of numerical algorithms to compute suchsolutions.

Let H be a Hilbert space with its inner product 〈·, ·〉 and norm ‖ · ‖. Let J : H → R

be a C1 functional. A point u∗ ∈ H is called a critical point of J if its Frechet derivative,J ′(u∗) = 0. The most well-studied critical points are the local extrema of J on which theclassical calculus of variations and numerical methods focus. Critical points u∗ that are notlocal extrema of J are called saddle points, or saddles, i.e., in any neighborhood of u∗ thereare v,w s.t. J (v) < J (u∗) < J (w). An index-k saddle point or k-saddle is a critical pointthat is a local maximum of J in a k-dimensional subspace and a local minimum of J in thecorresponding k-co-dimensional subspace. Critical points correspond to local equilibriumstates in a physical process. Thus mathematically, a ground state as a stable local equilibriumis a localminimumpoint or 0-saddle, excited states, as unstable local equilibria, correspond tosaddles and metastable states are among the first few saddles. Comparing to a local minimumcomputation, numerical search for saddles is much more challenging due to their instabilityand multiplicity.

Many algorithms are proposed in computational physics/chemistry/biology to find 1-saddles in various applications [3–5,7–17,21–28]. Those algorithms can be classified inmany different ways depending on different natures of the problems or people’s knowledgeabout the problems, such as:

(1) Finite-dimensional versus infinite dimensional;(2) Unconstrained versus constrained;(3) Local optimization versus global optimization;(4) Knowing (two localminima) the starting state ur (reactant) and the final state u p (product)

versus knowing only (a local minimum) the starting state ur or W-type versus M-type;(5) Autonomous system versus non autonomous system;(6) Using only J ′ versus using both J ′ and J ′′;(7) min1−cod −max1−d versus max1−d −min1−cod ;(8) Algorithm stability: with versus without dissipation low;(9) Algorithm adaptivity: for a specific problem versus for a general class of problems, etc.

As a motivation to this paper, we consider finding soliton solutions to a semilinearSchrödinger equation, which leads to solving a non-autonomous semilinear elliptic PDE:

− �u(x) + λu(x) + κ|x |r |u(x)|p−1u(x) = 0, (1.1)

in H10 (�) = W 1,2

0 (�) where � is an open bounded domain in Rn and p > 1, λ, κ, r are

prescribed parameters. Its variational functional is

J (u) =∫

[1

2(|∇u(x)|2 + λu2(x)) + κ

p + 1|x |r |u(x)|p+1

]dx . (1.2)

123

Page 3: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

204 Journal of Scientific Computing (2019) 78:202–225

k−saddle

k−saddle

Fig. 1 Function profiles of M-type (left) where ∩-shape in [v1, . . . , vk ], M-shape in [v1, . . . , vk ]⊥ versusW-type (right) where ∪-shape in [v1, . . . , vk ]⊥, W-shape in [v1, . . . , vk ]. They are not upside-down to eachother due to the significant difference in space dimensions

Thus solutions to (1.1) correspond to critical points of J in (1.2). Let 0 < μ1 <

μ2 < · · · be the eigenvalues of −� satisfying zero Dirichlet boundary condition (B.C.)and {v1, v2, . . .} be their corresponding eigenfunctions. The system (1.1) is called focus-ing (M-type) if κ < 0 and −μk+1 < λ < −μk (k = 0, 1, 2, . . . , μ0 = −∞)and defocusing (W-type) if κ > 0 and μk < −λ < μk+1 (k = 1, 2, . . . ,). Thosetwo cases are very different in their physical natures and mathematical structures, referFig. 1. For both types, 0 is the only k-saddle. All non-trivial saddles have index > k(< k) for M-type (W-type). In particular, for the M-type with λ > −μ1, J has a moun-tain pass structure and 0 is the only local minimum so the mountain pass/linking typeapproaches or methods can be applied [29–31]. However when λ < −μ1, J does nothave a mountain pass structure, the mountain pass type approach or methods cannot beapplied; In particular for the W-type, J has two local minima but no mountain pass struc-ture, thus the mountain pass/linking type approach or algorithms are not applicable. Inthe literature, those two cases have to be treated by two very different variational meth-ods.

Due to its high nonlinearity/multiplicity of the problem in an infinite dimensional space,finding an initial guess sufficiently close to an unknown saddle is commonly viewedas too difficult to do so. Thus Newton-type or other local convergence based methodsalone even they are very fast are not suitable for solving such multiple solution prob-lems. By our experience and analysis [32], although having used in the literature forsolving finite-dimensional problems, a max1−d −min1−cod algorithm is unstable for find-ing a saddle in an infinite-dimensional space. Since a stable algorithm can always befollowed by a Newton or other fast locally convergent method to accelerate its conver-gence, in this paper, we focus on developing a new stable local min-max algorithm (LMM)in Sects. 2 and 3 using only J ′ for finding unconstrained k-saddles (k = 1, 2, . . .) ofvery different variational problems in Hilbert spaces. It is known that the energy dis-sipation law is an important property for an algorithm related dynamics to be stable.However for the algorithm to converge, when a numerical approximation of the dynamicsis involved, the dissipation law alone is not enough, one needs a stronger energy dissi-pation law, called stepsize rule, in particular, in an infinite-dimensional space. Hence weestablish LMM’s mathematical justification including a solution characterization, a stepsizerule and a convergence result in Sects. 2, 4 and 5. In Sect. 6, the algorithm is imple-mented and tested first on several benchmark problems commonly used in the literatureto show its stability and efficiency, and then on our model problem with two very dif-ferent W-type and M-type variational structures. Since the geometric objects used in thealgorithm are virtual, we show how easily those geometric objects can be defined with-out knowing their explicit expressions or representative points. We can also use somespecial ones, e.g., those defined by the Nehari manifold, to speedup algorithm conver-gence.

123

Page 4: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

Journal of Scientific Computing (2019) 78:202–225 205

2 A Local Minimax Characterization of 1-Saddles

Let P(t, s) be a t-parametrized family of smooth curves in variable s connecting ur to u ptwhere ur is a local minimum of J and u pt can be another local minimum u p or a fixed ormoving point with J (u pt ) ≤ J (ur ) and ‖ur − u pt ‖ > δ > 0. Thus for each t ≥ 0, P(t, s)is a smooth curve in the variable s. We may assume 0 ≤ s ≤ 1 with P(t, 0) = ur andP(t, 1) = u pt . For each t ≥ 0, let s(t) ∈ (0, 1) be the first local maximum of J (P(t, s)) ins. Since ur is a local minimum of J (P(t, s)) in s and J (u pt ) ≤ J (ur ), such an s(t) alwaysexists with 0 < δ < s(t) < 1 for some δ > 0 if ur is nondegenerate. We have

d J (P(t, s))

ds|s=s(t) = J ′(P(t, s(t)))P ′

s (t, s(t)) = 0. (2.1)

Once the value s(t) and the direction P ′s (t, s(t)) are specified as in (2.1), for the t-parametrized

family of curves P(t, s) to evolve in t in a regular way or to avoid a sliding, we need to assignamoving direction. Let Ht be the hyperplane normal to P ′

s(t, s(t)) at P(t, s(t)) and called thenormal plane of the curve P(t, s) at P(t, s(t)) [33]. Since P ′

t (t, s(t)) defines the direction ofthe t-parametrized family of curves P(t, s) moving away locally from the point P(t, s(t)),for this evolution in t to be regular or to avoid a sliding, the moving direction should be onHt . By (2.1), J ′(P(t, s(t))) ∈ Ht . Thus we set P ′

t (t, s(t)) = −J ′(P(t, s(t)))/Ct where andthroughout of the paper we denote Ct = max{1, ‖J ′(P(t, s(t)))‖}. Here a negative sign isused since we hope that the curves P(t, s) evolves in t along certain negative gradient flow,so that the value of J (P(t, s(t)) will be strictly decreasing (or obey the energy dissipationlaw), and the scalar Ct is introduced to enhance the stability of this evolution in searchingfor an unstable 1-saddle. Thus we propose the following 1-saddle search system:

〈J ′(P(t, s(t))), P ′s (t, s(t))〉 = 0, (2.2)

P ′t (t, s(t)) = −J ′(P(t, s(t)))/Ct (2.3)

starting from an initial point P(0, s(0)) on a given initial smooth curve P(t, s). It is importantto note that this system is a dynamics of t-parametrized points P(t, s(t)) starting from aninitial point P(0, s(0)) not a dynamics of t-parametrized family of smooth curves P(t, s).So there are infinitely many smooth curves satisfying the system (2.2)–(2.3) and we do nothave to know their explicit expressions or representative points. For this reason, we call thosecurves virtual. This feature will be clearer in the algorithm implementation. It reduces thecomplexity of the method, gives us a great flexibility to choose preferred smooth curvesfor different purposes or to satisfy certain constraints if exist and also enables us to easilyextend the method for finding k-saddles. In addition, we assume that the scalar function0 < s(t) < 1 is locally Lipschitz continuous. Since for each t > 0, the Eq. (2.2) can beused to solved for s(t), a local maximum of J (P(t, s))) in s, by using the implicit functiontheorem, a condition can always be proposed so that s is locally C1.

Note that (2.2) is achieved by taking a local maximum of J along the curve P(t, s) in s and(2.3) indicates that this system follows a negative gradient flow and leads to a local minimumof J (P(t, s(t))) in t . Thus the system (2.2)–(2.3) is a new local minimax principle for a1-saddle. Modifications of the system (2.2)–(2.3) can be developed for other purposes. Alsodifferent discrete realizations of this system in t may lead to different numerical algorithmsfor finding a 1-saddle of different types of functionals. When a discretization is used alongt , since a 1-saddle is an unstable solution, we do not want to go too fast to loose algorithmstability in a search process. It is known that the energy dissipation law is important foralgorithm stability, but it alone is not enough for an algorithm to converge, in particular, in an

123

Page 5: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

206 Journal of Scientific Computing (2019) 78:202–225

infinite-dimensional space. We need to setup a stepsize rule, a stronger version of the energydissipation law.

Lemma 2.1 (Stepsize Rule) If P(t0, s(t0)) is not a critical point, then there exists s0 > 0 s.t.when 0 < t ′ < s0, we have a stepsize rule

J (P(t0 + t ′, s(t0 + t ′))) − J (P(t0, s(t0))) <−t ′

4‖J ′(P(t0, s(t0)))‖2/Ct0 .

Furthermore, if P(tk, s(tk)) → P(t0, s(t0)) as tk → t0, then there exists N > 0 s.t. when0 < t ′ < s0/2, k > N, we have a uniform stepsize rule

J (P(tk + t ′, s(tk + t ′))) − J (P(tk, s(tk))) <−t ′

4‖J ′(P(tk, s(tk)))‖2/Ctk .

Proof Note

P(t0 + t ′, s(t0 + t ′)) − P(t0, s(t0))

= P ′t (t0, s(t0))t

′ + P ′s(t0, s(t0))(s(t0 + t ′) − s(t0)) + o(t ′ + |s(t0 + t ′) − s(t0)|)

= −J ′(P(t0, s(t0)))t′/Ct0 + P ′

s (t0, s(t0))(s(t0 + t ′)− s(t0)) + o(t ′ + |s(t0 + t ′) − s(t0)|)

and we assume that s(t) is locally Lipschitz continuous, |s(t0 + t ′) − s(t0)| ≤ �0t ′. Thus‖P(t0 + t ′, s(t0 + t ′)) − P(t0, s(t0))‖ = O(|t ′|) and we have

J (P(t0 + t ′, s(t0 + t ′))) − J (P(t0, s(t0)))

= 〈J ′(P(t0, s(t0))), P(t0 + t ′, s(t0 + t ′)) − P(t0, s(t0))〉+ o(‖P(t0 + t ′, s(t0 + t ′)) − P(t0, s(t0))‖)

= 〈J ′(P(t0, s(t0))), P′t (t0, s(t0))t

′ + P ′s(t0, s(t0))(s(t0 + t ′) − s(t0))〉 + o(|t ′|)

= − t ′‖J ′(P(t0, s(t0)))‖2/Ct0 + o(|t ′|) (by(2.2) and (2.3)).

Hence there is s0 > 0 s.t. when 0 < t ′ < s0, we have

J (P(t0 + t ′, s(t0 + t ′))) − J (P(t0, s(t0))) <−t ′

4‖J ′(P(t0, s(t0)))‖2/Ct0 .

Since J , J ′ are continuous, the last conclusion follows directly from P(tk, s(tk)) →P(t0, s(t0)) as tk → t0 and the stepsize rule in the first part. ��Theorem 2.1 (Local Minimax Characterization) Let t0 = arg loc-mint>0 J (P(t, s(t))).Then P(t0, s(t0)) is a saddle point.

Proof Suppose not, since P(t0, s(t0)) is a local maximum of J along the smooth curveP(t0, s) in s, it cannot be a local minimum of J either. Then by Lemma 2.1, there existss0 > 0 s.t. when 0 < t ′ < s0, we have

J (P(t0 + t ′, s(t0 + t ′))) − J (P(t0, s(t0))) <−t ′

4‖J ′(P(t0, s(t0)))‖2/Ct0 ,

which leads to a contradiction to t0 = arg loc-mint>0 J (P(t, s(t))). ��Remark 2.1 It is clear that the above local minimax characterization implies a local minimaxmethod (LMM) for a 1-saddle. It is also interesting to indicate the significant differencesbetween LMM (2.2)–(2.3) and the well-known minimax method characterized by the moun-tain pass lemma ([34], 1973). In the latter, u pt = u p is fixed and it assumes a mountain

123

Page 6: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

Journal of Scientific Computing (2019) 78:202–225 207

pass structure, the min and max are all in the global sense, i.e., P(t, s(t)) must be a globalmaximum of J on a continuous curve P(t, s) and the min must be taken over all possiblecontinuous paths connecting ur and u p . Numerically this is impossible. While in the former,u pt is flexible and it does not assume a mountain pass structure, the min and max are all inlocal sense, and the paths can be any one-parameter family of smooth curves connecting urand u pt . Thus LMM (2.2)–(2.3) has clear advantages in numerical algorithm implementation.It is described as below.

3 A Local MinimaxMethod (LMM)

Assume that ur is a local minimum of J and u p0 is either another fixed local minimum of Jor any point with J (u p0) ≤ J (ur ). Given λ, ε, τk > 0 with τk → 0,

∑∞k=0 τk = +∞.

Step 1 Let P(0, s) be a given smooth curve s.t. P(0, 0) = ur , P(0, 1) = u p0 and s(0) bethe first local maximum of J (P(0, s)), namely, s(0) = argmaxs>0 J (P(0, s)). Setk = 0, t0 = 0, uk = P(tk, s(tk));

Step 2 Evaluate dk = J ′(uk). If ‖dk‖ < ε,then output uk and stop, otherwise continue.Step 3 For t ′ = λ

2m ,m = 1, 2, . . ., let uk(t ′) = P(tk, s(tk)) − t ′dk/Ck . Let u pk ,t ′ be a localminimum of J or chosen in a continuous way in t ′ s.t. J (u pk ,t ′) ≤ J (ur ). Constructa smooth curve P(tk + t ′, s) passing through ur , uk(t ′), u pk ,t ′ . Use s(tk) as an initialguess to solve for s(tk + t ′) = argmaxs>0 J (P(tk + t ′, s)). Denote

t ′k = max{t ′ = λ

2m≤ τk |m ∈ N , J (P(tk + t ′, s(tk + t ′))) − J (uk) ≤ − t ′

4‖dk‖2/Ctk },

tk+1 = tk + t ′k and u(tk+1) = P(tk+1, s(tk+1));Step 4 Set k = k + 1 and go to Step 2.

Remark 3.1 (1) In Step 3, the first line is a discrete version of the condition (2.3),namely, a finite difference in t-variable P(t + t ′, s(t)) ≈ P(t, s(t)) − t ′ J ′(t, s(t))is used to approximate the condition (2.3), −J ′(P(t, s(t)) = P ′

t (t, s(t)) =limt ′→0

P(t+t ′,s(t))−P(t,s(t))t ′ . Thus for algorithm stability, we choose the approximation

u(t ′) = P(t + t ′, s(t)) ≈ P(t, s(t)) − t ′ J ′(t, s(t))/Ct .

The condition (2.2) is always satisfied at a local maximum J (P(tk + t ′, s) in s. Sincealong a smooth path J (P(t, s)), there can be multiple selections of local maxima of J in s.Each one is called a peak selection. An oscillation among different peak selections destroysthe continuity of P in s. In order to avoid such an unnecessary oscillation and to hold thecontinuity of P(t, s(t)) in t , it is very important in Step 3 of the algorithm to use the previousvalue s(tk) as an initial guess to consistently trace a peak selection. On the other hand, inStep 1, by starting with s > 0 small, we intend to choose the peak selection which is closestto the local minimum ur , though it is not necessary;

(2) In an extreme case, just like a negative gradient flow may be stuck at a saddle u∗ notnecessarily a local minimum (0-saddle), the algorithm may be stuck at a k-saddle u∗(k > 1) not necessarily a 1-saddle. In this rare case, we check J ′′(u∗). It will have at leasttwo negative eigenvalues. We may use their eigenfunctions, two decreasing directions ofJ at u∗, to construct a decreasing direction v− orthogonal to P ′

s (t∗, s(t∗)), then move

along u∗ + t ′v− to stay away from u∗ and continue the algorithm iteration to search fora 1-saddle. Since the algorithm is descending, it will not come back to u∗.

123

Page 7: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

208 Journal of Scientific Computing (2019) 78:202–225

4 Convergence Analysis

To prove the convergence of the algorithm, we need the Palais–Smale (PS) condition

Definition 4.1 A function J ∈ C1(H ,R) satisfies the Palais–Smale (PS) condition if anysequence {uk} ⊂ H with {J (uk)} bounded and J ′(uk) → 0 has a convergent subsequence.

Using the notation in the new algorithm, we have the following convergence result.

Theorem 4.1 Let tk+1 = tk + t ′k where 0 < t ′k ≤ τk, τk → 0,∑∞

k=0 τk = +∞ and uk =P(tk, s(tk)) be the sequence generated by the algorithm with ε = 0. Then

(a) t ′k J ′(uk)/Ctk → 0 as k → ∞; Furthermore if J satisfies the PS condition, then(b) There is a subsequence uk j → u∗, a saddle of J ;

Denote K = {u ∈ H : J ′(u) = 0, J (u) = J (u∗)}.(c) Any convergent subsequence of {uk} converges to a point of K ;(d) If in addition, ‖P ′

s (tk, s(tk))‖ is bounded and the scalar function s(t) is Lipschitz contin-uous, then {uk} ∩ K �= ∅ is connected and dis(uk, K ) → 0 as k → ∞.

Proof Since J (ur ) < J (uk+1) < J (uk), we have

J (ur ) − J (u0) ≤ limk→∞ J (uk) − J (u0) =

∞∑k=0

(J (uk+1) − J (uk)) < −1

4

∞∑k=0

t ′k‖J ′(uk)‖2/Ctk .

Thus t ′k‖J ′(uk)‖2/Ctk → 0. It leads to t ′k J ′(uk)/Ctk → 0 if ‖J ′(uk)‖ ≥ 1 or ‖J ′(uk)‖ < 1and 0 < t ′k < τk → 0. So (a) is verified.

To prove (b), suppose there is η > 0 s.t. ‖J ′(uk)‖ > η, k = 1, 2, . . ., we may assumeη < 1

2 . Thus Ctk = 1, η > η2 and we have

J (ur ) − J (u0) < −1

4

∞∑k=0

t ′k‖J ′(uk)‖2/Ctk ≤ −η2

4

∞∑k=0

t ′k = −η2

4

∞∑k=0

|tk+1 − tk |. (4.1)

That is, {tk} is a Cauchy sequence. We obtain tk → t∗. By the continuity, we have uk =P(tk, s(tk)) → u∗ = P(t∗, s(t∗)) and ‖J ′(P(t∗, s(t∗)))‖ ≥ η, i.e., u∗ is not a criticalpoint. Then Lemma 2.1 states that there exist s0 > 0, N > 0 s.t. when k > N , we haves02 ≤ t ′k < τk → 0, a contradiction. Thus there must be a subsequence J ′(uki ) → 0. Since{J (uk)} is bounded and J ′(uk j ) → 0, by the PS condition, (b) is proved.

To establish (c), let {uk j } ⊂ {uk} be any subsequence with uk j → u′. If J ′(u′) �= 0, wecan pass to a subsequence if necessary, then t ′k J ′(uk)/Ctk → 0 and ‖J ′(uk j )‖ > η > 0 leadto t ′k j → 0, a contradiction to Lemma 2.1 under uk j → u′. Thus J ′(u′) = 0 must hold. Since

J (uk) is monotonically decreasing, it leads to J (u′) = J (u∗), i.e., u′ ∈ K .To prove (d), by (b), we have

uk+1 − uk = P(tk+1, s(tk+1)) − P(tk, s(tk))

= P ′t (tk, s(tk))t

′k + P ′

s (tk, s(tk))(s(tk+1) − s(tk)) + o(|t ′k | + |s(tk+1) − s(tk)|)= −J ′(uk)t ′k/Ctk + P ′

s(tk, s(tk))(s(tk+1) − s(tk)) + o(|t ′k | + |s(tk+1) − s(tk)|).Note that s(t) is Lipschitz continuous, as k → ∞, we have 0 < t ′k ≤ τk → 0,

‖J ′(uk)/Ctk‖ ≤ 1, ‖P ′s(tk, s(tk))‖ ≤ M, |s(tk+1) − s(tk))| ≤ �(t ′k) → 0

and then there is �1 > 0 s.t.

‖uk+1 − uk‖ = ‖P(tk+1, s(tk+1)) − P(tk, s(tk))‖ ≤ �1(t′k) → 0 as k → ∞. (4.2)

123

Page 8: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

Journal of Scientific Computing (2019) 78:202–225 209

It is clear from (b) that u∗ ∈ {uk} ∩ K �= ∅. We let I ⊂ N = {1, 2, . . .} and call∑i∈I ‖ui+1 − ui‖ the total distance traveled by the subsequence {ui }i∈I . For any given

η > 0, let i ∈ I ⊂ N denote the whole index set in N with ‖J ′(ui )‖ > η. Since J (ur ) <

J (uk+1) < J (uk), similar to (4.1), we have

J (ur ) − J (u0) ≤∑i∈I

[J (ui+1) − J (ui )] < −1

4

∑i∈I

t ′i‖J ′(ui )‖2/Cti ≤ −η2

4

∑i∈I

t ′i (4.3)

and then by (4.2) ∑i∈I

‖ui+1 − ui‖ ≤ �1∑i∈I

t ′i < +∞, (4.4)

i.e., the total distance traveled by {ui }i∈I is finite.If there is δ3 > 0 s.t. there are infinitely many points u in {uk} with ‖u − u∗‖ > δ3.

By the inequality (4.2), for any 0 < δ1 < δ2 < δ3, there is K0 > 0 s.t. when k > K0,‖uk+1 −uk‖ < 1

2 (δ2 − δ1), i.e., two consecutive points uk, uk+1 cannot jump over the regionR12 = {u ∈ H : δ1 < ‖u∗ − u‖ < δ2}. Since u∗ is a limit point of {uk} and there areinfinitely many points u in {uk} satisfying ‖u∗ −u‖ > δ3, the sequence {uk} has to enterR12

for infinitely many times. Denote {uki } ⊂ {uk} ∩R12. So each time {uk} entersR12 at uki ithas to travel at least 1

2 (δ2 − δ1) distance before it leavesR12. Thus the total distance traveledby {uki } is infinite. However by (4.4), for any δ > 0, the total distance traveled by all thepoints u ∈ {ui } with ‖J ′(u)‖ > δ is finite. Thus there must be a subsequence {uki ′ } ⊂ {uki }s.t. J ′(uki ′ ) → 0 and J (uki ′ ) → J (u∗). By the PS condition, there is a subsequence, denotedby {uki ′ } again, s.t. uki ′ → u′ with J ′(u′) = 0 and J (u′) = J (u∗). Then u′ ∈ K andδ1 ≤ ‖u∗ − u′‖ ≤ δ2. Since 0 < δ1 < δ2 < δ3 can be any such numbers and u∗ can be anylimit point of {uk} by (c), it follows that (1) for any δ3 > 0, there are at most a finite numberof points u ∈ {uk} s.t. dis(u, K ) > δ3 and (2) {uk} ∩ K must be connected. Finally (d) isverified. ��Remark 4.1 The condition that the scalar function s(t) is Lipschitz continuity in (d) of The-orem 4.1 is important to obtain a sequence convergence from a subsequence convergence.Image that if s(t) ∈ (0, 1) is only continuous but oscillates for infinitely many times withunbounded |s′(tk)|, then there could be many subsequences of {s(tk)} converging to dif-ferent points in [0, 1]. In this case we can have only the subsequence convergence (b) butnot the sequence convergence (d) in Theorem 4.1. Conclusion (d) implies uk → u∗ if u∗is isolated. The condition that ‖P ′

s(tk, s(tk))‖ is bounded will be verified in our numericalimplementation for both M and W-types.

We have established the mathematical justification of LMM using virtual curves for finding1-saddles. If 2-saddles or the first few saddles are interested, the above LMM can be easilyextended to find 2-saddles or more while the convergent results still hold.

5 Finding 2-Saddles

Let P(t, s1, s2) be a t-parametrized family of smooth 2D-surfaces in variables s1, s2 con-necting ur , u pt , us where ur , u pt are the same as before and us is a previously foundproper 1-saddle of J (not a local maximum of J on P(t, s1, s2). Such a structure isnecessary for a 2-saddle to exist and to be numerically computable). We may assumeP(t, 0, 0) = ur , P(t, 1, 0) = u pt , P(t, 0, 1) = us . Denote s = (s1, s2) and P(t, s(t)) =

123

Page 9: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

210 Journal of Scientific Computing (2019) 78:202–225

P(t, s1(t), s2(t)) where s(t) = (s1(t), s2(t)) is a local maximum point of J on the surfaceP(t, s) closest to us . By the chain rule, we have

J ′(P(t, s1(t), s2(t)))P′s1(t, s(t)) = 0, J ′(P(t, s1(t), s2(t)))P

′s2(t, s(t)) = 0. (5.1)

Let Ht = {v ∈ H : v⊥P ′s1(t, s(t)), v⊥P ′

s2(t, s(t))} be the normal space of the surface P(t, s)at P(t, s(t)). Since P ′

t (t, s(t)) is the direction of this t-parametrized family of surfacesmovingaway locally from the point P(t, s(t)) and we want this evolution to be nonsliding and alsoto follow a negative gradient flow, we choose P ′

t (t, s(t)) = −J ′(P(t, s(t)))/Ct ∈ Ht . Thuswe propose the following 2-saddle search system

J ′(P(t, s(t)))P ′s1(t, s(t)) = 0, J ′(P(t, s(t)))P ′

s2(t, s(t)) = 0, (5.2)

P ′t (t, s(t)) = −J ′(P(t, s(t)))/Ct (5.3)

starting from an initial point P(0, s(0)) on an initial surface P(0, s). Again the system (5.2)–(5.3) is a system of points P(t, s(t)) not a dynamics of surfaces P(t, s). There are infinitelymany surfaces satisfying the system and we do not have to know their explicit expressions orrepresentative points (too expensive to trace such points in numerical computation). For thisreason, we call those surfaces virtual. We denote P ′

s(t, s(t)) = (P ′s1(t, s(t)), P

′s2(t, s(t)))

and assume s(t) be locally Lipschitz continuous in t , since for each t ≥ 0, the equationsin (5.2) can be used to solved for s(t), by the implicit function theorem, conditions canalways be proposed so that s(t) is locally C1. Then a (uniform) stepsize rule similar toLemma 2.1, a 2-saddle characterization similar to Theorem 2.1 and a convergence resultsimilar to Theorem 4.1 can be proved in the same ways.

LMMforFinding2-Saddles.Letus be apreviously found1-saddle, following the algorithmdescribed in Sect. 3: In Step 1, let P(0, s) where s = (s1, s2) be a preferred initial smoothsurface s.t. P(0, (0, 0)) = ur , P(0, (0, 1)) = u p0 , P(0, (1, 0)) = us . Find a local maximumP(0, s(0)) of J on P(t, s) closest to us ; In Step 3, u(t ′) is the same. Instead of using ur , uk(t ′)and u pk ,t ′ to construct a new preferred smooth curve, this timewe use ur , us, uk(t ′) and u pk ,t ′to construct a newpreferred surface. Then use s(tk) as an initial guess to solve for s(tk+t ′) . . ..Other parts in LMM remain the same.

LMM can be further modified to find higher index saddles in a similar way if needed.

6 Numerical Implementation and Examples

Since those geometric objects are virtual, there are infinitely many ways to implement.

6.1 Using Lines and Planes for M-type Problems

For the M-type functional (1.2) with λ > −λ1, there is only one local minimum at ur = 0with J (0) = 0. However due to its M-type structure, lims→∞ J (sut ) = −∞ along eachdirection ut , we always have J (st ut ) < J (0) = 0 when st > 0 is large. So to find a 1-saddle,we can use a straight line P(t, s) = sut where ‖P ′

s(t, s(t))‖ = ‖ut‖ = 1 is bounded andfind the first local maximum s(t) > 0 of J (sut ) in s.

Once a 1-saddle us is found, to find a 2-saddle, for each direction ut⊥us , we can usethe three points ur , us and ut to construct a plane P(t, s1, s2) where ‖P ′

s(t, s(t))‖2 =‖(P ′

s1(t, s(t)), P′s2(t, s(t)))‖2 = ‖us‖2 + ‖ut‖2 = ‖us‖2 + 1 is bounded and find a local

maximum of J on this plane closest to us . Due to the M-type structure, a point u pt on theplane P(t, s1, s2) with J (u pt ) < J (ur ) can always be easily found. But such an information

123

Page 10: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

Journal of Scientific Computing (2019) 78:202–225 211

is irrelevant for us to find a local maximum closest to us on the plane P(t, s1, s2). Such astrategy using straight lines and planes leads to the local minimaxmethod successfully devel-oped for finding saddles of many M-type problems [4,10,35]. Also since the term P ′

s (t, s(t))is always bounded in this case, the convergence result Theorem 4.1 can be applied.

6.2 Using Quadratic Curves or Surfaces

6.2.1 Using Quadratic Curves for 1-Saddles

To use quadratic curves for finding 1-saddles, in Step 3 of the algorithm, we let

L(tk, t′, x, y) = ur + x(uk(t

′) − ur ) + y(u pk,t ′ − ur ) (6.1)

be an xy-plane passing through the points ur , uk(t ′), u pk,t ′ and then let x = c1(s), y = c2(s)be the parametrized equations of a quadratic curve in the xy-plane s.t. (c1(0), c2(0)) =(0, 0), (c1(1), c2(1)) = (0, 1) and (c1(s), c2(s)) = (1, 0) for some 0 ≤ s ≤ 1. There aremany ways to do so, e.g., in our numerical computation, we take

(x − 1

2

)2

+(y − 1

2

)2

= 1

2, (6.2)

which leads to

c1(s) = 1√2cos

(3π

4(2s − 1)

)+ 1

2, c2(s) = 1√

2sin

(3π

4(2s − 1)

)+ 1

2(6.3)

and we obtain

P(tk + t ′, s) = ur + c1(s)(uk(t′) − ur ) + c2(s)(u pk ,t ′ − ur ). (6.4)

Thus the term P ′s (t, s(t)) = c′

1(s)(uk(t′)−ur )+c′

2(s)(u pk ,t ′−ur ) is clearly bounded if the twoterms uk(t ′), u pk ,t ′ are selected to be bounded. It is interesting to note that (c1(0), c2(0)) =(0, 0), (c1( 13 ), c2(

13 )) = (1, 0), (c1(1), c2(1)) = (0, 1), i.e., (6.3) remains the same for dif-

ferent points ur , uk(t ′), u pk ,t ′ . A local maximum of the function g(s) = J (P(tk + t ′, s)) for0 ≤ s ≤ 1 can be easily computed by many 1-D numerical optimization methods, e.g., in ournumerical computation, it is done by calling the Matlab subroutine Fminunc with the initialguess s = 1

3 . To consistently trace local maxima of such g, we should always use the s-values(tk) saved in the previous iteration as an initial guess to find a local maximum of a new g.

6.2.2 Using Quadratic Surfaces for 2-Saddles

To use quadratic surfaces for finding 2-saddles, in Step 3 of LMM, we let

L(tk, t′, x, y, z) = ur + x(uk(t

′) − ur ) + y(u pk ,t ′ − ur ) + z(us − ur ) (6.5)

be a 3D-space passing through the points ur , uk(t ′), u pk ,t ′ , us and then let s = (s1, s2), x =c1(s), y = c2(s), z = c3(s) be the parametrized equations of a quadratic surface in xyz-spacepassing through the four points (0,0,0), (1,0,0), (0,1,0) and (0,0,1). There are many ways todo so, e.g., in our numerical computation, we take

(x − 1

2

)2

+(y − 1

2

)2

+(z − 1

2

)2

= 3

4, (6.6)

123

Page 11: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

212 Journal of Scientific Computing (2019) 78:202–225

which leads to spherical coordinates

c1(s) = 1

2+

√3

2sin(s1) cos(s2), c2(s) = 1

2+

√3

2sin(s1) sin(s2),

c3(s) = 1

2+

√3

2cos(s1) (6.7)

passing through the four points (0, 0, 0), (1, 0, 0), (0, 1, 0), (0, 0, 1) and we obtain

P(tk + t ′, s) = ur + c1(s)(uk(t′) − ur ) + c2(s)(u pk ,t ′ − ur ) + c3(s)(us − ur ). (6.8)

Thus the term P ′s (t, s(t)) = ur + c′

1(s)(uk(t′) − ur ) + c′

2(s)(u pk ,t ′ − ur ) + c′3(s)(us −

ur )(s) is clearly bounded if the two terms uk(t ′), u pk ,t ′ are selected to be bounded. Notethat (6.7) remains the same for different points ur , u pk ,t ′ , uk(t

′), us . A local maximum ofg(s) = J (P(tk + t ′, s)) in s = (s1, s2) can be easily computed by many finite-dimensionalunconstrained optimization methods, e.g., in our numerical computation, it is done by calling

the Matlab subroutine Fminunc with the initial guess s = (s1, s2) = (cos−1(−√3

3 ), −π4 ). To

consistently trace local maxima of such g, we should always use the s-value s(tk) saved inthe previous iteration as an initial guess to find a local maximum of a new g.

Lemma 6.1 For (1.2) with λ < 0, κ = +1 (W-type), r = 1, {uk(t ′)} generated by LMM isbounded.

Proof By LMMwe have uk(t ′) = uk − t ′kdk/Ck where uk = P(tk, s(tk)). Thus we only haveto show that {uk} is bounded. Denote uk = tk uk with ‖uk‖H1

0 (�) = 1. It suffices to provethat tk is bounded. By LMM we have

J (ur ) < J (tk uk) =∫

[t2k2

(|∇uk |2 − λ|uk |2) + t p+1k

p + 1|u|p+1

]dx < J (u0) < 0. (6.9)

If there is a subsequence {uki } s.t.∫�

|uki |2dx → 0, then∫�

|∇uki |2dx → 1. It leads to∫

t2k2

(|∇uki |2 − λ|uki |2)dx > 0,

a contradiction to (6.9), J (tk uk) < 0. Thus∫�

|uk |2dx > δ0 for some δ0 > 0. Then by theHolder inequality

∫�

|uk |p+1dx > δ1 for some δ1 > 0. Since (6.9) leads to

∫�

t p−1k

p + 1|u|p+1dx < −

∫�

1

2(|∇uk |2 − λ|uk |2)dx < +∞,

tk must be bounded. ��

6.3 Using the Nehari Manifold

As in the algorithm description, the geometric objects P(t, s) are virtual, we do not haveto know their expressions and we only need to find the points P(t, s(t)). To further exploresuch an advantage, let us observe the definitions of the quadratic curves (6.1)–(6.4) andquadratic surfaces (6.5)–(6.8) closely, we can see that those geometric objects are defined bythe intersections of two types of geometric objects, where (6.1)/(6.5) defines a 2D-plane/3D-space passing through certain required points, while (6.2)/(6.6) can be viewed as certainconstrained manifolds on which those points stay. Thus to find the point P(t, s(t)), it is not

123

Page 12: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

Journal of Scientific Computing (2019) 78:202–225 213

necessary to find an explicit expression (6.3)/(6.7), we can simply do 2D/3D-maximizationon (6.1)/(6.5) subject to the constraint (6.2)/(6.6). It is quite natural to think if a special typeof geometric objects can be used to speedup algorithm convergence. We turn to the NeharimanifoldN defined in (6.10), since all nonzero critical points are onN and it actually puts noextra constraints to the problem. Hence we replace (6.2)/(6.6) byN as an auxiliary constraintto define geometric objects whose explicit expressions are not available. TheNehari manifoldis defined by

N = {tvv : tv > 0, ‖v‖ = 1, 〈∇ J (tvv) , v〉 = 0}. (6.10)

Then in Step 3 of the algorithm, we let

P(tk + t ′, s) = L(tk, t′, s) ∩ N , (6.11)

where L(tk, t ′, s) is defined in (6.1)/(6.5) and can be easily expanded to form a (k+1)D-space,whileN remains the same as in (6.10). A local maximum of J on P = L∩N can be found bylow-dimensional equality constrained optimization methods, such as the Matlab subroutineFmincon in L subject toN . Other steps in the algorithm remain the same. Thus this methodcan be easily extended to find a k-saddle.

Remark 6.1 We have shown that the virtual geometric objects P(t, s) are easy to constructfor us to find the points P(t, s(t)) and then unconstrained saddles. Such a flexibility alsoleads us to consider numerical methods for finding equality constrained saddles. This willbe discussed in a subsequent paper [25].

6.4 Numerical Examples

Part 1 Using quadratic curves, surfaces, etc. defined in Sect. 6.2.

6.4.1 Tests on Finite-Dimensional Benchmark Problems

We now test LMM on some benchmark problems that are commonly used by finite-dimensional algorithms in the literature.

Example 6.1 Consider finding 1-saddles of a W-type function

J (x, y) = (1 − x2 − y2)2 + y2/(x2 + y2). (6.12)

J has two local minima ur = (−1, 0) and u p = (1, 0), two 1-saddles (0, 1) and (0,−1), anda local maximum (0, 0). We choose an initial guess (x0, y0)with−1 < x0 < 1 and constructa quadratic curve P(0, s) connecting ur , (x0, y0), u p . Since J is symmetric about the y-axis,P(0, s(0)) = (0, y∗

0 ) for some y∗0 . If y0 > 0 is selected, LMM finds the 1-saddle (0, 1) and

if y0 < 0 is selected, LMM yields another 1-saddle (0,−1). See Fig. 2 and Table 1, whereand below NIt denotes the iteration number.

Example 6.2 Find 1-saddles of a W-type function with a triple-well potential function

J (x, y) = 3e−x2−(y− 13 )2− 3e−x2−(y− 5

3 )2− 5e−(x−1)2−y2− 5e−(x+1)2−y2

+ 0.2x4+ 0.2

(y − 1

3

)4

. (6.13)

123

Page 13: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

214 Journal of Scientific Computing (2019) 78:202–225

Fig. 2 Contours of the function(6.12) with two local minima(open square) at (−1, 0), (1, 0),two 1-saddles (asterisk) at(0, −1), (0, 1) and one localmaximum (O) at (0, 0)

−2 −1 0 1 2−2

−1

0

1

2

Table 1 Numerical results on(6.12) by LMM in nth iteration

NIt Saddle ‖J ′(·)‖1 (1.2104e−07,0.718750000000) 1.389770507813

2 (8.101e−08,0.968750000000) 0.238403320313

3 (− 2.603e−08,0.998550415039) 0.011571476313

4 (− 2.603e−08,0.999996849578) 2.520325611e−05

5 (− 2.603e−08,0.999999999985) 5.207054057e−09

−2 −1 0 1 2−2

−1

0

1

2

A

B

C

SP1

SP2

−1.5 −1 −0.5 0 0.5 1

0

0.5

1

1.5

2

Fig. 3 (Left) Contours of the function (6.13) with 3 local minima (open square), a local maximum (pluse) andthree 1-saddles (asterisk). (Right) Contours of the Muller potential (6.14) with three local minima A, B, C andtwo 1-saddles SP1 and SP2

Table 2 Numerical results of (6.13) by LMM with the stepsize s = 0.05

Numerical solution ‖J ′(·)‖ ε NIt

1-Saddle 1 (0.000000016,− 0.315828508) 9.623269e−06 1e−05 36

1-Saddle 2 (0.617273601, 1.1027353229) 9.555707e−06 1e−05 27

1-Saddle 3 (− 0.6172852268, 1.1027945717) 3.630596e−04 4e−04 18

J has threeminima at (1.048054984,−0.0420936582), (−1.0480549862,− 0.0420936637),(− 6.0e−08, 1.5370819624), a local maximum at (− 4.94e−07, 0.5191867341), and three1-saddles as shown in Fig. 3(left) and Table 2.

123

Page 14: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

Journal of Scientific Computing (2019) 78:202–225 215

Example 6.3 We compute 1-saddles of the Muller function

J (x, y) =4∑

i=1

Kie[ai (x−x0i )2+bi (x−x0i )(y−y0i )+ci (y−y0i )2] (6.14)

where the vectors K = (−200, −100, −170, 15), a = (−1, −1, −6.5, 0.7), b = (0, 0,11, 0.6), c = (−10, −10, −6.5, 0.7), x0 = (1, 0, −0.5, −1), y0 = (0, 0.5, 1.5, 1). Threelocal minima A = (−0.55822363677964, 1.44172582715450), B = (0.62349936799644,0.02803774112374), C = (−0.05001028122511, 0.46669409222955) are found by the Mat-lab subroutine Fminunc and then two 1-saddles SP1 = (0.212486571139517, 0.292988327843969) and SP2 = (−0.822001541054890, 0.624312898567439) are found by LMM in 12and 432 iterations, respectively, with ‖J ′(·)‖ < 10−4 as shown in Fig. 3(right).

Remark 6.2 In our numerical computation, if ur = B and u p = C are used, LMM producesthe 1-saddle SP1; if ur = A and u p = C are assigned, then LMM yields the 1-saddle SP2.However, if we set ur = A and u p = B, LMMmay find the 1-saddle SP1 or SP2 dependingon the initial guess selected. By comparison, it is much harder to find 1-saddles of the Mullerfunction than other benchmark problems. Except the rough solutions in [11], we have notseen any more precise results in the literature.

Example 6.4 As the last benchmark test, we consider finding 1-saddles and 2-saddles of a3D M-type function for given r �= 0,

J (x1, x2, x3) = x21 + 2x22 + 3x23 − r2(x21 + x22 + x23 − 1

)2. (6.15)

By a direct computation, J has a local minimum at A = (0, 0, 0), two local maxima at B =(0, 0, ( 3

2r2+ 1)1/2), C = (0, 0,−( 3

2r2+ 1)1/2), two 1-saddles at (±( 1

2r2+ 1)1/2, 0, 0), and

two 2-saddles at (0,±( 1r2

+1)1/2, 0). Let r2 = 0.5. We take ur = (0, 0, 0), u p = (±3, 0, 0)since J (0, 0, 0) = −0.5 > J (±3, 0, 0) = −23.

Finding 1-saddles. Let v0 be an initial point and P(0, s(0)) be the local maximum of Jalong an initial curve P(0, s) connecting ur , v0, u p . If we take the advantage of knowing thelocal maxima at B = (0, 0, 2), C = (0, 0,−2), and directly set v0 = P(0, s(0)) = B or C ,then LMM will be stuck at B or C . Thus we follow Remark 3.1 (2) and use the eigenvectorcorresponding to the second eigenvalue of J ′′(B) or J ′′(C) to stay away from B or C . LMMwill continue and find two 1-saddles us11

, us21shown in Table 3.

Finding 2-saddles.Once 1-saddles us11, us21

are found, we process to find 2-saddles. Let usbe us11

or us21. Choose an initial guess v0 and construct an initial surface P(t, s) connecting

ur , u p, us, v0. Again we may take the advantage of knowing the local maxima at B =(0, 0, ( 3

2r2+ 1)1/2), C = (0, 0,−( 3

2r2+ 1)1/2) and directly set P(0, s(0)) = B or C . Then

LMM will be stuck at B or C . Thus we follow Remark 3.1 (2) and use the eigenvectorcorresponding to the third eigenvalue of J ′′(B) or J ′′(C) to stay away from B or C . LMMwill continue and find two 2-saddles us12

, us22shown in Table 3.

In all the above numerical examples, if LMM with ε = 10−4 is followed by a Newtonmethod for one iteration, we will get ‖J ′(·)‖ < 10−8.

6.4.2 Solving Infinite-Dimensional W/M-type Problems

We start to solve our model problem (1.1) with its energy functional J in (1.2) for 1-saddlesand 2-saddles in H1

0 (�) = W 1,20 (�). The steps in the algorithm description are the same

123

Page 15: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

216 Journal of Scientific Computing (2019) 78:202–225

Table 3 Numerical data of saddles of (6.15) by LMM with ε = 10−6

Numerical solution ‖J ′(·)‖ NIt

1-Saddle 1 (1.41421356, 0, −0.000000146) 5.84614e−07 18

1-Saddle 2 (−1.41421356, 0, −0.000000172) 6.90673e−07 21

2-Saddle 1 (0.0000000076, 1.73205081, 0) 1.19898e−07 6

2-Saddle 2 (0.0000000076, −1.73205081, 0) 1.19898e−07 6

except a significant difference in evaluating the gradient∇ J (·).When H is finite-dimensional,e.g., H = R

n , one can simply take a partial derivative with respect to each variable of J to get∇ J (x) = J ′(x) = (J ′

x1(x), . . . , J′xn (x)) ∈ R

n ; However when H is infinite-dimensional,

e.g., H = W 1,2(�), since J : W 1,20 (�) → R,

J ′(u) = −�u(x) + λu(x) + κ|x |r |u(x)|p−1u(x) ∈ W−1,2(�).

It cannot be used as a search direction in W 1,2(�). Thus we use the Riesz representationtheorem to find its canonical dual d = ∇ J (u) ∈ W 1,2

0 (�) of J ′(u) by solving

− �d(x) + d(x) = J ′(u), (6.16)

a linear elliptic PDE. It can be solved by many numerical solvers, such as using a finite-difference method (FDM), a finite-element method (FEM) or a boundary element method(BEM), etc. Since our main effort in this paper is to develop new algorithm, its mathematicaljustification and implementation in an infinite-dimensional space, when numerical solvers areavailable for solving a linear sub-problem, we simply apply those solvers handily availablewithout deliberating their CPU time or other computation cost in this paper. In the followingexamples, we choose p = 3 and � = (−1, 1)2 wherein 20,736 triangle elements are gener-ated by the Matlab subroutine Initmesh and (6.16) is solved by calling the Matlab subroutineAssempde. However, in order to clearly see its mesh grids and contours of a solution in onefigure, a coarse mesh is used to redraw the profile and its contours. In our numerical compu-tation, we first use LMM until ε < 10−2, then follow it by a Newton method to accelerateits local convergence [36].

When an initial direction v0 is selected, we use an initial guess u0 = sv0 where the scalars is chosen so that 〈J ′(sv0), v0〉 = 0 or otherwise as indicated.

For the W-type problem.When 0 < μk < −λ < μk+1 where μk is the k-th eigenvalue of−�, then 0 is a k-saddle of J and all other saddles u∗ will have J (u∗) < J (0) = 0. We firstuse a negative gradient method to locate a local minimum ur where the first eigenfunctionof −� is used as an initial guess. When the domain is symmetric about the original, theproblem is even-symmetric. Thus u p = −ur is another local minimum. Then Step 1 of thealgorithm leads to u0 = P(0, s(0)) = 0 a saddle. Since J ′(u0) = 0 ⇒ d0 = ∇ J (u0) = 0,the algorithm is stuck. Thus we follow Remark 3.1 (2) and check the eigenvalue of the linearoperator J ′′(0) = (−� + λI ). This can be easily done either by hand when the domain is asquare/disk or by a numerical solver otherwise. If J ′′(0) has only one negative eigenvalue,then 0 is a 1-saddle. It is done. Otherwise J ′′(0) has at least two negative eigenvalues. Thesecond eigenvalue is doubled. The eigenfunction corresponding to the first eigenvalue givesthe same direction as u p − ur . So it is useless. There will be two linearly independenteigenfunctions v2, v3 corresponding to the second double eigenvalue that are orthogonal tothe direction of u p − ur . Replacing d0, respectively, by the two eigenfunctions v2, v3, the

123

Page 16: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

Journal of Scientific Computing (2019) 78:202–225 217

Table 4 Numerical data of Case 1 (W-type) by LMM-Newton method using quadratic curves/surfaces

r Minimax method Newton method (ε = 1e−6) Refer Fig. 4

‖d‖ ε ‖J ′(·)‖∞ NIt ‖d‖ ‖J ′(·)‖∞ NIt

A Local Min 0 2.9211e−04 3e−4 0.0010 39 4.8190e−07 1.7290e−06 6 (a)

1-Saddle 1 0 4.9457e−04 5e−4 0.0055 22 4.3536e−07 4.8296e−06 7 (b)

1-Saddle 2 0 8.7338e−04 1e−3 0.0111 24 7.6886e−07 9.7744e−06 7 (c)

A 2-Saddle 0 1.6851e−04 2e−4 0.0018 8 4.0563e−07 4.3677e−06 6 (d)

A Local Min 1 4.6356e−04 5e−4 0.0079 24 4.0804e−07 6.9170e−06 7 (e)

1-Saddle 1 1 1.7927e−04 2e−4 0.0024 25 5.3217e−07 5.7109e−06 6 (f)

1-Saddle 2 1 3.6374e−04 4e−4 0.0032 26 8.7481e−07 7.8076e−06 6 (g)

A 2-Saddle 1 2.5209e−04 3e−4 0.0024 9 6.0699e−07 5.7883e−06 6 (h)

A Local Min 4 0.0010 2e−3 0.0286 175 7.1432e−07 2.2333e−05 8 (i)

1-Saddle 1 4 8.0420e−04 9e−4 0.0115 41 7.0791e−07 1.0105e−05 7 (j)

1-Saddle 2 4 6.5449e−04 7e−4 0.0064 58 5.7614e−07 5.6744e−06 7 (k)

A 2-Saddle 4 6.7800e−04 7e−4 0.0048 12 6.0498e−07 4.2186e−06 7 (l)

Table 5 Numerical data of Case 1 (W-type) by LMM using curves/surfaces on Nr ‖d‖ ε ‖J ′(·)‖∞ J NI t Refer Fig. 4

A Local Min 0 8.3234e−05 1e−4 1.5439e−03 −125.7122 13 (a)

1-Saddle 1 0 0.0009 1e−3 0.0063 − 28.5326 14 (b)

1-Saddle 2 0 0.0019 2e−3 0.0162 − 24.3429 14 (c)

2-Saddle 0 8.0899e−05 1e−4 0.0011 − 0.0303 6 (d)

A Local Min 1 3.8283e−04 4e−4 2.7002e−02 − 295.4945 10 (e)

1-Saddle 1 1 4.8885e−04 5e−4 3.7831e−03 − 48.7282 12 (f)

1-Saddle 2 1 3.1071e−04 4e−4 2.8164e−03 − 41.9083 19 (g)

2-Saddle 1 2.2167e−04 3e−4 2.0852e−03 − 0.0420 9 (h)

A Local Min 4 7.2575e−04 1e−3 1.6138e−02 − 4016.9 19 (i)

1-Saddle 1 4 6.0710e−04 8e−4 1.2526e−02 − 203.9375 32 (j)

1-Saddle 2 4 6.1039e−04 7e−4 1.1783e−02 − 180.1284 35 (k)

2-Saddle 4 3.6015e−04 5e−4 2.8650e−03 − 0.0914 12 (l)

algorithm produces two 1-saddles. Once a 1-saddle us is found, it can be used to find a2-saddle.

Case 1 In (1.2), we set κ = 1, λ = −20 and r = 0, 1, 4, respectively. A local minimum,two 1-saddles and a 2-saddle are found and shown in Tables 4, 5 and Fig. 4.

Case 2 In (1.2), we set κ = 1, λ = −20e|x |2 and r = 0, 1, 4, respectively. A localminimum, two 1-saddles and a 2-saddle are found and shown in Tables 6, 7 and Fig. 5.

For the M-type problem. We set κ = − 1, λ ≥ 0 in (1.2). Thus ur = 0 is the only localminimum and u pt can be easily selected as (1) a moving point along each direction withJ (u pt ) < J (0) = 0, or (2) a fixed point u p with J (u p) < J (0) = 0, or (3) mixing (1) and(2), i.e., in LMM, we fix u pt for several steps and then change it for the next several steps.For (1), the first few saddles can be found by using lines and planes, etc., as described inSect. 6.1.While for (3), we present here numerical 1-saddles and 2-saddles found surprisingly

123

Page 17: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

218 Journal of Scientific Computing (2019) 78:202–225

Fig. 4 Saddles of (1.2) in W-type Case 1 presented in Tables 4 and 5. a J = − 125.71, ‖u‖∞ =4.4 at (− 0.0005,− 0.0032), r = 0, b J = − 28.533, ‖u‖∞ = 3.44 at (− 0.5002,− 0.0023), r =0, c J = − 24.343, ‖u‖∞ = 3.51 at (− 0.3897,− 0.3953), r = 0, d J = − 0.0303, ‖u‖∞ =0.68 at (0.4978, 0.5000), r = 0, e J = − 295.49, ‖u‖∞ = 9.60 at (− 0.0005,− 0.0032), r = 1,f J = − 48.728, ‖u‖∞ = 4.72 at (− 0.4541,− 0.0038), r = 1, g J = − 41.908, ‖u‖∞ =4.72 at (0.3403, 0.3443), r = 1, h J = − 0.042, ‖u‖∞ = 0.80 at (0.4978, 0.5000), r = 1, iJ = − 4016.9, ‖u‖∞ = 61.3 at (− 0.0000,− 0.0000), r = 4, j J = − 203.94, ‖u‖∞ =11.53 at (0.4077,− 0.0025), r = 4, k J = − 180.13, ‖u‖∞ = 11.4 at (0.2938, 0.2905), r = 4, l J =− 0.0914, ‖u‖∞ =1.18 at (− 0.4955,− 0.4955), r = 4

Table 6 Numerical data of Case 2 (W-type) by LMM-Newton method using quadratic curves/surfaces

r Minimax method Newton method (ε = 1e−6) Refer Fig. 5‖d‖ ε ‖J ′(·)‖∞ NIt ‖d‖ ‖J ′(·)‖∞ NIt

A Local Min 0 7.9501e−05 8e−5 0.0042 91 8.8593e−07 4.5026e−05 5 (a)

1-Saddle 1 0 3.9487e−04 4e−4 0.0176 73 9.4960e−07 4.2338e−05 6 (b)

1-Saddle 2 0 0.0662 7e−2 0.9063 59 3.8354e−07 7.5605e−06 12 (c)

A 2-Saddle 0 0.0225 3e−2 0.6271 67 8.3317e−07 9.2134e−06 10 (d)

A Local Min 1 2.4181e−04 3e−4 0.0114 75 5.8150e−07 2.7377e−05 6 (e)

1-Saddle 1 1 7.3727e−04 8e−4 0.0379 65 6.4900e−07 3.3335e−05 7 (f)

1-Saddle 2 1 0.0618 7e−2 0.8864 56 9.9421e−07 1.9102e−05 11 (g)

A 2-Saddle 1 0.0384 4e−2 0.5264 58 8.7745e−07 5.3654e−06 10 (h)

A Local Min 4 0.0019 2e−3 0.0521 180 6.0471e−07 1.6776e−05 8 (i)

1-Saddle 1 4 0.0579 6e−2 0.7365 63 9.1399e−07 1.1637e−05 11 (j)

1-Saddle 2 4 0.0187 2e−2 0.2899 71 8.0516e−07 1.2514e−05 10 (k)

A 2-Saddle 4 0.0763 8e−2 1.2755 52 9.6832e−07 5.3671e−05 13 (l)

by LMM using quadratic curves and surfaces as described in Sect. 6.2. First an initial guessu0 ∈ H1

0 (�) can be easily selected by solving

− �u0(x) = c(x), (6.17)

123

Page 18: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

Journal of Scientific Computing (2019) 78:202–225 219

Table 7 Numerical data of Case 2 (W-type) by LMM using curves/surfaces on Nr ‖d‖ ε ‖J ′(·)‖∞ J NI t Refer Fig. 5

A Local Min 0 0.0004 1e−3 0.0147 − 505.8239 18 (a)

1-Saddle 1 0 1.9806e−4 2e−4 0.0107 − 318.1348 34 (b)

1-Saddle 2 0 0.0031 5e−3 0.0647 − 230.4429 41 (c)

2-Saddle 0 0.0175 2e−2 0.2287 − 198.2465 32 (d)

A Local Min 1 1.8581e−04 2e−4 1.0533e−02 − 847.5476 25 (e)

1-Saddle 1 1 4.4257e−04 5e−4 1.8674e−02 − 432.6319 53 (f)

1-Saddle 2 1 0.0083 1e−2 0.1218 − 329.5795 44 (g)

2-Saddle 1 0.0303 0.05 0.3451 − 238.7703 32 (h)

A Local Min 4 0.0008 1e−3 0.0315 − 6753.9851 26 (i)

1-Saddle 1 4 0.0097 0.01 0.1252 − 1219.4057 40 (j)

1-Saddle 2 4 0.0536 0.06 1.4855 − 1051.9 56 (k)

2-Saddle 4 0.0197 0.06 0.8036 − 421.1175 20 (l)

Fig. 5 Saddles of (1.2) in Case 2 presented in Tables 6 and 7. a J = − 505.82, ‖u‖∞ =5.93 at (− 0.6455,− 0.6462), r = 0, b J = − 318.13, ‖u‖∞ = 5.84 at (− 0.6590,− 0.6473), r =0, c J = − 230.44, ‖u‖∞ = 5.91 at (0.6486, 0.6479), r = 0, d J = − 198.25, ‖u‖∞ =5.74 at (− 0.6654,− 0.6604), r = 0, e J = − 847.55, ‖u‖∞ = 10.27 at (− 0.0005,− 0.0032), r =1, f J = − 432.63, ‖u‖∞ = 6.45 at (− 0.5035, 0.0038), r = 1, g J = − 329.58, ‖u‖∞ =6.60 at (− 0.4226,− 0.4269), r = 1, h J = − 238.77, ‖u‖∞ = 6.12 at (− 0.6091,− 0.6119), r =1, i J = − 6754.0, ‖u‖∞ = 67.12 at (− 0.0005,− 0.0032), r = 4, j J = − 1219.4, ‖u‖∞ =16.77 at (− 0.3897, 0.0019), r = 4, k J = − 1051.9, ‖u‖∞ = 16.73 at (− 0.2785,− 0.2755), r = 4, lJ =− 421.12, ‖u‖∞ =9.10 at (− 0.4617,− 0.4583), r = 4

where c(x) = −1, 0, 1, respectively, are used to control the concavity of u0 at x and con-sequently its peak(s) or symmetry. For example, to select its peak location (x1, x2), weset c(x1, x2) = −1 if |(x1, x2) − (x1, x2)| ≤ 0.1 and c(x1, x2) = 0 otherwise. Thenwe set u p = t0u0 where t0 > 0 is selected s.t. J (u p) < J (0) = 0. The initial curveP(0, s) = ur + su p is actually a straight line and P(0, s(0)) = s0u p where s0 > 0 can

123

Page 19: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

220 Journal of Scientific Computing (2019) 78:202–225

Table 8 Numerical data of Case 3 (M-type) by LMM-Newton method using quadratic curves/surfaces

r Minimax method Newton method (ε = 1e−6) ReferFig. 6‖d‖ ε ‖J ′(·)‖∞ NIt ‖d‖ ‖J ′(·)‖∞ NIt

A 1-Saddle 0 9.7252e−06 1e−5 1.0660e−04 93 4.6886e−10 3.5382e−09 1 (a)

2-Saddle 1 0 9.8132e−06 1e−5 2.0675e−04 193 5.6813e−10 8.4482e−09 1 (b)

2-Saddle 2 0 0.0048 5e−3 0.0895 89 2.3253e−09 9.1912e−08 2 (c)

A 1-Saddle 1 9.7564e−07 1e−6 3.3929e−05 123 (d)

2-Saddle 1 1 9.7928e−06 1e−5 2.7780e−04 137 7.7088e−11 1.5337e−09 1 (e)

2-Saddle 2 1 9.7972e−06 1e−5 2.9769e−04 238 9.4872e−10 1.9698e−08 1 (f)

A 1-Saddle 4 5.7653e−06 6e−6 5.7845e−04 103 3.1841e−11 1.9634e−09 1 (g)

2-Saddle 1 4 9.8790e−06 1e−5 7.1571e−04 107 1.0905e−10 6.3743e−09 1 (h)

2-Saddle 2 4 1.9687e−05 2e−5 0.0011 137 5.7580e−10 3.3462e−08 1 (i)

Table 9 Numerical data of Case 3 (M-type) by LMM using curves/surfaces on Nr ‖d‖ ε ‖J ′(·)‖∞ J NI t Refer Fig. 6

A 1-Saddle 0 9.6443e−06 1e−5 1.0583e−04 13.2091 49 (a)

2-Saddle 1 0 9.7583e−06 1e−5 1.8202e−04 56.1944 90 (b)

2-Saddle 2 0 1.9573e−03 2e−3 8.9010e−02 61.3222 18 (c)

A 1-Saddle 1 9.3523e−07 1e−6 2.1692e−05 36.0204 118 (d)

2-Saddle 1 1 9.7463e−06 1e−5 2.9809e−04 88.2238 114 (e)

2-Saddle 2 1 9.6591e−06 1e−5 2.6570e−04 97.8384 183 (f)

A 1-Saddle 4 4.5802e−06 5e−6 4.5802e−06 68.6989 97 (g)

2-Saddle 1 4 9.2042e−06 1e−3 7.1599e−04 139.9730 94 (h)

2-Saddle 2 4 9.1813e−06 1e−5 6.0878e−04 142.8290 107 (i)

be easily computed by 〈J ′(s0u p), u p〉 = 0. In computing the 1-saddles below us , we set(x1, x2) = (0.6, 0.6).

Once ur , u p and us are obtained, to compute a 2-saddle,wefirst generate an initial guess u0by solving (6.17) and then a normalization. We set c(x1, x2) = −1 if |(x1, x2) − (x1, x2)| ≤0.1, c(x1, x2) = 1 if |(x1, x2) − (x ′

1, x ′2)| ≤ 0.1 and c(x1, x2) = 0 otherwise. In our

numerical computation below, we use (x1, x2) = (0.6, 0.6) and (x ′1, x ′

2) = (−0.6,−0.6)to compute 2-saddle 1 and (x1, x2) = (0.6, 0) and (x ′

1, x ′2) = (−0.6, 0) to compute 2-saddle

2.Case 3 In (1.2), we set κ = −1, λ = 1 and r = 0, 1, 4, respectively. A 1-saddle and two

2-saddles are found by LMM and shown Tables 8, 9 and Fig. 6.Case 4 In (1.2), we set κ = −1, λ = e|x |2 and r = 0, 1, 4, respectively. A 1-saddle and

two 2-saddles are found by LMM and shown in Tables 10, 11 and Fig. 7.Part 2Using the geometric objects constructed by the intersection of a (k+1)D-space and

the Nehari manifold N = {tvv : ‖v‖ = 1, 〈∇ J (tvv) , v〉 = 0} as described in Sect. 6.3,where for the functional (1.2),

〈∇ J (tvv) , v〉 =∫

[−tvv(x)�v(x) + tvλ(x)v2(x) + κ |x |r t pv |v(x)|p+1] dx . (6.18)

123

Page 20: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

Journal of Scientific Computing (2019) 78:202–225 221

Fig. 6 Saddles of (1.2) in M-type Case 3 presented in Tables 8 and 9. a J = 13.209, ‖u‖∞ =3.72 at (− 0.0005,− 0.0032),r = 0,b J = 56.194, ‖u‖∞ = 5.57 at (0.3972, 0.3990), r = 0,c J =61.322, ‖u‖∞ = 5.76 at (0.5003, 0.0031), r = 0,d J = 36.020, ‖u‖∞ = 6.20 at (0.3972, 0.3990),r =1,e J = 88.224, ‖u‖∞ = 7.00 at (− 0.4964,− 0.5015), r = 1, f J = 97.838, ‖u‖∞ =7.37 at (− 0.5626, 0.4793),r = 1,g J = 68.699, ‖u‖∞ = 8.81 at (0.6835, 0.6838), r = 4,h J =139.97, ‖u‖∞ = 8.91 at (− 0.6958 − 0.6921), r = 4, i J = 142.83, ‖u‖∞ = 9.00 at (0.7064, 0.6983), r =4

Table 10 Numerical data of Case 4 (M-type) by LMM-Newton method using quadratic curves/surfaces

r Minimax method Newton method (ε = 1e−6) ReferFig. 7‖d‖ ε ‖J ′(·)‖∞ NIt ‖d‖ ‖J ′(·)‖∞ NIt

A 1-Saddle 0 1.8566e−05 2e−5 2.1878e−04 87 1.4820e−09 1.2554e−08 1 (a)

2-Saddle 1 0 9.6554e−06 1e−5 2.1932e−04 286 7.0749e−07 4.8347e−06 1 (b)

2-Saddle 2 0 0.0021 3e−3 0.0363 116 6.8862e−07 1.2222e−05 8 (c)

A 1-Saddle 1 1.9405e−05 2e−5 4.9654e−04 149 7.4012e−10 1.2125e−08 1 (d)

2-Saddle 1 1 5.6344e−05 6e−5 8.7060e−04 143 6.4147e−07 2.0887e−05 5 (e)

2-Saddle 2 1 4.9929e−05 5e−5 7.4802e−04 256 8.9621e−07 1.3427e−05 5 (f)

A 1-Saddle 4 9.3046e−06 1e−5 9.6464e−04 129 1.4650e−10 9.0770e−09 1 (g)

2-Saddle 1 4 9.6804e−05 1e−4 0.0096 203 6.3601e−07 6.3072e−05 5 (h)

2-Saddle 2 4 4.7882e−05 5e−5 0.0033 271 8.5941e−07 5.9592e−05 4 (i)

For the W-type Problem. This part is the same as the corresponding part in Sect. 6.4.2.except that we use the constraint 〈∇ J (u) , u/‖u‖〉 = 0 to replace the Eq. (6.2) or (6.6) inStep 3 of the algorithm, i.e., we let

P(tk + t ′, s) = L(tk, t′, s1, s2) ∩ N , (6.19)

123

Page 21: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

222 Journal of Scientific Computing (2019) 78:202–225

Table 11 Numerical data of Case 4 (M-type) by LMM using curves/surfaces on Nr ‖d‖ ε ‖J ′(·)‖∞ J NI t Refer Fig.7

A 1-Saddle 0 9.4738e−06 1e−5 1.1227e−04 14.1293 49 (a)

2-Saddle 1 0 9.8686e−06 1e−5 2.4812e−04 60.0649 98 (b)

2-Saddle 2 0 9.6887e−04 1e−3 2.8354e−02 64.6892 31 (c)

A 1-Saddle 1 1.9652e−05 2e−5 4.7641e−04 39.4887 103 (d)

2-Saddle 1 1 4.8189e−05 5e−5 1.5092e−03 96.0497 103 (e)

2-Saddle 2 1 4.9700e−05 5e−5 1.5146e−03 106.0977 181 (f)

A 1-Saddle 4 9.3234e−06 1e−5 8.1918e−04 75.5768 106 (g)

2-Saddle 1 4 9.2800e−05 1e−4 6.1961e−03 153.4184 95 (h)

2-Saddle 2 4 4.6558e−05 5e−5 3.2680e−03 156.0572 107 (i)

Fig. 7 Saddles of (1.2) in M-type Case 4 presented in Tables 10 and 11. a J = 64.689, ‖u‖∞ =3.85 at (− 0.0005,− 0.0032),r = 0,b J = 60.065, ‖u‖∞ = 5.72 at (0.3869, 0.3870), r =0,c J = 64.689, ‖u‖∞ = 5.90 at (0.4928, 0.0017), r = 0,d J = 39.489, ‖u‖∞ =6.43 at (0.3869, 0.3870),r = 1,e J = 96.050, ‖u‖∞ = 7.22 at (0.4929, 0.4883), r = 1, f J =106.10, ‖u‖∞ = 7.59 at (− 0.5562, 0.4634),r = 1,g J = 75.578, ‖u‖∞ = 9.12 at (0.6949, 0.6911), r =4,h J = 153.42, ‖u‖∞ = 9.20 at (− 0.6991,− 0.6985), r = 4, i J = 156.060, ‖u‖∞ =9.29 at (− 0.7057, 0.7040), r = 4

where

L(tk, t′, s1, s2) = ur + s1(uk(t

′) − ur ) + s2(u pk ,t ′ − ur ) (6.20)

and u pk ,t ′ = u p = −ur . A local maximum of J on P can be found by a Matlab subroutineFmincon through min −J on L over s = (s1, s2) subject to the constraint N . After we findthe two 1-saddles us1 and us2 , we can pick either one of them, denoted by us to compute a2-saddle, i.e., we let

123

Page 22: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

Journal of Scientific Computing (2019) 78:202–225 223

P(tk + t ′, s) = L(tk, t′, s1, s2, s3) ∩ N , (6.21)

where

L(tk, t′, s1, s2, s3) = ur + s1(uk(t

′) − ur ) + s2(u pk,t ′ − ur ) + s3(us − ur ) (6.22)

and u pk,t ′ = u p = −ur . A local maximum of J on P can be found by a Matlab subroutineFmincon through min −J on L over s = (s1, s2, s3) subject to the constraint N .

We re-do the Cases 1 and 2 by LMM and document their numerical data in Tables 5 and 7for convergence speed comparison.

For the M-type Problem. Since 0 is the only local minimum of J but not on the Neharimanifold N , it cannot be used. Thus this part is different from the corresponding part inSect. 6.4.2. A preprocessing is required to use the Nehari manifold N . We first use theMatlab subroutine Fmincon to minimize J subject to the constraint N to find a solution urwhich is actually a 1-saddle of J . By the symmetry of the problem, u pk ,t ′ = u p = −ur isanother 1-saddle of J or another local minimum of J on N .

To find a 2-saddle, an initial direction u0 is constructed as the same as in Sect. 6.4.2. bysolving (6.17). We use three points ur , u0, u p to construct an initial 2D-space

L(0, 0, s1, s2) = ur + s1(u0 − ur ) + s2(u p − ur )

and an initial curve is defined by P(0, s) = L(0, 0, s1, s2) ∩ N . The initial local maximumpoint P(0, s(0)) can be found by the Matlab subroutine Fmincon to minimize −J on L overs = (s1, s2) subject to the constraintN . Then LMM starts to run. In Step 3, we use the threepoints ur , u p and uk(t ′) to construct a 2D-space

L(tk, t′, s1, s2) = ur + s1(uk(t

′) − ur ) + s2(us − ur )

and the surface is defined by

P(tk + t ′, s) = L(tk, t′, s1, s2) ∩ N . (6.23)

The point uk+1 = P(tk+1, s(tk+1)) can be computed by the Matlab subroutine Fmincon tominimize −J on L over s = (s1, s2) subject to the constraint N . Other steps are the same.By choosing two different u0, LMM finds two 2-saddles us1 and us2 .

After the 2-saddles us1 and us2 are found, denote one of them by us , to find a 3-saddle, weconstruct an initial direction u0 as the same as in Sect. 6.4.2. by solving (6.17). Then throughthe four points ur , u p, us and u0, we construct an initial 3D-space

L(0, 0, s1, s2, s3) = ur + s1(u0 − ur ) + s2(u p − ur ) + s3(us − ur )

and an initial surface P(0, s) = L(0, 0, s1, s2, s3) ∩ N . The initial local maximum pointP(0, s(0)) can be found by the Matlab subroutine Fmincon to minimize −J on L overs = (s1, s2, s3) subject to the constraint N . Then LMM starts to run. In Step 3, through thefour points ur , u p, us and uk(t ′), we construct a 3D-space

L(tk, t′, s1, s2, s3) = ur + s1(uk(t

′) − ur ) + s2(u p − ur ) + s3(us − ur )

and the surface is defined by

P(tk + t ′, s) = L(tk, t′, s1, s2, s3) ∩ N . (6.24)

The point uk+1 = P(tk+1, s(tk+1)) can be computed by the Matlab subroutine Fmincon tominimize −J on L over s = (s1, s2, s3) subject to the constraint N . Other steps are thesame. LMM finds a 3-saddle. We re-do Cases 3 and 4. Their numerical data are documentedin Tables 9 and 11 for convergence speed comparison.

123

Page 23: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

224 Journal of Scientific Computing (2019) 78:202–225

Note that the point uk(t ′) in each iteration we used to construct a space in both W-typeand M-type problems has been multiplied by a constant so that it is on the Nehari manifoldN . In the similar way, LMM can be easily extended to find higher index saddles. Due to thepage limit, we omit them here.

Final Remarks. The algorithm, LMM proposed and mathematically verified in this paperis general and flexible. It uses a flexible point u pt on virtual geometric objects such as curves,surfaces, etc., without their explicit expressions or representative points (too expensive totrace such points in computing k-saddles for k > 1), and thus forms its two distinctivefeatures from other methods in the literature. It can be modified to solve different types ofproblems in infinite-dimensional spaces for k-saddles. The results obtained in this paper laida foundation for further study of LMM on using different types of geometric objects, etc.,for computational efficiency, for computing equality constrained saddles or other purposes.Actually it has been modified to successfully find constrained k-saddles. Those results arereported in a subsequent paper [25].

References

1. Cameron, M., Kohn, R., Vanden-Eijnden, E.: The string method as a dynamical system. J. Nonlinear Sci.21, 193–230 (2009)

2. Cances, E., Legoll, F., Marinica, M.-C., Minoukadeh, K., Willaime, F.: Some improvements of theactivation-relaxation technique method for finding transition pathways on potential energy surfaces. J.Chem. Phys. 130, 114711 (2009)

3. Cerjan, C., Miller, W.: On finding transition states. J. Chem. Phys. 75, 2800 (1981)4. Choi, Y.S., McKenna, P.J.: A mountain pass method for the numerical solution of semilinear elliptic

problems. Nonlinear Anal. Theory Methods Appl. 20, 417–437 (1993)5. Du, Q., Zhang, L.: A constrained string method and its numerical analysis. Commun. Math. Sci. 7,

1039–1051 (2009)6. E, W., Vanden-Eijnden, E.: Towards theory of transition paths. J. Stat. Phys. 123, 503–523 (2005)7. E, W., Ren, W., Vanden-Eijnden, E.: Simplified and improved string method for computing the minimum

energy path in barrier-crossing events. J. Chem. Phys. 126, 164103 (2007)8. E, W., Zhou, X.: The gentlest ascent dynamics. Nonlinearity 24, 1831–1842 (2011)9. Henkelman, G., Jónsson, H.: A dimer method for finding saddle points on high dimensional potential

surfaces using only first derivatives. J. Chem. Phys. 111, 7010–7022 (1999)10. Li, Y., Zhou, J.: A minimax method for finding multiple critical points and its applications to nonlinear

PDEs. SIAM Sci. Comput. 23, 840–865 (2001)11. Miller, K., Brown, L.D.: Location of saddle points and minimum energy paths by a constrained simplex

optimization procedure. J. Theor. Chim. Acta 53, 75–93 (1979)12. Olsen, R.A., Kroes, G.J., Henkelman, G., Arnaldsson, A., Jónsson, H.: Comparison ofmethods for finding

saddle points without knowledge of the final states. J. Chem. Phys. 121, 9776–9792 (2004)13. Palmer, J.C., Martelli, F., Liu, Y., Car, R., Panagiotopoulos, A.Z., Debenedetti, P.G.: Metastable liquidliq-

uid transition in a molecular model of water. Nature 510, 385–388 (2014)14. Peters, B., Heyden, A., Bell, A.T., Chakrabortya, A.: A growing string method for determining transition

states: comparison to the nudged elastic band and stringmethods. J. Comput. Chem. Phys. 120, 7877–7886(2004)

15. Principi, E., Minicucci, M., Di Cicco, A., Trapananti, A., De Panfilis, S., Poloni, R.: Metastable phasediagram of Bi probed by single-energy x-ray absorption detection and angular dispersive x-ray diffraction.Phys. Rev. B 74, 064101 (2006)

16. Samanta, A., Weinan, E.: Optimization-based string method for finding minimum energy path. Commun.Comput. Phys. 14, 265–275 (2013)

17. Schlegel, H.B.: Exploring potential energy surfaces for chemical reactions: an overview of some practicalmethods. J. Comput. Chem. 24, 1514–1527 (2003)

18. Yang, G.H., Chen, J.B., Zhao, B., Pan, F.: Structural andmagnetic investigation of metastable alloy phasesin Bi–Co multilayers. J. Alloys Compd. 365, 43–48 (2004)

19. Yoon,W., Perepezko, J.H.: The effect of pressure onmetastable phase formation in the undercooled Bi–Snsystem. J. Mater. Sci. 23, 4300–4306 (1988)

123

Page 24: A Local Minimax Method Using Virtual Geometric Objects ...j.zhou/LJZ-Virt-1.pdf · complexity of the method, gives us a great flexibility to choose preferred smooth curves for different

Journal of Scientific Computing (2019) 78:202–225 225

20. Yoshida, K., Morimoto, C., Yamada, T.: New metastable alloy phase in Mn–Bi system produced bycrystallization of amorphous films. J. Cryst. Growth 58, 229–235 (1982)

21. Zhang, J., Du, Q.: Shrinking dimer dynamics and its applications to saddle point search. SIAM J. Numer.Anal. 50, 1899–1921 (2012)

22. Zhang, J., Du, Q.: Constrained shrinking dimer dynamics for saddle point search with constraints. J.Comput. Phys. 231, 4745–4758 (2012)

23. Grantham, W.: Gradient transformation trajectory following algorithms for determining stationarymin-max saddle points. Advances inDynamicGameTheory:NumericalMethods,Algorithms, andAppli-cations to Ecology and Economics. 9, 639–657 (2007)

24. Horak, J.: Constrainedmountain pass algorithm for the numerical solution of semilinear elliptic problems.Numer. Math. 98, 251–276 (2004)

25. Li, Z.X., Zhou, J.: A local minimax method using virtual geometric objects: part II–for finding equalityconstrained saddles. J. Sci. Comput. https://doi.org/10.1007/s10915-018-0775-x

26. Lin, L., Cheng, X., E,W., Shi, A.-C., Zhang, P.: A numerical method for the study of nucleation of orderedphases. J. Comput. Phys 229, 1797–1809 (2010)

27. Liu, X., Frazer, J., Tang, M.X.: Visualization and genetic algorithms in minimax theory for nonlinearfunctionals. J. Sci. Comput. 18, 49–68 (2003)

28. Sheppard, D., Terrell, R., Henkelmanal, G.: Optimization methods for finding minimum energy paths. J.Chem. Phys. 128, 134106 (2008)

29. Wang, Z.-Q., Zhou, J.: An efficient and stable method for computing multiple saddle points with symme-tries. SIAM J. Numer. Anal. 43, 891–907 (2005)

30. Zhou, J.: A local min-orthogonal method for finding multiple saddle points. J. Math. Anal. Appl. 291,66–81 (2004)

31. Zhou, J.: Instability analysis of saddle points by a local minimax method. Math. Comput. 74, 1391–1411(2005)

32. Wang, C.C., Zhou, J.: An orthogonal subspace minimization method for finding multiple solutions todefocusing Schrodinger equation with symmetries. NMPDE 29, 1778–1800 (2013)

33. Fabian, M., Habala, P., Hajek, P., Santalucia, V.M., Pelant, J., Zizler, V.: Functional analysis and infinite-dimensional geometry. Springer, New York (2001)

34. Ambrosetti, A., Rabinowitz, P.H.: Dual variational methods in critical point theory and applications. J.Funct. Anal. 14, 349–381 (1973)

35. Zhou, J.: Global sequence convergence of a localminimaxmethod for findingmultiple solutions inBanachspaces. Numer. Funct. Anal. Optim. 32, 1365–1380 (2011)

36. Wang, Z.-Q., Zhou, J.: A local minimax-Newton method for finding multiple saddle points with symme-tries. SIAM J. Numer. Anal. 42, 1745–1759 (2004)

123