Home Avoidance of the Lavrentiev gap for one-dimensional non-autonomous functionals with constraints
Article Open Access

Avoidance of the Lavrentiev gap for one-dimensional non-autonomous functionals with constraints

  • Carlo Mariconda ORCID logo EMAIL logo
Published/Copyright: October 2, 2024

Abstract

Consider a positive functional F ( y ) := t T L ( s , y ( s ) , y ( s ) ) 𝑑 s , defined on the space of Sobolev functions W 1 , p ( [ t , T ] ; n ) , where p 1 . This functional is minimized among functions y that may satisfy one or both endpoint conditions. The Lagrangian L is allowed to assume the value + . In numerous applications minimizers may not be explicit or even may not exist. In such circumstances, it is crucial to know that the infimum of F can be approximated using a sequence of Lipschitz functions that meet the given boundary conditions. However, there are instances where this approximation is not feasible, even with polynomial Lagrangians that meet Tonelli’s existence conditions: this situation is referred to as the Lavrentiev phenomenon. Some results are present in the literature if one requires the Lipschitz approximations to preserve just one endpoint constraint or when the Lagrangian is finite valued. The present paper deals with problems with two endpoint constraints and Lagrangians that are allowed to take extended values. As a byproduct, our Lipschitz approximations may preserve given state constraints. The extended-valued case is challenging since the phenomenon may occur even when the Lagrangian is constant on its effective domain, whose topology becomes relevant. Once assumed that the Lagrangian is radial convex on the rays of the last variable, our findings offer new insights, even when the Lagrangian L is real-valued, autonomous, and there are no state constraints.

MSC 2020: 49J45; 49N60

1 Introduction

We consider here a one-dimensional, vectorial functional of the Calculus of Variations

F ( y ) = t T L ( s , y ( s ) , y ( s ) ) 𝑑 s

defined on the space of Sobolev functions W 1 , p ( I , n ) on I := [ t , T ] with values in n , for some p 1 . In this paper the Lagrangian L ( s , y , v ) is Borel, with extended values in [ 0 , + [ { + } . Following the terminology of control theory, we will refer to s as to the time variable, to y as to the state variable and to v as to the velocity variable. It may be desirable, especially when the existence of a minimizer to F is unknown, to approximate the infimum of the energy F through a sequence of values F ( y ν ) along a sequence ( y ν ) ν of Lipschitz functions, that possibly share some desired boundary values and constraints. Though the Lipschitz functions are dense in W 1 , p ( I ; n ) , unless some a priori growth assumptions from above are satisfied, this approximation is not always possible: in this case we say that the Lavrentiev phenomenon occurs. We direct the reader to the references [11, 13] for a comprehensive review of the state of the art on the subject.

Of course, the best way to avoid the phenomenon is to make sure that the minimizers of F exist and are Lipschitz. This path was studied by Leonida Tonelli himself (see [35]); new results appeared in the last decades starting from the works of Francis Clarke and Richard Vinter [22] and Luigi Ambrosio, Oscar Ascenzi and Giuseppe Buttazzo [2] for coercive Lagrangians, under the very weak growth Condition (H) introduced by Francis Clarke in the seminal paper on the subject [21] and more recent works (e.g., [5, 6, 7, 14, 15, 24, 31]).

Conditions ensuring the non-occurrence of the phenomenon, other than Lipschitz regularity of the minimizers, were established by the pioneer paper by Giovanni Alberti and Francesco Serra Cassano in [1] for the autonomous case (i.e., L ( s , y , v ) = L ( y , v ) ), for problems with just one endpoint constraint: a suitable local boundedness condition on the Lagrangian ensures that given y W 1 , p ( [ t , T ] ) with F ( y ) < + , there is a no Lavrentiev gap at y, i.e., there exists a sequence of Lipschitz functions ( y ν ) ν such that:

  1. for all ν , y ν ( t ) = y ( t ) (prescribed initial endpoint),

  2. y ν y in W 1 , p ( [ t , T ] ) (approximation “in norm”),

  3. F ( y ν ) F ( y ) as ν + (approximation “in energy”).

We shall refer as to ( y ν ) ν as to an approximating sequence of Lipschitz functions, both in norm and in energy.

The main road to obtain the non-occurrence of the phenomenon (an exception is [23] where Maria Colombo and Rosario Mingione use some minimality properties to face multidimensional problem) is that to ensure that the sufficient conditions for the non-occurrence of the gap are satisfied for every admissible absolute continuous trajectory y. When the Lagrangian is non-autonomous, the Lavrentiev phenomenon may occur (though it is quite rare if L is coercive, see [37]), even with innocent looking Lagrangians, like in Manià’s problem (see [27])

min F ( y ) := 0 1 ( y 3 - s ) 2 ( y ) 6 𝑑 s subject to  y W 1 , 1 ( I ) , y ( 0 ) = 0 , y ( 1 ) = 1 .

It may happen that one may reach the infimum of the energy F among the functions y satisfying the condition y ( t ) = X n with a sequence ( y ν ) ν of Lipschitz functions satisfying y ν ( t ) = X , but that the same is not feasible if one, in addition, wishes to keep the final endpoint y ν ( T ) equal to a point Y n (see Example 3.3). The difficulty of preserving the boundary condition in such a kind of Lipschitz approximations was noticed also in the multidimensional setting (see [10, 32, 33]).

An extension of [1, Theorem 2.4] for non-autonomous Lagrangians, without requiring further assumptions on the state or velocity variables of Λ, was recently provided by the author in [29]. The same paper establishes a sufficient condition, conjectured by Alberti in a personal communication, for the non-occurrence of the phenomenon with both endpoint conditions. The need of a different set of sufficient conditions to establish the non-occurrence of the Lavrentiev phenomenon between the one endpoint and the two endpoint problem was not considered before.

Two questions, strictly related one to each other, remained to be considered:

  1. The approach by means of the techniques developed in [1], inherited in [29], for the two endpoint problem permits to consider just the case of real-valued Lagrangians. Now, being able to deal with extended-valued Lagrangians is not a purely aesthetical matter, since by adding suitable indicator functions (i.e., functions that take just the values 0 or + ) to the Lagrangian allows to consider problems with state or velocity constraints (e.g., variational problems associated to the orbits around a point mass, like the one described in [25]). However, the case of an extended-valued Lagrangian is by far the most difficult: though the results quoted above lead to expect that some boundedness condition on the Lagrangian should be enough to exclude the Lavrentiev phenomenon, various examples show that it may occur even when the Lagrangian is simply the indicator function of a subset of n × n (Examples 3.5 and 3.6).

  2. The approximation technique used in the proof of the main results in [1, 29], based first on the use of Lusin’s Theorem, does not allow the Lipschitz approximating sequence ( y ν ) ν to preserve, in general, a given state constraint of the form y ( [ t , T ] ) Δ n .

The first question was considered by Clarke in [21] and [28], for Lagrangians that satisfy an extra growth condition, though much weaker than the standard superlinearity: in this case the approximating sequence in energy may be chosen to be equi-Lipschitz. The second problem was considered by Arrigo Cellina, Alessandro Ferriero and Elsa Marchini in [16] for autonomous and just real-valued Lagrangians.

The present research addresses the two problems when the Lagrangian is infinite valued without involving any kind of growth condition. The main core of the paper, Theorem 5.4 and Corollary 5.13, are a set of sufficient conditions for the non-occurrence of the Lavrentiev gap and phenomenon for Lagrangians that might take the value + , in the presence of possible state and velocity constraints, both for the one endpoint and the two endpoint problems.

We consider Lagrangians of the form L ( s , y , v ) = Λ ( s , y , v ) Ψ ( s , y ) , where Λ , Ψ are Borel, and Ψ > 0 is continuous in the first variable: we refer to these properties as to the Basic Assumptions. We assume that Λ satisfies Condition (S + ) in the first variable s, described in Section 2.3. This condition is new and extends Condition (S) of previous literature (see [7, 21, 28]) and allows both Hölder or total variation dependence on the first variable. The class of Lagrangians appears to be more general than the one considered in [28], where Ψ = 1 ; in the autonomous and real-valued it was considered in [16]. Lastly, we require some Structure Assumptions on Λ:

  1. the effective domain Dom ( Λ ) of Λ (i.e., the set where Λ is finite), is a product of the form [ t , T ] × D Λ for some D Λ n × n ,

  2. Λ is radially convex in v, i.e., for all ( s , y , v ) Dom ( Λ ) , the function

    0 < r Λ ( s , y , r v )

    is convex, continuous and subdifferentiable (see Definition 2.5) up to r = 1 ,

  3. Λ ( s , y , r v ) < + for all ( s , y , v ) Dom ( Λ ) and r ] 0 , 1 ] .

Radial convexity in v is a somewhat natural assumption in the Calculus of Variations: it was established in [5, 7] that if y * is a (local) minimizer of F, then for almost every s [ t , T ] , the convex subdifferential ( Λ ( s , y * ( s ) , r y * ( s ) ) ) r = 1 of 0 < r Λ ( s , y * ( s ) , r y * ( s ) ) at r = 1 is nonempty; a fact that is ensured if 0 < r Λ ( s , y * ( s ) , r y * ( s ) ) is convex for a.e. s.

The main achievement concerning the Lavrentiev phenomenon is Corollary 5.13, that for the sake of clarity we report here in a less general form.

Corollary 5.13.

In addition to the Basic Assumptions, Condition (S + ), the Structure Assumptions on Λ suppose that the following hypotheses hold:

  1. Ψ is real-valued and bounded on the compact subsets of [ t , T ] × Δ .

  2. For every compact subset 𝒦 of Δ there is m 𝒦 > 0 such that Ψ ( s , z ) m 𝒦 for all s [ t , T ] , z 𝒦 .

  3. For every compact subset 𝒦 of Δ there is a sphere B r 𝒦 of radius r 𝒦 > 0 such that Λ is bounded on ( [ t , T ] × 𝒦 × B r 𝒦 ) Dom ( Λ ) .

Then:

  1. The Lavrentiev phenomenon does not occur for the problem with one endpoint constraint y ( t ) = X and state constraint y ( [ t , T ] ) Δ .

  2. The Lavrentiev phenomenon does not occur for the problem with two endpoint constraints y ( t ) = X , y ( T ) = Y and state constraint y ( [ t , T ] ) Δ if, in addition:

    1. Dom ( Λ ) = [ t , T ] × D Λ and D Λ is open in n × n ,

    2. the following condition holds:

      1. Λ is bounded on [ t , T ] × A , for every compact subset A of D Λ ( Δ × n ) .

With the exception of the Radial Convexity Assumption, the sufficient conditions in Corollary 5.13 are less restrictive than those in [1, 29] for the unconstrained case and require some kind of local boundedness just inside the effective domain of L:

  1. For problems with just one prescribed endpoint, differently from [1, Theorem 2.4] and [29, Corollary 3.7], we do neither impose that the effective domain of Λ contains a product of intervals nor the local boundedness of Λ on sets whose projection onto the velocity variable v contains a full neighborhood of the origin. Trying to avoid these conditions, following the approach of Alberti and Serra Cassano was performed successfully just for n = 1 in the autonomous case in a recent paper with Raphaël Cerf [18].

  2. We do not impose that Λ is bounded on all bounded subsets of Dom ( Λ ) ; notice that this would force Λ to takes values in . Condition ( B Λ σ ) requires the boundedness of Λ just on the intersection of some compact sets (of dimension less than 2 n ) of [ t , T ] × n × n with Dom ( Λ ) , whereas Condition ( B Λ ) is concerned with the behavior of Λ just on the compact subsets of Dom ( Λ ) ; in particular one avoids the neighborhoods of the boundary of Dom ( Λ ) , where typically Λ may diverge.

  3. For problems with both prescribed endpoints, the requirement that D Λ is open shows that the topology of the domain plays a role in the non-occurrence of the phenomenon. Actually, the topology on n × n might be as well the Euclidean one (as in the above simplified version of the main result), or a finer one induced by an extended-valued distance that coincides with the Euclidean on specific pairs that form a suitable equivalence class: the study of the admissible topologies in connection to the Lavrentiev phenomenon was recently carried on in [8], the main achievements being resumed in Section 4. By taking greater distances than the Euclidean one, one makes easier finding that D Λ is open in n × n , but it is more difficult to check the related version of Condition ( B Λ ), namely the boundedness of Λ on the bigger family of sets that are “well-inside” it.

Once one assumes the Basic Assumptions and radial convexity in the last variable, we show with several examples that the failure of any of the other conditions of Corollary 5.13 may lead to the occurrence of the Lavrentiev phenomenon.

Corollary 5.13 relies on Theorem 5.4, which provides some sufficient conditions for the non-occurrence of the Lavrentiev gap at a specific admissible trajectory y W 1 , p ( [ t , T ] ) that has a finite energy F ( y ) : the proof, inspired by that of [28, Theorem 5.1], consists on reparametrizing y in order to obtain a lower value of the energy by means of the radial convexity assumption. A suitable convex subgradient (called here DBR type subgradient) related to recent nonsmooth versions of the Du Bois–Reymond condition for minimizers – whose existence is by no means required here – and which coincides with Λ ( s , y , v ) - v v Λ ( s , y , v ) in the smooth case, plays a role in the crucial estimates of the reparametrized energy: these are formulated in Lemma 6.2 and proven in [8]. The reparametrization technique employed here is a classical tool in the proof of the Du Bois–Reymond equation and its use in order to obtain regularity of minimizing sequences was used under different settings and assumptions, for instance, in [17] by Arrigo Cellina, Giulia Treu and Sandro Zagatti and in [15, 16, 28].

The proof is constructive and allows to explicitly build a Lipschitz approximating sequence in the sense of Definition 3.1: an example of such a construction is fully described in Example 8.1.

The arguments of the proof have several sources of inspiration: Clarke’s formulation of a very slow growth condition based on the DBR type subgradient in [21] for problems without smoothness assumptions, [16] where a version of Theorem 5.4 is formulated for a real-valued, autonomous Lagrangian under some extra assumptions, e.g., convexity in v and continuity of L (we refer to [19] for a simplified proof of the main result), and [15] for how the authors deal with the extended-valued case in the more comfortable class of Lagrangians that satisfy a growth condition. All the authors exploit in different ways what we call here the DBR type subgradient: Lemma 6.2 shows that some fundamental estimates that concern this kind of subgradient, starting with those obtained in [21] and in [17], continue to hold in a more general setting assuming just boundedness assumptions on the Lagrangian (and not on its derivatives).

The conditions in Theorem 5.4 and Corollary 5.13 where both endpoint constraints y ( t ) = X , y ( T ) = Y are considered and the Lagrangian takes extended values are new in such generality. They extend in many directions the author’s [28, Theorem 5.1, Claim (2)] and [28, Corollary 5.7]; in particular no reference on the topology of the effective domain was considered before. Some simpler particular cases of the main results with Ψ = 1 for problems with just the initial endpoint condition appeared as an announcement of the present paper in [30] in a slightly more general optimal control framework, and as an example of application of the main result in [8].

The paper is organized as follows. In Section 3 we recall some basic facts on the Calculus of Variations and on the Lavrentiev phenomenon. Section 4 is devoted to the definition of the topologies that are involved in the main results for the case of two endpoints and an extended-valued Lagrangian. In Section 5 we state the main results, Theorem 5.4 and Corollary 5.13, we discuss thoroughly some comparisons with respects to the results in the literature and give some examples on the importance of the validity of the various assumptions. Section 6 illustrates the fundamental tools of the proof; we thoroughly show the role of the DBR type subgradient and of the topological properties of the effective domain. Section 7 is devoted to the proof of Theorem 5.4. In Section 8 we provide some examples of occurrence of the Lavrentiev phenomenon when some of the assumptions of the main results are not satisfied. Finally, Section 9 draws some possible extensions. Some open questions are addressed in various parts of the paper.

2 Notation and basic assumptions on the Lagrangian

2.1 Notation

We introduce the main recurring notation:

  1. The Euclidean norm in any k is denoted by | | .

  2. The closed ball of n centered in the origin of radius K 0 is denoted by B K , its boundary (the sphere of radius K) by B K .

  3. If y : I n , we denote by y ( I ) its image, by y its sup-norm and by y p its norm in L p ( I ; n ) .

  4. Lip ( I ; n ) = { y : I n , y  Lipschitz } ; if n = 1 , we simply write Lip ( I ) .

  5. For p 1 , W 1 , p ( I ; n ) = { y : I n : y , y L p ( I ; n ) } ; if n = 1 , we simply write W 1 , p ( I ) . The norm of y in W 1 , p is denoted by y 1 , p .

2.2 Basic assumptions

Let p 1 and n , n 1 . The functional F (sometimes referred as to the “energy”) is defined by

F ( y ) := I L ( s , y ( s ) , y ( s ) ) 𝑑 s for all  y W 1 , p ( I ; n ) ,

where L ( s , y , v ) is of the form

L ( s , y , v ) = Λ ( s , y , v ) Ψ ( s , y ) .

Basic Assumptions.

We assume the following conditions:

  1. I = [ t , T ] is a closed, bounded interval of .

  2. Λ : I × n × n [ 0 , + [ { + } is Lebesgue–Borel measurable in ( s , ( y , v ) ) , i.e., measurable with respect to the σ-algebra generated by the products of Lebesgue measurable subsets of I (for s) and Borel measurable subsets of n × n (for ( y , v ) ): this guarantees that if y , v : I n are measurable then s Λ ( s , y ( s ) , v ( s ) ) is measurable (see [20, Proposition 6.34]).

  3. Ψ : I × n [ 0 , + ] is Borel and continuous in the first variable, i.e., s Ψ ( s , y ) is continuous for every y n .

  4. Δ is a subset of n and 𝒰 is a cone in n , i.e., if u 𝒰 , then λ u 𝒰 whenever λ > 0 .

2.3 Condition (S + )

We consider the following Condition (S + ) on the first variable of Λ, satisfied if Λ does not depend on the first variable s.

Condition (S${}^{+}$).

For every K 0 there exist β ] 0 , 1 ] , ε * > 0 and:

  1. a Lebesgue–Borel measurable nonnegative function h ( s , y , v ) in ( s , ( y , v ) ) such that h ( s , y ( s ) , y ( s ) ) L 1 ( I ) whenever y W 1 , p ( I ) and F ( y ) < + ,

  2. a nonnegative Borel function g ( y , v ) such that g ( y ( s ) , y ( s ) ) L ( I ) whenever y W 1 , p ( I ) ,

  3. a bounded variation function η on I with values in

such that, for a.e. s I , for all s [ s - ε * , s + ε * ] I , y B K Δ , v 𝒰 , ( s , y , v ) Dom ( Λ ) ,

(2.1) | Λ ( s , y , v ) - Λ ( s , y , v ) | | s - s | β h ( s , y , v ) + | η ( s ) - η ( s ) | g ( y , v ) .

Remark 2.1.

Condition (S + ) is satisfied if, for every K 0 , there are κ > 0 , γ L 1 ( I ) , ε * > 0 satisfying, for a.e. s I

(2.2) | Λ ( s , y , v ) - Λ ( s , y , v ) | κ ( Λ ( s , y , v ) + | v | p + γ ( s ) ) | s - s |

whenever s [ s - ε * , s + ε * ] I , y B K Δ , v 𝒰 , ( s , y , v ) Dom ( Λ ) . The above is known as Condition (S). In the smooth setting, Condition (S) ensures the validity of the Du Bois–Reymond (DBR) condition for minimizers. In this more general framework it played a crucial role for Lipschitz regularity under slow growth conditions in [7, 21, 28] and ensured the validity of a generalized version of the DBR condition in [5, 7]. An extension of Condition (S) was considered in [4] where (2.2) was replaced by

(2.3) | Λ ( s , y , v ) - Λ ( s , y , v ) | κ ( Λ ( s , y , v ) + | v | p + γ ( s ) ) | s - s | + | η ( s ) - η ( s ) |

for some nondecreasing function η.

Example 2.2.

Let a : I be Hölder continuous, η : I of bounded variation, and h : n continuous. Then

Λ ( s , y , v ) = η ( s ) h ( y ) + a ( s ) | v | p for all  ( s , y , v ) I × n × n

satisfies Condition (S + ).

2.4 Structure assumptions on Λ

We recall here some important properties of the subgradient of a convex function, that will be used in the proof of the main results (see, for instance, [7, 36]).

Proposition 2.3 (Convex subgradient).

Let ϕ : ] 0 , + [ [ 0 , + ] be convex, finite in ] 0 , 1 ] and has a subgradient Q R of ϕ in 1: there is Q R , called a subgradient of ϕ in 1, such that

ϕ ( r ) - ϕ ( 1 ) Q ( r - 1 ) for all  r > 0 .

The subset of such elements Q R is called the (convex) subdifferential of ϕ at 1, and denoted by ϕ ( 1 ) . Then:

  1. The function μ > 0 Φ ( μ ) := ϕ ( 1 μ ) μ is convex and finite in [ 1 , + [ .

  2. P Φ ( 1 ) if and only if there is Q ϕ ( 1 ) satisfying

    (2.4) P = ϕ ( 1 ) - Q .

    In particular, P is the ordinate of the intersection of the tangent line z = Q ( μ - 1 ) + ϕ ( 1 ) to the graph of ϕ at 1 with the vertical axis.

Remark 2.4.

While it is understood that the subgradient of a convex function exists within the interior of its effective domain, the existence of a subgradient of ϕ at 1, as stated in Proposition 2.3, is not inherently guaranteed and must therefore be explicitly assumed.

The following additional conditions on Λ will be assumed throughout the paper.

Definition 2.5 (Radial Convexity Assumption (RC)).

For a.e. s I , for all ( y , v ) Δ × 𝒰 such that ( s , y , v ) Dom ( Λ ) ,

(2.5) 0 < r Λ ( s , y , r v )  is convex .

Moreover, the function 0 < r Λ ( s , y , r v ) has a nonempty convex subdifferential ( Λ ( s , y , r v ) ) r = 1 at r = 1 .

Definition 2.6 (Assumptions on Dom ( Λ ) ).

When Λ is extended valued, we assume the following conditions on its effective domain, defined by

Dom ( Λ ) := { ( s , y , v ) I × n × n : Λ ( s , y , v ) < + } .

  1. The effective domain of Λ is a product:

    (2.6) Dom Λ = I × D Λ for some  D Λ n × n .

  2. For every y Δ let

    (2.7) D Λ ( y ) := { v n : ( y , v ) Dom ( Λ ) }

    be the y-section of D Λ . We assume that D Λ ( y ) 𝒰 is strictly star-shaped on the variable v with respect to the origin, i.e.,

    (2.8) ( y , r v ) D Λ for all  ( y , v ) D Λ ( Δ × 𝒰 )  and all  r ] 0 , 1 ] .

    Thus if Λ ( s , y , v ) < + and ( y , v ) Δ × 𝒰 , then Λ ( s , y , r v ) < + for every r ] 0 , 1 ] .

Figure 1 illustrates an example of a domain that satisfies the conditions of Definition 2.6.

Remark 2.7.

Condition (2.5) in Definition 2.5 is somewhat natural in the Calculus of Variations. Indeed, it was established in [5, 7] that if y * is a (local) minimizer of F, then for almost every s I , the convex subdifferential ( Λ ( s , y * ( s ) , r y * ( s ) ) ) r = 1 of 0 < r Λ ( s , y * ( s ) , r y * ( s ) ) at r = 1 is nonempty.

Remark 2.8.

  1. Condition (2.6) is fulfilled if Λ is a product of functions of the form Λ ( s , y , v ) = a ( s ) h ( y , v ) .

  2. If Λ is radially convex then (2.8) is fulfilled if

    Λ ( s , y , v ) < + Λ ( s , y , 0 ) < + for all  ( s , y , v ) I × Δ × 𝒰 .

    Notice, however, that the Structure Assumptions do not require ( s , y , 0 ) Dom ( Λ ) for some ( s , y ) I × n .

  3. Let ( s , y , v ) Dom ( Λ ) , y Δ , v 𝒰 . Conditions (1) and (2) in Definition 2.6 imply that the effective domain of 0 < r Λ ( s , y , r v ) = ϕ ( r ) is an interval of the form J y , v = ] 0 , r ( y , v ) [ or J y , v = ] 0 , r ( y , v ) ] for some r ( y , v ) [ 1 , + [ . Therefore, ϕ has a nonempty subdifferential on the interior of J y , v . Condition 3 requires in addition that, if r ( y , v ) = 1 , then this is still true at r = 1 . It is satisfied if, for instance, J y , v is open for all ( y , v ) or if Λ is identically equal to 0 on its effective domain (in which case 0 ( Λ ( s , y , r v ) ) r = 1 ).

Figure 1 
                  An example of a subset 
                        
                           
                              
                                 D
                                 Λ
                              
                           
                           
                           {D_{\Lambda}}
                        
                      of the plane 
                        
                           
                              
                                 (
                                 y
                                 ,
                                 v
                                 )
                              
                           
                           
                           {(y,v)}
                        
                      that satisfies the assumptions of Definition 2.6.
Figure 1

An example of a subset D Λ of the plane ( y , v ) that satisfies the assumptions of Definition 2.6.

Let ( s , y , v ) Dom ( Λ ) , ( y , v ) Δ × 𝒰 . By applying Proposition 2.3 with ϕ ( r ) := Λ ( s , y , r v ) , r > 0 , it turns out that the function Φ defined by

Φ ( μ ) := Λ ( s , y , v μ ) μ for all  μ > 0

is convex on ] 0 , + [ , finite in [ 1 , + [ and has a subgradient P ( s , y , v ) at μ = 1 . When Λ is smooth in the last variable, we get

P ( s , y , v ) = Λ ( s , y , v ) - v v Λ ( s , y , v ) .

The function P plays a special role in the Calculus of Variations: under suitable assumptions it turns out that a minimizer y * of F satisfies the Du Bois–Reymond condition: the function p ( s ) = P ( s , y * ( s ) , y * ( s ) ) is absolutely continuous and p = L s ( s , y * ( s ) , y * ( s ) ) for a.e. s I : this fact is well known in the smooth case, and was established in this general framework in [5, 7] under Assumption (S).

Definition 2.9 (Du Bois–Reymond (DBR) type subgradient).

Let ( s , y , v ) Dom ( Λ ) , ( y , v ) Δ × 𝒰 . Any

P ( s , y , v ) μ [ Λ ( s , z , v μ ) μ ] μ = 1

is a Du Bois–Reymond (DBR) type subgradient of Λ at ( s , y , v ) . It has the property that

(2.9) Λ ( s , y , v μ ) μ - Λ ( s , y , v ) P ( s , y , v ) ( μ - 1 ) for all  μ > 0 .

Remark 2.10.

It follows from Proposition 2.3 that P ( s , y , v ) represents the intersection with the ordinate axis in the half-plane { r v , r > 0 } × { w = Λ ( s , y , r v ) : r > 0 } of the tangent line to r Λ ( s , y , r v ) at r = 1 (see Figure 2).

Figure 2 
                        Interpretation of 
                              
                                 
                                    
                                       P
                                       
                                          (
                                          s
                                          ,
                                          z
                                          ,
                                          u
                                          )
                                       
                                       ∈
                                       
                                          ∂
                                          μ
                                       
                                       
                                          
                                             [
                                             Λ
                                             
                                                (
                                                s
                                                ,
                                                z
                                                ,
                                                
                                                   u
                                                   μ
                                                
                                                )
                                             
                                             μ
                                             ]
                                          
                                          
                                             μ
                                             =
                                             1
                                          
                                       
                                    
                                 
                                 
                                 {P(s,z,u)\in\partial_{\mu}[\Lambda(s,z,\frac{u}{\mu})\mu]_{\mu=1}}
                              
                            when 
                              
                                 
                                    
                                       n
                                       =
                                       1
                                    
                                 
                                 
                                 {n=1}
                              
                           .
Figure 2

Interpretation of P ( s , z , u ) μ [ Λ ( s , z , u μ ) μ ] μ = 1 when n = 1 .

3 Lavrentiev gap at a function and Lavrentiev phenomenon

In this paper we consider different boundary data for the same integral functional.

3.1 The variational problems ( 𝒫 Δ , 𝒰 ), ( 𝒫 X Δ , 𝒰 ), ( 𝒫 X , Y Δ , 𝒰 )

We shall consider different variational problems associated to the functional F, with different endpoint conditions and state constraints. Let X , Y n , Δ n and let, as above, 𝒰 n be a cone. We define:

  1. Free endpoints. We set Γ Δ , 𝒰 := { y W 1 , p ( I ; n ) : y ( I ) Δ , y ( s ) 𝒰  a.e. } . Assuming Γ Δ , 𝒰 , we consider the corresponding variational problem

    (${\mathcal{P}^{\Delta,\mathcal{U}}}$) Minimize  { F ( y ) : y Γ Δ , 𝒰 } .

  2. One prescribed endpoint. If X Δ we set Γ X Δ , 𝒰 := { y Γ Δ , 𝒰 : y ( t ) = X } . Assuming Γ X Δ , 𝒰 we consider the corresponding variational problem

    (${\mathcal{P}_{X}^{\Delta,\mathcal{U}}}$) Minimize  { F ( y ) : y Γ X Δ , 𝒰 } .

  3. Two prescribed endpoints. If X , Y Δ we set Γ X , Y Δ , 𝒰 := { y Γ Δ , 𝒰 : y ( t ) = X , y ( T ) = Y } . Assuming Γ X , Y Δ , 𝒰 we consider the corresponding variational problem

    (${\mathcal{P}_{X,Y}^{\Delta,\mathcal{U}}}$) Minimize  { F ( y ) : y Γ X , Y Δ , 𝒰 } .

For the problems with one endpoint constraint, there is no privilege in considering the initial condition y ( t ) = X instead of the final one y ( T ) = Y : any result obtained here for the above variational problems can be reformulated replacing the initial endpoint condition y ( t ) = X with y ( T ) = Y .

3.2 Lavrentiev gap and phenomenon

Definition 3.1 (Lavrentiev gap at y W 1 , p ( I ; R n ) and Lavrentiev phenomenon).

Let Γ { Γ Δ , 𝒰 , Γ X Δ , 𝒰 , Γ X , Y Δ , 𝒰 } and let y Γ be such that F ( y ) < + (in what follows y will be called an admissible trajectory). We say that the Lavrentiev gap does not occur at y for the variational problem corresponding to Γ if there exists a sequence ( y ν ) ν of functions in Lip ( I , n ) satisfying:

  1. for all ν , y ν Γ ,

  2. y ν y in W 1 , p ( I ; n ) (approximation in norm),

  3. lim ν + F ( y ν ) = F ( y ) (approximation in energy).

We will refer to ( y ν ) ν as to an approximating sequence. We say that the Lavrentiev phenomenon does not occur for the problem related to Γ if

(3.1) inf y W 1 , p ( I ; n ) y Γ F ( y ) = inf y Lip ( I ; n ) y Γ F ( y ) .

Remark 3.2 (Gap and phenomenon).

AA

  1. Once Condition (2) of Definition 3.1 is satisfied, in order to verify Condition (3) it is enough to check that

    lim sup ν + F ( y ν ) F ( y ) .

    Indeed, Condition (2) and Fatou’s Lemma give

    lim inf ν + F ( y ν ) F ( y ) .

  2. The non-occurrence of the Lavrentiev gap along a minimizing sequence (and in particular at every admissible y W 1 , p ( I ; n ) ) implies the non-occurrence of the Lavrentiev phenomenon for the same variational problem.

Concerning the Lavrentiev gap or phenomenon, there are several reasons to distinguish between problems with both endpoint conditions and those with just one endpoint condition. The main issue is that the phenomenon may occur in first case and not in the second, as shown by Example 3.3.

Example 3.3 (Manià, [27]).

Consider the problem of minimizing

(${\mathcal{P}_{0,1}}$) F ( y ) = 0 1 ( y 3 - s ) 2 ( y ) 6 𝑑 s subject to  y W 1 , 1 ( I ) , y ( 0 ) = 0 , y ( 1 ) = 1 .

Then y * ( s ) := s 1 3 is a (non-Lipschitz) minimizer and F ( y * ) = 0 . It turns out (see [12, Section 4.3]) that the Lavrentiev phenomenon occurs, i.e.,

0 = min F = F ( y * ) < inf { F ( y ) : y Lip ( [ 0 , 1 ] ) , y ( 0 ) = 0 , y ( 1 ) = 1 } .

However, as it is noticed in [12], the situation changes drastically if one allows to vary the initial boundary condition along the sequence ( y ν ) ν . Indeed consider the sequence (see Figure 3) ( y ν ) ν obtained by truncating y * at 1 ν ( ν 1 ), as follows:

y ν ( s ) := { ν - 1 3 if  s [ 0 , 1 ν ] , s 1 3 otherwise , for all  ν 1 .

Figure 3 
                        The function 
                              
                                 
                                    
                                       y
                                       ν
                                    
                                 
                                 
                                 {y_{\nu}}
                              
                            in Example 3.3.
Figure 3

The function y ν in Example 3.3.

Then, for every ν 1 , y ν is Lipschitz,

y ν ( 1 ) = y ( 1 ) = 1 , F ( y ν ) F ( y * ) = min F , y ν y *  in  W 1 , 1 ( [ 0 , 1 ] ) .

Therefore, no Lavrentiev phenomenon occurs for the variational problem

(3.2) min F ( y ) = 0 1 ( y 3 - s ) 2 ( y ) 6 𝑑 s subject to  y W 1 , 1 ( I ) , y ( 1 ) = 1 .

3.3 Examples of the occurrence of the gap or phenomenon

The Lavrentiev phenomenon may occur in the autonomous case for Lagrangians that take the value + . In the following examples the Lavrentiev phenomenon occurs because there are no Lipschitz trajectories that have finite energy and satisfy the given endpoint constraints. All of them are based on the fact that y * ( s ) = s is the only absolutely continuous solution to the differential equation 2 y y = 1 on [ 0 , 1 ] .

Example 3.4 (Lavrentiev phenomenon with one endpoint constraint [19]).

Consider the autonomous Lagrangian L ( s , y , v ) = Λ ( s , y , v ) Ψ ( s , y ) with

Λ ( s , y , v ) = { ( v - 1 2 y ) 2 if  y 0 , + if  y = 0 , Ψ ( s , y ) = 1 for all  ( s , y , v ) [ 0 , 1 ] × × .

Notice that:

  1. the effective domain of Λ is I × D Λ , D Λ = { 0 } × ,

  2. v Λ ( s , y , ) is finite valued, convex and continuous for all ( s , y ) [ 0 , 1 ] × ( { 0 } ) .

In particular, Λ satisfies Condition (S) of Remark 2.1 and the Structure Assumptions (with Δ = 𝒰 = ). The function y * ( s ) = s is a minimizer of F in W 1 , 1 ( [ 0 , 1 ] ) , but it turns out (see [19, 18]) that the Lavrentiev phenomenon for the one endpoint problem y ( 0 ) = 0 occurs.

In the following examples, the Lavrentiev phenomenon occurs when the Lagrangian is a constant in its effective domain.

Example 3.5 (Lavrentiev phenomenon with one endpoint constraint).

Let L ( s , y , v ) = Λ ( s , y , v ) Ψ ( s , y ) with

Λ ( s , y , v ) = { 0 if  v = 1 2 y  and  y 0 , + otherwise , Ψ ( s , y ) = 1 for all  ( s , y , v ) [ 0 , 1 ] × × .

Notice that y * ( s ) = s , s [ 0 , 1 ] minimizes F ( y ) among the functions y W 1 , 1 ( [ 0 , 1 ] ) with y ( 0 ) = 0 . We claim that F ( y ) = + whenever y * y W 1 , 1 ( [ 0 , 1 ] ) and y ( 0 ) = 0 . Indeed, let y be an absolutely continuous function such that y ( 0 ) = 0 and F ( y ) < + . Then for a.e. s in [ 0 , 1 ] we have y ( s ) = 1 2 y ( s ) . It follows that, necessarily, y ( s ) = y * ( s ) = s . In particular, if y ( 0 ) = 0 is Lipschitz in [ 0 , 1 ] , then necessarily F ( y ) = + , Thus the Lavrentiev gap occurs at the minimum y * for the one endpoint problem y ( 0 ) = 0 so that the Lavrentiev phenomenon occurs in this case. Regarding the standing assumptions described above (with Δ = 𝒰 = ):

  1. The effective domain of Λ is I × D Λ , D Λ = { ( y , v ) 2 : y 0 , v = 1 2 y } .

  2. For all y 0 the y-section of D Λ given by D Λ ( y ) = { 1 2 y } is not strictly star-shaped with respect to 0; in particular, Λ does not fully satisfy the Structure Assumptions.

Example 3.6 (Lavrentiev phenomenon with two endpoint constraints).

This example is inspired by one of Alberti (see [29]). Consider, for ( s , y , v ) [ 0 , 1 ] × × , the autonomous Lagrangian L ( s , y , v ) = Λ ( s , y , v ) Ψ ( s , y ) with

Λ ( s , y , u ) := { 0 if  y > 0  and  v 1 2 y , + otherwise , Ψ ( s , y ) = 1 .

Figure 4 
                        The set 
                              
                                 
                                    
                                       D
                                       Λ
                                    
                                 
                                 
                                 {D_{\Lambda}}
                              
                            in Example 3.6.
Figure 4

The set D Λ in Example 3.6.

The effective domain of Λ is I × D Λ , with

D Λ = { ( y , v ) × : y > 0 , v 1 2 y } ,

represented in Figure 4. Note that Λ satisfies the Structure Assumptions:

  1. For all y > 0 , the y-section of D Λ is the interval D Λ ( y ) = ] - , 1 2 y ] .

  2. For all s [ 0 , 1 ] , ( y , v ) D Λ , the map 0 < r Λ ( s , y , r v ) = 0 is convex on its effective domain.

Clearly, y * ( s ) = s , s [ 0 , 1 ] minimizes F among the absolutely continuous functions y on [ 0 , 1 ] satisfying y ( 0 ) = 0 , y ( 1 ) = 1 . The Lavrentiev gap occurs at y * for the two endpoint problem; actually, F ( y ) = + for every Lipschitz y : [ 0 , 1 ] satisfying y ( 0 ) = 0 , y ( 1 ) = 1 . Indeed, assume the contrary: let y be such a function and suppose F ( y ) < + . Since F ( y ) < + , it follows that

(3.3) y ( s ) > 0 , y ( s ) 1 2 y ( s ) a.e. on  [ 0 , 1 ] .

Note that it may not happen that y ( s ) y ( s ) = 1 2 a.e. on [ 0 , 1 ] , otherwise y ( s ) = s , which is not Lipschitz. Thus there is a non-negligible subset of [ 0 , 1 ] where y y < 1 2 , whence

(3.4) 0 1 y ( s ) y ( s ) 𝑑 s < 0 1 1 2 𝑑 s = 1 2 .

However, the change of variable ζ = y ( s ) (which is justified, for instance, by the chain rule [26, Theorem 1.74]) gives

0 1 y ( s ) y ( s ) 𝑑 s = 0 1 ζ 𝑑 ζ = 1 2 ,

contradicting (3.4). Therefore F ( y ) = + for every Lipschitz function y satisfying y ( 0 ) = 0 , y ( 1 ) = 0 . Notice that the Lavrentiev phenomenon does not occur if, instead, one considers just the one endpoint constraint y ( 0 ) = 0 : indeed the Lipschitz function

y ¯ ( s ) = s 2 for all  s [ 0 , 1 ]

satisfies ( y ¯ ( s ) , y ¯ ( s ) ) D Λ for all s [ 0 , 1 ] , so that F ( y ¯ ) = 0 , proving that there is a Lipschitz minimizer among the functions that satisfy the given boundary condition.

4 Admissible topologies

This section may be skipped if Λ takes only finite values or for problems with just one end point condition. When dealing with an extended-valued Lagrangian for problem ( 𝒫 X , Y Δ , 𝒰 ) with two endpoint conditions, it is useful to introduce some topologies that are finer than the Euclidean one. We just formulate here the main results that will be used in the paper, and refer for the details and proofs to [8].

4.1 The topology τ 𝒲 induced by an equivalence relation 𝒲 in a metric space

We allow here a metric in metric spaces to take the value + .

Definition 4.1 (The “metric” dist W ).

Let 𝒲 be an equivalence relation on a metric space ( Π , d ) . We set

dist 𝒲 ( ω , ω ) = { d ( ω , ω ) if  ω 𝒲 ω , + otherwise , for all  ω , ω Π .

Proposition 4.2.

Let W be an equivalence relation on a metric space ( Π , d ) . Then dist W is a distance on Π and

(4.1) dist 𝒲 ( ω , ω ) d ( ω , ω ) for all  ω , ω Π .

Definition 4.3 (The topology τ W ).

Let 𝒲 be an equivalence relation on a metric space ( Π , d ) . If ω Π and r > 0 , we shall denote by B 𝒲 ( ω , r [ the open ball of center ω Π and radius r with respect to dist 𝒲 :

B 𝒲 ( ω , r [ := { ω Π : dist 𝒲 ( ω , ω ) < r } = { ω Π : ( ω , ω ) 𝒲 , d ( ω , ω ) < r } .

The topology τ 𝒲 is the topology induced on Π by the metric dist 𝒲 .

The functions 𝒲 τ 𝒲 and 𝒲 dist 𝒲 are decreasing with respect to the natural orders.

Proposition 4.4 (Inclusions between topologies and metrics).

Let W 2 W 1 be equivalence relations in a metric space ( Π , d ) . Then

(4.2) τ 𝒲 2 τ 𝒲 1 , dist 𝒲 2 dist 𝒲 1 .

4.2 Admissible topologies in n × n

In the main result of the paper we will consider topologies on n × n built upon the Euclidean distance and a suitable subclass of equivalence relations.

Definition 4.5 (Admissible topologies).

Let 𝒲 be an equivalence relation on n × n . The topology τ 𝒲 will be called admissible whenever 𝒲 𝒲 0 , where

𝒲 0 := { ( ( y , λ 1 v ) , ( y , λ 2 v ) ) : y n , v n , λ 1 > 0 , λ 2 > 0 } .

We shall denote by 𝒲 e the maximal equivalence relation ( n × n ) × ( n × n ) , in which case τ 𝒲 e = τ e , the Euclidean topology: we denote by dist e the Euclidean distance in n × n .

Remark 4.6.

It follows from Proposition 4.4 that if 𝒲 is an equivalence relation in n × n , then, since 𝒲 0 𝒲 𝒲 e , it turns out that

τ e τ 𝒲 τ 𝒲 0 .

In particular, any topology τ 𝒲 is finer than the Euclidean one.

Example 4.7 (The topology of the y-sections).

Among the admissible topologies on n × n , by taking

𝒲 = 𝒲 1 = { ( ( y , v ) , ( y , v ) ) : y n , v , v n } ,

a set A n × n is open for τ 𝒲 1 if and only if for all y n the sections

A ( y ) = { v n : ( y , v ) A }

are open in n .

The next result motivates the requirement that we consider just equivalence classes on n × n containing 𝒲 0 .

Proposition 4.8.

Let W W 0 be an equivalence relation on R n × R n . Let A R n × R n , ( y , v ) R n × R n with dist W ( ( y , v ) , A ) ρ > 0 . Then, if r 0 ,

dist 𝒲 ( ( y , v + r v ) , A ) ρ - r | v | .

4.3 Well-inside subsets of D Λ

We denote by D c the complement of a subset D of n × n .

Definition 4.9 (The family M W , the notation W ).

Let 𝒲 be an equivalence relation on n × n . We say that a subset A of D is well-inside D Λ for dist 𝒲 , and we write A 𝒲 D Λ , if A is bounded and there is ρ > 0 such that

A { ( y , v ) D : dist 𝒲 ( ( y , v ) , D Λ c ) ρ } .

We shall denote by M 𝒲 the family of sets that are well-inside D Λ for dist 𝒲 .

Remark 4.10.

Let 𝒲 = 𝒲 e . A set A is well-inside D Λ if and only if the distance from A to the boundary of D Λ is strictly positive.

The family of sets that are well-inside D Λ for dist 𝒲 decreases as the equivalence relation set 𝒲 increases.

Proposition 4.11.

Let W 1 W 2 be two equivalence relations in R n × R n . Then M W 2 M W 1 .

Remark 4.12.

It follows from Proposition 4.11 that

(4.3) M 𝒲 e M 𝒲 M 𝒲 0 .

Observe that inequalities in (4.3) might be strict: for instance, if n = 1 and D Λ = ] 0 , 1 [ × ] 0 , 1 [ , then ] 0 , 1 [ × ] 0 , 1 2 ] is well-inside D Λ for τ 𝒲 0 but not for τ e .

Example 4.13.

We consider here n × n endowed with the Euclidean distance.

  1. If 𝒲 = n × n , then τ 𝒲 = τ e . A bounded subset A of n × n belongs to M 𝒲 e if for every ( y , v ) A we have

    | ( y , v ) - ( y , v ) | ρ for all  ( y , v ) D Λ c .

    Therefore A M 𝒲 e if A has a strictly positive Euclidean distance from D Λ c in n × n .

  2. Analogously, if 𝒲 = 𝒲 0 , a bounded subset A of n × n belongs to M 𝒲 0 if for every ( y , λ 1 v ) A with | v | = 1 , λ 1 > 0 , we have

    λ 2 λ 1 + ρ  or  λ 2 λ 1 - ρ for all  ( y , λ 2 v ) D Λ c , λ 2 > 0 .

    Recalling that D Λ ( y ) is strictly star-shaped with respect to 0, it turns out that A M 𝒲 0 if there is ρ > 0 such that, for all ( y , λ v ) A with λ > 0 and | v | = 1 ,

    sup { λ 1 > 0 : ( y , λ 1 v ) A } + ρ inf { λ 2 > 0 : ( y , λ 2 v ) D Λ c } .

    For instance, if n = 1 , the above means that, for all y in the first projection of A, the y-section A ( y ) of A satisfies

    sup ( A ( y ) * + ) + ρ inf ( D Λ c ( y ) * + ) , inf ( A ( y ) * - ) - ρ sup ( D Λ c ( y ) * - ) .

    An example is depicted in Figure 5.

Figure 5 
                        The set A is well-inside 
                              
                                 
                                    
                                       D
                                       Λ
                                    
                                 
                                 
                                 {D_{\Lambda}}
                              
                            for 
                              
                                 
                                    
                                       dist
                                       
                                          𝒲
                                          0
                                       
                                    
                                 
                                 
                                 {\operatorname{dist}_{\mathcal{W}_{0}}}
                              
                           .
Figure 5

The set A is well-inside D Λ for dist 𝒲 0 .

5 Non-occurrence of the Lavrentiev gap and phenomenon

We present here the main results of the paper, on the non-occurrence of the gap or phenomenon. When the projection of the effective domain onto the velocity space v n is bounded, every admissible trajectory is Lipschitz and the non-occurrence of the gap/phenomenon follows trivially. We thus subsume that the projection onto the velocity variable v of D Λ is unbounded.

5.1 Non-occurrence of the Lavrentiev gap

We will require the boundedness of Λ on sets that “enclose” the origin, as does a sphere.

Definition 5.1 (Sets enclosing the origin).

We say that A n encloses the origin if (see Figure 6)

  1. There is r A > 0 such that

    A B r A .

  2. For all x B 1 there is y A such that x = y | y | or, equivalently, every radius from the origin intersects A in at least one point.

Figure 6 
                        A curve enclosing the origin in 
                              
                                 
                                    
                                       ℝ
                                       2
                                    
                                 
                                 
                                 {{\mathbb{R}}^{2}}
                              
                            (Definition 5.1).
Figure 6

A curve enclosing the origin in 2 (Definition 5.1).

Example 5.2.

A subset A of encloses the origin if and only if it contains { a , b } with a < 0 , b > 0 .

5.1.1 Sufficient conditions for the non-occurrence of the gap

Theorem 5.4 makes use of the notion of admissible topology in n × n introduced in Section 4. Some particular case of such topologies are the Euclidean one or the topology whose sets are open if and only if their projection onto the second component n of n × n are open.

Definition 5.3 ( essinf ).

If u is a measurable function on I, we define

essinf { | u ( s ) | : s I } := sup { : | u |  a.e. on  I } .

Theorem 5.4 (Non-occurrence of the Lavrentiev gap).

Let y W 1 , p ( I ; R n ) be such that F ( y ) < + . In addition to the Basic Assumptions, Condition (S + ), the Structure Assumptions on Λ, suppose that:

  1. There is m y , Ψ > 0 such that Ψ ( s , z ) m y , Ψ for all s I , z y ( I ) .

  2. Ψ is bounded on I × y ( I ) .

  3. There is a subset A y of n enclosing the origin (e.g., a sphere) such that Λ is bounded on ( I × y ( I ) × ( A y 𝒰 ) ) Dom ( Λ ) .

Then:

  1. There is no Lavrentiev gap for ( 𝒫 X Δ , 𝒰 ) at y.

  2. There is no Lavrentiev gap for ( 𝒫 X , Y Δ , 𝒰 ) at y if, in addition, there is an equivalence relation 𝒲 𝒲 0 in n × n such that:

    1. Dom ( Λ ) = I × D Λ and D Λ ( Δ × 𝒰 ) is interior to D Λ for the topology τ 𝒲 .

    2. the following condition holds:

      1. There is λ y > essinf { | y ( s ) | : s I } such that Λ is bounded on the subsets of the form I × A where A y ( I ) × ( B λ y 𝒰 ) , A 𝒲 D Λ .

In both cases the Lipschitz approximating sequence ( y ν ) ν to y in energy and norm may be obtained as Lipschitz reparametrizations of y, i.e., y ν = y ψ ν with ψ ν : I I Lipschitz.

Let us show how Theorem 5.4 may be of help.

Example 5.5.

Consider an autonomous Lagrangian of the form

L ( s , y , v ) = { f ( y , v ) if  v 0 , + otherwise,

where f : n × ( n { 0 } ) [ 0 , + [ is Borel, bounded on the compact subsets, radially convex in v and

lim v 0 f ( y , v ) = + for all  y n .

Then Λ fulfills the Assumptions of Claim (1) in Theorem 5.4: thus the Lavrentiev phenomenon does not occur for problem ( 𝒫 X n , n ). However, previous results in the literature could not be applied: indeed, [1, Theorem 2.4] requires that Λ is bounded on a set of the form 𝒦 × B r , for some compact subset 𝒦 of n and r > 0 .

Example 5.6.

Consider an autonomous Lagrangian of the form

L ( s , y , v ) = { f ( y , v ) if  y 0 , | y | 1 , + otherwise,

where f : ( B 1 { 0 } ) × n [ 0 , + [ is Borel, bounded on the compact subsets of ( B 1 { 0 } ) × n , radially convex in v and

lim y 0 f ( y , v ) = + for all  v n .

Then Λ fulfills the Assumptions of Claim (2) of Theorem 5.4, with 𝒲 = 𝒲 0 : thus the Lavrentiev phenomenon does not occur for problem ( 𝒫 X , Y B 1 { 0 } , n ). However, the known results in the literature could not be applied. Indeed:

  1. [29, Theorem 3.1, Corollary 3.7] and [16, Theorem 1] require that Λ is finite valued,

  2. [28, Proposition 4.24, Theorem 5.1] require that Λ goes to infinity at the boundary of Dom ( Λ ) and that Λ is bounded on a rectangle of the form I × B K × B r .

5.1.2 On the assumptions of Theorem 5.4

We comment briefly the assumptions required in Theorem 5.4.

Remark 5.7.

  1. Condition (S + ) is a new and more general than Condition (S) as described in [21, 5, 7, 28, 30] and [29], as well as its recent extension, Condition (BV), introduced in [8]. Actually, in the forthcoming note [9], we show that the results reported in [28, 29, 30] and [8] remain valid under conditions that are more general than (S).

  2. Condition ( P y , Ψ ) is needed just to infer that Λ ( s , y ( s ) , y ( s ) ) L 1 ( I ) , see [29, Remark 3.2]. Hence the conclusions of Theorem 5.4 hold under the more general assumption that Λ ( s , y , y ) L 1 ( I ) .

  3. Condition ( B y , Λ σ ) is satisfied if Λ is bounded on the bounded sets of Dom ( Λ ) .

  4. In Claim (2), Condition (a) is satisfied if D Λ is open for τ 𝒲 , the converse being true if Δ = 𝒰 = n .

  5. Since essinf { | y ( s ) | : s I } y 1 | I | , the requirement that essinf { | y ( s ) | : s I } < λ y in Condition ( B y , Λ 𝒲 ) is satisfied if y 1 < λ y | I | . The value essinf { | y ( s ) | : s I } appears to be a threshold for the study of the Lipschitz regularity of y in [21] when the latter is, instead of being any absolutely continuous function with finite energy F ( y ) , the function y is a minimizer of F.

  6. Condition ( B y , Λ 𝒲 ) is satisfied if Λ is bounded on the subsets of the form I × A where A Δ × 𝒰 , A 𝒲 D Λ .

5.1.3 Choice of the equivalence relation 𝒲

The problem of the choice of a suitable equivalence relation on n × n involves just the problem ( 𝒫 X , Y Δ , 𝒰 ) with two prescribed endpoints in the case where Λ takes the value + .

Remark 5.8.

The choice of the suitable equivalence relation 𝒲 in the application of Claim (2) of Theorem 5.4 depends on the validity (a) and (b). It looks like the two requirements fight one against the other: among the possible choices, 𝒲 = 𝒲 0 is more suitable for (a), whereas 𝒲 = 𝒲 e = n × n is the best option for ( B y , Λ 𝒲 ). More precisely:

  1. Condition (a) in Claim (2) of Theorem 5.4 is more easily verified for 𝒲 = 𝒲 0 . Indeed, it follows from Proposition 4.4 that τ 𝒲 0 is the finer topology among the admissible ones.

  2. Instead, Condition ( B y , Λ 𝒲 ) is more easily verified for 𝒲 = 𝒲 e . Indeed, it follows from Proposition 4.11 that the smallest family of sets that are well-inside D Λ for dist 𝒲 occurs for the greatest choice of the equivalence relation 𝒲 , i.e., 𝒲 = 𝒲 e .

In order to verify the validity of the assumptions for the validity of Claim (2) in Theorem 5.4, one may proceed as follows:

  1. Check whether D Λ ( Δ × 𝒰 ) is interior to D Λ for τ 𝒲 0 and if ( B y , Λ 𝒲 ) holds with 𝒲 = 𝒲 e : if not, there is no way to satisfy the assumptions for Claim (2) of Theorem 5.4.

  2. If the two conditions stated in (i) hold, check whether D Λ ( Δ × 𝒰 ) is interior to D Λ for τ e (in which case choose 𝒲 = 𝒲 e ) or if ( B y , Λ 𝒲 ) holds with 𝒲 = 𝒲 0 (in which case choose 𝒲 = 𝒲 0 ).

  3. If (ii) fails, try to find an intermediate equivalence relation 𝒲 0 𝒲 𝒲 e in n × n in such a way that D Λ ( Δ × 𝒰 ) is still interior to D Λ for τ 𝒲 and ( B y , Λ 𝒲 ) is satisfied. The topology of the y-sections introduced in Example 4.7 might be a good intermediate option.

Example 5.9.

Suppose, for instance, that n = 1 , Δ = 𝒰 = and

D Λ = { ( y , v ) 2 : y ] 0 , + [ ,  0 v < 1 y } .

Then D Λ is not open for the Euclidean topology, but is open for τ 𝒲 0 . Now, in relation with Condition ( B y , Λ 𝒲 ), notice that, accordingly to Example 4.13, a subset A of 2 is well-inside D Λ if and only if A is bounded and there is ρ > 0 such that

A { ( y , v ) 2 : y ] 0 , + [ ,  0 v max { 1 y - ρ , 0 } } .

5.1.4 Counterexamples

Remark 5.10 (Counterexamples).

The Lavrentiev phenomenon may occur if some of the assumptions of Theorem 5.4 are not fulfilled:

  1. As mentioned in [16], the gap may occur if Ψ 0 may vanish: this is the case actually in Manià’s Example 3.3 where Λ ( s , y , v ) = v 6 , Ψ ( s , y ) = ( y 3 - s ) 2 .

  2. The Lavrentiev phenomenon may occur, even for autonomous Lagrangians for the one endpoint problem ( 𝒫 X Δ , 𝒰 ), if just ( B y , Λ σ ) does not hold. This is the case, for instance, of Example 3.4 (taken from [19]). In that case, recalling that y * ( s ) = s :

    1. Δ = 𝒰 = ,

    2. Λ ( s , y , v ) and Ψ = 1 satisfy the Basic and Structure Assumptions,

    3. Λ satisfies ( B y * , Λ 𝒲 e ), since it is continuous on the effective domain and thus bounded on the bounded subsets of the form I × A , where A n × n is relatively compact in D Λ ,

    4. Condition ( B y * , Λ σ ) does not hold. Indeed, y * ( I ) = [ 0 , 1 ] and Λ is unbounded on [ 0 , 1 ] × ] 0 , 1 ] × { a , b } for all a < 0 , b > 0 .

  3. Example 3.5 shows that the assumptions that the y-sections of D Λ are star-shaped with respect to 0 (namely Condition (2.8)) cannot be dropped, in general.

  4. When Λ is extended valued, the requirement that D Λ ( Δ × 𝒰 ) is interior to D Λ for some τ 𝒲 is not merely a technical assumption. In Example 3.6, the Lavrentiev gap at a minimizer (and thus the phenomenon) occurs for the two endpoints problem. Since Λ satisfies the Structure Assumptions and Λ = 0 on Dom ( Λ ) , all the boundedness assumptions of Theorem 5.4 are satisfied with Δ = 𝒰 = . However, the set D Λ is not open for some admissible topology: indeed D Λ it is not even open for the finer topology τ 𝒲 0 .

  5. We ignore whether the gap for constrained/extended-valued problems may occur if all the assumptions of Theorem 5.4 hold true, except the radial convexity one.

5.1.5 Comparison with known results in the literature

Theorem 5.4 is a step forward with respect to the results of the literature on Lavrentiev’s phenomenon.

Remark 5.11.

A first point of distinction is that the case with just one endpoint constraint has not been considered intentionally before as differing from the problem with both endpoint conditions, except in the author’s recent [29] (though without taking into account there the state constraint y Δ ). Moreover:

  1. Theorem 5.4 extends in several directions [16, Theorem 1] by Cellina, Ferriero and Marchini established for real-valued and continuous Lagrangians of the form Λ ( y , v ) Ψ ( s , y ) that are convex in v: we allow here Λ to be possibly discontinuous, non-autonomous, non-convex in v and Λ , Ψ to be extended valued. Moreover, we show here the approximation of the Lipschitz sequence not only in energy, as in [16, Theorem 1], but also in the W 1 , p norm.

  2. A particular case of Theorem 5.4 was formulated in [28, Theorem 5.1, Claim (2)], for a Lagrangian of the form Λ ( s , y , v ) satisfying the Structure Assumptions and Condition (S). With respect to [28, Theorem 5.1], other than the more general Lagrangian and the new Condition (S + ) considered here:

    1. We study problem ( 𝒫 X Δ , 𝒰 ) with just one endpoint condition, requiring for the non-occurrence of the gap much simpler assumptions than those for the two endpoint problem ( 𝒫 X , Y Δ , 𝒰 ).

    2. Assumption h 2 ) of [28, Theorem 5.1], requiring that Λ tends to + approaching Dom ( Λ ) c , is weakened here by the requirement that D Λ ( Δ × 𝒰 ) is interior to D Λ for τ 𝒲 .

    3. Finer topologies than the Euclidean one are not mentioned elsewhere.

    4. The linear growth assumption from below (see ((G${{}_{\Lambda}}$)) in Section 5.2) is no more needed here.

    5. We do not impose anymore, as in [28, Proposition 4.24] that, for all K 0 , the effective domain Dom ( Λ ) of Λ contains a product of the form I × B K × B r K .

    6. We prove the convergence of the Lipschitz approximation not only – as in [28, Theorem 5.1] – in energy, but also in the W 1 , p norm.

  3. [29, Theorem 3.1] and [1, Theorem 2.4] do not consider the state constraint y ( s ) Δ for all s I . When Δ = n , the conclusions of Theorem 5.4 are those of [29, Theorem 3.1] and, in the special case of an autonomous Lagrangian with Λ = Λ ( y , v ) , Ψ = 1 and Problem ( 𝒫 X n ), those of [1, Theorem 2.4], except the fact that the approximating sequence in the quoted papers is not obtained just through a reparametrization of y. Theorem 5.4 and [29, Theorem 3.1] do essentially share a same set of assumptions concerning the function Ψ (more precisely, [29, Theorem 3.1] requires, instead of Condition (B y , Ψ σ ), the stronger assumption (B y , Ψ ) that Ψ is bounded on a neighborhood of I × y ( I ) ). Concerning Λ, [29, Theorem 3.1] involves Condition (S), which is not technical: the celebrated example by Ball and Mizel in [3] exhibits a positive Lagrangian Λ ( s , y , y ) that is a polynomial, superlinear and convex in y (thus satisfying all the assumptions of Claim (2) of Theorem 5.4, with the exception of Condition (S)), for which the Lavrentiev phenomenon occurs for some suitable initial and end boundary data. Actually, the proof of [29, Theorem 3.1] shows that Condition (S) might be replaced by the weaker Condition (S + ), by following the same argument of Step xvi of the proof of Theorem 5.4. The main differences concern the other set of assumptions on Λ:

    1. Here, in addition to the conditions of [29, Theorem 3.1], we require the radial convexity of Λ in the last variable, a geometrical structure of the domain and, for the problem with two endpoint constraints, the topological condition that D Λ ( Δ × 𝒰 ) is interior to D Λ with respect to a suitable topology, possibly finer than the Euclidean one.

    2. The main improvement with respect to the existing literature, even in the unconstrained case, concerns the two endpoint problem where Condition ( B y , Λ σ ) weakens Condition ( B y , Λ ) of [29, Theorem 3.1], requiring that Λ is bounded on a neighborhood of I × y ( I ) × B r y for some r y > 0 . Notice that the latter subsumes, in particular, that the effective domain of Λ contains a neighborhood of the rectangle I × y ( I ) × B r y . Here ( B y , Λ σ ) requires that Λ is bounded on ( I × y ( I ) × A y ) Dom ( Λ ) , for some A y n enclosing the origin (e.g., a sphere), i.e., in a 2 n -dimensional subset of Dom ( Λ ) I × n × n . The methods of [1, 29] need the boundedness of Λ ( s , y , v ) for v in a whole ball: weakening this requirement without adding the convexity assumption in v was the main aim in [18], carried on successfully just in dimension 1 by requiring the boundedness of Λ along the graph of two Lipschitz functions, one positive and the other negative.

    3. For problem ( 𝒫 X , Y n , n ) with both endpoint conditions, instead of Conditions ( B y , Λ σ ) and ( B y , Λ 𝒲 ) [29, Theorem 3.1] requires the stronger Condition ( B y , Λ + ) that there is a neighborhood 𝒪 y of y ( I ) such that, for every r > 0 , Λ is bounded on I × 𝒪 y × B r . This implies, in particular, that the effective domain of Λ contains the unbounded rectangle I × 𝒪 y × n , which excludes in practice to deal with extended-valued Lagrangians. Figure 7 illustrates the comparison of the various assumptions in the case of an autonomous Lagrangian with Ψ 1 , Δ = 𝒰 = .

Figure 7 
                           The effective domain of an autonomous Lagrangian 
                                 
                                    
                                       
                                          
                                             Λ
                                             ⁢
                                             
                                                (
                                                s
                                                ,
                                                y
                                                ,
                                                v
                                                )
                                             
                                          
                                          =
                                          
                                             L
                                             ⁢
                                             
                                                (
                                                y
                                                ,
                                                v
                                                )
                                             
                                          
                                       
                                    
                                    
                                    {\Lambda(s,y,v)=L(y,v)}
                                 
                               (
                                 
                                    
                                       
                                          n
                                          =
                                          1
                                       
                                    
                                    
                                    {n=1}
                                 
                              , 
                                 
                                    
                                       
                                          𝒲
                                          =
                                          
                                             𝒲
                                             e
                                          
                                          =
                                          
                                             ℝ
                                             ×
                                             ℝ
                                          
                                       
                                    
                                    
                                    {\mathcal{W}=\mathcal{W}_{e}={\mathbb{R}}\times{\mathbb{R}}}
                                 
                               so that the topology 
                                 
                                    
                                       
                                          τ
                                          𝒲
                                       
                                    
                                    
                                    {\tau_{\mathcal{W}}}
                                 
                               is the Euclidean one) and the validity of the assumptions of [29, Theorem 3.1] compared with those of Theorem 5.4 for problem (
                                 
                                    
                                       
                                          𝒫
                                          
                                             X
                                             ,
                                             Y
                                          
                                          
                                             ℝ
                                             ,
                                             ℝ
                                          
                                       
                                    
                                    
                                    {\mathcal{P}^{{\mathbb{R}},{\mathbb{R}}}_{X,Y}}
                                 
                              ) (with 
                                 
                                    
                                       
                                          Δ
                                          =
                                          𝒰
                                          =
                                          ℝ
                                       
                                    
                                    
                                    {\Delta=\mathcal{U}={\mathbb{R}}}
                                 
                              ):
(a) Assumption (
                                 
                                    
                                       
                                          B
                                          
                                             y
                                             ,
                                             Λ
                                          
                                          +
                                       
                                    
                                    
                                    {{\rm B}^{+}_{y,\Lambda}}
                                 
                              ) in [29, Theorem 3.1] requires that there is a neighborhood 
                                 
                                    
                                       
                                          𝒪
                                          y
                                       
                                    
                                    
                                    {\mathcal{O}_{y}}
                                 
                               of 
                                 
                                    
                                       
                                          y
                                          ⁢
                                          
                                             (
                                             I
                                             )
                                          
                                       
                                    
                                    
                                    {y(I)}
                                 
                               such that, for all 
                                 
                                    
                                       
                                          r
                                          >
                                          0
                                       
                                    
                                    
                                    {r>0}
                                 
                              , Λ is bounded on 
                                 
                                    
                                       
                                          
                                             𝒪
                                             y
                                          
                                          ×
                                          
                                             B
                                             r
                                          
                                       
                                    
                                    
                                    {\mathcal{O}_{y}\times B_{r}}
                                 
                               (the gray region); in particular 
                                 
                                    
                                       
                                          D
                                          Λ
                                       
                                    
                                    
                                    {D_{\Lambda}}
                                 
                               needs to contain the unbounded strip 
                                 
                                    
                                       
                                          
                                             𝒪
                                             y
                                          
                                          ×
                                          ℝ
                                       
                                    
                                    
                                    {\mathcal{O}_{y}\times{\mathbb{R}}}
                                 
                              . (b) Hypotheses (
                                 
                                    
                                       
                                          B
                                          
                                             y
                                             ,
                                             Λ
                                          
                                          σ
                                       
                                    
                                    
                                    {{\rm B}^{\sigma}_{y,\Lambda}}
                                 
                              )–(
                                 
                                    
                                       
                                          B
                                          
                                             y
                                             ,
                                             Λ
                                          
                                          
                                             ⋐
                                             𝒲
                                          
                                       
                                    
                                    
                                    {{\rm B}^{\Subset_{\mathcal{W}}}_{y,\Lambda}}
                                 
                              ) in Theorem 5.4 require that there are 
                                 
                                    
                                       
                                          
                                             
                                                a
                                                y
                                             
                                             <
                                             0
                                          
                                          ,
                                          
                                             
                                                b
                                                y
                                             
                                             >
                                             0
                                          
                                       
                                    
                                    
                                    {a_{y}<0,b_{y}>0}
                                 
                               such that Λ is bounded on 
                                 
                                    
                                       
                                          
                                             (
                                             
                                                
                                                   
                                                      I
                                                      ×
                                                      y
                                                   
                                                   ⁢
                                                   
                                                      (
                                                      I
                                                      )
                                                   
                                                
                                                ×
                                                
                                                   {
                                                   
                                                      a
                                                      y
                                                   
                                                   ,
                                                   
                                                      b
                                                      y
                                                   
                                                   }
                                                
                                             
                                             )
                                          
                                          ∩
                                          
                                             Dom
                                             ⁡
                                             
                                                (
                                                Λ
                                                )
                                             
                                          
                                       
                                    
                                    
                                    {(I\times y(I)\times\{a_{y},b_{y}\})\cap\operatorname{Dom}(\Lambda)}
                                 
                               (strongly dotted lines) and there is 
                                 
                                    
                                       
                                          
                                             λ
                                             y
                                          
                                          >
                                          
                                             essinf
                                             ⁡
                                             
                                                {
                                                
                                                   
                                                      |
                                                      
                                                         
                                                            y
                                                            ′
                                                         
                                                         ⁢
                                                         
                                                            (
                                                            s
                                                            )
                                                         
                                                      
                                                      |
                                                   
                                                   :
                                                   
                                                      s
                                                      ∈
                                                      I
                                                   
                                                
                                                }
                                             
                                          
                                       
                                    
                                    
                                    {{\lambda_{y}}>\operatorname{essinf}\{|y^{\prime}(s)|:s\in I\}}
                                 
                               such that Λ is bounded on the subsets of the form 
                                 
                                    
                                       
                                          I
                                          ×
                                          A
                                       
                                    
                                    
                                    {I\times A}
                                 
                               where 
                                 
                                    
                                       
                                          A
                                          ⊆
                                          
                                             
                                                y
                                                ⁢
                                                
                                                   (
                                                   I
                                                   )
                                                
                                             
                                             ×
                                             
                                                B
                                                
                                                   λ
                                                   y
                                                
                                             
                                          
                                       
                                    
                                    
                                    {A\subseteq y(I)\times B_{\lambda_{y}}}
                                 
                               is relatively compact in 
                                 
                                    
                                       
                                          D
                                          Λ
                                       
                                    
                                    
                                    {D_{\Lambda}}
                                 
                               (e.g., the interior dotted region up to level 
                                 
                                    
                                       
                                          λ
                                          y
                                       
                                    
                                    
                                    {\lambda_{y}}
                                 
                              ); it is also required that 
                                 
                                    
                                       
                                          Dom
                                          ⁡
                                          
                                             (
                                             Λ
                                             )
                                          
                                       
                                    
                                    
                                    {\operatorname{Dom}(\Lambda)}
                                 
                               satisfies the assumptions of Definition 2.6.
Figure 7 
                           The effective domain of an autonomous Lagrangian 
                                 
                                    
                                       
                                          
                                             Λ
                                             ⁢
                                             
                                                (
                                                s
                                                ,
                                                y
                                                ,
                                                v
                                                )
                                             
                                          
                                          =
                                          
                                             L
                                             ⁢
                                             
                                                (
                                                y
                                                ,
                                                v
                                                )
                                             
                                          
                                       
                                    
                                    
                                    {\Lambda(s,y,v)=L(y,v)}
                                 
                               (
                                 
                                    
                                       
                                          n
                                          =
                                          1
                                       
                                    
                                    
                                    {n=1}
                                 
                              , 
                                 
                                    
                                       
                                          𝒲
                                          =
                                          
                                             𝒲
                                             e
                                          
                                          =
                                          
                                             ℝ
                                             ×
                                             ℝ
                                          
                                       
                                    
                                    
                                    {\mathcal{W}=\mathcal{W}_{e}={\mathbb{R}}\times{\mathbb{R}}}
                                 
                               so that the topology 
                                 
                                    
                                       
                                          τ
                                          𝒲
                                       
                                    
                                    
                                    {\tau_{\mathcal{W}}}
                                 
                               is the Euclidean one) and the validity of the assumptions of [29, Theorem 3.1] compared with those of Theorem 5.4 for problem (
                                 
                                    
                                       
                                          𝒫
                                          
                                             X
                                             ,
                                             Y
                                          
                                          
                                             ℝ
                                             ,
                                             ℝ
                                          
                                       
                                    
                                    
                                    {\mathcal{P}^{{\mathbb{R}},{\mathbb{R}}}_{X,Y}}
                                 
                              ) (with 
                                 
                                    
                                       
                                          Δ
                                          =
                                          𝒰
                                          =
                                          ℝ
                                       
                                    
                                    
                                    {\Delta=\mathcal{U}={\mathbb{R}}}
                                 
                              ):
(a) Assumption (
                                 
                                    
                                       
                                          B
                                          
                                             y
                                             ,
                                             Λ
                                          
                                          +
                                       
                                    
                                    
                                    {{\rm B}^{+}_{y,\Lambda}}
                                 
                              ) in [29, Theorem 3.1] requires that there is a neighborhood 
                                 
                                    
                                       
                                          𝒪
                                          y
                                       
                                    
                                    
                                    {\mathcal{O}_{y}}
                                 
                               of 
                                 
                                    
                                       
                                          y
                                          ⁢
                                          
                                             (
                                             I
                                             )
                                          
                                       
                                    
                                    
                                    {y(I)}
                                 
                               such that, for all 
                                 
                                    
                                       
                                          r
                                          >
                                          0
                                       
                                    
                                    
                                    {r>0}
                                 
                              , Λ is bounded on 
                                 
                                    
                                       
                                          
                                             𝒪
                                             y
                                          
                                          ×
                                          
                                             B
                                             r
                                          
                                       
                                    
                                    
                                    {\mathcal{O}_{y}\times B_{r}}
                                 
                               (the gray region); in particular 
                                 
                                    
                                       
                                          D
                                          Λ
                                       
                                    
                                    
                                    {D_{\Lambda}}
                                 
                               needs to contain the unbounded strip 
                                 
                                    
                                       
                                          
                                             𝒪
                                             y
                                          
                                          ×
                                          ℝ
                                       
                                    
                                    
                                    {\mathcal{O}_{y}\times{\mathbb{R}}}
                                 
                              . (b) Hypotheses (
                                 
                                    
                                       
                                          B
                                          
                                             y
                                             ,
                                             Λ
                                          
                                          σ
                                       
                                    
                                    
                                    {{\rm B}^{\sigma}_{y,\Lambda}}
                                 
                              )–(
                                 
                                    
                                       
                                          B
                                          
                                             y
                                             ,
                                             Λ
                                          
                                          
                                             ⋐
                                             𝒲
                                          
                                       
                                    
                                    
                                    {{\rm B}^{\Subset_{\mathcal{W}}}_{y,\Lambda}}
                                 
                              ) in Theorem 5.4 require that there are 
                                 
                                    
                                       
                                          
                                             
                                                a
                                                y
                                             
                                             <
                                             0
                                          
                                          ,
                                          
                                             
                                                b
                                                y
                                             
                                             >
                                             0
                                          
                                       
                                    
                                    
                                    {a_{y}<0,b_{y}>0}
                                 
                               such that Λ is bounded on 
                                 
                                    
                                       
                                          
                                             (
                                             
                                                
                                                   
                                                      I
                                                      ×
                                                      y
                                                   
                                                   ⁢
                                                   
                                                      (
                                                      I
                                                      )
                                                   
                                                
                                                ×
                                                
                                                   {
                                                   
                                                      a
                                                      y
                                                   
                                                   ,
                                                   
                                                      b
                                                      y
                                                   
                                                   }
                                                
                                             
                                             )
                                          
                                          ∩
                                          
                                             Dom
                                             ⁡
                                             
                                                (
                                                Λ
                                                )
                                             
                                          
                                       
                                    
                                    
                                    {(I\times y(I)\times\{a_{y},b_{y}\})\cap\operatorname{Dom}(\Lambda)}
                                 
                               (strongly dotted lines) and there is 
                                 
                                    
                                       
                                          
                                             λ
                                             y
                                          
                                          >
                                          
                                             essinf
                                             ⁡
                                             
                                                {
                                                
                                                   
                                                      |
                                                      
                                                         
                                                            y
                                                            ′
                                                         
                                                         ⁢
                                                         
                                                            (
                                                            s
                                                            )
                                                         
                                                      
                                                      |
                                                   
                                                   :
                                                   
                                                      s
                                                      ∈
                                                      I
                                                   
                                                
                                                }
                                             
                                          
                                       
                                    
                                    
                                    {{\lambda_{y}}>\operatorname{essinf}\{|y^{\prime}(s)|:s\in I\}}
                                 
                               such that Λ is bounded on the subsets of the form 
                                 
                                    
                                       
                                          I
                                          ×
                                          A
                                       
                                    
                                    
                                    {I\times A}
                                 
                               where 
                                 
                                    
                                       
                                          A
                                          ⊆
                                          
                                             
                                                y
                                                ⁢
                                                
                                                   (
                                                   I
                                                   )
                                                
                                             
                                             ×
                                             
                                                B
                                                
                                                   λ
                                                   y
                                                
                                             
                                          
                                       
                                    
                                    
                                    {A\subseteq y(I)\times B_{\lambda_{y}}}
                                 
                               is relatively compact in 
                                 
                                    
                                       
                                          D
                                          Λ
                                       
                                    
                                    
                                    {D_{\Lambda}}
                                 
                               (e.g., the interior dotted region up to level 
                                 
                                    
                                       
                                          λ
                                          y
                                       
                                    
                                    
                                    {\lambda_{y}}
                                 
                              ); it is also required that 
                                 
                                    
                                       
                                          Dom
                                          ⁡
                                          
                                             (
                                             Λ
                                             )
                                          
                                       
                                    
                                    
                                    {\operatorname{Dom}(\Lambda)}
                                 
                               satisfies the assumptions of Definition 2.6.
Figure 7

The effective domain of an autonomous Lagrangian Λ ( s , y , v ) = L ( y , v ) ( n = 1 , 𝒲 = 𝒲 e = × so that the topology τ 𝒲 is the Euclidean one) and the validity of the assumptions of [29, Theorem 3.1] compared with those of Theorem 5.4 for problem ( 𝒫 X , Y , ) (with Δ = 𝒰 = ): (a) Assumption ( B y , Λ + ) in [29, Theorem 3.1] requires that there is a neighborhood 𝒪 y of y ( I ) such that, for all r > 0 , Λ is bounded on 𝒪 y × B r (the gray region); in particular D Λ needs to contain the unbounded strip 𝒪 y × . (b) Hypotheses ( B y , Λ σ )–( B y , Λ 𝒲 ) in Theorem 5.4 require that there are a y < 0 , b y > 0 such that Λ is bounded on ( I × y ( I ) × { a y , b y } ) Dom ( Λ ) (strongly dotted lines) and there is λ y > essinf { | y ( s ) | : s I } such that Λ is bounded on the subsets of the form I × A where A y ( I ) × B λ y is relatively compact in D Λ (e.g., the interior dotted region up to level λ y ); it is also required that Dom ( Λ ) satisfies the assumptions of Definition 2.6.

5.1.6 The real-valued case

If Λ is real-valued, many of the assumptions of Theorem 5.4 are fulfilled and there is no need to worry about the topologies induced by an equivalence relation 𝒲 : indeed, whatever topology for n × n is involved, D Λ = n × n is open and any of its bounded subsets is well-inside D Λ . The simpler claim that results deserves to be explicit.

Corollary 5.12 (Λ real-valued: Non-occurrence of the Lavrentiev gap).

Let y W 1 , p ( I ; R n ) be such that F ( y ) < + . In addition to the Basic Assumptions, Condition (S + ) and the Radial Convexity Assumption (RC) on Λ suppose that Λ is real-valued and

  1. There is m y , Ψ > 0 such that Ψ ( s , z ) m y , Ψ for all s I , z y ( I ) .

  2. Ψ is bounded on I × y ( I ) .

  3. There is a subset A y of n enclosing the origin (e.g., a sphere) such that Λ is bounded on I × y ( I ) × ( A y 𝒰 ) .

Then:

  1. There is no Lavrentiev gap for ( 𝒫 X Δ , 𝒰 ) at y.

  2. There is no Lavrentiev gap for ( 𝒫 X , Y Δ , 𝒰 ) at y provided that, in addition,

    1. There is λ y > essinf { | y ( s ) | : s I } such that Λ is bounded on I × y ( I ) × ( B λ y 𝒰 ) .

In both cases the approximating sequence for y in energy and norm is of Lipschitz reparametrizations of y.

5.2 Non-occurrence of the Lavrentiev phenomenon

As a consequence of Theorem 5.4, we obtain the following condition ensuring the non-occurrence of the Lavrentiev phenomenon.

Corollary 5.13 (Non-occurrence of the Lavrentiev phenomenon).

In addition to the Basic Assumptions, Condition (S + ), the Structure Assumptions on Λ suppose, moreover, that for every compact subset K of Δ the following hypotheses hold:

  1. Ψ is real-valued and bounded on I × 𝒦 .

  2. There is m 𝒦 > 0 such that inf { Ψ ( s , z ) : ( s , z ) I × 𝒦 } m 𝒦 .

  3. There is a subset A 𝒦 of n enclosing the origin (e.g., a sphere) such that Λ is bounded on ( I × 𝒦 × ( A 𝒦 𝒰 ) ) Dom ( Λ ) .

Then:

  1. The Lavrentiev phenomenon does not occur for ( 𝒫 X Δ , 𝒰 ).

  2. The Lavrentiev phenomenon does not occur for ( 𝒫 X , Y Δ , 𝒰 ) if, in addition, there is an equivalence relation 𝒲 𝒲 0 in n × n such that:

    1. Dom ( Λ ) = I × D Λ and D Λ ( Δ × 𝒰 ) is interior to D Λ for τ 𝒲 ,

    2. the following condition holds:

      1. Λ is bounded on the subsets of the form I × A where A 𝒦 × 𝒰 , A 𝒲 D Λ .

Proof.

We prove Point (2) of the corollary for the two endpoint problem ( 𝒫 X , Y Δ , 𝒰 ), the case of one endpoint being similar. Let ( y j ) j be a minimizing sequence for ( 𝒫 X , Y Δ , 𝒰 ) such that

F ( y j ) inf ( 𝒫 X , Y Δ , 𝒰 ) + 1 j + 1 for all  j .

Fix j . The assumptions imply the validity of the conditions of Theorem 5.4. Indeed, the validity of Hypotheses ( P Ψ ), ( B Ψ σ ), ( B Λ σ ), ( B Λ 𝒲 ) with 𝒦 = y j ( I ) imply, respectively, ( P y j , Ψ ), ( B y j , Ψ ), ( B y j , Λ σ ), ( B y j , Λ 𝒲 ) of Theorem 5.4. The application of Theorem 5.4 yields y ¯ j Lip ( I ; n ) satisfying the desired boundary conditions and constraints and, moreover,

F ( y ¯ j ) F ( y j ) + 1 j + 1 inf ( 𝒫 X , Y Δ , 𝒰 ) + 2 j + 1 .

The conclusion follows. ∎

Remark 5.14.

If 𝒲 = 𝒲 e , in view of Remark 4.10, Condition ( B Λ 𝒲 e ) means that Λ is bounded on I × A , for every bounded subset A of Δ × 𝒰 well-inside D Λ with respect to the Euclidean distance.

Remark 5.15.

Assume, that there are α > 0 and d such that

(G${{}_{\Lambda}}$) Λ ( s , y , v ) α | v | - d for all  ( s , y , v ) I × Δ × 𝒰

and that inf { Ψ ( s , z ) : s I , z Δ } := m Ψ > 0 . Choose λ , R satisfying

λ > λ 0 := inf ( 𝒫 ) + m Ψ d | I | m Ψ α | I | , R > R 0 := | X | + | I | λ 0 ,

where 𝒫 equals, respectively, ( 𝒫 X Δ , 𝒰 ) or ( 𝒫 X , Y Δ , 𝒰 ) . Then Claim (1) (resp. Claim (2)) of Corollary 5.13 still holds if ( B Ψ σ ) is satisfied just for the compact subsets 𝒦 of B R and Conditions ( P Ψ ), ( B Λ σ ), ( B Λ 𝒲 ) are assumed to hold just for the compact subsets 𝒦 of Δ B R and ( B Λ 𝒲 ) holds just for the bounded subsets of the form I × A where A 𝒦 × ( 𝒰 B λ ) , A 𝒲 D Λ . Indeed, let ( y j ) j be a minimizing sequence as in the proof of Corollary 5.13. If Λ satisfies (G Λ ), then for every j we have

y j 1 F ( y j ) + m Ψ d | I | m Ψ α

(see [29, Corollary 3.7]). We may assume that j is big enough in such a way that

(5.1) y j 1 | I | F ( y j ) + m Ψ d | I | m Ψ α | I | inf ( 𝒫 ) + 1 j + 1 + m Ψ d | I | m Ψ α | I | λ 0 + 1 ( j + 1 ) m Ψ α | I | < λ

and

y j | X | + y j 1 | X | + λ 0 | I | + 1 ( j + 1 ) m Ψ α R 0 + 1 ( j + 1 ) m Ψ α < R ,

so that y j ( I ) B R . This property, together with (5.1), ensure the validity of Conditions ( B y j , Ψ ), ( B y j , Λ σ ), ( B y j , Λ 𝒲 ) of Theorem 5.4 (see Remark 5.7). We now proceed as in the proof of Corollary 5.13.

Remark 5.16.

When Δ = 𝒰 = n , the conclusions of Corollary 5.13 look the same as those of [29, Corollary 3.7]. The main difference here is that, for Problem ( 𝒫 X , Y n , n ), we are dealing here with extended-valued Lagrangians. The hypotheses concerning Ψ do overlap, whereas the assumptions on Λ differ quite a lot. Let us recall that, in [29, Corollary 3.7], the radial convexity in the last variable is not needed but it is required instead, for problem ( 𝒫 X n ), that

  1. for all K > 0 , there is r K > 0 such that Λ is bounded on I × B K × B r K ,

implicitly implying that B K × B r K D Λ , and for problem ( 𝒫 X , Y n ), that

  1. Λ is real-valued and bounded on bounded sets.

The proof of [29, Corollary 3.7] shows that, actually, Condition (S) required there might be replaced by the weaker Condition (S + ), see [9].

The next Example 5.17 shows how it might be useful to consider finer topologies than the Euclidean one.

Example 5.17.

Let h : ] 0 , + [ be a map that is not lower semicontinuous, and let Λ ( s , y , v ) be the indicator function of the complement of the epigraph of h, namely

Λ ( s , y , v ) = { 0 , v < h ( y ) , + , otherwise .    for all  ( s , y , v ) I × × .

Then D Λ = { ( y , v ) : v < h ( y ) } is not open in 2 for the Euclidean metric; however, since the y-sections of D Λ are the open intervals ] - , h ( y ) [ , it follows that D Λ is open for the topology of the y-sections τ 𝒲 1 introduced in Example 4.7. Since Λ is bounded on its domain, by choosing 𝒲 = 𝒲 1 , all the assumptions of Corollary 5.13 are satisfied (with Δ = 𝒰 = ) so that there is no Lavrentiev phenomenon for the two endpoints problem.

Many of the assumptions of Corollary 5.13 are satisfied for real-valued, continuous Lagrangians. Moreover, there is no need in this case to consider any admissible topology τ 𝒲 : the resulting simplified claim deserves to be explicit. With respect to [28, Corollary 5.9], Corollary 5.18 deals with a more general class of Lagrangians and with the presence of a state constraint.

Corollary 5.18 (Non-occurrence of the Lavrentiev phenomenon for ( P X , Y Δ , 𝒰 ) – real-valued case).

Let Ψ and Λ be real-valued. Suppose, in addition to the Basic Assumptions, Condition (S + ) and the Radial Convexity Assumption (RC) on Λ, that:

  1. inf Ψ > 0 ,

  2. Ψ , Λ are bounded on bounded sets.

Then the Lavrentiev phenomenon does not occur for ( 𝒫 X , Y Δ , 𝒰 ).

Remark 5.19.

The claim and the assumptions of Corollary 5.18 almost overlap with those of [29, Corollary 3.7]. The extra Radial Convexity Assumption (RC) on Λ, required here, allows to deal with the constraint y ( I ) Δ , not considered in [29].

Open Question 1.

Can one build an example of a Lagrangian and set Δ for which the Lavrentiev phenomenon occurs for the constrained problem y ( I ) Δ and all the assumptions of Corollary 5.18 but the radial convexity (RC) are fulfilled? In that case notice that, owing to [29, Corollary 3.7], there is no Lavrentiev phenomenon in the unconstrained case (i.e., without taking care of the condition y ( I ) Δ ).

6 Preliminary results to the proof of the main result

In this section we formulate the fundamental tools of the proof of Theorem 5.4: Lemma 6.1 shows how radial convexity is used to lower the value of the energy at a given trajectory y by means of a DBR type subgradient (Definition 2.9); Lemma 6.2 illustrates how the conditions in Theorem 5.4 give some crucial bounds of the DBR subgradient. Finally, Lemma 6.5, needed just for the two endpoint problem ( 𝒫 X , Y Δ , 𝒰 ), illustrates the role of the topological requirements on the effective domain of Λ.

6.1 Lowering the energy via convexity

The part of the proof of Theorem 5.4 concerning the estimate of the energy F along the built Lipschitz approximating sequence is based on the following inequalities, both consequences of the Radial Convexity Assumption (RC) that allow, given an admissible trajectory, to lower the value of the functional along one suitable reparametrization.

Lemma 6.1 (Estimates through DBR subgradients).

Assume the validity of the Structure Assumptions. Let, for all ( s , z , v ) Dom ( Λ ) , P ( s , z , v ) be a DBR type subgradient of Λ at ( s , z , v ) . Let ( s , y , u ) Dom ( Λ ) and μ > 0 be such that ( s , y , u μ ) Dom ( Λ ) . Then

(6.1) Λ ( s , y , u μ ) μ Λ ( s , y , u ) + P ( s , y , u μ ) ( μ - 1 ) .

Proof.

Writing that u = u μ 1 μ , the subgradient inequality applied at ( s , y , u μ ) gives

Λ ( s , y , u ) 1 μ - Λ ( s , y , u μ ) = Λ ( s , y , u μ 1 μ ) 1 μ - Λ ( s , y , u μ ) P ( s , y , u μ ) ( 1 μ - 1 ) .

Multiplying both terms of the inequality by μ, we get

Λ ( s , y , u ) - Λ ( s , y , u μ ) μ P ( s , y , u μ ) ( 1 - μ ) ;

the conclusion follows. ∎

6.2 A crucial boundedness property on the DBR type subgradients

Let P ( s , z , v ) be a DBR type subgradient for Λ at ( s , z , v ) Dom ( Λ ) , y Δ , v 𝒰 . The monotonicity of the convex subgradients implies that

sup μ 1 P ( s , z , μ v ) P ( s , z , v ) < + , inf μ 1 P ( s , z , μ v ) P ( s , z , v ) > - .

Lemma 6.2 shows that, under some suitable local boundedness assumptions on Λ, these estimates become somewhat uniform as v varies, respectively, out of a ball and inside a ball of given radii. One difficulty is that P ( s , z , v ) may tend to - when ( s , z , v ) approaches Dom ( Λ ) c : for this purpose we focus as much as possible on sets that are well-inside the domain for a suitable topology.

We refer to [9, Theorem 4.2] for the proof of Lemma 6.2.

Lemma 6.2 (Boundedness of the DBR type subgradients [8, Theorem 4.2]).

Suppose that Λ satisfies the Basic and the Structure Assumptions. Let K Δ be a bounded set and let W W 0 be an equivalence relation on R n × R n . Let, for any ( s , z , v ) Dom ( Λ ) ( I × Δ × U ) , P ( s , z , v ) be a DBR type subgradient of Λ at ( s , z , v ) .

  1. Assume that there exists a set A 𝒦 enclosing the origin such that Λ is bounded on

    ( I × 𝒦 × ( A 𝒦 𝒰 ) ) Dom ( Λ ) .

    Then, for all r > r 𝒦 := sup { | v | : v A 𝒦 𝒰 } ,

    (6.2) sup s I , z 𝒦 , | v | r ( z , v ) D Λ v 𝒰 P ( s , z , v ) < + .

  2. Suppose that there exists λ 𝒦 > 0 such that Λ is bounded on the subsets of the form I × A , where A 𝒦 × ( B λ 𝒦 𝒰 ) , A 𝒲 D Λ . Then, for all 0 < λ < λ 𝒦 ,

    (6.3) - < inf s I , z 𝒦 , | v | λ dist 𝒲 ( ( z , v ) , D Λ c ) ρ v 𝒰 P ( s , z , v ) for all  ρ > 0 .

Remark 6.3.

Taking into account the geometric interpretation of the subdifferentials given in Remark 2.10:

  1. Condition (i) in Lemma 6.2 has a geometrical interpretation: the intersection with the ordinate axis of the tangent lines to r Λ ( s , z , r v ) at r = 1 are bounded below by a constant if ( s , z , v ) Dom ( Λ ) , | v | r and v 𝒰 . In the smooth case Condition (6.2) may be rewritten as

    (6.4) sup s I , z 𝒦 , | v | r ( z , v ) D Λ v 𝒰 Λ ( s , z , v ) - v v Λ ( s , z , v ) < + .

  2. Similarly, Condition (ii) in Lemma 6.2 geometrically means that the intersection with the ordinate axis of the tangent lines to r Λ ( s , z , r v ) at r = 1 are bounded below by a constant when dist 𝒲 ( ( z , v ) , D Λ c ) ρ , z 𝒦 and v 𝒰 with | v | λ . In the smooth case Condition (ii) may be rewritten as

    (6.5) - < inf s I , z 𝒦 , | v | λ dist 𝒲 ( ( z , v ) , D Λ c ) ρ v 𝒰 Λ ( s , z , v ) - v v Λ ( s , z , v ) .

Remark 6.4.

Notice that the assumptions of Theorem 5.4 imply the validity of those of Lemma 6.2. More precisely:

  1. If ( B y , Λ σ ) of Theorem 5.4 holds, then Condition (i) of Lemma 6.2 is satisfied with 𝒦 = y ( I ) and A 𝒦 = A y .

  2. If ( B y , Λ 𝒲 ) of Theorem 5.4 holds, then Condition (ii) of Lemma 6.2 is satisfied with 𝒦 = y ( I ) , λ 𝒦 = λ y .

6.3 Staying well-inside the domain with greater velocities

The following lemma, needed just for the problem with two endpoint conditions in the extended-valued case, explains the requirement in Claim (2) of Theorem 5.4 that D Λ ( Δ × 𝒰 ) is interior to D Λ for an admissible topology. We shall denote by | Ω | the Lebesgue measure of a subset Ω of I = [ t , T ] .

Lemma 6.5 (Staying well-inside the domain with greater velocity).

Let W W 0 be an equivalence relation in R n × R n and assume that D Λ ( Δ × U ) is interior to D Λ for τ W . Let y : I Δ be Borel, u L 1 ( I ; U ) be such that Λ ( s , y ( s ) , u ( s ) ) L 1 ( I ) and let

(6.6) λ > essinf { | u ( s ) | : s I } .

There exist μ ] 0 , 1 [ , ρ > 0 and Ω I with Lebesgue measure | Ω | > 0 such that

(6.7) | u ( s ) | μ < λ , dist 𝒲 ( ( y ( s ) , u ( s ) ) , D Λ c ) ρ , dist 𝒲 ( ( y ( s ) , u ( s ) μ ) , D Λ c ) ρ for a.e.  s Ω .

Proof.

Let, for μ ] 0 , 1 ] , Ω μ := { s I : | u ( s ) | μ < λ } . The choice of λ implies that Ω 1 is non-negligible. For every ρ > 0 let

I ρ := { s I : dist 𝒲 ( ( y ( s ) , u ( s ) ) , D Λ c ) 2 ρ } .

Since, for a.e. s I , ( y ( s ) , u ( s ) ) D Λ ( Δ × 𝒰 ) is interior to D Λ for τ 𝒲 , we have ρ > 0 I ρ = I : the continuity of the measure shows that there is ρ > 0 such that | Ω 1 I ρ | > 0 . Moreover,

μ ] 0 , 1 [ ( Ω μ I ρ ) = Ω 1 I ρ ,

so that there is μ ¯ ] 0 , 1 [ such that | Ω μ I ρ | > 0 for all μ [ μ ¯ , 1 [ : we choose μ sufficiently close to 1 in such a way that λ ( 1 - μ ¯ ) < ρ . Let Ω := Ω μ I ρ . By construction, a.e. on Ω we have

| u ( s ) | μ < λ , dist 𝒲 ( ( y ( s ) , u ( s ) ) , D Λ c ) 2 ρ .

Owing that | u ( s ) | < μ λ a.e. on Ω, we deduce that

| u ( s ) μ - u ( s ) | λ ( 1 - μ ) λ ( 1 - μ ¯ ) ρ a.e.  s Ω .

Since, for all s I , ( ( y ( s ) , u ( s ) μ ) , ( y ( s ) , u ( s ) ) 𝒲 0 𝒲 , we deduce from Proposition 4.8 that

(6.8) dist 𝒲 ( ( y ( s ) , u ( s ) μ ) , D Λ c ) ρ for a.e.  s Ω ,

proving that Ω has the desired properties. ∎

Remark 6.6.

Condition (6.6) in Lemma 6.5 plays a prominent role in establishing the validity of Clarke’s growth condition (H) in [21].

7 Proof of Theorem 5.4

The proof of Theorem 5.4 follows the lines of the proof of [28, Theorem 5.1], which is more focused on the construction not only of a Lipschitz approximating sequence, but a equi-Lipschitz one, with a Lipschitz constant that is uniform with respect to the initial time and datum. We will emphasize the new points which are:

  1. The non-occurrence of the gap for the problem with just one endpoint condition: the previous literature, except [29], did not emphasized a different set of conditions with respect to the two endpoint problem.

  2. Condition ( B y , Λ 𝒲 ) is new: it arises from [8] (see Lemma 6.2); it weakens Condition b) in [28, Proposition 4.24] since the latter requires that I × B K × B λ y Dom ( Λ ) .

  3. The new Condition (S + ), more general than (S) considered in [28].

  4. The presence of the function Ψ.

  5. We build here an approximating sequence ( y ν ) ν not only in energy, but also in W 1 , p ( I ; n ) .

As mentioned in the introduction, we build a sequence of reparametrizations ( φ ν ) ν of I in such a way that the derivative φ ν equals 1 on a big “good set” where | y | is small, a suitable value greater than 1 on a “bad” set where | y | > ν ; in the case of the problem with two endpoint conditions we need to lower φ ν on a small subset of the good set in order that φ ν ( I ) = I . We the use φ ν in order to define y ν , a reparametrization of y. Differently from previous approaches (see [1, 29]), we do not show that the energy of y ν tends to zero on the bad set, but that it is lower than the energy of y on the same set, in virtue of the radial convexity.

We fix ϵ > 0 and prove the existence of an admissible function for the desired problem y ¯ such that

  1. y ¯ is obtained via a reparametrization of y and satisfies the boundary conditions,

  2. y ¯ is bounded so that y ¯ is Lipschitz,

  3. F ( y ¯ ) F ( y ) + ϵ ,

  4. y ¯ - y L p ( I ; n ) ϵ .

7.1 Change of variables and approximations

We shall often make use of the following change of variables formula for Lebesgue integrals.

Proposition 7.1 (Change of variables for Lebesgue integrals [34]).

Let f 0 be measurable and φ : I I be bijective, absolutely continuous with φ > 0 on I. Then, for every A I , f L 1 ( A ) ( f φ ) φ L 1 ( φ - 1 ( A ) ) and

A f ( s ) 𝑑 s = φ - 1 ( A ) f ( φ ( τ ) ) φ ( τ ) 𝑑 τ .

The following approximation argument will be used in the sequel; we refer to [18] for a proof.

Lemma 7.2.

Let f L 1 ( I ) and ( φ ν ) ν be a sequence of bijective, absolutely continuous functions φ ν : I φ ν ( I ) I with a Lipschitz inverse ψ ν , such that:

  1. for all ν, φ ν > 0 on I,

  2. the sequence of Lipschitz constants ( ψ ν ) ν is bounded,

  3. for a.e. t I , φ ν ( t ) t as ν + .

Then f φ ν - f L 1 ( ψ ν ( I ) ) 0 as ν + .

7.2 Proof of Claim (2) of Theorem 5.4

Proof.

We begin by proving the more challenging Claim (2). We consider several steps.

(i) Definition of Ξ ( ν ) . Fix a Borel selection of the DBR type subgradient P ( s , z , v ) μ [ Λ ( s , z , v μ ) μ ] μ = 1 ; the existence of P is stated in [8, Proposition 2.7]. For ν > 0 we define

Ξ ( ν ) := sup s I , z y ( I ) , | v | ν ( z , v ) D Λ v 𝒰 P ( s , z , v ) .

We may assume that Ξ ( ν ) > - for all ν > 0 , otherwise there is ν > 0 such that | y ( s ) | ν a.e. on I and the conclusion of Theorem 5.4 follows trivially. As pointed out in Remark 6.4, Hypothesis ( B y , Λ σ ) ensures the validity of (i) of Lemma 6.2, with 𝒦 = y ( I ) , A 𝒦 = A y . Its application shows that, once chosen ν 0 > r y = sup { | v | : v A y 𝒰 } , we have

(7.1) Ξ ( ν ) for all  ν ν 0 .

(ii). For every ν ν 0 define S ν := { s I : | y ( s ) | > ν } , ε ν := S ν ( | y ( s ) | ν - 1 ) 𝑑 s . Then

(7.2) | S ν | 0 , 0 ε ν y 1 ν 0 as  ν + .

Indeed, ν | S ν | S ν | y ( s ) | 𝑑 s y 1 .

The purpose of Steps (iii)– (vi) is that to define the set Σ ν . They are needed just for the problem with two endpoint constraints; in the extended-valued case Step (iii) makes use of the fact that D Λ ( Δ × U ) is interior to D Λ for τ W .

(iii) Definition of Υ λ . Since D Λ ( Δ × 𝒰 ) is interior to D Λ for τ 𝒲 , owing to Lemma 6.5, for every λ > essinf { | y ( s ) | : s I } there exist a non-negligible subset Ω λ of I, ρ λ > 0 and μ λ ] 0 , 1 [ such that

(7.3) | y ( s ) | μ λ < λ , dist 𝒲 ( ( y ( s ) , y ( s ) ) , D Λ c ) ρ λ for a.e.  s Ω λ ,

and

(7.4) dist 𝒲 ( ( y ( s ) , y ( s ) μ ) , D Λ c ) ρ λ for a.e.  s Ω λ .

For every λ > essinf { | y ( s ) | : s I } we define

Υ λ := inf s I , z y ( I ) , | v | λ dist 𝒲 ( ( z , v ) , D Λ c ) ρ v 𝒰 P ( s , z , v ) .

It follows from (7.3) that Υ λ + .

(iv) Choice of Ω , λ , μ , ρ , definition of Υ. Let λ y be as in ( B y , Λ 𝒲 ) and choose λ ] essinf { | y ( s ) | : s I } , λ y [ . Set

Υ := Υ λ , Ω := Ω λ , ρ := ρ λ .

As pointed out in Remark 6.4, Hypothesis ( B y , Λ 𝒲 ) ensures the validity of (ii) of Lemma 6.2, with 𝒦 = y ( I ) , λ 𝒦 = λ y . Its application yields Υ - , so that

(7.5) Υ .

(v) Choice of Σ ν Ω with | Σ ν | = ε ν 1 - μ . Since, from (7.2), ε ν 0 as ν + , we may choose ν 1 ν 0 big enough in such a way that

(7.6) ε ν 1 - μ | Ω | for all  ν ν 1 .

We may thus select a measurable subset Σ ν of Ω such that | Σ ν | = ε ν 1 - μ .

(vi) Choice of ν 2 . We choose ν 2 max { ν 1 , λ } in such a way that

(7.7) ε ν y 1 ν ε * 2 1 2 for all  ν ν 2 ,

where ε * 1 is as in Condition (S + ), correspondingly to K = y . Notice that S ν Σ ν is negligible if ν λ . Indeed, if y ( s ) is defined and s S ν , then | y ( s ) | > ν , whereas if s Σ ν , then s Ω , whence | y ( s ) | < λ ν .

(vii) Choice of ν 3 = ν 3 ( ϵ ) and definition of Θ , Ξ . As a consequence of (B y , Ψ ),

M Ψ := sup { Ψ ( s , z ) : s I , z y ( I ) } < + .

Let h , g , η , β be as in Condition (S + ), correspondingly to K = y . Since η is BV, there are η 1 , η 2 nondecreasing such that η = η 1 - η 2 : set

(7.8) C η := η 1 + η 2 , Θ := 2 β I h ( τ , y , y ) 𝑑 τ + 4 C η g ( y , y ) .

We further choose ν 3 ν 2 large enough in such a way that

(7.9) ( y 1 ν ) β M Ψ ( Θ + Ξ ( ν 2 ) + + Υ - ) ϵ 2 for all  ν ν 3 ,

where we denote by x + (resp. x - ) the positive (resp. negative) part of a real number x. Notice that, differently from ν 0 , ν 1 , ν 2 , the natural ν 3 depends on ϵ. Since

ν ν 2 Ξ ( ν ) Ξ ( ν 2 ) Ξ ( ν ) + Ξ ( ν 2 ) +

and, recalling from (7.2) that ε ν y 1 ν , (7.9) implies that

(7.10) ε ν β M Ψ ( Θ + Ξ ( ν ) + + Υ - ) ϵ 2 for all  ν ν 3 .

From now on we set Ξ := Ξ ( ν 3 ) .

(viii) The change of variable φ ν . For ν ν 3 we introduce the following absolutely continuous change of variable φ ν : I defined by

(7.11) φ ν ( t ) := t , φ ν ( τ ) := { | y ( τ ) | ν if  τ S ν , μ if  τ Σ ν , 1 otherwise , for a.e.  τ I .

Notice that φ ν is well defined since, from Step (vi), S ν Σ ν is negligible. Clearly, φ is strictly increasing; from Steps (ii) and (v),

t T φ ν ( τ ) 𝑑 τ = S ν | y ( τ ) | ν 𝑑 τ + Σ ν μ 𝑑 τ + | I S ν Σ ν |
= ( ε ν + | S ν | ) + μ | Σ ν | + ( | I | - | S ν | - | Σ ν | )
= ε ν - ( 1 - μ ) | Σ ν | + | I | = | I | .

Therefore the image of φ ν is I and thus φ ν : I I is bijective; let us denote by ψ ν its inverse, which is absolutely continuous and even Lipschitz, since ψ ν 1 μ .

(ix). Set, for all s I ,

(7.12) y ν ( s ) := y ( ψ ν ( s ) ) .

Then y ν W 1 , p ( I ; n ) and, y ν being a reparametrization of y, we have y ν ( I ) y ( I ) Δ . Moreover, since ψ ν ( t ) = t , ψ ν ( T ) = T , it follows that y ν satisfies the desired boundary conditions. We refer to the proof of [28, Theorem 5.1] for further details.

(x) y ν W 1 , ( I ; R n ) , y ν is bounded and y ν U . Indeed, for a.e. s I ,

y ν ( s ) = { ν y ( ψ ν ( s ) ) | y ( ψ ν ( s ) ) | if  s φ ν ( S ν ) , y ( ψ ν ( s ) ) μ if  s φ ν ( Σ ν ) , y ( ψ ν ( s ) ) otherwise.

The fact that 𝒰 is a cone implies that y ν 𝒰 . Since | y ( s ) | ν a.e. out of S ν and | y | μ < λ on Σ ν , we have

(7.13) y ν max { ν , λ } = ν .

Notice that, since ν ν 3 = ν 3 ( ϵ ) , the Lipschitz constant of y ν depends in general from ϵ.

(xi). The following estimate holds:

(7.14) τ I φ ν ( τ ) - τ t T | φ ν ( s ) - 1 | 𝑑 s 2 ε ν ε * .

Indeed,

| φ ν ( τ ) - τ | t τ | φ ν ( s ) - 1 | 𝑑 s S ν ( | y ( s ) | ν - 1 ) 𝑑 s + Σ ν ( 1 - μ ) 𝑑 s
ε ν + ( 1 - μ ) | Σ ν | = 2 ε ν ε * ,

in virtue of Step (v) and (7.7).

In Steps (xii)–(xvii) we compare F ( y ν ) with F ( y ) .

(xii). For all ν ν 3 , a.e. in S ν we have

(7.15) Λ ( φ ν , y , ν y | y | ) | y | ν Λ ( φ ν , y , y ) + ( | y | ν - 1 ) Ξ .

Indeed, let ν ν 3 and s S ν . By applying Lemma 6.1 with μ = | y ( s ) | ν we get

Λ ( φ ν ( s ) , y ( s ) , ν y ( s ) | y ( s ) | ) | y ( s ) | ν Λ ( φ ν ( s ) , y ( s ) , y ( s ) ) + P ( φ ν ( s ) , y ( s ) , ν y ( s ) | y ( s ) | ) ( | y ( s ) | ν - 1 ) .

Since ν y ( s ) | y ( s ) | 𝒰 , y ( s ) y ( I ) , and ( y ( s ) , ν y ( s ) | y ( s ) | ) D Λ we have

P ( φ ν ( s ) , y ( s ) , ν y ( s ) | y ( s ) | ) sup s I , z y ( I ) , | v | ν ( z , v ) D Λ v 𝒰 P ( s , z , v ) = Ξ ( ν ) Ξ ( ν 3 ) ;

inequality (7.15) then follows immediately from the fact that | y ( s ) | ν 1 .

(xiii). This step, needed just for the problem with two endpoint conditions ( 𝒫 X , Y Δ , 𝒰 ), is essential in order to estimate the energy of y ν on Σ ν :

(7.16) Λ ( φ ν , y , y μ ) μ Λ ( φ ν , y , y ) - ( 1 - μ ) Υ a.e. in  Σ ν for all  ν ν 3 .

Indeed, since Σ ν Ω for ν ν 3 , from (7.4), a.e. in Σ ν we have

| y ( s ) | μ < λ , dist 𝒲 ( ( y ( s ) , y ( s ) μ ) , D Λ c ) ρ for all  ν ν 3 .

By applying Lemma 6.1, recalling that μ < 1 , we obtain

Λ ( φ ν , y , y μ ) μ - Λ ( φ ν , y , y ) P ( φ ν , y , y μ ) ( μ - 1 ) ( μ - 1 ) inf s I , z y ( I ) , | v | λ dist 𝒲 ( ( z , v ) , D Λ c ) ρ v 𝒰 P ( s , z , v ) = ( μ - 1 ) Υ ,

whence (7.16).

(xiv). The following estimate of F ( y ν ) holds:

(7.17) F ( y ν ) I Λ ( φ ν , y , y ) Ψ ( φ ν , y ) 𝑑 τ + ε ν M Ψ ( Ξ + + Υ - ) for all  ν ν 3 .

Indeed, we have

(7.18) F ( y ν ) = I Λ ( s , y ν , y ν ) Ψ ( s , y ν ) 𝑑 s .

Taking into account (7.12), the change of variables s = φ ν ( τ ) , τ I yields (in what follows, for the sake of brevity, we omit the independent variables τ , s ):

(7.19) F ( y ν ) = I [ Λ ( φ ν , y , y φ ν ) φ ν ] Ψ ( φ ν , y ) 𝑑 τ = 𝒥 Σ ν + 𝒥 S ν + 𝒥 I ( Σ ν S ν ) ,

where we set

𝒥 Σ ν := Σ ν [ Λ ( φ ν , y , y μ ) μ ] Ψ ( φ ν , y ) 𝑑 τ ,
𝒥 S ν := S ν [ Λ ( φ ν , y , ν y | y | ) | y | ν ] Ψ ( φ ν , y ) 𝑑 τ ,
𝒥 I ( Σ ν S ν ) := I ( Σ ν S ν ) Λ ( φ ν , y , y ) Ψ ( φ ν , y ) 𝑑 τ .

  1. Estimate of J Σ ν . Since, for ν ν 3 , Σ ν Ω and | Σ ν | = ε ν 1 - μ , it is immediate from (7.16) of Step (xiii) that

    (7.20)

    𝒥 Σ ν Σ ν Λ ( φ ν , y , y ) Ψ ( φ ν , y ) 𝑑 τ + ( 1 - μ ) Υ - M Ψ | Σ ν |
    Σ ν Λ ( φ ν , y , y ) Ψ ( φ ν , y ) 𝑑 τ + Υ - M Ψ ε ν .

  2. Estimate of J S ν . We deduce from (7.15) that, a.e. in S ν ,

    (7.21) [ Λ ( φ ν , y , ν y | y | ) | y | ν ] Ψ ( φ ν , y ) Λ ( φ ν , y , y ) Ψ ( φ ν , y ) + ( | y | ν - 1 ) M Ψ Ξ + ,

    whence

    (7.22) 𝒥 S ν S ν Λ ( φ ν , y , y ) Ψ ( φ ν , y ) 𝑑 τ + Ξ + M Ψ ε ν .

Therefore, (7.17) follows now immediately from (7.18), (7.19), (7.20) and (7.22).

(xv) Choice of ν 4 . Since φ ν ( τ ) τ pointwise, from the fact that Λ ( s , y , y ) L 1 ( I ) , the boundedness Condition (B y , Ψ ) and the continuity of Ψ ( , y ) , we may choose ν 4 ν 3 big enough in such a way that

(7.23) | I Λ ( s , y , y ) ( Ψ ( φ ν , y ) - Ψ ( s , y ) ) 𝑑 s | ϵ 2 for all  ν ν 4 .

(xvi). The following estimate holds:

(7.24) I Λ ( φ ν , y , y ) Ψ ( φ ν , y ) 𝑑 τ F ( y ) + ε ν β M Ψ Θ + ϵ 2 for all  ν ν 4 .

Indeed, by adding and subtracting of both Λ ( τ , y , y ) Ψ ( φ ν , y ) and Λ ( τ , y , y ) Ψ ( τ , y ) , a.e. on I we obtain

Λ ( φ ν , y , y ) Ψ ( τ , y ) = Q 1 , ν ( τ ) + Q 2 , ν ( τ ) + Λ ( τ , y , y ) Ψ ( τ , y ) ,

where

Q 1 , ν ( τ ) := Λ ( φ ν , y , y ) Ψ ( φ ν , y ) - Λ ( τ , y , y ) Ψ ( φ ν , y )

and

Q 2 , ν ( τ ) := Λ ( τ , y , y ) Ψ ( φ ν , y ) - Λ ( τ , y , y ) Ψ ( τ , y ) .

It follows from (7.23) that

(7.25) I | Q 2 , ν ( τ ) | 𝑑 τ ϵ 2 for all  ν ν 4 .

Condition (S + ) (corresponding to K := y ) and Step (xi) imply that

I | Q 1 , ν ( τ ) | 𝑑 τ M Ψ I | Λ ( φ ν , y , y ) - Λ ( τ , y , y ) | 𝑑 τ
M Ψ ( 2 ε ν ) β I h ( τ , y , y ) 𝑑 τ + M Ψ g ( y , y ) I | η ( φ ν ) - η ( τ ) | 𝑑 τ .

From [4, Lemma 2.10] we have

I | η ( φ ν ) - η ( τ ) | 𝑑 τ 2 C η φ ν - τ 4 C η ε ν ,

where C η was defined in (7.8) of Step (vii). Since ε ν ε ν β , it follows from the definition of Θ in Step (vii) that

I | Q 1 , ν ( τ ) | 𝑑 τ 2 β M Ψ ε ν β I h ( τ , y , y ) 𝑑 τ + 4 C η g ( y , y ) ε ν M Ψ ε ν β Θ ,

which, together with (7.25), give (7.24).

(xvii) Final estimate of F ( y ν ) . From (7.17) of Step (xiv) and (7.24) of Step (xvi), recalling that ε ν ε ν β , we obtain

(7.26) F ( y ν ) F ( y ) + ε ν β M Ψ ( Θ + Ξ + + Υ - ) + ϵ 2 for all  ν ν 4 .

It follows from (7.10) that F ( y ν ) F ( y ) + ϵ .

(xviii) Proof of the convergence of y ν to y. Since y ν ( t ) = X for all ν, it is enough to prove that ( y ν ) ν converges to y in L p ( I ; n ) . The change of variable s = φ ν ( τ ) gives

(7.27)

I | y ν - y | p 𝑑 s = I | y ( ψ ν ) φ ν ( ψ ν ) - y | p 𝑑 s = I | y φ ν - y ( φ ν ) | p φ ν 𝑑 τ
I ( S ν Σ ν ) * d τ + Σ ν * d τ + S ν * d τ

where in the above * stands for | y φ ν - y ( φ ν ) | p φ ν . It follows from the definition of φ ν in Step (viii) that:

  1. Since φ ν = 1 on I ( S ν Σ ν ) , it follows that

    (7.28) I ( S ν Σ ν ) * d τ = I ( S ν Σ ν ) | y - y ( φ ν ) | p 𝑑 τ I | y - y ( φ ν ) | p 𝑑 τ 0 as  ν + ,

    as a consequence of Lemma 7.2 since, from Step (xi), φ ν ( τ ) - τ 0 .

  2. Recall that φ ν = μ on Σ ν . Therefore

    (7.29) Σ ν * d τ := Σ ν | y μ - y ( φ ν ) | p φ ν 𝑑 τ 2 p ( 1 μ p - 1 Σ ν | y | p 𝑑 τ + φ ν ( Σ ν ) | y | p 𝑑 s ) .

    Since | Σ ν | 0 as ν + then Σ ν | y | p 𝑑 τ 0 as ν + . Moreover, from Step (v), if ν ν 4 ,

    | φ ν ( Σ ν ) | = φ ν ( Σ ν ) 1 𝑑 s = Σ ν φ ν 𝑑 τ = μ | Σ ν | = μ 1 - μ ε ν 0 as  ν + .

    It follows from (7.29) that

    (7.30) Σ ν | y φ ν - y ( φ ν ) | p φ ν 𝑑 τ 0 as  ν + .

  3. In S ν we have φ ν = | y | ν 1 , so that

    (7.31)

    S ν * d τ := S ν | y φ ν - y ( φ ν ) | p φ ν 𝑑 τ
    2 p ( S ν | y φ ν | p φ ν 𝑑 τ + S ν | y ( φ ν ) | p φ ν 𝑑 τ ) 2 p ( S ν | y | p 𝑑 τ + φ ν ( S ν ) | y | p 𝑑 s ) .

    Since y L p ( I ) and, from Step (ii), | S ν | 0 as ν + , we have S ν | y | p 𝑑 τ 0  as  ν + . Moreover,

    | φ ν ( S ν ) | = φ ν ( S ν ) 1 𝑑 s = S ν φ ν 𝑑 τ = S ν | y | ν 𝑑 τ y 1 ν 0 as  ν + .

    It follows from (7.31) that

    (7.32) S ν | y φ ν - y ( φ ν ) | p φ ν 𝑑 τ 0 as  ν + .

We deduce from (7.27), together with (7.28), (7.30) and (7.32) that y ν - y p 0 as ν + , which concludes the proof of Claim (2) of Theorem 5.4. ∎

7.3 Proof of Claim (1) of Theorem 5.4

Proof.

The proof is similar than that of Claim (2), from which we actually skip many steps. We just summarize the main issues. A self contained in the particular case where Ψ 1 can be found in [30]. As in the proof of [29, Theorem 3.1, Claim 1], the change of variables φ ν may map the interval I onto an interval φ ν ( I ) that contains I but may be larger than I, with | φ ν ( I ) | | I | as ν + .

Referring to the proof of Claim (2) of Theorem 5.4, we do not need here to build the sets Υ, Ω and Σ ν : as we pointed out before Step (iii), we thus do not need that D Λ ( Δ × 𝒰 ) is interior to D Λ for some topology τ 𝒲 .

  1. We keep Steps (i) and (ii).

  2. We skip Steps (iii)–(vi). We do not need to choose λ , μ , ρ nor to define the set Σ ν ; we set ν 1 = ν 0 .

  3. In Step (vii) we set Σ ν := , Ω := I , Υ = 0 .

We then modify as follows the steps of the proof of Claim (2):

(viii’) The change of variable φ ν . We introduce the following absolutely continuous change of variable φ ν : I defined by

φ ν ( t ) := t , φ ν ( τ ) := { | y ( τ ) | ν if  τ S ν , 1 otherwise , for a.e.  τ I

As in the proof of Claim (2), φ ν is absolutely continuous, strictly increasing but now, since φ ν 1 on I, φ ν ( I ) is now an interval containing I : we denote by ψ ν the restriction of the inverse of φ ν to I: ψ ν is Lipschitz, moreover ψ ν ( I ) I , ψ ν ( t ) = t and | ψ ν ( I ) | | I | as ν + .

(ix’). Set, for all s I , y ν ( s ) := y ( ψ ν ( s ) ) . Then y ν W 1 , p ( I ; n ) satisfies the boundary condition y ( t ) = X and y ν ( I ) Δ , y ν 𝒰 . Notice that

(7.33) y ν ( s ) = { ν y ( ψ ν ( s ) ) | y ( ψ ν ( s ) ) | if  ψ ν ( s ) S ν , y ( ψ ν ( s ) ) otherwise.

The new fact is that now y ν ( t ) = y ( ψ ν ( t ) ) = y ( t ) = X but y ν ( T ) = y ( ψ ν ( T ) ) = y ( t ν ) for some t ν T , and thus it may happen that y ν ( T ) = y ( t ν ) y ( T ) .

We now proceed as in the proof of Claim (2) up to Step (xii), skipping Step (xiii), not needed here. We need a little more care in the estimates in the last steps, the change of variable being now s = φ ν ( τ ) , with τ ψ ν ( I ) I .

(xiv’). Condition (7.17) becomes now

F ( y ν ) ψ ν ( I ) Λ ( φ ν , y , y ) Ψ ( φ ν , y ) 𝑑 τ + ε ν M Ψ Ξ + .

Similarly, all the integrals where Λ ( φ ν , ) appears are now computed just on ψ ν ( I ) instead of I. In the proof of the claim we set 𝒥 Σ ν := 0 . Since ψ ν ( I ) I , instead of (7.19) we obtain

(7.34) I Λ ( s , y ν , y ν ) Ψ ( s , y ν ) 𝑑 s = ψ ν ( I ) [ Λ ( φ ν , y , y φ ν ) φ ν ] Ψ ( φ ν , y ) 𝑑 τ .

(xvi’). We choose ν 4 as in Step (xv). Instead of (7.24), one gets

(7.35) I Λ ( φ ν , y , y ) Ψ ( φ ν , y ) 𝑑 τ ψ ν ( I ) Λ ( τ , y , y ) Ψ ( τ , y ) 𝑑 τ + ε ν β M Ψ Θ + ϵ 2 for all  ν ν 4 .

(xvii’). Inequality (7.26) is now

(7.36) F ( y ν ) ψ ν ( I ) Λ ( s , y , y ) Ψ ( s , y ) 𝑑 s + ε ν β M Ψ ( Θ + Ξ + ) + ϵ 2 F ( y ) + ε ν β M Ψ ( Θ + Ξ + ) + ϵ 2 for all  ν ν 4 .

(xviii’). Similar arguments apply to the proof of the convergence of y ν to y in L 1 ( I ) , taking into account that (7.27) becomes

(7.37)

I | y ν - y | p 𝑑 s = I | y ( ψ ν ) φ ν ( ψ ν ) - y | p 𝑑 s = ψ ν ( I ) | y φ ν - y ( φ ν ) | p φ ν 𝑑 τ
= ψ ν ( I ) S ν | y φ ν - y ( φ ν ) | p φ ν 𝑑 τ + S ν | y φ ν - y ( φ ν ) | p φ ν 𝑑 τ .

The proof is complete. ∎

Remark 7.3.

In the case of a final endpoint constraint, instead of the initial one, Claim (1) of Theorem 5.4 may be obtained by slightly modifying Step (viii’) of the proof: indeed, it is enough to define

φ ν ( T ) := T , φ ν ( τ ) = { | y ( τ ) | ν if  τ S ν , 1 otherwise , for a.e.  τ I .

Remark 7.4.

For problems with one endpoint constraint, or when Λ is real-valued, the proof of Theorem 5.4 is constructive. Indeed, for problems with one endpoint condition, the approximating functions y ν are defined by y ν := y ψ ν , where φ ν is defined in Step (viii’) and depends just on y and the set S ν := { s I : | y ( s ) | > ν } . For problems with conditions at both endpoints involving real-valued Lagrangians, there is no need of Step (iii). From Step (iv) on, we have

Υ := inf s I , z y ( I ) , | v | λ v 𝒰 P ( s , z , v ) .

It is then enough to define Ω = { s I : | y ( s ) | < λ } , choose a subset Σ ν of Ω as in Step (v), i.e., in such a way that ( 1 - μ ) | Σ ν | = S ν | y ( s ) | ν 𝑑 s . One then defines the reparametrization φ ν as in Step (viii). An explicit approximating sequence is built in Example 8.1.

8 Examples

In Example 3.6, we showed directly that the Lavrentiev gap at a minimizer y * (and, consequently, the Lavrentiev phenomenon) does not occur for the problem with the initial prescribed condition y ( 0 ) = 0 by exhibiting a Lipschitz minimizer satisfying the same initial condition. In Example 8.1 we show how the procedure of the proof of Theorem 5.4 allows to construct a sequence ( y ν ) ν of Lipschitz functions such that, not only F ( y ν ) F ( y * ) , but also y ν y * in the W 1 , p norm.

Example 8.1 (A constructive method).

Consider the Lagrangian considered in Example 3.6. We already noticed that Λ satisfies the Basic and Structure Assumptions. Moreover, Λ is equal to 0 and thus bounded on every subset of Dom ( Λ ) : therefore Assumption 2(a) of Theorem 5.4 is satisfied at the minimizer y * ( s ) = s . We follow the steps of the proof (Section 7.3) to construct a family of Lipschitz approximating sequence (in energy and in norm) that share the initial endpoint constraint y ( 0 ) = 0 . Let ν , ν 1 . Notice that, since y * ( s ) = 1 2 s ,

S ν = [ 0 , 1 4 ν 2 ] .

Following Step (viii’) of the proof of Theorem 5.4, the change of variable φ ν : [ 0 , 1 ] is defined by

φ ν ( 0 ) := 0 , φ ν ( τ ) := { y * ( τ ) ν = 1 2 ν s if  τ [ 0 , 1 4 ν 2 ] , 1  otherwise , for a.e.  τ [ 0 , 1 ]

Therefore we have

φ ν ( τ ) := { τ ν if  τ [ 0 , 1 4 ν 2 ] , 1 4 ν 2 + τ otherwise .

Notice that φ ν ( 1 ) = 1 2 ν 2 + 1 > 1 ; we set

t ν = φ ν - 1 ( 1 ) = 1 - 1 4 ν 2 .

The inverse ψ ν of φ ν , restricted to [ 0 , 1 ] is thus defined by

ψ ν ( s ) := { s 2 ν 2 if  s [ 0 , 1 2 ν 2 ] , s - 1 4 ν 2 if  s [ 1 2 ν 2 , 1 ] .

Then ψ ν ( [ 0 , 1 ] ) = [ 0 , t ν ] . The function y ν = y * ψ ν is thus defined by:

y ν ( s ) := { s ν if  s [ 0 , 1 2 ν 2 ] , s - 1 4 ν 2 if  s [ 1 2 ν 2 , 1 ] .

Figure 8 depicts the graphs of some of these approximations for some values of ν.

Figure 8 
                     The absolutely continuous function 
                           
                              
                                 
                                    
                                       
                                          y
                                          *
                                       
                                       ⁢
                                       
                                          (
                                          s
                                          )
                                       
                                    
                                    :=
                                    
                                       s
                                    
                                 
                              
                              
                              {y_{*}(s):=\sqrt{s}}
                           
                         (above all) and some of its Lipschitz approximations in norm and in energy with prescribed initial datum 
                           
                              
                                 
                                    
                                       y
                                       ⁢
                                       
                                          (
                                          0
                                          )
                                       
                                    
                                    =
                                    0
                                 
                              
                              
                              {y(0)=0}
                           
                         (from below: 
                           
                              
                                 
                                    
                                       y
                                       1
                                    
                                    ,
                                    
                                       y
                                       2
                                    
                                    ,
                                    
                                       y
                                       3
                                    
                                    ,
                                    
                                       y
                                       *
                                    
                                 
                              
                              
                              {y_{1},y_{2},y_{3},y_{*}}
                           
                        ), following the recipe of the proof of Theorem 5.4.
Figure 8

The absolutely continuous function y * ( s ) := s (above all) and some of its Lipschitz approximations in norm and in energy with prescribed initial datum y ( 0 ) = 0 (from below: y 1 , y 2 , y 3 , y * ), following the recipe of the proof of Theorem 5.4.

Notice that

y ν ( s ) = { ν if  s s [ 0 , 1 2 ν 2 ] , 1 2 s - 1 4 ν 2 if  s [ 1 2 ν 2 , 1 ] ,

so that | y ν | ν , y ν ( 0 ) = 0 , y ν ( [ 0 , 1 ] ) [ 0 , 1 ] and ( y ν , y ν ) D Λ , i.e., y ν 1 2 y ν , as expected from the proof of Theorem 5.4. Therefore F ( y ν ) = 0 F ( y ) and it can be easily checked that y ν y * in the W 1 , 1 norm: indeed, since y * y ν on [ 1 2 ν 2 , 1 ] ,

0 1 | y ν - y * | 𝑑 s = 0 1 2 ν 2 | ν - y * ( s ) | 𝑑 s + 1 2 ν 2 1 ( y * ( s ) - y ν ( s ) ) 𝑑 s

and

0 1 2 ν 2 | ν - y * ( s ) | 𝑑 s 0 1 2 ν 2 ν 𝑑 s + y * ( 1 2 ν 2 ) 0

as ν + and

1 2 ν 2 1 ( y * ( s ) - y ν ( s ) ) 𝑑 s = ( y * ( 1 ) - y ν ( 1 ) ) - ( y * ( 1 2 ν 2 ) - y ν ( 1 2 ν 2 ) )
= ( 1 - 1 - 1 4 ν 2 ) - ( 1 2 ν - 1 2 ν ) 0

as ν + .

In the next example we show how the constructive method in the proof of Theorem 5.4 may lead to a sequence ( y ν ) ν that converges to y in W 1 , 1 but such that lim sup ν F ( y ν ) > F ( y ) if some of the assumptions are not fulfilled.

Example 8.2.

Consider Manià’s example, where the Lagrangian

L ( s , y , v ) = Λ ( s , y , v ) Ψ ( s , v ) , Λ ( s , y , v ) = v 6 , Ψ ( s , y ) = ( y 3 - s ) 2 ,

and let y * ( s ) := s 1 3 . The Lavrentiev gap occurs at y * (see [12]) for the two endpoint problem y ( 0 ) = 0 , y ( 1 ) = 1 . We showed in Example 3.3 that there is no gap at y * if one considers, instead, just the endpoint condition y ( 1 ) = 1 . This fact, however, is not a consequence of Theorem 5.4. Indeed,

inf { Ψ ( s , y * ( s ) ) : s [ 0 , 1 ] } = 0

and

Λ ( s , y * ( s ) , y * ( s ) ) = ( y * ( s ) ) 6 = 1 3 6 s - 4 L 1 ( [ 0 , 1 ] ) .

This raises the following:

Open Question 2.

Can the non-occurrence of the gap with the endpoint constraint y ( 1 ) = 1 be established here from a general result, more general than Theorem 5.4?

We show here how, in the proof of Theorem 5.4, the violation of the integrability of Λ ( s , y * ( s ) , y * ( s ) ) affects the construction of the Lipschitz sequence ( y ν ) ν , with prescribed initial datum y ν ( 1 ) = 1 , in such a way that F ( y ν ) - F ( y * ) diverges.

Fix ν 2 . Following Step (viii’) in the case of a prescribed endpoint and taking into account Remark 7.3, we define

φ ν ( 1 ) = 1 , φ ν ( τ ) := { | y * ( τ ) | ν if  y * ( τ ) > ν , 1 otherwise , for all  τ [ 0 , 1 ] .

Thus we obtain, for all τ [ 0 , 1 ] ,

φ ν ( τ ) := { τ 1 3 ν - c ν if  0 τ < τ ν := ( 1 3 ν ) 3 2 , τ if  τ ν τ 1 , c ν := 2 τ ν = 2 3 3 ν 3 2 .

Notice that, as expected,

φ ν ( 0 ) = - c ν < 0 .

The inverse ψ ν of φ ν , restricted to [ 0 , 1 ] is thus defined for all s [ 0 , 1 ] by

ψ ν ( s ) := { ( s + c ν ) 3 ν 3 if  0 s < τ ν s if  τ ν s 1 .

Following Step (ix) we therefore define, for all s [ 0 , 1 ] ,

y ν ( s ) := y * ( ψ ν ( s ) ) = { ( s + c ν ) ν if  0 s < τ ν , y * ( s ) if  τ ν s 1 .

Notice that of course y ν ( 1 ) = 1 ; however, y ν ( 0 ) = ν c ν 0 , as expected. The proof of Theorem 5.4 ensures that y ν is Lipschitz for big values of ν and ( y ν ) ν converges to y * both in W 1 , 1 and in energy. Let us check these facts directly.

  1. Lipschitzianity. For all s [ 0 , 1 ] we have

    y ν ( s ) := { ν if  0 s < τ ν , 1 3 s - 2 3 1 3 τ ν - 2 3 ν if  τ ν s 1 .

  2. Convergence in W 1 , 1 . Since y ν ( 1 ) = y * ( 1 ) , we have

    y ν - y * 1 = 0 τ ν | y ν - y * | 𝑑 s + τ ν 1 | y ν - y * | 𝑑 s = 0 τ ν | y ν - y * | 𝑑 s
    = 0 τ ν | ν - y * | 𝑑 s 0 τ ν y * ( s ) 𝑑 s + ν τ ν τ ν 1 3 + ν ( 1 3 ν ) 3 2 0 as  ν + .

  3. Failure of the convergence in energy. We have

    F ( y ν ) - F ( y * ) = 0 τ ν ( y ν 3 ( s ) - s ) 2 ( y ν ( s ) ) 6 𝑑 s = 0 τ ν ( ( s + 2 τ ν ) 3 ν 3 - s ) 2 ν 6 𝑑 s
    α ν 3 2 as  ν +

    for some positive constant α. Thus F ( y ν ) - F ( y * ) + as ν + .

Figure 9 
                     The functions 
                           
                              
                                 
                                    
                                       y
                                       *
                                    
                                    ,
                                    
                                       y
                                       2
                                    
                                    ,
                                    
                                       y
                                       3
                                    
                                    ,
                                    
                                       y
                                       10
                                    
                                 
                              
                              
                              {y_{*},y_{2},y_{3},y_{10}}
                           
                         in Example 8.2.
Figure 9

The functions y * , y 2 , y 3 , y 10 in Example 8.2.

9 Further developments and questions

Remark 9.1.

  1. The proof of Theorem 5.4 relies on the fact that, for some λ > essinf { | y ( s ) | : s I } , ρ > 0 and P ( s , z , v ) μ [ Λ ( s , z , v μ ) μ ] μ = 1 ,

    (9.1) - < inf s I , z y ( I ) , | v | λ dist 𝒲 ( ( z , v ) , D Λ c ) ρ v 𝒰 P ( s , z , v ) ,

    and there is r > 0 such that

    (9.2) sup s I , z y ( I ) , | v | r ( z , v ) D Λ v 𝒰 P ( s , z , v ) < + .

Open Question 3.

Properties (9.1)–(9.2) are named Growth Condition (M t B ) in [28] where it is compared with other growth conditions. Lemma 6.2 shows that Hypotheses ( B y , Λ σ ) and ( B y , Λ 𝒲 ) provide a sufficient condition for the validity of (9.1)–(9.2). Can they be weakened?

  1. The conclusions of the paper may be easily extended to Lagrangians of the form i = 1 M Λ i ( s , z , v ) Ψ i ( s , z ) , assuming that each pair ( Ψ i , Λ i ) satisfies the assumptions for ( Ψ , Λ ) .

  2. The proof of Theorem 5.4 shows that, in the boundedness conditions for Ψ and Λ one can replace I × y ( I ) with a neighbourhood of the graph of y in I × y ( I ) .

  3. The main results can be generalized easily to an optimal control problem of the form

    J ( y , u ) := I Λ ( s , y ( s ) , u ( s ) ) Ψ ( s , y ( s ) ) 𝑑 s

    subject to

    (D) { y W 1 , p ( I ; n ) , y = b ( y ) u  a.e. in  I , y ( t ) = x , u ( s ) 𝒰  a.e.  s I , y ( s ) Δ  for all  s I ,

    where

    1. the control u : I m is in L p ( I ; m ) ,

    2. b : n L ( n , m ) (the space of linear functions from n to m ) is a Borel measurable function, bounded on bounded sets,

    3. the control set 𝒰 m is a cone.

    In this context the Lavrentiev gap does not occur at ( y , u ) if there exists a sequence ( y ν , u ν ) ν pairs in Lip ( I , n ) × L ( I ; m ) satisfying (D) and the given boundary conditions, and moreover:

    1. lim ν + F ( y ν , u ν ) = F ( y , u ) (approximation in energy),

    2. ( y ν , u ν ) ( y , u ) in L ( I ; n ) × L p ( I ; m ) as ν + .

    The proof can be obtained following the lines of the proof of Theorem 5.4 with the obvious modifications (see the proof of [28, Theorem 5.1]); the details, in the case where Ψ 1 , can be found in [30].


Dedicated to Francis Clarke on his 75th birthday



Communicated by Frank Duzaar


Funding statement: This research received partial funding from the University of Padua under the grant SID 2018 “Controllability, stabilizability and infimum gaps for control systems”, protocol BIRD 187147. Additionally, it was supported by the European Union – NextGenerationEU under the National Recovery and Resilience Plan (NRRP), Mission 4 Component 2 Investment 1.1 – Call PRIN 2022 No. 104 of February 2, 2022 by the Italian Ministry of University and Research; Project 2022238YY5 (subject area: PE – Physical Sciences and Engineering) titled “Optimal control problems: Analysis, approximation and applications”. Furthermore, this project was successfully completed with contributions from the UMI Group TAA “Approximation Theory and Applications”.

Acknowledgements

I am deeply thankful to Piernicola Bettiol and Raphaël Cerf for their gracious hospitality during my visits to the Université de Bretagne Occidentale (Brest) and the École Normale Supérieure de Paris, respectively. The welcoming and intellectually stimulating environments at these institutions greatly enhanced my ability to work on and refine this paper. My gratitude also extends to Pierre Bousquet and Giulia Treu for their insightful questions and keen attention throughout the preparation of our upcoming survey on our recent findings regarding the Lavrentiev phenomenon. I owe a special thank you to Giuseppe De Marco for the valuable discussions on the admissible topologies addressed in Section 4 and to the referees for their thoughtful remarks and suggestions.

References

[1] G. Alberti and F. Serra Cassano, Non-occurrence of gap for one-dimensional autonomous functionals, Calculus of Variations, Homogenization and Continuum Mechanics (Marseille 1993), Ser. Adv. Math. Appl. Sci. 18, World Scientific, River Edge (1994), 1–17. Search in Google Scholar

[2] L. Ambrosio, O. Ascenzi and G. Buttazzo, Lipschitz regularity for minimizers of integral functionals with highly discontinuous integrands, J. Math. Anal. Appl. 142 (1989), no. 2, 301–316. 10.1016/0022-247X(89)90001-2Search in Google Scholar

[3] J. M. Ball and V. J. Mizel, One-dimensional variational problems whose minimizers do not satisfy the Euler–Lagrange equation, Arch. Ration. Mech. Anal. 90 (1985), no. 4, 325–388. 10.1007/BF00276295Search in Google Scholar

[4] J. Bernis, P. Bettiol and C. Mariconda, Value function for nonautonomous problems in the calculus of variations, Math. Control Relat. Fields (2024), to appear. 10.3934/mcrf.2024045Search in Google Scholar

[5] P. Bettiol and C. Mariconda, A new variational inequality in the calculus of variations and Lipschitz regularity of minimizers, J. Differential Equations 268 (2020), no. 5, 2332–2367. 10.1016/j.jde.2019.09.011Search in Google Scholar

[6] P. Bettiol and C. Mariconda, Regularity and necessary conditions for a Bolza optimal control problem, J. Math. Anal. Appl. 489 (2020), no. 1, Article ID 124123. 10.1016/j.jmaa.2020.124123Search in Google Scholar

[7] P. Bettiol and C. Mariconda, A Du Bois–Reymond convex inclusion for nonautonomous problems of the calculus of variations and regularity of minimizers, Appl. Math. Optim. 83 (2021), no. 3, 2083–2107. 10.1007/s00245-019-09620-ySearch in Google Scholar

[8] P. Bettiol and C. Mariconda, Uniform boundedness for the optimal controls of a discontinuous, non-convex Bolza problem, ESAIM Control Optim. Calc. Var. 29 (2023), Paper No. 12. 10.1051/cocv/2022079Search in Google Scholar

[9] P. Bettiol, G. De Marco and C. Mariconda, A useful subdifferential in the Calculus of Variations, J. Nonlinear Anal., to appear. Search in Google Scholar

[10] P. Bousquet, C. Mariconda and G. Treu, On the Lavrentiev phenomenon for multiple integral scalar variational problems, J. Funct. Anal. 266 (2014), no. 9, 5921–5954. 10.1016/j.jfa.2013.12.020Search in Google Scholar

[11] G. Buttazzo and M. Belloni, A survey on old and recent results about the gap phenomenon in the calculus of variations, Recent Developments in Well-Posed Variational Problems, Math. Appl. 331, Kluwer Academic, Dordrecht (1995), 1–27. 10.1007/978-94-015-8472-2_1Search in Google Scholar

[12] G. Buttazzo, M. Giaquinta and S. Hildebrandt, One-Dimensional Variational Problems, Oxford Lecture Ser. Math. Appl. 15, Oxford University, New York, 1998. 10.1093/oso/9780198504658.001.0001Search in Google Scholar

[13] G. Buttazzo and V. J. Mizel, Interpretation of the Lavrentiev phenomenon by relaxation, J. Funct. Anal. 110 (1992), no. 2, 434–460. 10.1016/0022-1236(92)90038-KSearch in Google Scholar

[14] A. Cellina, The classical problem of the calculus of variations in the autonomous case: Relaxation and Lipschitzianity of solutions, Trans. Amer. Math. Soc. 356 (2004), no. 1, 415–426. 10.1090/S0002-9947-03-03347-6Search in Google Scholar

[15] A. Cellina and A. Ferriero, Existence of Lipschitzian solutions to the classical problem of the calculus of variations in the autonomous case, Ann. Inst. H. Poincaré C Anal. Non Linéaire 20 (2003), no. 6, 911–919. 10.1016/s0294-1449(03)00010-6Search in Google Scholar

[16] A. Cellina, A. Ferriero and E. M. Marchini, Reparametrizations and approximate values of integrals of the calculus of variations, J. Differential Equations 193 (2003), no. 2, 374–384. 10.1016/S0022-0396(02)00176-6Search in Google Scholar

[17] A. Cellina, G. Treu and S. Zagatti, On the minimum problem for a class of non-coercive functionals, J. Differential Equations 127 (1996), no. 1, 225–262. 10.1006/jdeq.1996.0069Search in Google Scholar

[18] R. Cerf and C. Mariconda, Occurrence of gap for one-dimensional scalar autonomous functionals with one end point condition, preprint (2022), https://arxiv.org/abs/2209.03820; to appear in Ann. Sc. Norm. Super. Pisa Cl. Sci. (5). 10.2422/2036-2145.202209_007Search in Google Scholar

[19] R. Cerf and C. Mariconda, The Lavrentiev phenomenon, preprint (2024), https://arxiv.org/abs/2404.02901; to appear in Amer. Math. Monthly. Search in Google Scholar

[20] F. Clarke, Functional Analysis, Calculus of Variations and Optimal Control, Grad. Texts in Math. 264, Springer, London, 2013. 10.1007/978-1-4471-4820-3Search in Google Scholar

[21] F. H. Clarke, An indirect method in the calculus of variations, Trans. Amer. Math. Soc. 336 (1993), no. 2, 655–673. 10.1090/S0002-9947-1993-1118823-3Search in Google Scholar

[22] F. H. Clarke and R. B. Vinter, Regularity properties of solutions to the basic problem in the calculus of variations, Trans. Amer. Math. Soc. 289 (1985), no. 1, 73–98. 10.1090/S0002-9947-1985-0779053-3Search in Google Scholar

[23] M. Colombo and G. Mingione, Bounded minimisers of double phase variational integrals, Arch. Ration. Mech. Anal. 218 (2015), no. 1, 219–273. 10.1007/s00205-015-0859-9Search in Google Scholar

[24] G. Dal Maso and H. Frankowska, Autonomous integral functionals with discontinuous nonconvex integrands: Lipschitz regularity of minimizers, DuBois–Reymond necessary conditions, and Hamilton-Jacobi equations, Appl. Math. Optim. 48 (2003), no. 1, 39–66. 10.1007/s00245-003-0768-4Search in Google Scholar

[25] W. B. Gordon, A minimizing property of Keplerian orbits, Amer. J. Math. 99 (1977), no. 5, 961–971. 10.2307/2373993Search in Google Scholar

[26] J. Malý and W. P. Ziemer, Fine Regularity of Solutions of Elliptic Partial Differential Equations, Math. Surveys Monogr. 51, American Mathematical Society, Providence, 1997. 10.1090/surv/051Search in Google Scholar

[27] B. Manià, Sopra un esempio di Lavrentieff, Boll. Unione Mat. Ital. 13 (1934), 147–153. 10.1007/BF02413436Search in Google Scholar

[28] C. Mariconda, Equi-Lipschitz minimizing trajectories for non coercive, discontinuous, non convex Bolza controlled-linear optimal control problems, Trans. Amer. Math. Soc. Ser. B 8 (2021), 899–947. 10.1090/btran/80Search in Google Scholar

[29] C. Mariconda, Non-occurrence of gap for one-dimensional non-autonomous functionals, Calc. Var. Partial Differential Equations 62 (2023), no. 2, Paper No. 55. 10.1007/s00526-022-02391-5Search in Google Scholar

[30] C. Mariconda, Non-occurrence of the Lavrentiev gap for a Bolza type optimal control problem with state constraints and no end cost, Commun. Optim. Theory 12 (2023), 1–18. Search in Google Scholar

[31] C. Mariconda and G. Treu, Lipschitz regularity of the minimizers of autonomous integral functionals with discontinuous non-convex integrands of slow growth, Calc. Var. Partial Differential Equations 29 (2007), no. 1, 99–117. 10.1007/s00526-006-0059-4Search in Google Scholar

[32] C. Mariconda and G. Treu, Non-occurrence of a gap between bounded and Sobolev functions for a class of nonconvex Lagrangians, J. Convex Anal. 27 (2020), no. 4, 1247–1259. Search in Google Scholar

[33] C. Mariconda and G. Treu, Non-occurrence of the Lavrentiev phenomenon for a class of convex nonautonomous Lagrangians, Open Math. 18 (2020), no. 1, 1–9. 10.1515/math-2020-0001Search in Google Scholar

[34] J. Serrin and D. E. Varberg, A general chain rule for derivatives and the change of variables formula for the Lebesgue integral, Amer. Math. Monthly 76 (1969), 514–520. 10.1080/00029890.1969.12000249Search in Google Scholar

[35] L. Tonelli, Fondamenti di calcolo delle variazioni. I., N. Zanichelli, Bologna, 1922. Search in Google Scholar

[36] R. Vinter, Optimal Control, Mod. Birkhäuser Class., Birkhäuser, Boston, 2000. Search in Google Scholar

[37] A. J. Zaslavski, Nonoccurrence of the Lavrentiev phenomenon for many optimal control problems, SIAM J. Control Optim. 45 (2006), no. 3, 1116–1146. 10.1137/050640370Search in Google Scholar

Received: 2023-08-17
Accepted: 2024-08-17
Published Online: 2024-10-02
Published in Print: 2025-01-01

© 2024 Walter de Gruyter GmbH, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 International License.

Downloaded on 5.11.2025 from https://www.degruyterbrill.com/document/doi/10.1515/acv-2023-0096/html
Scroll to top button