Startseite Gradient estimates for an orthotropic nonlinear diffusion equation
Artikel Open Access

Gradient estimates for an orthotropic nonlinear diffusion equation

  • Pierre Bousquet , Lorenzo Brasco , Chiara Leone EMAIL logo und Anna Verde
Veröffentlicht/Copyright: 3. Dezember 2021

Abstract

We consider a quasilinear degenerate parabolic equation driven by the orthotropic p-Laplacian. We prove that local weak solutions are locally Lipschitz continuous in the spatial variable, uniformly in time.

MSC 2010: 35K65; 35B65; 35K92

1 Introduction

1.1 Aim of the paper

Let Ω N be an open bounded set and I an open bounded interval. We study the gradient regularity of local weak solutions to the following parabolic equation:

(1.1) u t = i = 1 N ( | u x i | p - 2 u x i ) x i in  I × Ω .

Evolution equations of this type have been studied since the 1960s, especially by the Soviet school, see for example the paper [27] by Vishik. Equation (1.1) also explicitly appears in the monographs [21], [23, Example 4.A, Chapter III] and [29, Example 30.8], among others.

In this paper, we will focus on the case p 2 . We first observe that (1.1) looks quite similar to the more familiar one

(1.2) u t = Δ p u in  I × Ω ,

which involves the p-Laplace operator

Δ p u = i = 1 N ( | u | p - 2 u x i ) x i .

Indeed, both parabolic equations are particular instances of equations of the type

u t = div F ( u )

with F : N a convex function which satisfies the structural conditions

F ( z ) , z 1 C | z | p and | F ( z ) | C | z | p - 1 for every  z N .

Then the basic regularity theory equally applies to both (1.1) and (1.2). The standard reference in the field is DiBenedetto’s monograph [13], where one can find boundedness results for the solution u (see [13, Chapter V]), Hölder continuous estimates for u (see [13, Chapter III]), as well as Harnack inequality for positive solutions (see [13, Chapter VI]). At a technical level, there is no distinction to be made between (1.1) and (1.2).

In contrast, when coming to the regularity of u (i.e. boundedness and continuity), the situation becomes fairly more complicated. Let us start from (1.2). DiBenedetto and Friedman [14] have proved that the gradients of solutions to this equation are bounded. This is the starting point to obtain the continuity of the gradients for any

p > 2 N N + 2 .

We refer again to DiBenedetto’s book for a comprehensive collection of results on the subject, notably to [13, Chapter VIII]. Since then, there has been a growing literature concerning the regularity for nonlinear, possibly degenerate or singular, parabolic equations (or systems), the main model of which is given by the evolutionary p-Laplacian equation (1.2). Without any attempt to completeness, we can just mention some classical references [15, 12, 28, 8, 9], up to the most recent contributions on the subject, given by [2, 19, 18], among others.

However, none of these results apply to our equation (1.1). Indeed, all of them rely on the fact that the loss of ellipticity of the operator div F is restricted to a single point, since the Hessian D 2 F behaves as in the model case (1.2)

D 2 F ( z ) ξ , ξ 1 C | z | p - 2 | ξ | 2 ,

where the elliptic character is lost only for z = 0 . Such a property dramatically breaks down for our equation (1.1). Indeed, in this case, the function F has the following orthotropic structure:

(1.3) F ( z ) = 1 p i = 1 N | z i | p for every  z N .

The Hessian matrix of F now degenerates on an unbounded set, namely the set of those z N such that one component z i is 0. As a consequence, the aforementioned references do not provide any regularity results for the gradients of the solutions.

The main goal of the present paper is to prove the L bound on u for our equation (1.1), thus extending the result by DiBenedetto and Friedman to this more degenerate setting. In order to do this, we will need to adapt to the parabolic setting the machinery that we developed in [3, 4, 5, 6] and [7], for degenerate equations with orthotropic structure. Indeed, the operator

i = 1 N ( | u x i | p - 2 u x i ) x i ,

that we called orthotropic p-Laplacian, is the prominent example of this kind of equations. We also refer to [11] for an approach to this operator, based on viscosity techniques.

1.2 Main result

In this paper, we establish the following regularity result which can be seen as the parabolic counterpart of our previous result [5, Theorem 1.1] for the elliptic case. In the statement below, the notation u refers to the spatial variables, i.e. u = ( u x 1 , , u x N ) .

Main Theorem.

Let p > 2 and let u L loc p ( I ; W loc 1 , p ( Ω ) ) be a local weak solution of (1.1). Then u L loc ( I × Ω ) . More precisely, for every parabolic cube

𝒬 τ , R ( t 0 , x 0 ) := ( t 0 - τ , t 0 ) × ( x 0 - R , x 0 + R ) N I × Ω ,

and every 0 < σ < 1 , we have

(1.4) u L ( 𝒬 σ τ , σ R ( t 0 , x 0 ) ) C 1 ( 1 - σ ) N + 2 2 ( τ R 2 ) 1 2 ( 𝒬 τ , R ( t 0 , x 0 ) | u | p 𝑑 t 𝑑 x ) 1 2 + C ( ( 1 - σ ) R 2 τ ) 1 p - 2

for a constant C = C ( N , p ) > 0 .

Remark 1.1 (Scalings).

We observe that equation (1.1) is invariant with respect to the “horizontal” and “vertical” scale changes

u λ , μ ( t , x ) = μ u ( μ p - 2 λ p t , λ x )

for every λ , μ > 0 . Then it is easily seen that the a priori estimate (1.4) is invariant with respect to these scale changes. We point out that such estimate is the exact analogue of that for the evolutionary p-Laplacian, see [13, Theorem 5.1, Chapter VIII]. Occasionally, in the paper we will work with anisotropic parabolic cubes of the type

Q R ( t 0 , x 0 ) = ( t 0 - R p , t 0 ) × ( x 0 - R , x 0 + R ) N .

The choice of cubes of this type could be loosely justified by a dimensional analysis of the equation. Indeed, by considering the quantity u as dimensionless and using the family of scalings

( t , x ) ( λ p t , λ x ) for every  λ > 0 ,

we get the relation

time ( length ) p .

However, as is well known, estimates on cubes of the type Q R are too restrictive when looking at C 0 , α estimates for u . Indeed, in light of the so-called intrinsic geometry, it is much more important to work with local estimates on cubes 𝒬 τ , R , where the time scale τ is adapted to the solution itself: roughly speaking, we can take

τ R 2 | u | p - 2 .

This explains the importance of having (1.4) with two independent scales R and τ. We refer to [13, Chapter VIII] for a description of the method of intrinsic scalings, where these heuristics are clarified.

Remark 1.2 (Case 1 < p 2 ).

When p = 2 , the orthotropic parabolic equation (1.1) boils down to the standard heat equation, for which solutions are well known to be smooth. For this reason, in our statement we restrict our attention to the case p > 2 . However, we point out that by making the choice τ = R 2 and taking the limit as p goes to 2 in (1.4), we formally end up with the classical gradient estimate for solutions of the heat equation

u L ( 𝒬 σ R 2 , σ R ( t 0 , x 0 ) ) C ( 1 - σ ) N + 2 2 ( 𝒬 R 2 , R ( t 0 , x 0 ) | u | 2 𝑑 t 𝑑 x ) 1 2 .

In light of the previous remark, in the case p = 2 the relation

time ( length ) 2

is now the natural one.

As for the singular case p < 2 , this is somehow simpler than its degenerate counterpart. In this case, the local Lipschitz regularity of solutions to (1.1) can be directly inferred from [22, Theorem 1], under the restriction

p > 2 N N + 2 .

Indeed, the result of [22] covers (among others) the case of parabolic equations of the form

u t = div F ( u ) ,

under the following assumptions on the convex function F:

| F ( z ) | C | z | p - 1    and    D 2 F ( z ) ξ , ξ 1 C | z | p - 2 | ξ | 2 for every  ξ N , z N { 0 } .

It is not difficult to see that the orthotropic function (1.3) for p < 2 matches both requirements. Indeed, observe that

D 2 F ( z ) ξ , ξ = ( p - 1 ) i = 1 N | z i | p - 2 | ξ | 2 ( p - 1 ) | z | p - 2 | ξ | 2 ,

thanks to the fact that p - 2 < 0 . Thus in the subquadratic case, the orthotropic structure helps more than it hurts, in a sense.

Remark 1.3 (Anisotropic diffusion).

We conclude this part by observing that, more generally, one could consider the following parabolic equation:

u t = i = 1 N ( | u x i | p i - 2 u x i ) x i in  Ω × I ,

which still has an orthotropic structure. Now we have a whole set of exponents 1 < p 1 p 2 p N , one for each coordinate direction. We cite the paper [24], where some global Lipschitz regularity results are proven for solutions of the relevant Cauchy–Dirichlet problem, under appropriate regularity assumptions on the data. We point out that in light of their global nature, for p 1 = = p N = p > 2 such results are not comparable to ours. We also refer to [10] for a sophisticated Harnack inequality for positive local weak solutions, as well as for some further references on the problem. Finally, the very recent paper [16] contains a thorough study of the Cauchy problem in the case p i < 2 , together with some regularity results.

However, as for the counterpart of our Main Theorem for local solutions of this equation, this is still an open problem, to the best of our knowledge.

1.3 Technical aspects of the proof

The core of the proof of the Main Theorem is an a priori Lipschitz estimate for smooth solutions of the orthotropic parabolic equation, see Proposition 4.1 below. More precisely, we introduce the regularized problem

( u ε ) t = div F ε ( u ε ) ,

where F ε is a smooth uniformly convex approximation of the orthotropic function (1.3). By the classical regularity theory, the maps u ε are regular enough to justify all the calculations below. The goal is to establish a local uniform Lipschitz estimate on u ε , which does not depend on the regularization parameter ε. Finally, we let ε go to 0 and prove that the family u ε converges to the original solution u. This allows to obtain the Lipschitz estimate for u itself.

In the subsequent part of this subsection, we emphasize the main difficulties to get such a Lipschitz estimate on u ε . In order to simplify the presentation, we drop the index ε both for F ε and u ε . The strategy is apparently quite classical: we rely on a Moser iterative scheme of reverse Hölder’s inequalities, resulting from the interplay between Caccioppoli estimates and the Sobolev embeddings.

To be more specific, we first differentiate the equation with respect to a spatial variable x j , so as to get the equation solved by the j-th component of the gradient. This is given by

(1.5) I × Ω ( u x j ) t φ 𝑑 t 𝑑 x + I × Ω ( F ( u ) ) x j , φ 𝑑 t 𝑑 x = 0 for every  φ C 0 ( I × Ω ) .

More generally, the composition of the component u x j with a non-negative convex function h is a subsolution of this equation. Accordingly, the map h ( u x j ) satisfies the Caccioppoli inequality which is naturally attached to (1.5) (see Lemma 3.1 below). If I = ( T 0 , T 1 ) and τ ( T 0 , T 1 ) , this reads as follows:

(1.6)

χ ( τ ) { τ } × Ω h 2 ( u x j ) η 2 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) h ( u x j ) , h ( u x j ) χ η 2 𝑑 t 𝑑 x
( T 0 , τ ) × Ω χ η 2 h 2 ( u x j ) 𝑑 t 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) η , η h 2 ( u x j ) χ 𝑑 t 𝑑 x .

Here, the maps χ C 0 ( I ) and η C 0 ( Ω ) are non-negative cut-off functions in the time and space variables, respectively. We have used the following expedient notation: given f L 1 ( I × Ω ) ,

{ τ } × Ω f 𝑑 x := Ω f ( τ , x ) 𝑑 x for a.e.  τ I .

When F is a uniformly elliptic integrand, in the sense that

1 C | ξ | 2 D 2 F ( u ) ξ , ξ C | ξ | 2 for every  ξ N ,

one can easily obtain from (1.6) a crucial “unnatural” feature of the subsolution h ( u x j ) ; that is, a sort of reverse Poincaré inequality where the Sobolev norm of h ( u x j ) is controlled by the L 2 norm of the subsolution itself. In conjunction with the Sobolev inequality, this is the cornerstone which eventually leads to the classical version of the Moser iterative scheme. It should be noticed that this strategy still works even in the degenerate case, provided the Hessian behaves like

1 C | u | p - 2 | ξ | 2 D 2 F ( u ) ξ , ξ C | u | p - 2 | ξ | 2 for every  ξ N ,

as for the evolutionary p-Laplace equation (1.2). It is sufficient to use the “absorption of degeneracy” trick, where the degenerate weight | u | p - 2 is recombined with the subsolution h ( u x j ) by means of simple algebraic manipulations. This still permits to infer from (1.6) a control on the Sobolev norm of a suitable convex function of u x j . This is nowadays a standard technique in the field; for the elliptic case, it goes back to the pioneering works by Ural’tseva [26] and Uhlenbeck [25].

As we explained above, due to the severe degeneracy of D 2 F in our orthotropic situation, it is not possible to follow the same path. In order to rely on such an absorption trick, we have to go through a tour de force and to introduce a new family of weird Caccioppoli inequalities (see Lemma 3.2 below). These are the parabolic counterparts of a corresponding estimate introduced in the elliptic setting in [3] and then fruitfully exploited in [5].

The crucial idea is to mix together the components of the gradient with respect to 2 orthogonal directions. This compensates the lack of ellipticity of D 2 F and allows to rely on the Sobolev embeddings in the iterative scheme. We do not detail these Caccioppoli-type estimates here, but instead explain the main additional difficulties with respect to the elliptic framework.

Let us come back for one instant to the standard Caccioppoli inequality (1.6). It follows from (1.5) by taking φ = h h ( u x j ) χ η 2 . In particular, the parabolic term is given by

I × Ω ( u x j ) t φ 𝑑 t 𝑑 x = 1 2 I × Ω ( h 2 ( u x j ) ) t χ η 2 𝑑 t 𝑑 x .

Then an integration by parts yields

I × Ω ( u x j ) t φ 𝑑 t 𝑑 x = - 1 2 I × Ω h 2 ( u x j ) χ η 2 𝑑 t 𝑑 x .

The latter yields the “time slice” term on the left-hand side of (1.6). This way, the time derivative is transferred to χ and one can handle the factor h 2 ( u x j ) as in the elliptic framework.

For the weird Caccioppoli inequalities, the test function is now φ = u x j Φ ( u x j 2 ) Ψ ( u x k 2 ) χ η 2 , for some 1 j , k N . The corresponding parabolic term becomes

I × Ω ( u x j ) t φ 𝑑 t 𝑑 x = 1 2 I × Ω ( Φ ( u x j 2 ) ) t Ψ ( u x k 2 ) χ η 2 𝑑 t 𝑑 x .

In contrast to the previous situation, we cannot perform an integration by parts to get rid of the time derivative on Φ ( u x j 2 ) , since it would affect the factor Ψ ( u x k 2 ) . In order to overcome this difficulty, which does not arise in the elliptic setting, we need a new approach, aimed at “symmetrizing” the above quantity containing u x j and u x k .

Basically, we merge together two weird Caccioppoli inequalities, where the spatial variables x j and x k play symmetric roles. More specifically, we insert into (1.5) the test functions

φ = u x j Φ ( u x j 2 ) Ψ ( u x k 2 ) χ η 2 , φ ~ = u x k Ψ ( u x k 2 ) Φ ( u x j 2 ) χ η 2 ,

and then add the two resulting inequalities. The parabolic term is now replaced by the following quantity:

1 2 I × Ω ( Φ ( u x j 2 ) Ψ ( u x k 2 ) ) t χ η 2 𝑑 t 𝑑 x .

This allows to integrate by parts and transfer the time derivative on the test function. It turns out that by a suitable adaptation of the arguments that we used in the elliptic case, one can incorporate this new term in the iterative Moser scheme. This finally leads to the desired local L estimate on u .

1.4 Plan of the paper

The paper is organized as follows: after collecting the basic terminology and some preliminaries on Steklov averages in Section 2, we present in Section 3 the proofs of the new Caccioppoli inequalities in the parabolic setting. We detail the iterative Moser scheme in Section 4 and finally establish the Main Theorem in Section 5, by transferring to the original solution u the a priori estimates obtained on the approximating solutions u ε .

2 Preliminaries

2.1 Local solutions

Let Ω N be an open bounded set and I an open bounded interval. Fix p > 2 and take 𝒜 : N N a continuous function such that

𝒜 ( z ) - 𝒜 ( w ) , z - w 0 for every  z , w N

and

𝒜 ( z ) , z 1 C | z | p and | 𝒜 ( z ) | C | z | p - 1 for every  z N .

We say that u L loc p ( I ; W loc 1 , p ( Ω ) ) is a local weak solution of the quasilinear diffusion equation

(2.1) u t = div 𝒜 ( u ) in  I × Ω

if for every φ C 0 ( I × Ω ) we have

- I × Ω u φ t 𝑑 t 𝑑 x + I × Ω 𝒜 ( u ) , φ 𝑑 t 𝑑 x = 0 .

2.2 Steklov averages

Throughout the paper, we denote by T 0 < T 1 the endpoints of the time interval I. Let v L loc 1 ( I × Ω ) . For every 0 < σ < T 1 - T 0 , we define its so-called

  1. forward Steklov average

    v σ + ( t , x ) = t t + σ v ( τ , x ) 𝑑 τ for every  ( t , x ) ( T 0 , T 1 - σ ) × Ω ,

  2. backward Steklov average

    v σ - ( t , x ) = t - σ t v ( τ , x ) 𝑑 τ for every  ( t , x ) ( T 0 + σ , T 1 ) × Ω .

We shall use some standard properties of the Steklov averages. Let 0 < σ < T 1 - T 0 and ψ L ( I × Ω ) such that ψ is compactly supported in ( T 0 , T 1 - σ ) × Ω . We extend ψ by 0 on ( I ) × Ω , so that ψ σ - is well defined on I × Ω and compactly supported therein. By the Fubini theorem, we have

(2.2) ( T 0 , T 1 - σ ) × Ω v σ + ψ 𝑑 t 𝑑 x = I × Ω v ψ σ - 𝑑 t 𝑑 x .

Moreover, if v L loc q ( I × Ω ) for some 1 q < , then v σ + converges to v in L loc q ( I × Ω ) , as σ goes to 0, see e.g. [13, Chapter I, Lemma 3.2].

Finally, we can derive from (2.2) the following regularity properties of the Steklov averages:

Lemma 2.1.

Let v L loc 1 ( I × Ω ) . Then for every 0 < σ < T 1 - T 0 ,

  1. the map v σ + belongs to W loc 1 , 1 ( ( T 0 , T 1 - σ ) ; L loc 1 ( Ω ) ) and

    (2.3) ( v σ + ) t ( t , x ) = v ( t + σ , x ) - v ( t , x ) σ for a.e.  ( t , x ) ( T 0 , T 1 - σ ) × Ω ,

  2. if one further assumes that v L loc 1 ( I ; W loc 1 , 1 ( Ω ) ) , then ( v σ + ) L loc 1 ( ( T 0 , T 1 - σ ) × Ω ) and

    (2.4) ( v σ + ) = ( v ) σ + .

Proof.

Fix 0 < σ < T 1 - T 0 . Let ψ C 0 ( ( T 0 , T 1 - σ ) × Ω ) . Then by (2.2),

( T 0 , T 1 - σ ) × Ω v σ + ψ t 𝑑 t 𝑑 x = I × Ω v ( ψ t ) σ - 𝑑 t 𝑑 x = I × Ω v ( t , x ) ψ ( t , x ) - ψ ( t - σ , x ) σ 𝑑 t 𝑑 x .

By an obvious change of variables, this yields

( T 0 , T 1 - σ ) × Ω v σ + ψ t 𝑑 t 𝑑 x = - ( T 0 , T 1 - σ ) × Ω v ( t + σ , x ) - v ( t , x ) σ ψ ( t , x ) 𝑑 t 𝑑 x ,

which gives the desired identity (2.3).

In order to prove (2.4), we rely again on (2.2), this time tested with ψ x j in place of ψ, for some 1 j N ,

( T 0 , T 1 - σ ) × Ω v σ + ψ x j 𝑑 t 𝑑 x = I × Ω v ( ψ x j ) σ - 𝑑 t 𝑑 x = I × Ω v ( ψ σ - ) x j 𝑑 t 𝑑 x .

In the last equality, we have derived under the integral sign the smooth function ψ. Hence, by integrating by parts the last integral and using (2.2) again, one gets

I × Ω v σ + ψ x j 𝑑 t 𝑑 x = - ( T 0 , T 1 - σ ) × Ω ( v x j ) σ + ψ 𝑑 t 𝑑 x ,

from which (2.4) follows. ∎

3 Energy estimates for a regularized equation

3.1 An approximating equation

We denote by

G ( ξ ) = 1 p ( 1 + | ξ | 2 ) p 2 for every  ξ N ,

and for every ε ( 0 , 1 ) , we consider the convex function

(3.1) F ε ( ξ ) = 1 p i = 1 N | ξ i | p + ε G ( ξ ) for every  ξ N .

We consider a local weak solution u ε L loc p ( I ; W loc 1 , p ( Ω ) ) of equation (2.1) with the choice

𝒜 ( z ) = F ε ( z ) .

This means that u ε verifies

(3.2) - I × Ω u ε φ t 𝑑 t 𝑑 x + I × Ω F ε ( u ε ) , φ 𝑑 t 𝑑 x = 0

for every φ C 0 ( I × Ω ) . Observe that the map F ε belongs to C 2 ( N ) and satisfies

ε ( 1 + | ξ | 2 ) p - 2 2 | ζ | 2 D 2 F ε ( ξ ) ζ , ζ ( 1 + ε ) ( p - 1 ) ( 1 + | ξ | 2 ) p - 2 2 | ζ | 2 for every  ξ , ζ N .

Hence, one can rely on the classical regularity theory for quasilinear parabolic equations, see e.g. [13, Theorem 5.1, Chapter VIII] and [1, Lemma 3.1], to get

(3.3) u ε L loc ( I × Ω ) and u ε L loc 2 ( I ; W loc 2 , 2 ( Ω ) ) .

In the following computations, we delete the index ε both for u and F.

3.2 An equation for the spatial gradient

In order to establish a Lipschitz bound on our solution u, we need to differentiate (3.2) with respect to the spatial variables x j , 1 j N .

Fix 0 < σ < T 1 - T 0 . Let ψ C 0 ( ( T 0 , T 1 - σ ) × Ω ) . As already observed, the backward Steklov average

φ ( t , x ) = ψ σ - ( t , x ) for  ( t , x ) I × Ω

is compactly supported in I × Ω . We can thus insert it into (3.2):

- I × Ω u ( ψ σ - ) t 𝑑 t 𝑑 x + I × Ω F ( u ) , ψ σ - 𝑑 t 𝑑 x = 0 .

Since ( ψ σ - ) t = ( ψ t ) σ - , equation (2.2) implies that

- I × Ω u ( ψ σ - ) t 𝑑 t 𝑑 x = - ( T 0 , T 1 - σ ) × Ω u σ + ψ t 𝑑 t 𝑑 x = ( T 0 , T 1 - σ ) × Ω ( u σ + ) t ψ 𝑑 t 𝑑 x .

One thus gets

(3.4) ( T 0 , T 1 - σ ) × Ω ( u σ + ) t ψ 𝑑 t 𝑑 x + I × Ω F ( u ) , ψ σ - 𝑑 t 𝑑 x = 0

for every ψ C 0 ( ( T 0 , T 1 - σ ) × Ω ) .

Let j { 1 , , N } and φ C 0 ( ( T 0 , T 1 - σ ) × Ω ) . The map

( u σ + ) t ( t , x ) = u ( t + σ , x ) - u ( t , x ) σ

belongs to L loc p ( ( T 0 , T 1 - σ ) ; W loc 1 , p ( Ω ) ) and ( ( u σ + ) t ) x j = ( ( u x j ) σ + ) t . By derivation under the integral sign, one also has

( ( φ x j ) σ - ) = ( ( φ σ - ) ) x j .

We insert ψ = φ x j in equation (3.4). An integration by parts in the spatial variable leads to

( T 0 , T 1 - σ ) × Ω ( ( u x j ) σ + ) t φ 𝑑 t 𝑑 x + I × Ω ( F ( u ) ) x j , φ σ - 𝑑 t 𝑑 x = 0 .

Finally, using (2.2) in the second term, one gets

(3.5) ( T 0 , T 1 - σ ) × Ω ( ( u x j ) σ + ) t φ 𝑑 t 𝑑 x + ( T 0 , T 1 - σ ) × Ω ( ( F ( u ) ) x j ) σ + , φ 𝑑 t 𝑑 x = 0 .

We observe that, since F C 2 ( N ) and u L loc ( I × Ω ) L loc 2 ( I ; W loc 1 , 2 ( Ω ) ) , one has

F ( u ) L loc 2 ( ( 0 , T ) ; W loc 1 , 2 ( Ω ) ) .

We can thus appeal to a density argument to get that (3.5) remains true for every φ L 2 ( I ; W 1 , 2 ( Ω ) ) , with compact support in ( T 0 , T 1 - σ ) × Ω .

3.3 Caccioppoli-type inequalities

As explained in the introduction, the first technical tool in the proof of the Lipschitz bound of u is the following Caccioppoli inequality which provides a W 1 , 2 estimate on h ( u x j ) , where h is any smooth convex function.

Lemma 3.1 (Standard Caccioppoli inequality).

Let η C 0 ( Ω ) and χ C 0 ( ( T 0 , T 1 ] ) be two non-negative functions, with χ non-decreasing. Let h : R R be a C 1 convex non-negative function. Then, for almost every τ I and every j = 1 , , N , we have

(3.6)

χ ( τ ) { τ } × Ω h 2 ( u x j ) η 2 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) h ( u x j ) , h ( u x j ) χ η 2 𝑑 t 𝑑 x
( T 0 , τ ) × Ω χ η 2 h 2 ( u x j ) 𝑑 t 𝑑 x + 4 ( T 0 , τ ) × Ω D 2 F ( u ) η , η h 2 ( u x j ) χ 𝑑 t 𝑑 x .

Proof.

We first assume that h is a C 2 convex non-negative function. Let ζ C 0 ( I ) and η C 0 ( Ω ) . There exists 0 < σ 1 < 1 2 ( T 1 - T 0 ) such that ζ is compactly supported in ( T 0 + σ 1 , T 1 - σ 1 ) . Given 0 < σ < σ 1 , Lemma 2.1 and (3.3) imply that ( u x j ) σ + W loc 1 , 1 ( ( T 0 , T 1 - σ ) ; L loc 1 ( Ω ) ) L loc ( ( T 0 , T 1 - σ ) × Ω ) . Hence, the map h 2 ( ( u x j ) σ + ) belongs to W loc 1 , 1 ( ( T 0 , T 1 - σ ) ; L loc 1 ( Ω ) ) and we have

(3.7) 1 2 ( h 2 ( ( u x j ) σ + ) ) t = ( h h ) ( ( u x j ) σ + ) ( ( u x j ) σ + ) t .

We insert into (3.5) the test function

φ = ( h h ) ( ( u x j ) σ + ) ζ η 2 ,

which has compact support in ( T 0 + σ 1 , T 1 - σ 1 ) × Ω and belongs to L ( ( T 0 + σ 1 , T 1 - σ 1 ) × Ω ) L 2 ( ( T 0 + σ 1 , T 1 - σ 1 ) ; W 1 , 2 ( Ω ) ) . By (3.7),

( ( u x j ) σ + ) t φ = 1 2 ( h 2 ( ( u x j ) σ + ) ) t ζ η 2 .

We use the above identity to infer

1 2 ( T 0 + σ 1 , T 1 - σ 1 ) × Ω ( h 2 ( ( u x j ) σ + ) ) t ζ η 2 𝑑 t 𝑑 x + ( T 0 + σ 1 , T 1 - σ 1 ) × Ω ( ( F ( u ) ) x j ) σ + , φ 𝑑 t 𝑑 x = 0 .

We then perform an integration by parts with respect to the time variable in the first term

- 1 2 ( T 0 + σ 1 , T 1 - σ 1 ) × Ω h 2 ( ( u x j ) σ + ) ζ η 2 𝑑 t 𝑑 x + ( T 0 + σ 1 , T 1 - σ 1 ) × Ω ( ( F ( u ) ) x j ) σ + , φ 𝑑 t 𝑑 x = 0 .

We now want to take the limit as σ goes to 0. Let Ω 1 Ω such that η is compactly supported in Ω 1 . Since u x j L loc 2 ( I × Ω ) , we have

lim σ 0 + ( u x j ) σ + - u x j L 2 ( ( T 0 + σ 1 , T 1 - σ 1 ) × Ω 1 ) = 0 .

Moreover, we know that u x j L loc ( I × Ω ) which guarantees that there exists C 1 > 0 such that for every σ ( 0 , σ 1 ) ,

| ( u x j ) σ + | C 1 a.e. on  ( T 0 + σ 1 , T 1 - σ 1 ) × Ω 1 .

It then follows from the Dominated Convergence Theorem that

(3.8) lim σ 0 + - 1 2 ( T 0 + σ 1 , T 1 - σ 1 ) × Ω h 2 ( ( u x j ) σ + ) ζ η 2 𝑑 t 𝑑 x = - 1 2 I × Ω h 2 ( u x j ) ζ η 2 𝑑 t 𝑑 x .

Next, by recalling the choice of φ above, we have

φ = ( h h ) ( ( u x j ) σ + ) ( ( u x j ) σ + ) ζ η 2 + ( h h ) ( ( u x j ) σ + ) ζ ( η 2 ) .

By Lemma 2.1, we know that

( ( u x j ) σ + ) = ( u x j ) σ + .

This implies that ( ( u x j ) σ + ) converges to u x j in L 2 ( ( T 0 + σ 1 , T 1 - σ 1 ) × Ω ) . Hence, a similar argument to the one leading to (3.8) implies that

lim σ 0 + φ - ( ( h h ) ( u x j ) ζ η 2 ) L 2 ( ( T 0 + σ 1 , T 1 - σ 1 ) × Ω ) = 0 .

Finally, by using that ( F ( u ) ) x j L loc 2 ( I × Ω ) , we can infer that

lim σ 0 + ( ( F ( u ) ) x j ) σ + - ( F ( u ) ) x j L 2 ( ( T 0 + σ 1 , T 1 - σ 1 ) × Ω 1 ) = 0 .

It follows that

lim σ 0 + ( T 0 + σ 1 , T 1 - σ 1 ) × Ω ( ( F ( u ) ) x j ) σ + , φ 𝑑 t 𝑑 x = I × Ω ( F ( u ) ) x j , ( ( h h ) ( u x j ) ζ η 2 ) 𝑑 t 𝑑 x .

Up to now, we have thus proved

(3.9) - 1 2 I × Ω h 2 ( u x j ) ζ η 2 𝑑 t 𝑑 x + I × Ω ( F ( u ) ) x j , ( ( h h ) ( u x j ) ζ η 2 ) 𝑑 t 𝑑 x = 0 .

We now choose ζ as follows. Let χ C 0 ( ( T 0 , T 1 ] ) be as in the statement. Given τ I and δ > 0 such that T 0 < τ < τ + δ < T 1 , we define

χ ~ δ ( t ) := { 1 if  t τ , 1 - t - τ δ if  τ < t < τ + δ , 0 if  t τ + δ .

We then insert

(3.10) ζ ( t ) = χ ~ δ ( t ) χ ( t )

into (3.9). Then, for almost every τ I , we can let δ go to 0 and obtain

(3.11) χ ( τ ) 2 { τ } × Ω h 2 ( u x j ) η 2 𝑑 x + ( T 0 , τ ) × Ω ( F ( u ) ) x j , ( ( h h ) ( u x j ) χ η 2 ) 𝑑 t 𝑑 x = 1 2 ( T 0 , τ ) × Ω χ η 2 h 2 ( u x j ) 𝑑 t 𝑑 x .

Since χ does not depend on the spatial variable, we have

( F ( u ) ) x j , ( ( h h ) ( u x j ) χ η 2 ) = D 2 F ( u ) h ( u x j ) , h ( u x j ) χ η 2
+ D 2 F ( u ) u x j , u x j h ′′ ( u x j ) h ( u x j ) χ η 2
+ 2 D 2 F ( u ) u x j , η ( h h ) ( u x j ) χ η .

Since the second term is non-negative, by dropping it, we get from (3.11)

χ ( τ ) 2 { τ } × Ω h 2 ( u x j ) η 2 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) h ( u x j ) , h ( u x j ) χ η 2 𝑑 t 𝑑 x
1 2 ( T 0 , τ ) × Ω χ η 2 h 2 ( u x j ) 𝑑 t 𝑑 x - 2 ( T 0 , τ ) × Ω D 2 F ( u ) u x j , η ( h h ) ( u x j ) χ η 𝑑 t 𝑑 x .

In order to estimate the last term, we use the Cauchy–Schwarz inequality

| D 2 F ( u ) u x j , η | ( D 2 F ( u ) u x j , u x j ) 1 2 ( D 2 F ( u ) η , η ) 1 2 .

A further application of Young inequality leads to

| - 2 ( T 0 , τ ) × Ω D 2 F ( u ) u x j , η ( h h ) ( u x j ) χ η d t d x |
1 2 ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j h ( u x j ) 2 χ η 2 𝑑 t 𝑑 x + 2 ( T 0 , τ ) × Ω D 2 F ( u ) η , η h 2 ( u x j ) χ 𝑑 t 𝑑 x .

In this way, the integral containing u x j can be absorbed on the left-hand side. Let us finally observe that we can remove the C 2 assumption on the function h, by a standard approximation argument. ∎

We next establish the key tool for the proof of our main result, namely a Caccioppoli-type inequality, where two different partial derivatives u x j and u x k come into play.

Lemma 3.2 (Weird Caccioppoli inequality).

Let η C 0 ( Ω ) and χ C 0 ( ( T 0 , T 1 ] ) be two non-negative functions, with χ non-decreasing. Let Φ : R + R and Ψ : R + R be two C 1 non-decreasing and non-negative convex functions. Then, for almost every τ I , every k , j = 1 , , N and every θ [ 0 , 1 ] , we have

χ ( τ ) { τ } × Ω Φ ( u x j 2 ) Ψ ( u x k 2 ) η 2 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j Φ ( u x j 2 ) Ψ ( u x k 2 ) χ η 2 𝑑 t 𝑑 x
( T 0 , τ ) × Ω χ η 2 Φ ( u x j 2 ) Ψ ( u x k 2 ) 𝑑 t 𝑑 x
+ 4 ( T 0 , τ ) × Ω D 2 F ( u ) η , η ( u x j 2 Φ ( u x j 2 ) Ψ ( u x k 2 ) + u x k 2 Ψ ( u x k 2 ) Φ ( u x j 2 ) ) χ 𝑑 t 𝑑 x
+ 8 ( ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j u x j 2 Φ ( u x j 2 ) 2 Ψ ( u x k 2 ) θ χ η 2 𝑑 t 𝑑 x ) 1 2
× ( 1 4 ( T 0 , τ ) × Ω χ η 2 | u x k | 2 θ Ψ ( u x k 2 ) 2 - θ d t d x
+ ( T 0 , τ ) × Ω D 2 F ( u ) η , η | u x k | 2 θ Ψ ( u x k 2 ) 2 - θ χ d t d x ) 1 2 .

Proof.

It is convenient to divide the proof into two steps.

Step 1: An identity involving u x j and u x k . We first assume that Φ and Ψ are two C 2 non-decreasing and non-negative convex functions. We fix k , j { 1 , , N } . Given 0 < σ 1 < 1 2 ( T 1 - T 0 ) and ζ C 0 ( T 0 + σ 1 , T 1 - σ 1 ) , we consider (3.5) with the index j and for every 0 < σ < σ 1 , we insert the test function

φ = ( u σ + ) x j Φ ( ( ( u σ + ) x j ) 2 ) Ψ ( ( ( u σ + ) x k ) 2 ) ζ η 2 .

Symmetrically, we consider (3.5) with the index k and insert the test function

φ ~ = ( u σ + ) x k Ψ ( ( ( u σ + ) x k ) 2 ) Φ ( ( ( u σ + ) x j ) 2 ) ζ η 2 .

The functions φ and φ ~ are compactly supported in ( T 0 + σ 1 , T 1 - σ 1 ) × Ω and belong to L ( ( T 0 + σ 1 , T 1 - σ 1 ) × Ω ) L 2 ( ( T 0 + σ 1 , T 1 - σ 1 ) ; W 1 , 2 ( Ω ) ) . Thus they are admissible test functions. We observe that

( ( u σ + ) x j ) t φ = 1 2 ( Φ ( ( ( u σ + ) x j ) 2 ) ) t Ψ ( ( ( u σ + ) x k ) 2 ) ζ η 2 ,

and similarly

( ( u σ + ) x k ) t φ ~ = 1 2 ( Ψ ( ( ( u σ + ) x k ) 2 ) ) t Φ ( ( ( u σ + ) x j ) 2 ) ζ η 2 .

Thus we obtain

(3.12) ( ( u σ + ) x j ) t φ + ( ( u σ + ) x k ) t φ ~ = 1 2 [ Φ ( ( ( u σ + ) x j ) 2 ) Ψ ( ( ( u σ + ) x k ) 2 ) ] t ζ η 2 .

By summing the two equations obtained from (3.5) as described above, and using the identity (3.12), we get

1 2 ( T 0 + σ 1 , T 1 - σ 1 ) × Ω [ Φ ( ( ( u σ + ) x j ) 2 ) Ψ ( ( ( u σ + ) x k ) 2 ) ] t ζ η 2 𝑑 t 𝑑 x + ( T 0 + σ 1 , T 1 - σ 1 ) × Ω ( ( F ( u ) ) x j ) σ + , φ 𝑑 t 𝑑 x
+ ( T 0 + σ 1 , T 1 - σ 1 ) × Ω ( ( F ( u ) ) x k ) σ + , φ ~ 𝑑 t 𝑑 x = 0 .

We then perform an integration by parts in the time variable in the first integral, which gives

- 1 2 ( T 0 + σ 1 , T 1 - σ 1 ) × Ω Φ ( ( ( u σ + ) x j ) 2 ) Ψ ( ( ( u σ + ) x k ) 2 ) ζ η 2 𝑑 t 𝑑 x + ( T 0 + σ 1 , T 1 - σ 1 ) × Ω ( ( F ( u ) ) x j ) σ + , φ 𝑑 t 𝑑 x
+ ( T 0 + σ 1 , T 1 - σ 1 ) × Ω ( ( F ( u ) ) x k ) σ + , φ ~ 𝑑 t 𝑑 x = 0 .

Finally, we let σ go to 0. In the same vein as in the proof of (3.9), the Dominated Convergence Theorem implies that

(3.13) - 1 2 I × Ω Φ ( ( u x j ) 2 ) Ψ ( ( u x k ) 2 ) ζ η 2 𝑑 t 𝑑 x + I × Ω ( F ( u ) ) x j , φ 0 𝑑 t 𝑑 x + I × Ω ( F ( u ) ) x k , φ ~ 0 𝑑 t 𝑑 x = 0 ,

where

φ 0 = u x j Φ ( u x j 2 ) Ψ ( u x k 2 ) ζ η 2 and φ ~ 0 = u x k Ψ ( u x k 2 ) Φ ( u x j 2 ) ζ η 2 .

We now choose ζ as in (3.10) and insert it in (3.13). By letting δ go to 0, we obtain for almost every τ I

(3.14)

χ ( τ ) 2 { τ } × Ω Φ ( u x j 2 ) Ψ ( u x k 2 ) η 2 𝑑 x + ( T 0 , τ ) × Ω ( F ( u ) ) x j , ψ 0 𝑑 t 𝑑 x + ( T 0 , τ ) × Ω ( F ( u ) ) x k , ψ ~ 0 𝑑 t 𝑑 x
= 1 2 ( T 0 , τ ) × Ω χ η 2 Φ ( u x j 2 ) Ψ ( u x k 2 ) 𝑑 t 𝑑 x ,

where ψ 0 and ψ ~ 0 are defined as φ 0 and φ ~ 0 , except that ζ is now replaced by χ.

Step 2: Completion of the proof. We first observe that

( F ( u ) ) x j , ψ 0 = D 2 F ( u ) u x j , ψ 0 , ( F ( u ) ) x k , ψ ~ 0 = D 2 F ( u ) u x k , ψ ~ 0 .

Taking into account the definition of ψ 0 and ψ ~ 0 , we thus get from (3.14)

(3.15)

χ ( τ ) 2 { τ } × Ω Φ ( u x j 2 ) Ψ ( u x k 2 ) η 2 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j ( Φ ( u x j 2 ) + 2 u x j 2 Φ ′′ ( u x j 2 ) ) Ψ ( u x k 2 ) χ η 2 𝑑 t 𝑑 x
+ ( T 0 , τ ) × Ω D 2 F ( u ) u x k , u x k ( Ψ ( u x k 2 ) + 2 u x k 2 Ψ ′′ ( u x k 2 ) ) Φ ( u x j 2 ) χ η 2 𝑑 t 𝑑 x
= 1 2 ( T 0 , τ ) × Ω χ η 2 Φ ( u x j 2 ) Ψ ( u x k 2 ) 𝑑 t 𝑑 x
- 4 ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x k u x j Φ ( u x j 2 ) u x k Ψ ( u x k 2 ) χ η 2 𝑑 t 𝑑 x
- 2 ( T 0 , τ ) × Ω D 2 F ( u ) u x j , η u x j Φ ( u x j 2 ) Ψ ( u x k 2 ) χ η 𝑑 t 𝑑 x
- 2 ( T 0 , τ ) × Ω D 2 F ( u ) u x k , η u x k Φ ( u x j 2 ) Ψ ( u x k 2 ) χ η 𝑑 t 𝑑 x .

We first estimate the two last terms. We use the Cauchy–Schwarz inequality

| D 2 F ( u ) u x j , η | ( D 2 F ( u ) u x j , u x j ) 1 2 ( D 2 F ( u ) η , η ) 1 2 .

By the Young inequality, this implies

| - 2 ( T 0 , τ ) × Ω D 2 F ( u ) u x j , η u x j Φ ( u x j 2 ) Ψ ( u x k 2 ) χ η d t d x |
1 2 ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j Φ ( u x j 2 ) Ψ ( u x k 2 ) χ η 2 𝑑 t 𝑑 x
+ 2 ( T 0 , τ ) × Ω D 2 F ( u ) η , η u x j 2 Φ ( u x j 2 ) Ψ ( u x k 2 ) χ 𝑑 t 𝑑 x .

A similar inequality holds true for the last term in (3.15). Hence,

(3.16)

χ ( τ ) 2 { τ } × Ω Φ ( u x j 2 ) Ψ ( u x k 2 ) η 2 𝑑 x
+ ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j ( 1 2 Φ ( u x j 2 ) + 2 u x j 2 Φ ′′ ( u x j 2 ) ) Ψ ( u x k 2 ) χ η 2 𝑑 t 𝑑 x
+ ( T 0 , τ ) × Ω D 2 F ( u ) u x k , u x k ( 1 2 Ψ ( u x k 2 ) + 2 u x k 2 Ψ ′′ ( u x k 2 ) ) Φ ( u x j 2 ) χ η 2 𝑑 t 𝑑 x
1 2 ( T 0 , τ ) × Ω χ η 2 Φ ( u x j 2 ) Ψ ( u x k 2 ) 𝑑 t 𝑑 x
- 4 ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x k u x j Φ ( u x j 2 ) u x k Ψ ( u x k 2 ) χ η 2 𝑑 t 𝑑 x
+ 2 ( T 0 , τ ) × Ω D 2 F ( u ) η , η ( u x j 2 Φ ( u x j 2 ) Ψ ( u x k 2 ) + u x k 2 Ψ ( u x k 2 ) Φ ( u x j 2 ) ) χ 𝑑 t 𝑑 x .

On the left-hand side of (3.16), in the second term, we drop 2 u x j 2 Φ ′′ ( u x j 2 ) which is non-negative. We also drop the whole last term of the left-hand side for the same reason. This yields

(3.17)

χ ( τ ) 2 { τ } × Ω Φ ( u x j 2 ) Ψ ( u x k 2 ) η 2 𝑑 x + 1 2 ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j Φ ( u x j 2 ) Ψ ( u x k 2 ) χ η 2 𝑑 t 𝑑 x
1 2 ( T 0 , τ ) × Ω χ η 2 Φ ( u x j 2 ) Ψ ( u x k 2 ) 𝑑 t 𝑑 x
- 4 ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x k u x j Φ ( u x j 2 ) u x k Ψ ( u x k 2 ) χ η 2 𝑑 t 𝑑 x
+ 2 ( T 0 , τ ) × Ω D 2 F ( u ) η , η ( u x j 2 Φ ( u x j 2 ) Ψ ( u x k 2 ) + u x k 2 Ψ ( u x k 2 ) Φ ( u x j 2 ) ) χ 𝑑 t 𝑑 x .

We next estimate the second term of the right-hand side of (3.17), that we denote by

𝒜 = ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x k u x j Φ ( u x j 2 ) u x k Ψ ( u x k 2 ) χ η 2 𝑑 t 𝑑 x .

We first use the Cauchy–Schwarz inequality to get

| D 2 F ( u ) u x j , u x k | ( D 2 F ( u ) u x j , u x j ) 1 2 ( D 2 F ( u ) u x k , u x k ) 1 2 .

We then introduce a parameter θ [ 0 , 1 ] . By writing Ψ ( u x k 2 ) = Ψ ( u x k 2 ) θ 2 Ψ ( u x k 2 ) 1 - θ 2 , one gets by using the Cauchy–Schwarz inequality again

(3.18)

𝒜 ( ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j u x j 2 Φ ( u x j 2 ) 2 Ψ ( u x k 2 ) θ χ η 2 𝑑 t 𝑑 x ) 1 2
× ( ( T 0 , τ ) × Ω D 2 F ( u ) u x k , u x k Ψ ( u x k 2 ) 2 - θ u x k 2 χ η 2 𝑑 t 𝑑 x ) 1 2 .

We define

G ( t ) = 0 t 2 Ψ ( s ) 1 - θ 2 𝑑 s .

Then G is a C 1 non-negative convex function. Moreover, by its definition

G ( u x k ) = 2 u x k Ψ ( u x k 2 ) 1 - θ 2 u x k .

Hence, by the standard Caccioppoli inequality (3.6) with h = G and k in place of j, this yields

( T 0 , τ ) × Ω D 2 F ( u ) u x k , u x k Ψ ( u x k 2 ) 2 - θ u x k 2 χ η 2 𝑑 t 𝑑 x
= 1 4 ( T 0 , τ ) × Ω D 2 F ( u ) G ( u x k ) , G ( u x k ) χ η 2 𝑑 t 𝑑 x
1 4 ( T 0 , τ ) × Ω χ η 2 G 2 ( u x k ) 𝑑 t 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) η , η G 2 ( u x k ) χ 𝑑 t 𝑑 x .

By using the Jensen inequality with the concave function y y 1 - θ 2 , we obtain

0 G ( u x k ) | u x k | θ Ψ ( u x k 2 ) 1 - θ 2 .

This implies

( T 0 , τ ) × Ω D 2 F ( u ) u x k , u x k Ψ ( u x k 2 ) 2 - θ u x k 2 χ η 2 𝑑 t 𝑑 x
1 4 ( T 0 , τ ) × Ω χ η 2 | u x k | 2 θ Ψ ( u x k 2 ) 2 - θ 𝑑 t 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) η , η | u x k | 2 θ Ψ ( u x k 2 ) 2 - θ χ 𝑑 t 𝑑 x .

Coming back to (3.18), it follows that

𝒜 ( ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j u x j 2 Φ ( u x j 2 ) 2 Ψ ( u x k 2 ) θ χ η 2 𝑑 t 𝑑 x ) 1 2
× ( 1 4 ( T 0 , τ ) × Ω χ η 2 | u x k | 2 θ Ψ ( u x k 2 ) 2 - θ 𝑑 t 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) η , η | u x k | 2 θ Ψ ( u x k 2 ) 2 - θ χ 𝑑 t 𝑑 x ) 1 2 .

Together with (3.17), this yields the desired inequality. Finally, the C 2 assumption on Φ and Ψ can be removed by a standard approximation argument. ∎

4 Uniform Lipschitz estimate for the regularized equation

This section is devoted to the proof of the following uniform estimate. For simplicity, we will work with anisotropic parabolic cubes of the form

Q R ( t 0 , x 0 ) = ( t 0 - R p , t 0 ) × ( x 0 - R , x 0 + R ) N .

As in the previous section, we drop the index ε and simply write u and F, in place of u ε and F ε .

Proposition 4.1.

There exist constants α = α ( N ) > 2 and C = C ( N , p ) > 0 such that for every ε > 0 and for every Q r ( x 0 , t 0 ) Q R ( t 0 , x 0 ) I × Ω with R 1 , one has

(4.1) u ε L ( Q r ( x 0 , t 0 ) ) C ( R - r ) α p [ ( Q R ( t 0 , x 0 ) | u ε | p 𝑑 t 𝑑 x ) 1 2 + 1 ] .

Proof.

We will limit ourselves for simplicity to the case N 3 . This allows to use the Sobolev inequality valid for every f W 0 1 , 2 ( Ω )

f L 2 * ( Ω ) C N f L 2 ( Ω ) with  2 * = 2 N N - 2 .

Here C N is a constant which depends only on N. The case N = 2 follows with minor modifications and we omit the details.

The proof is quite involved and for ease of readability, we divide it into several steps.

Step 1: The choices of Φ and Ψ. We apply Lemma 3.2 with the following choices:

Φ ( t ) = t s and Ψ ( t ) = t m for  t 0 ,

with 1 s m . We also take

θ = { m - s m - 1 [ 0 , 1 ] if  m > 1 , 1 if  m = 1 .

This gives

χ ( τ ) { τ } × Ω | u x j | 2 s | u x k | 2 m η 2 𝑑 x + s ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x j | 2 s - 2 | u x k | 2 m χ η 2 𝑑 t 𝑑 x
( T 0 , τ ) × Ω χ η 2 | u x j | 2 s | u x k | 2 m 𝑑 t 𝑑 x + 4 ( T 0 , τ ) × Ω D 2 F ( u ) η , η ( s | u x j | 2 s | u x k | 2 m + m | u x k | 2 m | u x j | 2 s ) χ 𝑑 t 𝑑 x
+ 8 s m θ 2 ( ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x j | 4 s - 2 | u x k | 2 m - 2 s χ η 2 𝑑 t 𝑑 x ) 1 2
× ( 1 4 ( T 0 , τ ) × Ω χ η 2 | u x k | 2 ( s + m ) 𝑑 t 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) η , η | u x k | 2 ( s + m ) χ 𝑑 t 𝑑 x ) 1 2 .

On the product of the two last integrals, we use the Young inequality in the form

a b a 2 + b 2 4 .

This gives

χ ( τ ) { τ } × Ω | u x j | 2 s | u x k | 2 m η 2 𝑑 x + s ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x j | 2 s - 2 | u x k | 2 m χ η 2 𝑑 t 𝑑 x
( T 0 , τ ) × Ω χ η 2 | u x j | 2 s | u x k | 2 m 𝑑 t 𝑑 x + 4 ( s + m ) ( T 0 , τ ) × Ω D 2 F ( u ) η , η | u x j | 2 s | u x k | 2 m χ 𝑑 t 𝑑 x
    + 16 s 2 m θ ( 1 4 ( T 0 , τ ) × Ω χ η 2 | u x k | 2 ( s + m ) 𝑑 t 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) η , η | u x k | 2 ( s + m ) χ 𝑑 t 𝑑 x )
    + ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x j | 4 s - 2 | u x k | 2 m - 2 s χ η 2 𝑑 t 𝑑 x .

By the Young inequality again, we can estimate

| u x j | 2 s | u x k | 2 m | u x j | 2 m + 2 s + | u x k | 2 m + 2 s .

Using that s 1 on the left-hand side and m θ m on the right-hand side, we thus obtain

χ ( τ ) { τ } × Ω | u x j | 2 s | u x k | 2 m η 2 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x j | 2 s - 2 | u x k | 2 m χ η 2 𝑑 t 𝑑 x
( T 0 , τ ) × Ω χ η 2 ( | u x j | 2 ( s + m ) + | u x k | 2 ( s + m ) ) 𝑑 t 𝑑 x
    + 4 ( s + m ) ( T 0 , τ ) × Ω D 2 F ( u ) η , η ( | u x j | 2 ( s + m ) + | u x k | 2 ( s + m ) ) χ 𝑑 t 𝑑 x
    + 16 s 2 m ( 1 4 ( T 0 , τ ) × Ω χ η 2 | u x k | 2 ( s + m ) 𝑑 t 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) η , η | u x k | 2 ( s + m ) χ 𝑑 t 𝑑 x )
    + ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x j | 4 s - 2 | u x k | 2 m - 2 s χ η 2 𝑑 t 𝑑 x .

This finally implies

(4.2)

χ ( τ ) { τ } × Ω | u x j | 2 s | u x k | 2 m η 2 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x j | 2 s - 2 | u x k | 2 m χ η 2 𝑑 t 𝑑 x
16 ( s + m + s 2 m ) ( T 0 , τ ) × Ω ( χ η 2 + χ D 2 F ( u ) η , η ) ( | u x j | 2 ( s + m ) + | u x k | 2 ( s + m ) ) 𝑑 t 𝑑 x
+ ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x j | 4 s - 2 | u x k | 2 m - 2 s χ η 2 𝑑 t 𝑑 x .

Step 2: The staircase. Let 0 { 0 } and set q = 2 0 - 1 . We define the two families of indices

s = 2 and m = q + 1 - 2 for  { 0 , , 0 } .

We observe that by construction, for every 0 0 - 1 ,

s + m = q + 1 , 4 s - 2 = 2 s + 1 - 2 and 2 m - 2 s = 2 m + 1 .

We also use that s + m + s 2 m 2 ( q + 1 ) 3 . Then the above inequality (4.2) written for s = s and m = m with 0 0 - 1 gives

χ ( τ ) { τ } × Ω | u x j | 2 s | u x k | 2 m η 2 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x j | 2 s - 2 | u x k | 2 m χ η 2 𝑑 t 𝑑 x
32 ( q + 1 ) 3 ( T 0 , τ ) × Ω ( χ η 2 + χ D 2 F ( u ) η , η ) ( | u x j | 2 ( q + 1 ) + | u x k | 2 ( q + 1 ) ) 𝑑 t 𝑑 x
+ ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x j | 2 s + 1 - 2 | u x k | 2 m + 1 χ η 2 𝑑 t 𝑑 x .

Observe that we used that 2 s + 2 m = 2 ( q + 1 ) on the first term on the right-hand side. By summing from = 0 up to = 0 - 1 , one gets

χ ( τ ) = 0 0 - 1 { τ } × Ω | u x j | 2 s | u x k | 2 m η 2 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x k | 2 q χ η 2 𝑑 t 𝑑 x
C q 3 0 ( T 0 , τ ) × Ω ( χ η 2 + χ D 2 F ( u ) η , η ) ( | u x j | 2 ( q + 1 ) + | u x k | 2 ( q + 1 ) ) 𝑑 t 𝑑 x
    + ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x j | 2 q χ η 2 𝑑 t 𝑑 x .

For the last term, we apply Lemma 3.1 with the choice

h ( t ) = | t | q + 1 q + 1 , t .

We thus get

χ ( τ ) = 0 0 - 1 { τ } × Ω | u x j | 2 s | u x k | 2 m η 2 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x k | 2 q χ η 2 𝑑 t 𝑑 x
C q 4 ( T 0 , τ ) × Ω ( χ η 2 + χ D 2 F ( u ) η , η ) ( | u x j | 2 ( q + 1 ) + | u x k | 2 ( q + 1 ) ) 𝑑 t 𝑑 x
+ 1 ( q + 1 ) 2 ( T 0 , τ ) × Ω χ η 2 | u x j | 2 ( q + 1 ) 𝑑 t 𝑑 x + 4 ( q + 1 ) 2 ( T 0 , τ ) × Ω D 2 F ( u ) η , η | u x j | 2 ( q + 1 ) χ 𝑑 t 𝑑 x .

This in turn implies that

(4.3)

χ ( τ ) = 0 0 - 1 { τ } × Ω | u x j | 2 s | u x k | 2 m η 2 𝑑 x + ( T 0 , τ ) × Ω D 2 F ( u ) u x j , u x j | u x k | 2 q χ η 2 𝑑 t 𝑑 x
C q 4 ( T 0 , τ ) × Ω ( χ η 2 + χ D 2 F ( u ) η , η ) ( | u x j | 2 ( q + 1 ) + | u x k | 2 ( q + 1 ) ) 𝑑 t 𝑑 x ,

possibly for a different constant C > 0 .

Step 3: Weak ellipticity and boundedness of D 2 F . We now use the explicit expression of F. We recall that

F ( ξ ) = 1 p i = 1 N | ξ i | p + ε G ( ξ ) for every  ξ N ,

where

G ( ξ ) = 1 p ( 1 + | ξ | 2 ) p 2 for every  ξ N .

It follows that for every ξ , λ N , we have

( p - 1 ) i = 1 N | ξ i | p - 2 λ i 2 D 2 F ( ξ ) λ , λ C ( | ξ | p - 2 + 1 ) | λ | 2 .

By inserting these estimates into (4.3), one gets

χ ( τ ) = 0 0 - 1 { τ } × Ω | u x j | 2 s | u x k | 2 m η 2 𝑑 x + ( p - 1 ) i = 1 N ( T 0 , τ ) × Ω | u x i | p - 2 u x i x j 2 | u x k | 2 q χ η 2 𝑑 t 𝑑 x
C q 4 ( T 0 , τ ) × Ω ( χ η 2 + χ ( | u | p - 2 + 1 ) | η | 2 ) ( | u x j | 2 ( q + 1 ) + | u x k | 2 ( q + 1 ) ) 𝑑 t 𝑑 x .

We consider the second term on the left-hand side: observe that by keeping in the sum only the term with i = k and dropping the others, we get

i = 1 N ( T 0 , τ ) × Ω | u x i | p - 2 u x i x j 2 | u x k | 2 q χ η 2 𝑑 t 𝑑 x ( T 0 , τ ) × Ω | u x k | p - 2 u x k x j 2 | u x k | 2 q χ η 2 𝑑 t 𝑑 x
= 1 ( q + p 2 ) 2 ( T 0 , τ ) × Ω | ( | u x k | q + p - 2 2 u x k ) x j | 2 χ η 2 𝑑 t 𝑑 x .

When we sum over j = 1 , , N the resulting estimate, we thus get

χ ( τ ) = 0 0 - 1 { τ } × Ω j = 1 N | u x j | 2 s | u x k | 2 m η 2 d x + p - 1 ( q + p 2 ) 2 ( T 0 , τ ) × Ω | ( | u x k | q + p - 2 2 u x k ) | 2 χ η 2 𝑑 t 𝑑 x
C q 4 ( T 0 , τ ) × Ω ( χ η 2 + χ ( | u | p - 2 + 1 ) | η | 2 ) ( j = 1 N | u x j | 2 ( q + 1 ) + N | u x k | 2 ( q + 1 ) ) 𝑑 t 𝑑 x ,

which in turn implies

χ ( τ ) = 0 0 - 1 { τ } × Ω j = 1 N | u x j | 2 s | u x k | 2 m η 2 d x + ( T 0 , τ ) × Ω | ( | u x k | q + p - 2 2 u x k ) | 2 χ η 2 𝑑 t 𝑑 x
C q 6 ( T 0 , τ ) × Ω ( χ η 2 + χ ( | u | p - 2 + 1 ) | η | 2 ) | u | 2 ( q + 1 ) 𝑑 t 𝑑 x ,

up to redefine the constant C > 0 . We now add the term

( T 0 , τ ) × Ω | u x k | 2 q + p χ | η | 2 𝑑 t 𝑑 x

on both sides of the above inequality. With some algebraic manipulations, this gives

χ ( τ ) = 0 0 - 1 { τ } × Ω j = 1 N | u x j | 2 s | u x k | 2 m η 2 d x + ( T 0 , τ ) × Ω | ( | u x k | q + p - 2 2 u x k η ) | 2 χ 𝑑 t 𝑑 x
C q 6 ( T 0 , τ ) × Ω ( χ η 2 + χ ( | u | p - 2 + 1 ) | η | 2 ) | u | 2 ( q + 1 ) 𝑑 t 𝑑 x + ( T 0 , τ ) × Ω | u x k | 2 q + p χ | η | 2 𝑑 t 𝑑 x
C q 6 ( T 0 , τ ) × Ω ( χ η 2 + χ ( | u | p - 2 + 1 ) | η | 2 ) | u | 2 ( q + 1 ) 𝑑 t 𝑑 x .

By using the Sobolev inequality in the spatial variable for the second term of the left-hand side, one obtains

χ ( τ ) = 0 0 - 1 { τ } × Ω j = 1 N | u x j | 2 s | u x k | 2 m η 2 d x + T 0 τ χ ( Ω ( | u x k | 2 q + p η 2 ) 2 * 2 𝑑 x ) 2 2 * 𝑑 t
C q 6 ( T 0 , τ ) × Ω ( χ η 2 + χ ( | u | p - 2 + 1 ) | η | 2 ) | u | 2 ( q + 1 ) 𝑑 t 𝑑 x .

We now finally sum over k = 1 , , N and use the Minkowski inequality for the second term of the left-hand side. This gives

χ ( τ ) = 0 0 - 1 { τ } × Ω j = 1 N | u x j | 2 s k = 1 N | u x k | 2 m η 2 d x + T 0 τ χ ( Ω ( k = 1 N | u x k | 2 q + p η 2 ) 2 * 2 𝑑 x ) 2 2 * 𝑑 t
C q 6 ( T 0 , τ ) × Ω ( χ η 2 + χ ( | u | p - 2 + 1 ) | η | 2 ) | u | 2 ( q + 1 ) 𝑑 t 𝑑 x .

We introduce the expedient function

𝒰 = max 1 k N | u x k | .

We observe that for every s 0 , one has

𝒰 s k = 1 N | u x k | s N 𝒰 s .

In particular for s = 2 , we get 𝒰 | u | N 𝒰 . Hence, we get

(4.4)

χ ( τ ) { τ } × Ω 𝒰 2 ( q + 1 ) η 2 𝑑 x + T 0 τ χ ( Ω 𝒰 ( 2 q + p ) 2 * 2 η 2 * 𝑑 x ) 2 2 * 𝑑 t
C q 6 ( T 0 , τ ) × Ω ( χ η 2 + χ ( 𝒰 p - 2 + 1 ) | η | 2 ) 𝒰 2 ( q + 1 ) 𝑑 t 𝑑 x
C q 6 ( T 0 , τ ) × Ω ( χ η 2 + χ | η | 2 ) ( 1 + 𝒰 2 q + p ) 𝑑 t 𝑑 x .

Step 4: Choice of the cut-off functions. Let ( t 0 , x 0 ) I × Ω and 0 < r < R 1 such that the cube

Q R ( x 0 ) = ( x 0 - R , x 0 + R ) N

is compactly contained in Ω. We further require that

( t 0 - R p , t 0 ) I ,

so that we must have T 0 < t 0 < T 1 and R p < t 0 - T 0 . Let χ : [ T 0 , T 1 ] be a non-decreasing Lipschitz function such that

χ ( t ) = 0  on  [ T 0 , t 0 - R p ] , χ ( t ) = 1  on  [ t 0 - r p , t 0 ] and | χ ( t ) | C ( R - r ) p .

Let η C 0 ( Q R ( x 0 ) ) be such that

0 η 1 , η = 1  on  Q r ( x 0 ) and | η | C R - r .

We recall the notation for the anisotropic parabolic cube

Q ρ ( t 0 , x 0 ) = ( t 0 - ρ p , t 0 ) × Q ρ ( x 0 ) .

With such a choice of χ and η, we use (4.4) twice:

  1. firstly, by dropping the second term on the left-hand side, and taking the supremum in τ over the interval ( t 0 - r p , t 0 ) ,

  2. secondly, by dropping the first term on the left-hand side and taking τ = t 0 .

By summing up the two resulting contributions, this gives

(4.5) sup τ ( t 0 - r p , t 0 ) { τ } × Q r ( x 0 ) 𝒰 2 ( q + 1 ) 𝑑 x + t 0 - r p t 0 ( Q r ( x 0 ) 𝒰 ( 2 q + p ) 2 * 2 𝑑 x ) 2 2 * 𝑑 t C q 6 ( R - r ) p Q R ( t 0 , x 0 ) ( 1 + 𝒰 2 q + p ) 𝑑 t 𝑑 x .

Observe that we also used that ( R - r ) p ( R - r ) 2 , since R 1 and p > 2 . By the Hölder inequality, one has

Q r ( t 0 , x 0 ) 𝒰 2 q + p + 4 ( q + 1 ) N 𝑑 t 𝑑 x t 0 - r p t 0 ( Q r ( x 0 ) 𝒰 ( 2 q + p ) 2 * 2 𝑑 x ) 2 2 * ( Q r ( x 0 ) 𝒰 2 ( q + 1 ) 𝑑 x ) 2 N 𝑑 t
( sup τ ( t 0 - r p , t 0 ) { τ } × Q r ( x 0 ) 𝒰 2 ( q + 1 ) 𝑑 x ) 2 N t 0 - r p t 0 ( Q r ( x 0 ) 𝒰 ( 2 q + p ) 2 * 2 𝑑 x ) 2 2 * 𝑑 t .

Using (4.5), this implies

(4.6) Q r ( t 0 , x 0 ) 𝒰 2 q + p + 4 ( q + 1 ) N 𝑑 t 𝑑 x ( C q 6 ( R - r ) p Q R ( t 0 , x 0 ) ( 1 + 𝒰 2 q + p ) 𝑑 t 𝑑 x ) 2 N + 1 .

Step 5: The local L estimate on u . Take q = q j = 2 j + 1 - 1 with j . We set

γ j := 2 q j + p = 2 j + 2 - 2 + p , δ j := 2 q j + p + 4 N ( q j + 1 ) = 2 j + 2 - 2 + p + 4 N  2 j + 1

and

τ j = δ j - γ j δ j - γ j - 1 γ j - 1 γ j .

We observe that γ j - 1 < γ j < δ j and τ j ( 0 , 1 ) is defined in a such a way that

1 γ j = τ j γ j - 1 + 1 - τ j δ j .

Estimate (4.6) can be rewritten as

Q r ( t 0 , x 0 ) 𝒰 δ j 𝑑 t 𝑑 x ( C q j 6 ( R - r ) p Q R ( t 0 , x 0 ) ( 1 + 𝒰 γ j ) 𝑑 t 𝑑 x ) 2 N + 1 .

By interpolation in L p spaces, we obtain

Q r ( t 0 , x 0 ) 𝒰 γ j 𝑑 t 𝑑 x ( Q r ( t 0 , x 0 ) 𝒰 γ j - 1 𝑑 t 𝑑 x ) τ j γ j γ j - 1 ( Q r ( t 0 , x 0 ) 𝒰 δ j 𝑑 t 𝑑 x ) ( 1 - τ j ) γ j δ j .

By lengthy but elementary computations, we see that

( 1 - τ j ) γ j δ j = N N + 4 and τ j γ j γ j - 1 = 4 N + 4 ,

thus the combination of the two previous inequalities leads to

Q r ( t 0 , x 0 ) 𝒰 γ j 𝑑 t 𝑑 x ( Q r ( t 0 , x 0 ) 𝒰 γ j - 1 𝑑 t 𝑑 x ) 4 N + 4 ( C q j 6 ( R - r ) p Q R ( t 0 , x 0 ) ( 1 + 𝒰 γ j ) 𝑑 t 𝑑 x ) N + 2 N + 4 .

We now use the Young inequality

( Q R ( t 0 , x 0 ) 𝒰 γ j - 1 𝑑 t 𝑑 x ) 4 N + 4 ( C q j 6 ( R - r ) p Q R ( t 0 , x 0 ) ( 1 + 𝒰 γ j ) 𝑑 t 𝑑 x ) N + 2 N + 4
N + 2 N + 4 Q R ( t 0 , x 0 ) ( 1 + 𝒰 γ j ) 𝑑 t 𝑑 x + 2 N + 4 ( C q j 6 ( R - r ) p ) N + 2 2 ( Q R ( t 0 , x 0 ) 𝒰 γ j - 1 𝑑 t 𝑑 x ) 2 .

Thus we have proved the estimate

(4.7) Q r ( t 0 , x 0 ) 𝒰 γ j 𝑑 t 𝑑 x N + 2 N + 4 Q R ( t 0 , x 0 ) ( 1 + 𝒰 γ j ) 𝑑 t 𝑑 x + 2 N + 4 ( C q j 6 ( R - r ) p ) N + 2 2 ( Q R ( t 0 , x 0 ) 𝒰 γ j - 1 𝑑 t 𝑑 x ) 2 .

Next, we observe that

| Q r ( t 0 , x 0 ) | | Q R ( t 0 , x 0 ) | 2 C ( r R ) N + p 1 R N + p C ( R - r ) N + p C ( R - r ) p N + 2 2 ,

where the last inequality relies on the two facts: R 1 and p > 2 . Hence, using that q j 1 , one gets

| Q r ( t 0 , x 0 ) | C ( q j 6 ( R - r ) p ) N + 2 2 | Q R ( t 0 , x 0 ) | 2
C ( q j 6 ( R - r ) p ) N + 2 2 ( Q R ( t 0 , x 0 ) ( 1 + 𝒰 γ j - 1 ) 𝑑 t 𝑑 x ) 2 .

By summing | Q r ( t 0 , x 0 ) | on both sides of (4.7), this implies

Q r ( t 0 , x 0 ) ( 1 + 𝒰 γ j ) 𝑑 t 𝑑 x N + 2 N + 4 Q R ( t 0 , x 0 ) ( 1 + 𝒰 γ j ) 𝑑 t 𝑑 x + C ( q j 6 ( R - r ) p ) N + 2 2 ( Q R ( t 0 , x 0 ) ( 1 + 𝒰 γ j - 1 ) 𝑑 t 𝑑 x ) 2 .

We can now appeal to [17, Lemma 6.1] and absorb the term on the right-hand side containing 1 + 𝒰 γ j , in a standard way. By using the definition of q j , this leads to

(4.8) Q r ( t 0 , x 0 ) ( 1 + 𝒰 γ j ) 𝑑 t 𝑑 x C 2 3 ( N + 2 ) j ( R - r ) p N + 2 2 ( Q R ( t 0 , x 0 ) ( 1 + 𝒰 γ j - 1 ) 𝑑 t 𝑑 x ) 2 .

We want to iterate the previous estimate on a sequence of shrinking parabolic cylinders. We fix two radii 0 < r < R 1 , then we consider the sequence

R j = r + R - r 2 j - 1 , j { 0 } ,

and we apply (4.8) with R j + 1 < R j in place of r < R . We introduce the notation

Y j = Q R j ( t 0 , x 0 ) ( 1 + 𝒰 γ j - 1 ) 𝑑 t 𝑑 x .

Thus we get

Y j + 1 C  2 2 p ( N + 2 ) j ( R - r ) - p 2 ( N + 2 ) Y j 2 .

By iterating the previous estimate starting from j = 1 , we obtain for every n { 0 } ,

Y n + 1 ( C  2 2 p ( N + 2 ) ( R - r ) - p 2 ( N + 2 ) ) j = 0 n - 1 ( n - j )  2 j Y 1 2 n

possibly for a different constant C > 1 . We now take the power 2 - n on both sides

Y n + 1 2 - n ( C  2 2 p ( N + 2 ) ( R - r ) - p 2 ( N + 2 ) ) j = 0 n - 1 ( n - j )  2 j - n Y 1 C ( R - r ) - p ( N + 2 ) Y 1 ,

where the last inequality relies on the fact that

j = 0 n - 1 ( n - j )  2 j - n j = 1 j 2 j = 2 .

We thus get

(4.9)

𝒰 L ( Q r ( t 0 , x 0 ) ) = lim n ( Q R n + 1 ( t 0 , x 0 ) 𝒰 γ n 𝑑 t 𝑑 x ) 1 γ n lim sup n + ( Y n + 1 2 - n ) 2 n γ n
lim sup n + ( C ( R - r ) - p ( N + 2 ) Y 1 ) 2 n γ n
C ( R - r ) - p N + 2 4 ( Q R ( t 0 , x 0 ) ( 1 + 𝒰 p + 2 ) 𝑑 t 𝑑 x ) 1 4 .

Here, we have also used that γ 0 = p + 2 and γ n 2 n + 2 , for n going to . Finally, in order to remove the dependence on the L p + 2 norm of the gradient, we use a standard interpolation trick. We write

( Q R ( t 0 , x 0 ) ( 1 + 𝒰 p + 2 ) 𝑑 t 𝑑 x ) 1 4 ( 𝒰 L ( Q R ( t 0 , x 0 ) ) 2 Q R ( t 0 , x 0 ) 𝒰 p 𝑑 t 𝑑 x + | Q R ( t 0 , x 0 ) | ) 1 4
𝒰 L ( Q R ( t 0 , x 0 ) ) 1 2 ( Q R ( t 0 , x 0 ) 𝒰 p 𝑑 t 𝑑 x ) 1 4 + C R N + p 4 .

Inserting this estimate into (4.9), we get

𝒰 L ( Q r ( t 0 , x 0 ) ) C ( R - r ) p N + 2 4 [ 𝒰 L ( Q R ( t 0 , x 0 ) ) 1 2 ( Q R ( t 0 , x 0 ) 𝒰 p 𝑑 t 𝑑 x ) 1 4 + R N + p 4 ]
1 2 𝒰 L ( Q R ( t 0 , x 0 ) ) + C ( R - r ) p N + 2 2 ( Q R ( t 0 , x 0 ) 𝒰 p 𝑑 t 𝑑 x ) 1 2 + C R N + p 4 ( R - r ) p N + 2 4
1 2 𝒰 L ( Q R ( t 0 , x 0 ) ) + C ( R - r ) p N + 2 2 [ ( Q R ( t 0 , x 0 ) 𝒰 p 𝑑 t 𝑑 x ) 1 2 + 1 ] ,

where in the last line, we have used that R 1 . By [17, Lemma 6.1] again, we get

𝒰 L ( Q r ( t 0 , x 0 ) ) C ( R - r ) p N + 2 2 [ ( Q R ( t 0 , x 0 ) 𝒰 p 𝑑 t 𝑑 x ) 1 2 + 1 ] .

Since 𝒰 | u | N 𝒰 , this completes the proof. ∎

Remark 4.2.

An inspection of the proof reveals that the exponent α in (4.1) can be taken to be

α = { N + 2 2 if  N 3 , any number  > 2 if  N = 2 .

In the case N = 2 , the constant C blows-up as α tends to 2.

5 Proof of the Main Theorem

We take u L loc p ( I ; W loc 1 , p ( Ω ) ) to be a local weak solution of equation (1.1), i.e. it satisfies

(5.1) - I × Ω u φ t 𝑑 t 𝑑 x + I × Ω F 0 ( u ) , φ 𝑑 t 𝑑 x = 0

for every φ C 0 ( I × Ω ) . We recall that F 0 indicates the convex function

F 0 ( ξ ) = 1 p i = 1 N | ξ i | p for every  ξ N .

Our program is as follows:

  1. we approximate u by solutions u ε of the regularized equation,

  2. we transfer the Lipschitz estimate of Proposition 4.1 from u ε to u,

  3. we use a scaling argument to “rectify” the local estimate and obtain (1.4).

We start by recalling that u has the following additional properties: for every subinterval J I and every open set 𝒪 Ω , we have

(5.2) u t L p ( J ; W - 1 , p ( 𝒪 ) ) and u C ( J ¯ ; L 2 ( 𝒪 ) ) .

Here W - 1 , p ( 𝒪 ) is the topological dual space of W 0 1 , p ( 𝒪 ) and the latter is the completion of C 0 ( 𝒪 ) with respect to the L p norm of the gradient.

We briefly recall the argument to get (5.2), for completeness. Fix J and 𝒪 as above, by using equation (5.1) and the fact that F 0 ( u ) L loc p ( I × Ω ) , we get

| J × 𝒪 u ψ t 𝑑 t 𝑑 x | = | J × 𝒪 F 0 ( u ) , ψ 𝑑 t 𝑑 x |
C ψ L p ( J × 𝒪 ) = ψ L p ( J ; W 0 1 , p ( 𝒪 ) ) for every  ψ C 0 ( J × 𝒪 ) .

By density, we can extend the linear functional

Λ : ψ J × 𝒪 u ψ t 𝑑 t 𝑑 x

to the whole space L p ( J ; W 0 1 , p ( 𝒪 ) ) . This implies that (see for example [23, Theorem 1.5, Chapter III])

Λ ( L p ( J ; W 0 1 , p ( 𝒪 ) ) ) * = L p ( J ; W - 1 , p ( 𝒪 ) ) .

By the definition of Λ and of weak derivative, we get the first property in (5.2).

The second property in (5.2) follows by recalling that for every open set E N , we have (see [23, Proposition 1.2, Chapter III])

𝒲 p ( J × E ) := { φ L p ( J ; W 0 1 , p ( E ) ) : φ t L p ( J ; W - 1 , p ( E ) ) } C ( J ¯ ; L 2 ( E ) ) .

Indeed, it is sufficient to take 𝒪 𝒪 Ω and use the previous inclusion for the function θ u , where θ C 0 ( 𝒪 ) is such that θ 1 on 𝒪 . Thanks to the first property in (5.2) (used with 𝒪 in place of 𝒪 ) and the properties of θ, we have that

θ u 𝒲 p ( J × 𝒪 ) C ( J ¯ ; L 2 ( 𝒪 ) ) .

Since θ 1 on 𝒪 , the last fact entails that u C ( J ¯ ; L 2 ( 𝒪 ) ) , as claimed. In light of (5.2) and since we are only interested in a local result, it is not restrictive to assume from the beginning that

u L p ( I ; W 1 , p ( Ω ) ) C ( I ¯ ; L 2 ( Ω ) ) and u t L p ( I ; W - 1 , p ( Ω ) ) .

Part 1: Convergence of the approximation scheme. We remember the definition (3.1) of F ε for every ε 0 . We then consider the approximating initial boundary value problem parametrized by ε > 0

{ v t = div F ε ( v ) in  I × Ω , v = u on  I × Ω , v ( T 0 , ) = u ( T 0 , ) in  Ω .

By [23, Propositon 4.1, Chapter III], there exists a unique weak solution u ε L p ( I ; W 1 , p ( Ω ) ) to this problem such that

( u ε ) t L p ( I ; W - 1 , p ( Ω ) ) and thus  u ε C ( I ¯ ; L 2 ( Ω ) ) .

The boundary value condition is taken in the sense that

u ε - u L p ( I ; W 0 1 , p ( Ω ) ) ,

and the initial condition is taken in the L 2 sense, which is feasible thanks to the continuity properties of both u ε and u. The function u ε verifies

(5.3) - I × Ω u ε φ t 𝑑 t 𝑑 x + I × Ω F ε ( u ε ) , φ 𝑑 t 𝑑 x = 0

for every φ C 0 ( I × Ω ) . An integration by parts in (5.3) leads to

I ( ( u ε ) t , φ ) ( W - 1 , p , W 0 1 , p ) 𝑑 t + I × Ω F ε ( u ε ) , φ 𝑑 t 𝑑 x = 0 .

By density, the above identity remains true for every φ L p ( I ; W 0 1 , p ( Ω ) ) . Then the choice φ = u ε - u yields

I ( ( u ε ) t , u ε - u ) ( W - 1 , p , W 0 1 , p ) 𝑑 t + I × Ω F ε ( u ε ) , u ε - u 𝑑 t 𝑑 x = 0 .

By recalling the expression (3.1) of F ε , the previous integral identity can be rewritten as

I ( ( u ε ) t , u ε - u ) ( W - 1 , p , W 0 1 , p ) 𝑑 t + I × Ω F 0 ( u ε ) , u ε - u 𝑑 t 𝑑 x + ε I × Ω G ( u ε ) , u ε - u 𝑑 t 𝑑 x = 0 .

Starting from (5.1), we have similarly

I ( u t , u ε - u ) ( W - 1 , p , W 0 1 , p ) 𝑑 t + I × Ω F 0 ( u ) , u ε - u 𝑑 t 𝑑 x = 0 .

Upon subtracting the two identities above, we get

(5.4)

I ( ( u ε ) t - u t , u ε - u ) ( W - 1 , p , W 0 1 , p ) 𝑑 t + I × Ω F 0 ( u ε ) - F 0 ( u ) , u ε - u 𝑑 t 𝑑 x
+ ε I × Ω G ( u ε ) , u ε - u 𝑑 t 𝑑 x = 0 .

For the first term, we rely on

I ( ( u ε ) t - u t , u ε - u ) ( W - 1 , p , W 0 1 , p ) 𝑑 t = 1 2 { T 1 } × Ω | u ε - u | 2 𝑑 x ,

which follows from the fact that

t 1 2 Ω | u ε ( t , x ) - u ( t , x ) | 2 𝑑 x

is absolutely continuous on I, with derivative given exactly by

( ( u ε ) t - u t , u ε - u ) ( W - 1 , p , W 0 1 , p ) for a.e.  t I ,

see [23, Proposition 1.2].

For the second term in (5.4), we can use that for every ξ , ζ N , we have

F 0 ( ξ ) - F 0 ( ζ ) , ξ - ζ = i = 1 N ( | ξ i | p - 2 ξ i - | ζ i | p - 2 ζ i ) ( ξ i - ζ i )
2 2 - p i = 1 N | ξ i - ζ i | p 1 C | ξ - ζ | p

for some C = C ( N , p ) > 0 . The first inequality can be found in [20, Section 10]. On the other hand, the convexity of G implies that G is a monotone map and thus

G ( u ε ) , u ε - u G ( u ) , u ε - u .

By using these two pointwise estimates in (5.4), we thus get

1 2 { T 1 } × Ω | u ε - u | 2 𝑑 x + 1 C I × Ω | u ε - u | p 𝑑 t 𝑑 x ε I × Ω | G ( u ) , u ε - u | 𝑑 t 𝑑 x .

By the Cauchy–Schwarz inequality and the inequality | G ( ξ ) | ( 1 + | ξ | ) p - 1 , the right-hand side is not larger than

ε I × Ω ( 1 + | u | ) p - 1 | u ε - u | 𝑑 t 𝑑 x ,

which, by the Young inequality, in turn can be bounded from above by

p - 1 p ε I × Ω | u ε - u | p 𝑑 t 𝑑 x + ε p I × Ω ( 1 + | u | ) p 𝑑 t 𝑑 x .

Then for every ε > 0 , we have

1 2 { T 1 } × Ω | u ε - u | 2 𝑑 x + ( 1 C - p - 1 p ε ) I × Ω | u ε - u | p 𝑑 t 𝑑 x ε p I × Ω ( 1 + | u | ) p 𝑑 t 𝑑 x .

This estimate guarantees that the family { u ε } ε > 0 is strongly convergent to u in L p ( I × Ω ) . This is now sufficient to pass to the limit in the uniform Lipschitz estimate (4.1), which will be still valid for u, as well. By a covering argument, this in turn implies

u L loc ( I × Ω ) ,

as claimed.

Part 2: Scale invariant estimate. We finally focus on obtaining estimate (1.4). We fix

𝒬 τ , R ( t 0 , x 0 ) = ( t 0 - τ , t 0 ) × ( x 0 - R , x 0 + R ) N I × Ω

as in the statement of the Main Theorem. By recalling Remark 1.1, we know that for every R > 0 and μ > 0 the function

U R ( t , x ) = μ u ( t 0 + μ p - 2 R p t , x 0 + R x )

is still a local weak solution of our parabolic equation, this time in the rescaled set

I ~ × Ω ~ := ( T 0 - t 0 μ p - 2 R p , T 1 - t 0 μ p - 2 R p ) × Ω - x 0 R .

We consider the compactly contained cube 𝒬 τ μ p - 2 R p , 1 ( 0 , 0 ) I ~ × Ω ~ and make the choice

μ p - 2 R p = τ that is, μ = ( τ R p ) 1 p - 2 ,

so that

𝒬 τ μ p - 2 R p , 1 ( 0 , 0 ) = 𝒬 1 , 1 ( 0 , 0 ) = Q 1 ( 0 , 0 ) .

From Part 1, we know that we can use the a priori estimate (4.1) for U R on the anisotropic parabolic cubes

Q 1 2 ( 0 , 0 ) = ( - 1 2 p , 0 ) × ( - 1 2 , 1 2 ) N and Q 1 ( 0 , 0 ) = ( - 1 , 0 ) × ( - 1 , 1 ) N .

This gives

U R L ( Q 1 2 ( 0 , 0 ) ) C ( Q 1 ( 0 , 0 ) | U R | p 𝑑 t 𝑑 x ) 1 2 + C .

If we scale back, we get

(5.5) u L ( 𝒬 τ 2 p , R 2 ( t 0 , x 0 ) ) C ( τ R 2 ) 1 2 ( 𝒬 τ , R ( t 0 , x 0 ) | u | p 𝑑 t 𝑑 x ) 1 2 + C ( R 2 τ ) 1 p - 2 .

This already proves the claimed a priori estimate (1.4) for 0 < σ 1 2 p .

For any σ ( 0 , 1 ) , we can take ( t 1 , x 1 ) 𝒬 σ τ , σ R ( t 0 , x 0 ) such that

u L ( 𝒬 σ τ , σ R ( t 0 , x 0 ) ) u L ( 𝒬 1 - σ 2 p τ , 1 - σ 2 R ( t 1 , x 1 ) ) .

Observe that for every ( s , y ) 𝒬 σ τ , σ R ( t 0 , x 0 ) , we have

𝒬 1 - σ 2 p τ , 1 - σ 2 R ( s , y ) 𝒬 ( 1 - σ ) τ , ( 1 - σ ) R ( s , y ) 𝒬 τ , R ( t 0 , x 0 ) .

By applying (5.5) on this parabolic cylinder, we get

u L ( 𝒬 σ τ , σ R ( t 0 , x 0 ) ) u L ( 𝒬 1 - σ 2 p τ , 1 - σ 2 R ( t 1 , x 1 ) )
C 1 - σ ( τ R 2 ) 1 2 ( 𝒬 ( 1 - σ ) τ , ( 1 - σ ) R ( t 1 , x 1 ) | u | p 𝑑 t 𝑑 x ) 1 2 + C ( 1 - σ ) 1 p - 2 ( R 2 τ ) 1 p - 2 .

By observing that

𝒬 ( 1 - σ ) τ , ( 1 - σ ) R ( t 1 , x 1 ) | u | p 𝑑 t 𝑑 x = 1 ( 1 - σ ) N + 1 ( 2 R ) N τ 𝒬 ( 1 - σ ) τ , ( 1 - σ ) R ( t 1 , x 1 ) | u | p 𝑑 t 𝑑 x
1 ( 1 - σ ) N + 1 ( 2 R ) N τ 𝒬 τ , R ( t 0 , x 0 ) | u | p 𝑑 t 𝑑 x
= 1 ( 1 - σ ) N + 1 𝒬 τ , R ( t 0 , x 0 ) | u | p 𝑑 t 𝑑 x ,

we eventually reach the desired conclusion.


Dedicated to the memory of Emmanuele DiBenedetto, with esteem and admiration



Communicated by Jan Kristensen


Funding statement: The paper has been partially written during a visit of Pierre Bousquet and Lorenzo Brasco to Napoli and of Chiara Leone to Ferrara. Both visits have been funded by the Gruppo Nazionale per l’Analisi Matematica, la Probabilità e le loro Applicazioni (GNAMPA) through the project “Regolarità per operatori degeneri con crescite generali ”. Hosting institutions are gratefully acknowledged. Chiara Leone and Anna Verde are members of the Gruppo Nazionale per l’Analisi Matematica, la Probabilità e le loro Applicazioni (GNAMPA) of the Istituto Nazionale di Alta Matematica (INdAM).

Acknowledgements

We thank Alkis S. Tersenov for pointing out the reference [24].

References

[1] V. Bögelein, F. Duzaar and P. Marcellini, Parabolic equations with p , q -growth, J. Math. Pures Appl. (9) 100 (2013), no. 4, 535–563. 10.1016/j.matpur.2013.01.012Suche in Google Scholar

[2] V. Bögelein, F. Duzaar and G. Mingione, The regularity of general parabolic systems with degenerate diffusion, Mem. Amer. Math. Soc. 221 (2013), 1–143. 10.1090/S0065-9266-2012-00664-2Suche in Google Scholar

[3] P. Bousquet and L. Brasco, C 1 regularity of orthotropic p-harmonic functions in the plane, Anal. PDE 11 (2018), no. 4, 813–854. 10.2140/apde.2018.11.813Suche in Google Scholar

[4] P. Bousquet, L. Brasco and V. Julin, Lipschitz regularity for local minimizers of some widely degenerate problems, Ann. Sc. Norm. Super. Pisa Cl. Sci. (5) 16 (2016), no. 4, 1235–1274. 10.2422/2036-2145.201410_006Suche in Google Scholar

[5] P. Bousquet, L. Brasco, C. Leone and A. Verde, On the Lipschitz character of orthotropic p-harmonic functions, Calc. Var. Partial Differential Equations 57 (2018), no. 3, Paper No. 88. 10.1007/s00526-018-1349-3Suche in Google Scholar

[6] L. Brasco and G. Carlier, On certain anisotropic elliptic equations arising in congested optimal transport: Local gradient bounds, Adv. Calc. Var. 7 (2014), no. 3, 379–407. 10.1515/acv-2013-0007Suche in Google Scholar

[7] L. Brasco, C. Leone, G. Pisante and A. Verde, Sobolev and Lipschitz regularity for local minimizers of widely degenerate anisotropic functionals, Nonlinear Anal. 153 (2017), 169–199. 10.1016/j.na.2016.06.006Suche in Google Scholar

[8] Y. Z. Chen and E. DiBenedetto, On the local behavior of solutions of singular parabolic equations, Arch. Ration. Mech. Anal. 103 (1988), no. 4, 319–345. 10.1007/BF00251444Suche in Google Scholar

[9] H. J. Choe, Hölder regularity for the gradient of solutions of certain singular parabolic systems, Comm. Partial Differential Equations 16 (1991), no. 11, 1709–1732. 10.1080/03605309108820820Suche in Google Scholar

[10] S. Ciani, S. Mosconi and V. Vespri, Parabolic Harnack estimates for anisotropic slow diffusion, preprint (2020), https://arxiv.org/abs/2012.09685. Suche in Google Scholar

[11] F. Demengel, Lipschitz interior regularity for the viscosity and weak solutions of the pseudo p-Laplacian equation, Adv. Differential Equations 21 (2016), no. 3–4, 373–400. 10.57262/ade/1455805262Suche in Google Scholar

[12] E. DiBenedetto, On the local behaviour of solutions of degenerate parabolic equations with measurable coefficients, Ann. Sc. Norm. Super. Pisa Cl. Sci. (4) 13 (1986), no. 3, 487–535. Suche in Google Scholar

[13] E. DiBenedetto, Degenerate Parabolic Equations, Universitext, Springer, New York, 1993. 10.1007/978-1-4612-0895-2Suche in Google Scholar

[14] E. DiBenedetto and A. Friedman, Regularity of solutions of nonlinear degenerate parabolic systems, J. Reine Angew. Math. 349 (1984), 83–128. 10.1515/crll.1984.349.83Suche in Google Scholar

[15] E. DiBenedetto and A. Friedman, Hölder estimates for nonlinear degenerate parabolic systems, J. Reine Angew. Math. 357 (1985), 1–22. 10.1515/crll.1985.357.1Suche in Google Scholar

[16] F. Feo, J. L. Vázquez and B. Volzone, Anisotropic p-Laplacian evolution of fast diffusion type, Adv. Nonlinear Stud. 21 (2021), no. 3, 523–555. 10.1515/ans-2021-2136Suche in Google Scholar

[17] E. Giusti, Direct Methods in the Calculus of Variations, World Scientific, River Edge, 2003. 10.1142/5002Suche in Google Scholar

[18] T. Kuusi and G. Mingione, Potential estimates and gradient boundedness for nonlinear parabolic systems, Rev. Mat. Iberoam. 28 (2012), no. 2, 535–576. 10.4171/rmi/684Suche in Google Scholar

[19] T. Kuusi and G. Mingione, Riesz potentials and nonlinear parabolic equations, Arch. Ration. Mech. Anal. 212 (2014), no. 3, 727–780. 10.1007/s00205-013-0695-8Suche in Google Scholar

[20] P. Lindqvist, Notes on the p-Laplace Equation, Report. University of Jyväskylä 102, University of Jyväskylä, Jyväskylä, 2006. Suche in Google Scholar

[21] J.-L. Lions, Quelques méthodes de résolution des problèmes aux limites non linéaires, Dunod, Paris, 1969. Suche in Google Scholar

[22] G. Pisante and A. Verde, Regularity results for non smooth parabolic problems, Adv. Differential Equations 13 (2008), no. 3–4, 367–398. 10.57262/ade/1355867354Suche in Google Scholar

[23] R. E. Showalter, Monotone Operators in Banach Space and Nonlinear Partial Differential Equations, Math. Surveys Monogr. 49, American Mathematical Society, Providence, 1997. Suche in Google Scholar

[24] A. S. Tersenov and A. S. Tersenov, Existence of Lipschitz continuous solutions to the Cauchy–Dirichlet problem for anisotropic parabolic equations, J. Funct. Anal. 272 (2017), no. 10, 3965–3986. 10.1016/j.jfa.2017.02.014Suche in Google Scholar

[25] K. Uhlenbeck, Regularity for a class of non-linear elliptic systems, Acta Math. 138 (1977), no. 3–4, 219–240. 10.1007/BF02392316Suche in Google Scholar

[26] N. N. Ural’ceva, Degenerate quasilinear elliptic systems, Zap. Naučn. Sem. Leningrad. Otdel. Mat. Inst. Steklov. (LOMI) 7 (1968), 184–222. Suche in Google Scholar

[27] M. I. Višik, Solubility of boundary-value problems for quasi-linear parabolic equations of higher orders, Mat. Sb. (N. S.) 59(101) (1962), 289–325. Suche in Google Scholar

[28] M. Wiegner, On C α -regularity of the gradient of solutions of degenerate parabolic systems, Ann. Mat. Pura Appl. (4) 145 (1986), 385–405. 10.1007/BF01790549Suche in Google Scholar

[29] E. Zeidler, Nonlinear Functional Analysis and its Applications. II/A – Linear Monotone Operators, Springer, New York, 1989. Suche in Google Scholar

Received: 2021-06-11
Accepted: 2021-09-17
Published Online: 2021-12-03
Published in Print: 2023-07-01

© 2021 Walter de Gruyter GmbH, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 International License.

Heruntergeladen am 19.9.2025 von https://www.degruyterbrill.com/document/doi/10.1515/acv-2021-0052/html
Button zum nach oben scrollen