Home Mathematics Improvements of algebraic flux-correction schemes based on Bernstein finite elements
Article Open Access

Improvements of algebraic flux-correction schemes based on Bernstein finite elements

  • Hennes Hajduk EMAIL logo
Published/Copyright: June 26, 2025

Abstract

In Galerkin finite element schemes, the discrete first derivative operator for each spatial dimension is a square matrix that is skew-symmetric under restrictive assumptions for certain types of discretizations and boundary conditions. In most settings, however, this desirable property is violated, often only for a few pairs of nodes. These exceptions can invalidate certain design principles based on the skew-symmetry assumption made for these operators. This paper demonstrates that algebraic manipulations can be performed to make the discrete gradient operators of Bernstein polynomial-based finite element methods skew symmetric. Interest in such discretizations has recently been increasing because they represent natural extensions of second-order algebraic flux correction schemes to higher-order spaces. We employ the new operators in the context of such property-preserving methods, mostly based on discontinuous Galerkin discretizations of arbitrary order. Additional theoretical results for the schemes under investigation are derived, including local and global entropy inequalities, among others. Moreover, a discussion on the optimality of CFL-like time step restrictions arising in explicit Runge–Kutta methods shows that our new approach is superior to earlier representatives of operators employed in similar contexts. These techniques use the monolithic convex limiting paradigm and are applied to the compressible Euler equations.

MSC 2010 Classification: 65M60

1 Introduction

Hyperbolic problems are commonly solved numerically using high-resolution schemes. These encompass stabilization and limiting techniques designed for the purpose of obtaining physics-conforming approximations are, for instance, residual distribution schemes, slope and a posteriori limiters, entropy-based semi-discrete approaches, smoothness indicators, and/or weighted essentially non-oscillatory (WENO) techniques, see e.g., [1], [2], [3], [4], [5], [6]. For an overview of property-preservation, we refer to the book [7]. Most of the techniques mentioned here work by blending a certain high-order target scheme with a low-order counterpart to be used in the vicinity of steep gradients [8]. The main question is how to choose the numerical viscosity locally to preserve accuracy in smooth regions but avoid formation of spurious numerical solutions. These features include violations of nonnegativity constraints, Gibbs phenomena, and/or nonentropic results.

In this work, we focus on algebraic flux correction (AFC) schemes, e.g., [9], [10], [11], [12], [13], [14] because they are failsafe, in the sense that it is possible to guarantee the validity of various desirable constraints, e.g., the nonnegativity of certain quantities. Moreover, they are prone to numerical analysis, e.g., [7], [15], [16], [17] and can also be designed in a way that a multitude of desirable properties are guaranteed. Usual properties include discrete maximum principles [14], nonnegativity constraints [13], entropy stability conditions [18], and well-balancedness [12]. The latter reference presents a scheme that possesses all of these features at once. As such, it is unique among the families of high-resolution schemes. Because of such progress, additional improvements of AFC schemes are called for, and this work is aimed at that purpose.

The first work on algebraic flux correction schemes [19] has spawned many further developments and theoretical analyses [10], [11], [13], [14], [15], [17], [20], [21]. For instance, AFC techniques based on high-order Bernstein polynomials are presented in refs. [10], [14] for discontinuous Galerkin (DG) methods and classical finite elements, respectively. Combining the theoretical frameworks developed in the seminal papers by Hoff [22] and Harten et al. [23], Guermond and Popov [24] propose a new way of analyzing the low-order method, which provides sufficient numerical dissipation to guarantee the failsafe property of the scheme. In ref. [11] and related works, this approach is used in the predictor stage of the proposed convex limiting techniques. Kuzmin’s monolithic convex limiting (MCL) paradigm [13] is no representative of such FCT approaches. Instead, it operates on the semi-discrete level of spatial discretizations, allowing for steady-state calculations [13], [25], [26], implicit time stepping [26], [27], [28], and semi-discrete entropy stabilization [18], [28], [29] based on Tadmor’s entropy stability theory [30], [31]. MCL techniques have since been further developed, e.g., [18], [21], [25], [32] and analyzed [7], [17]. Following the work of Lohmann et al. [14] on high-order FCT schemes for finite elements based on Bernstein basis functions, Kuzmin and Quezada de Luna [32] first developed a high-order counterpart of MCL and later introduced entropy limiting based on Tadmor’s criterion into these algorithms [18], [29], [30], [31]. Subsequently, we extended the approaches in refs. [13], [32] to Bernstein-DG discretizations [25]. Rueda-Ramírez et al. [21] use MCL techniques in combination with Legendre–Gauss–Lobatto (LGL) bases for DG. These authors were the first who enforced Tadmor’s entropy stability criterion [30], [31] in the AFC-DG context to stabilize entropy-producing terms arising from volume integrals. In ref. [21], numerical fluxes across interior boundaries are of local Lax–Friedrichs (LLF) type. In contrast, here and in ref. [28], Ch. 6], an arbitrary flux is blended with such an LLF counterpart and entropy limiting can also be performed for blended interfacial fluxes.

The present article addresses the following issues: The employed discrete gradient operators in AFC schemes are generally not skew-symmetric as a result of the type of discretization and/or due to using other than periodic boundary conditions. Nevertheless, the assumption of skew-symmetry is often made for theoretical purposes [27], Rem. 4] and in the design of flux limiters [33]. The concept of summation-by-parts, often invoked in the LGL-DG context [20], [21], [34], [35], yields skew-symmetric discrete gradients seemingly for free, which suggests that, in this sense, LGL schemes are superior to their Bernstein polynomial-based counterparts [14], [25], [32]. However, the latter produce significantly less restrictive CFL conditions because the Bernstein nodes are uniformly distributed within the elements as opposed to the LGL nodes. Thus, it is worthwhile to adapt the LGL-summation-by-parts concept to high-order convex limiting schemes based on the Bernstein basis. A main aspect of this work is dedicated to this effort. Bernstein basis functions possess many further desirable properties (see [14], [25], [32] and the references therein) such as nonnegativity and boundedness of local approximations by the min- and max-values of nodal values within the elements, which can be exploited for mathematical purposes. We also demonstrate that a supposedly entropy-stable AFC limiter can actually produce entropy instead of dissipating it if the non-skew-symmetric version is used instead of our new one. Additionally, this work presents novel theoretical results for monolithic flux correction schemes applied to hyperbolic problems. In this regard, the present paper can be seen as a follow-up to ref. [25], again focusing on DG but second-order continuous Galerkin schemes are also discussed.

2 Preliminaries

2.1 Model problem

Let Ω R d , d ∈ {1, 2, 3}, x ∈ Ω, t ⩾ 0, and let u ( x , t ) R M , M N , be the solution to the conservation law

(1) u t + f ( u ) = 0 in  Ω × ( 0 , ) ,

where ∇⋅ is the matrix-divergence operator applied to the inviscid flux f = f ( u ) R M × d . To formulate a well-posed initial-boundary value problem, (1) has to be equipped with the initial condition u(⋅, 0) = u 0 and suitable data u 0 = u 0 ( x ) R M as well as appropriate boundary conditions. The latter are specified for the test problems in Section 6. For many specific applications (e.g., scalar problems, shallow water equations, gas dynamics), the exact solution to (1) is known to satisfy certain admissibility constraints that can be described by constraining u to lie in a some convex set A R M . For instance, in the scalar case, M = 1, A is an interval bounded by the extrema of initial and boundary conditions [36], Ch. 6]. Suppose there exists a convex function U : A R and a corresponding flux F : A R d such that F ′(u) = U′(u) f ′(u), then (U, F ) is called an entropy pair for the conservation law (1). By the chain rule, a conservation law for the entropy U can be derived under the assumption that u and U are smooth. In general however, weak solutions to the nonlinear problem (1) can develop discontinuities in finite time. Using, for instance, the concept of vanishing viscosity solutions [36], Sect. 6.3], one can show well-posedness of weak entropy solutions for the scalar case, if a weak form of the entropy inequality

(2) U ( u ) t + F ( u ) 0 in  Ω × ( 0 , )

holds for all (U, F ) that are entropy pairs for (1). If discrete versions of (2) are satisfied, ideally in a localized manner, the occurrence of nonphysical weak solutions can be averted in numerical approximations, which motivates the use of entropy-stable approximations, e.g., [3], [4], [7], [12], [21].

2.2 Generic monolithic convex limiting discretization

We now summarize the concept of monolithic convex limiting (MCL), first proposed by Kuzmin [13], see also [7], [18], [21], [25], [32]. This algebraic flux correction (AFC) paradigm operates on the semi-discrete level and is capable of enforcing a variety of constraints. A generic MCL semi-discretization can be written as

(3) m i d u i d t = j N i \ { i } 2 d i j u ̄ i j * u i .

Here the index i ∈ {1, …, N} refers to a node, N is the number of unknowns for each variable, and thus the total number of unknowns is MN. Note that in the system case all components of the solution vector u are interpolated on the same set of nodes, which are associated with a certain node x i Ω ̄ , i.e., we do not consider staggered approaches. Similarly, m i > 0 is associated with this node and typically represents an entry of a diagonalized mass matrix. The set N i { 1 , , N } is the nodal stencil of x i , i.e., the set of nodes x j that directly interact with x i . In piecewise linear continuous finite element methods, N i is the set of mesh vertices that are nearest neighbors to node x i in the sense that they are endpoints of a certain mesh edge. Entities featuring two distinct indices such as the symmetric artificial diffusion coefficient d ij = d ji ⩾ 0 refer to terms depending on the two nodal values u i and u j . In the MCL framework, the flux-corrected bar states u ̄ i j * [13] are algebraically enforced to satisfy user-defined constraints of convex nature. Upon temporal discretization of (3) with a strong-stability preserving (SSP) Runge–Kutta (RK) method [37], [38], [39], Shu–Osher updates can be written as a convex combination of forward Euler steps

(4) u i F E = 1 τ m i j N i \ { i } 2 d i j u i + τ m i j N i \ { i } 2 d i j u ̄ i j * ,

where τ > 0 is the time step. Suppose that u i A i for a certain convex subset A i A of the largest admissible set. If u ̄ i j * A i can be guaranteed for j N i \ { i } and the CFL-like time step restriction

(5) τ min i { 1 , , N } m i j N i \ { i } 2 d i j

holds, then u i F E is a convex combination of the solution at the previous iteration u i and the flux-corrected bar states u ̄ i j * . In conclusion, by convexity of A i , we then have u i F E A i as well. Recent works, e.g., [33], [40], also demonstrate how AFC and MCL can be performed using non-SSP time discretizations, including implicit methods and applications to viscous problems. Although they appear promising, these issues shall not be addressed here since our focus lies on the spatial discretization. The particular choice of the sets A i and how to enforce u ̄ i j * A i for all j N i \ { i } sets different MCL techniques apart from each other. Before discussing high-order DG discretizations, we give a brief example of arguably the simplest MCL variant.

2.3 Case study: MCL based on classical finite elements for the Euler equations

We only summarize the steady case here. More information can be found in ref. [7] and the references therein. Let I be the d × d-identity matrix and let ρ = ρ ( x , t ) R , v = v ( x , t ) R d , p = p ( x , t ) R denote density, velocity, and pressure of an ideal polytropic gas. We consider the compressible Euler equations

(6a) ρ t + ( ρ v ) = 0 ,

(6b) ( ρ v ) t + ( ρ v v + p I ) = 0 ,

(6c) ( ρ E ) t + ( ρ E + p ) v = 0 ,

where u = [ ρ , ρ v , ρ E ] is the vector of conserved unknowns (density, momentum, and total energy), and

(7) p = ( γ 1 ) ρ E ρ | v | 2 2 = ( γ 1 ) ρ e ,

where γ > 1 is the adiabatic constant and e is the specific internal energy. The (physically motivated) largest admissible set for (6) and (7) is A = { u = [ ρ , ρ v , ρ E ] R d + 2 : ρ 0 , p 0 } . Note that due to (7), nonnegativity of pressure is equivalent to the physically motivated constraint that internal energy e may not become negative. The usual entropy pair for (6) and (7) is given by (U, F ) with U = −ρlog( γ ), F = v U [41].

Let us discretize (1) using a conforming simplicial mesh and local P 1 -polynomials with corresponding Lagrange basis functions φ i C ( Ω ̄ ) , i ∈ {1, …, N}, defined implicitly by the interpolation property φ i ( x j ) = δ ij for all i, j ∈ {1, …, N}. We then set m i = ∫Ω φ i  d x , c ij = ∫Ω φ i φ j  d x and

(8) λ i j = λ ( u i , u j , c i j / | c i j | ) : = max v i c i j / c i j + γ p i / ρ i , v j c i j / c i j + γ p j / ρ j .

For a unit vector n ij = c ij /| c ij |, (8) corresponds to the largest absolute eigenvalue of the Jacobian of f (u i ) or f (u j ) projected onto n ij . This estimate for the exact wave speed does not produce states outside of the set A [42], App.], which is why we use (8) to define the diffusion coefficients as follows [24], [43], [44]:

(9) d i j = max { λ i j | c i j | , λ j i | c j i | } i , j { 1 , , N } , i j .

Next, we define the low-order bar states u ̄ i j = u i + u j 2 + ( f i f j ) c i j 2 d i j [23], [24] and their MCL counterparts [13]:

(10) u ̄ i j * = u ̄ i j + f i j * 2 d i j ,

where f i := f (u i ) and f i j * = f j i * is a flux-corrected counterpart of the target flux f ij , which for steady-state calculations can simply be set to f ij = d ij (u i u j ). The low-order counterpart u ̄ i j of u ̄ i j * initially appeared in Harten et al. [23]. Guermond and Popov [24] were the first to use it in the AFC context to analyze the low-order method that is the first-order predictor in FCT-type convex limiting schemes and, similarly, the low-order version of MCL approaches. In either case, the limited antidiffusive fluxes f i j * are set to zero.

In general, the least-restrictive constraint that should be enforced for any hyperbolic problem is the admissibility of solutions, i.e., u A . For the compressible Euler equations, we need to ensure that densities remain nonnegative, which can be achieved by enforcing positivity of the first component of u ̄ i j * = ρ ̄ i j * , ( ρ v ) ̄ i j * , ( ρ E ) ̄ i j * . Using definition (10) and skew symmetry of f i j * , this task is accomplished by setting the first component of f i j * to max 2 d i j ρ ̄ i j , min { f i j ρ , 2 d i j ρ ̄ j i } , where ρ ̄ i j and f i j ρ correspond to the first component of the low-order bar state u ̄ i j and the unlimited flux f ij , respectively. Next, we can enforce additional constraints such as nonnegativity of pressure and Tadmor’s entropy stability condition [30], [31] by further reducing the magnitude of f i j * . These issues are described in detail in refs. [13], [18], [27], respectively, see also [7]. Additionally, the quality of numerical solutions can be improved by enforcing discrete maximum principles in addition to nonnegativity, see, e.g., [7], [10], [16], [21], [28], [40]. All of these tasks (and more, for instance, well-balancedness in the case of the shallow water equations [12]) can be guaranteed simultaneously.

Since nonnegativity of pressure is a nonnegotiable constraint (as is nonnegativity of density), we briefly summarize the typical MCL pressure fix [13], [21], [25]. For a pair of nodes ij let f i j * be the prelimited antidiffusive flux. Setting f i j * * = α i j f i j * , where the correction factor α ij = α ji ∈ [0, 1] is defined via [13]:

(11) α i j = Q i j R i j    if  R i j > Q i j , 1    otherwise ,

where w ̄ i j = 2 d i j u ̄ i j = w ̄ i j ρ , w ̄ i j ρ v , w ̄ i j ρ E , f i j * = f i j ρ , * , f i j ρ v , * , f i j ρ E , * , and

Q i j = Q j i = min w ̄ i j ρ w ̄ i j ρ E 1 2 | w ̄ i j ρ v | 2 , w ̄ j i ρ w ̄ j i ρ E 1 2 | w ̄ j i ρ v | 2 , R i j = R j i = max | w ̄ i j ρ v | , | w ̄ j i ρ v | | f i j ρ v , * | + max w ̄ i j ρ , w ̄ j i ρ | f i j ρ E , * | + max w ̄ i j ρ E , w ̄ j i ρ E | f i j ρ , * | + max 0 , 1 2 | f i j ρ v , * | 2 f i j ρ , * f i j ρ E , *

guarantees that the pressure of the bar state u ̄ i j * * = u ̄ i j + f i j * * / ( 2 d i j ) remains nonnegative. The derivation of (11) can be found in refs. [13], [28]. The following result is an interesting observation made in the process of this work.

Lemma 1

(pressure fix yields nonnegative density). Let u i , u j A be arbitrary, u ̄ i j * * = u ̄ i j + f i j * * / ( 2 d i j ) , where d ij is given by (9), f i j * * = α i j f i j * , and let α ij be defined by (11). Then the first component of u ̄ i j * * is nonnegative.

Proof.

For α ij given by [13], Eq. (92)] or (11), the pressures of the states u ̄ i j * * = u ̄ i j + α i j f i j * / ( 2 d i j ) and u ̄ j i * * = u ̄ j i α i j f i j * / ( 2 d i j ) are nonnegative [13]. The density and total energy components of the low-order bar states u ̄ i j , u ̄ j i are also nonnegative [24], Lem. 2.1], see also [23]. Thus, only one of the two terms w ̄ i j ρ + α i j f i j ρ , * or w ̄ j i ρ α i j f i j ρ , * can potentially become negative. Let us first consider the case f i j ρ , * < 0 , where

α i j f i j ρ , * Q i j R i j | f i j ρ , * | w ̄ i j ρ w ̄ i j ρ E 1 2 | w ̄ i j ρ v | 2 w ̄ i j ρ E | f i j ρ , * | | f i j ρ , * | = w ̄ i j ρ | w ̄ i j ρ v | 2 2 w ̄ i j ρ E w ̄ i j ρ

since w ̄ i j ρ E 0 . Thus, the first component of w ̄ i j * * = 2 d i j u ̄ i j * * , w ̄ i j ρ + α i j f i j ρ 0 . The case f i j ρ > 0 is similar. □

3 Skew-symmetry of discrete gradient operators

3.1 General considerations and second-order continuous Galerkin schemes

Let us once more consider a P 1 or Q 1 continuous Galerkin discretization as described in Section 2.3. If the domain has only periodic boundaries, then each component C k of the discrete gradient operator C = ( C 1 , , C d ) = ( c i j ) i , j = 1 N is a skew-symmetric N × N matrix due to the divergence theorem. Otherwise,

c i j = Ω φ i φ j d x = Ω φ i φ j d x + Ω φ i φ j n d s = : c j i + b i j ,

where n = n ( x ) R d denotes the unit outward normal to ∂Ω. In this setting, each matrix C k , k ∈ {1, …, d}, is almost skew-symmetric, with only a few pairs of entries where b ij ≠ 0 being exceptions to this rule. For flux-correction schemes, the skew symmetry of C is often desirable to design new algorithms [33], simplify computations (note that c ij = − c ji implies λ ij = λ ji in (8) (the computation of λ ij is a significant cost factor in AFC schemes for systems), and to prove certain theoretical results such as entropy stability [18], see also Section 4.3.2. Therefore, it is unfortunate that a few boundary nodes invalidate skew symmetry.

We now explain how the issue raised at the beginning of this section can be resolved starting in the already introduced CG setting. Let us take a step back and remind ourselves how the matrices C arise in the first place. The strong form of the CG discretization of the conservation law (1) contains the integral

(12) Ω φ i f ( u h ) d x ,

which can generally only be approximated via quadrature. The group finite element formulation [45], [46]:

(13) f h : = j = 1 N f j φ j f ( u h )

interpolates the inviscid flux f (u h ) in the finite element space. On simplicial meshes with periodic boundary conditions, this approach can be interpreted as nodal quadrature [46]. Inserting (13) into (12), we obtain

(14) Ω φ i f ( u ) d x j = 1 N f j Ω φ i φ j d x = j = 1 N f j c i j = j N i \ { i } ( f j f i ) c i j ,

which is how the discrete gradient operators arise in AFC schemes. The last equality in (14) holds because the row sums of each C k are zero due to the partition-of-unity property j = 1 N φ j ( x ) = 1 for any  x Ω ̄ . Replacing c ij in (14) by some c ̃ i j R d such that c ̃ i j = 0 if j N i is equivalent to adding

(15) j N i \ { i } ( f j f i ) ( c ̃ i j c i j )

to (14). If for all k ∈ {1, …, d} the row sums of each C ̃ k in C ̃ = ( C ̃ 1 , , C ̃ d ) = ( c ̃ i j ) i , j = 1 N sum to zero, and the columns of the matrices C k and C ̃ k add up to the same values c j k stored in c j = c j 1 , , c j d , then

i = 1 N j N i \ { i } ( f j f i ) ( c ̃ i j c i j ) = j = 1 N f j i = 1 N c ̃ i j i = 1 N c i j = j = 1 N f j ( c j c j ) = 0 .

Thus, the proposed modification does not lead to a change in the global mass balance. Note that by writing the correction term (15) using a sum over only indices ji, we may choose the diagonal entries c ̃ i i arbitrarily. This possibility is what will allow us to define the skew-symmetric discrete gradients below.

Remark 1.

In this paper, we are referring to a matrix A = ( a i j ) i , j = 1 N as skew symmetric, if a ij + a ji = 0 for all i, j ∈ {1, …, N}, ij, which does not imply zero diagonal entries. Their values are unimportant since their use can be avoided in AFC schemes via formulations such as (15).

An additional constraint that is desirable in the context of high-order AFC schemes is the sparsity of C ̃ [10], [14], [20], [21], [25], [32], [47]. By that we refer to the property that every nonzero offdiagonal entry corresponds to two distinct nodes that are closest neighbors (in a certain metric) in the submesh obtained by using each Bernstein node as a degree of freedom of a continuous subcell P 1 or Q 1 (depending on the element geometry) discretization [13], [14], [25]. The corresponding stencil N ̃ i will be further specified below. Each pair of nodes (i, j) with j N ̃ i \ { i } corresponds to one antidiffusive flux that needs to be limited and incorporated. Thus, the fewer elements in the stencils N ̃ i the better in terms of computational resources.

Thus, we may replace the discrete gradient operator C by matrices C ̃ of the same size satisfying

(16a) c ̃ i j = c ̃ j i i , j { 1 , , N } , i j ,

(16b) j = 1 N c ̃ i j = 0 ,

(16c) i = 1 N ( c i j c ̃ i j ) = 0 j { 1 , N } ,

(16d) c ̃ i j k 0 i = j  or  i  and  j  are closest neighbors in the stencil  N ̃ i .

The remainder of this section is dedicated to constructing such matrices C ̃ . For second-order continuous Galerkin schemes with weakly-enforced boundary conditions, C satisfies (16b)–(16d). Thus finding a C ̃ that also fulfills (16a) is simple (which is why no originality is claimed here, although we found no works using these matrices in a similar context). Given the original operators C and symmetric boundary matrices B = ( B 1 , , B d ) = ( b i j ) i , j = 1 N , we only have to adjust a few entries of C , which can be done as follows:

c ̃ i j = c i j 1 2 b i j if  i j , c i i + 1 2 k N i \ { i } b i k otherwise.

Proving the validity of (16) for this operator is straightforward. So far, we have focused on global matrices. Below, we discuss elementwise operators because these become relevant in the high-order and/or DG cases.

3.2 Skew-symmetric discrete gradient operators for Bernstein polynomials

In Section 3.1, we established that the group finite element formulation [45], [46] produces the discrete gradients

(17) C = ( C 1 , , C d ) = ( c i j ) i , j = 1 N , c i j = Ω φ i φ j d x ,

where { φ i } i = 1 N are the Lagrange basis functions of P 1 or Q 1 continuous finite element spaces. Let us now consider elementwise operators C for a certain reference element K R d and the Bernstein basis. These matrices are obtained by replacing the number of rows and columns N and the integration domain Ω in (17) by the local number of unknowns and the reference element K, respectively. With some abuse of notation, we avoid making this distinction, to avoid redefining all operators or carry additional indices. For each of the below element types, we construct matrices C ̃ = ( C ̃ 1 , , C ̃ d ) = ( c ̃ i j ) i , j = 1 N , satisfying conditions (16).

3.2.1 1D elements

Denote the 1D reference element by K = [0, 1]. The Bernstein polynomials of degree p N associated with the nodes x k = k/p are φ k p ( x ) = p k ( 1 x ) p k x k , k ∈ {0, …, p}. The derivative operator C = C 1 = ( c i j ) i , j = 0 p , c i j = 0 1 φ i p ( x ) ( φ j p ) ( x ) d x is a dense matrix with zero row sums, and its column sums read

(18) i = 0 p 0 1 φ i p ( x ) φ j p ( x ) x d x = 0 1 φ j p ( x ) x d x = φ j p ( 1 ) φ j p ( 0 ) = 1 if  j = 0 , 1 if  j = p , 0 otherwise.

Lemma 2.

The tridiagonal matrix (originally proposed by Pazner [20], Eq. (21)] for an LGL-FCT scheme)

(19) C ̃ = ( c ̃ i , j ) i , j = 0 p , c ̃ 1,1 = c ̃ l , l 1 = 0.5 , c ̃ p , p = c ̃ l 1 , l = 0.5

for l ∈ {1, …, p} and all other entries set to zero, satisfies conditions (16).

Proof.

Condition (16c) follows from (18). The rest is obvious, see also Pazner [20]. □

Remark 2.

Applying the theory by Lohmann et al. [14], App. B] to the case of discrete gradients, we obtain alternative sparsified operators given by [14], Eq. (B.4)]. Note that this matrix satisfies all of the conditions (16) except for (16a). As we quantify in Sections 5 and 6, the use of [14], Eq. (B.4)] requires significantly smaller time steps for large p than its skew-symmetric counterpart defined in Lemma 2.

Lemma 3.

Let { ψ k } k = 0 p be the piecewise linear continuous Lagrange basis interpolating the Bernstein nodes { k / p } k = 0 p of the reference element [0, 1]. Then the following matrix coincides with (19):

(20) C ̄ = ( c ̄ i j ) i , j = 0 p , c ̄ i j = 0 1 ψ i ( x ) ψ j ( x ) x d x .

Proof.

For ij, the integral in (20) reduces to a single subcell connecting nodes x i and x j if they are closest neighbors. The subcell length is the reciprocal of the constant slope of ψ j p and so it remains to integrate ψ i p , which yields precisely c ̃ i j = ± 1 / 2 . The case i = j follows from c ̄ i i = 1 2 0 1 ψ i ( x ) 2 x d x . □

At first, it seems natural to extend the definition of subcell P 1 / Q 1 Lagrange polynomials to the multidimensional case and use an analog of (20) to define sparse discrete gradients. However, similar to the case of dense discrete gradients C , these matrices are not fully skew-symmetric because of entries corresponding to pairs of nodes that are neighbors on the same boundary segment of the element. Thus, we have to find other matrices to achieve skew symmetry, which we focus on next.

3.2.2 Box elements

Let us now discuss the case of Q p elements, i.e., quadrilaterals in 2D, hexahedra in 3D, or higher-dimensional analogues. For generality, we allow varying polynomial degrees p l in each spatial direction l ∈ {1, …, d}. The Bernstein basis functions { φ i } i = 1 N are simply tensor-products of the one-dimensional Bernstein polynomials and N = l = 1 d ( p l + 1 ) . We choose a lexicographical ordering of basis functions and corresponding node indices i = (i 1, …, i d ) within the reference element, where to obtain the element-global index i we first iterate over all basis functions with respect to the first spatial variable, i.e., going through all i 1 ∈ {0, …, p 1} before incrementing the subsequent one i 2 once and so forth. This common approach enables us to mimic the 1D case by using Kronecker products A B R ( k l ) × ( k l ) of matrices A R k × k , B R l × l . By I l R l × l , we denote the identity matrix and by 1 l R l the vector of ones.

Lemma 4.

Let C ̃ p k R ( p k + 1 ) × ( p k + 1 ) be given by (19) for p k = p. Then the matrices C ̃ = ( c ̃ i j ) i , j = 1 N ,

C ̃ = ( C ̃ 1 , , C ̃ d ) , C ̃ k = 1 l = 1 l k d ( p l + 1 ) I Π l = 1 k 1 ( p l + 1 ) C ̃ p k I Π l = k + 1 d ( p l + 1 ) R N × N ,

satisfy conditions (16) for multidimensional box elements (quadrilaterals, hexahedra, etc.).

Proof.

If c ̃ i j k 0 and i = (i 1, …, i d ) ≠ j = (j 1, …, j d ), then i l = j l for all l ∈ {1, …, d}\{k} and i k = j k ± 1. This property follows from the definition of C ̃ with the lexicographical numbering of nodes and implies sparsity (16d). Similarly, considering such a pair of nodes, skew symmetry (16a) can be shown by definition of the 1D matrices C ̃ p k . Furthermore, let A, B, C, D be matrices such that products AC and BD can be formed, then the Kronecker product satisfies the relationship (AB)(CD) = (AC) ⊗ (BD). Thus,

C ̃ k ( 1 p 1 + 1 1 p d + 1 ) = 1 l = 1 l k d ( p l + 1 ) 1 ( p 1 + 1 ) ( p k 1 + 1 ) 0 p k + 1 1 ( p k + 1 + 1 ) ( p d + 1 ) = 0 N

because C ̃ p k 1 p k + 1 = 0 p k + 1 , where 0 l R l is the zero vector. Hence, (16b) holds. Finally, to show (16c), we recall the definition c j k = K φ j x k d x = K φ j n k d s , where K R d is the reference element and n k is the kth component of the normal to ∂K. Note that for fixed j ∈ {1, …, N}, k ∈ {1, …, d}, ∂K in this integral can be replaced by a single boundary face Γ, on which φ j is equal to a Bernstein polynomial in (d − 1) variables. Since Γ is a box element in R d 1 with volume |Γ| = 1 and all Bernstein polynomials possess equal mass, that is ∫Γ φ i  ds = |Γ|/N k , where N k = l = 1 l k d ( p l + 1 ) is the number of nodes on Γ, it follows that

i = 1 N c i j k = i = 1 N K φ i φ j x k d x = c j k = 1 l = 1 l k d ( p l + 1 ) × 1 if  j k = 0 , 0 if  j k { 1 , , p 1 } , 1 if  j k = p .

This is equal to the corresponding row sum of C ̃ k because

( 1 p 1 + 1 1 p d + 1 ) C ̃ k = 1 l = 1 l k d ( p l + 1 ) 1 ( p 1 + 1 ) ( p k 1 + 1 ) 1 p k + 1 C ̃ p k 1 ( p k + 1 + 1 ) ( p d + 1 )

and 1 p k + 1 C ̃ p k = [ 1,0 , , 0,1 ] , see (19). Thus, all properties in (16) hold. □

3.2.3 Simplices

We now move on to the reference simplex K = conv { 0 d , e 1 , , e d } R d , where conv{} denotes the convex hull and e i is the ith unit vector in R d . The barycentric coordinates are Λ 0 ( x ) = 1 i = 1 d x i , Λ i ( x ) = x i for i ∈ {1, …, d}, and the Bernstein nodes shall be numbered using a multi-index notation i = (i 0, …, i d ) similar to the case of box elements. The set of multi-indices representing nodes in K is

J = J ( p , d ) = ( i 0 , , i d ) { 0 , , p } d + 1 : j = 0 d i j = N : = p + d p

and the (isotropic) pth-degree Bernstein basis functions read φ k p ( x ) = p ! l = 0 d k l ! l = 0 d Λ l ( x ) k l , k J .

3.2.3.1 The P 1 triangle

Before studying the general simplex case, let us briefly focus on the P 1 -triangle in 2D. By (16a)–(16c), all matrices satisfying (16) can be expressed as

C 1 = 1 4 1 a 1 a a 1 a 1 a 1 1 a 0 , C 2 = 1 4 1 b 1 b b 0 b b 1 b 1 , a , b R .

The magnitude of nonzero off-diagonal entries in these matrices affects the time step restriction of convex limiting schemes due to (9) and (5). Thus, all nonzero off-diagonal entries of C 1 and C 2 should generally have the same absolute value if possible (see also Section 5 for further discussions on this issue). By this argument, it makes sense to only choose a, b ∈ {0, 0.5, 1}. Figure 1 shows various choices including three (Figure 1b–d) out of the possible nine combinations of parameters a and b in addition to the usual, non-skew-symmetric matrices C (Figure 1a) along with a variant, where a, b ∉ {0, 0.5, 1} (Figure 1e).

Figure 1: 
Five different options for how to choose 





C

̃




$\tilde {\boldsymbol{C}}$



 on triangles, red represents entries in C
1, blue represents entries in C
2.
Figure 1:

Five different options for how to choose C ̃ on triangles, red represents entries in C 1, blue represents entries in C 2.

The case a = 1, b = 0 leads to dimensional decoupling and inhibits optimal convergence rates, see the numerical examples in Section 6.2. Setting a = 0, b = 1 corresponds to an unusual gradient, where coupling occurs in a direction that is opposite to the one in the previous case. This scheme exhibits optimal convergence behavior but requires significantly more time steps than the choice a = b = 0.5, which seems to be optimal in terms of CFL conditions. The gradient in Figure 1e corresponds to a discrete gradient that satisfies conditions (16) but uses off-diagonal entries of varying magnitude. As a result, the resulting CFL condition is much more restrictive, however, this method also exhibits optimal convergence rates. We have not experimented with gradients other than the ones shown in Figure 1 because the two matrices C 1, C 2 corresponding to any other choice of a, b ∈ {0, 0.5, 1} would be dimensionally inconsistent to each other.

In conclusion, uniqueness of simplicial discrete gradients satisfying conditions (16) does not hold. Note that for box elements the operators specified earlier are only unique because of their tensor-product structure (not possible here). Based on the summarized results obtained with each of the schemes in Figure 1 (detailed in Section 6.2), it makes sense to generalize the gradient in Figure 1d. In the remainder of this section, we demonstrate that this approach is feasible for general high-order simplices in arbitrary space dimensions.

3.2.3.2 General simplices

The realization that the gradient in Figure 1d appears to be optimal for P 1 -triangles implies the following structure for arbitrary p , d N : Conditions (16a)–(16c) fully determine the values of diagonal entries since

c j k = i = 1 N c ̃ i j k = i = 1 N c ̃ i j k + i = 1 N c ̃ j i k = i = 1 N c ̃ i j k + c ̃ j i k = 2 c ̃ j j k + i = 1 i j N ( c ̃ i j k c ̃ i j k ) = 2 c ̃ j j k c ̃ j j k = c j k 2 .

In any matrix row i, the offdiagonal entry in the jth column is nonzero only if the multiindices i and j correspond to nodes that are nearest neighbors. The magnitude of offdiagonal entries should be constant, and their sign should be determined by the spatial direction. If the node x j lies in positive direction w.r.t. any Cartesian coordinate, then c ̃ i j k > 0 and c ̃ i j k < 0 otherwise for all k ∈ {1, …, d}, i.e., for all matrices. For closest neighbors other than these nodes (diagonal neighbors), the sign of c ̃ i j k depends on whether the diagonal direction is positive w.r.t. the coordinate x k . If d > 2, there are diagonal neighbors with the same multiindex component for the x k direction. The corresponding indices are set to zero in C ̃ k .

To formalize these considerations mathematically, we require some notation. For consistency with previous works [25], [32], we define connectivity on simplices as follows.

Definition 1.

Let i , j J . If ∃k, l ∈ {0, …, p}, kl, such that j = i + e k − e l , where e k,l are the kth and lth unit vectors in R d + 1 , respectively, then we say j N ̃ i , i.e., j is in the local stencil N ̃ i .

In addition, let

(21) s = 1 2 ( d 1 ) ! p + d 1 p = p ! 2 ( p + d 1 ) !

denote half the mass of a pth-degree Bernstein polynomial on the (d − 1)-dimensional reference simplex. Furthermore, let c j k = K φ j n k d s be as before, and for i , j J , define

(22) c ̃ i j k : = s / d if  ( j N ̃ i ) [ ( j 0 = i 0 + 1 ) ( j k = i k 1 ) ] , s / d if  ( j N ̃ i ) [ ( j 0 = i 0 1 ) ( j k = i k + 1 ) ] , s if  ( i = j ) ( i 0 > 0 ) ( i k = 0 ) , s if  ( i = j ) ( i 0 = 0 ) ( i k > 0 ) , 0 otherwise.

Lemma 5.

The matrices C ̃ k = ( c ̃ i j k ) i , j J = ( c ̃ ( i 0 , , i d ) ( j 0 , , j d ) k ) i , j J given by (22) satisfy conditions (16).

Proof.

By Definition 1, i N ̃ i for all i J . Therefore, all cases in (22) are exclusive, and c ̃ i j k is well defined. Sparsity (16d) is built into Definition 1, and skew symmetry (16a) is proved by observing that for ij,

c ̃ j i k = s / d if  ( i N ̃ j ) [ ( i 0 = j 0 + 1 ) ( i k = j k 1 ) ] , s / d if  ( i N ̃ j ) [ ( i 0 = j 0 1 ) ( i k = j k + 1 ) ] , 0 otherwise, = s / d if  ( j N ̃ i ) [ ( j 0 = i 0 + 1 ) ( j k = i k 1 ) ] s / d if  ( j N ̃ i ) [ ( j 0 = i 0 1 ) ( j k = i k + 1 ) ] 0 otherwise = c ̃ i j k .

We show (16c) by proving that

(23) c ̃ j j k = 1 2 c j k j { 1 , , N } , k { 1 , , d } ,

which, together with (16a) and (16b) (to be proven last), implies (16c). To better illustrate how (23) can be shown, let us consider the example sketched in Figure 2. The red nodes are precisely those corresponding to the nonzero diagonal entries in (22). These are set in accordance with (23), as are the entries for black and blue nodes: Diagonal entries for the former are zero because these nodes are either within the element interior or lie on a boundary where the respective component of the normal appearing in c j k is zero. The blue nodes are degrees of freedom for which the Bernstein polynomial is nonzero on more than one boundary segment, with the corresponding normal component being nonzero on precisely two (also for d > 2) of these, which depends on k. These two integrals cancel, which is consistent with definition (22).

Let us now rigorously prove (23) for (22) by formalizing these exemplified considerations in all required cases: If j 0 j k > 0 (black nodes), then c j k = 0 and (22) is consistent with (23) because c ̃ j j k = 0 . For j 0 > 0 = j k (red nodes not opposite the origin), the integral c j k = K φ j n k d s can be reduced to a (d − 1)-dimensional reference simplex, on which φ j is a Bernstein polynomial in d − 1 variables. The component n k of the normal to this boundary is always −1, thus, c j k = 2 s by (21), and c ̃ j j k = s . If j 0 = 0 and j k > 0 (red nodes opposite the origin), the situation is exactly reversed, and by similar arguments (including transformation to the reference simplex), we obtain c j k = 2 s = 2 c ̃ j j k . In the case j 0 = j k = 0 (blue nodes), contributions to the integral c j k arise from precisely two boundaries, one of which is always opposite the origin. These two integrals are clearly of opposite sign. Invoking transformation rules, one can show that their magnitudes are equal. Hence, c j k = 0 , which is consistent with (22) because c ̃ j j k = 0 . Thus, we have shown (23).

It remains to prove (16b), where, for clarity, we also distinguish between all relevant cases based on the row multiindex i J : For i 0 = p, we have c ̃ i i k = s , precisely d positive, and no negative entries in the row. Similarly, for i k = p, c ̃ i i k = s , and the other d nonzero row entries are negative. For any other vertex, i.e., ∃l ∈ {1, …, d}\{k} with i l = p (for d = 2, these are blue nodes in Figure 2), we have c ̃ i i k = 0 . The nodes with j 0 = 1, j l = p − 1 and j k = 1, j l = p − 1 each contribute one negative and one positive row entry, of which there are no more nonzero ones. We have now dealt with all corners of the simplex. At this stage, it makes sense to restrict ourselves to the case d > 1 (in 1D, it can easily be shown that (22) coincides with (18)). We define q := |{l ∈ {1, …, d}\{k} : i l > 0}|, where |⋅| denotes the cardinality of a set. For i 0 = i k = 0, we have c ̃ i i k = 0 and there are precisely q negative and q positive entries in row i, which correspond to j 0 = i 0 + 1 and j k = i k + 1, respectively. For i 0 = 0 < i k < p, we have c ̃ i i k = s . The negative row entries correspond to either j = i + e0 − e k (one entry), j 0 = i 0 + 1 (q additional entries), or j k = i k − 1 (d − 1 additional (diagonal) neighbors). Moreover, there are precisely q positive entries in that row, which correspond to j k = i k + 1. Thus, the row sum is s s d ( 1 + q + d 1 ) + s d q = 0 . The case i k = 0 < i 0 < p is similar, and we obtain s s d q + s d ( 1 + d 1 + q ) = 0 for the row sum. Finally, for 0 < i 0,k < p, we have c ̃ i i k = 0 and the row sum is s d ( 1 + q + d 1 ) + s d ( 1 + d 1 + q ) = 0 . We have thus shown (16b), which together with (23) concludes the proof of (16c) as well as that of Lemma 5. □

Figure 2: 
Bernstein nodes of the 




P


4




${\mathbb{P}}_{4}$



 triangle and corresponding multiindices.
Figure 2:

Bernstein nodes of the P 4 triangle and corresponding multiindices.

3.2.4 Prisms

Finally, we study prismatic cells (also called wedges) in R 3 . These are tensor products of a triangular element and an interval, which is exploited here. For simplicity, we make the assumption that the spatial dimensions are numbered such that the reference element is K =△ × [0, 1], where △ = conv{(0, 0) , (1, 0) , (0, 1) }. We allow the Bernstein basis functions to be of different orders p x y , p z N on the triangle and the interval, respectively. The Bernstein basis functions are now tensor products of the triangular Bernstein polynomials and the 1D basis. We define n = p x y + 2 p x y as the number of nodes on △.

Lemma 6.

Let C ̃ p z be given by (19) for p = p z and C ̃ x , C ̃ y R n × n be given by (22) for p = p xy and let

C ̃ 1 = 1 p z + 1 I p z + 1 C ̃ x , C ̃ 2 = 1 p z + 1 I p z + 1 C ̃ y , C ̃ 3 = 1 ( p x y + 1 ) ( p x y + 2 ) C ̃ p z I n .

Then the matrices C ̃ = ( C ̃ 1 , C ̃ 2 , C ̃ 3 ) satisfy conditions (16).

Proof.

The proof (omitted for brevity) follows those for quadrilaterals and simplices, see Lemmas 4 and 5. □

3.3 Implementation aspects

The GitHub repository [48] was published together with the first DG-MCL paper [25]. It provides codes for computing the discrete gradient operators on simplices following [32]. This repository has been updated to include both the old and new sparsification approaches for all element geometries considered in this work.

Having discussed various geometry reference elements, let us briefly summarize the required modifications to obtain local discrete gradients on actual mesh elements. Let C ̂ = ( C ̂ 1 , , C ̂ d ) , C ̂ k = ( c ̂ i j ) i , j = 1 N for k ∈ {1, …, d} be the reference element matrices and let adj(J) denote the adjugate of the Jacobian matrix J to the transformation for mapping the reference element to physical cells. Then all entries of the corresponding discrete gradient operators on physical cells are obtained via R d c i j = adj ( J ) [ c ̂ i j 1 , , c ̂ i j d ] . Here we assumed linearity of the transformation to factor out adj(J). Whether the discrete gradients defined in this manner are suitable for nonlinear mappings too is yet to be determined. We refer to ref. [21], App. B] for a discussion regarding curvature and nonlinear transformations in the very similar LGL-AFC context.

4 Theoretical results for Bernstein-DG-AFC schemes

4.1 Notation

The superscript e ∈ {1, …, E} generally denotes the element index, where E is the number of mesh cells. The global solution u h is obtained from local contributions u h e via u h = e = 1 E u h e . A single subscript index refers to local nodes within the element. Two subscripts indicate interplay between distinct degrees of freedom, e.g., volumetric diffusion coefficients d i j e (here i, j are local indices of nodes within K e ). If subscripts are separated by a comma, this indicates coupling across element boundaries, e.g., interfacial diffusion coefficients d i , k e . Let F i e denote the set of (interior) boundary segments Γ k e that the node x i e belongs to. For fixed k N , there exists a unique node x i e at the same location as x i e K e . To ease readability, these quantities are denoted using a hat symbol, e.g., the degrees of freedom associated with the same location as x i e but located within the neighbor element that shares face Γ k e with K e is u ̂ i , k e (not u i e ).

The semi-discrete AFC discretization for the Bernstein-DG discretization reads

(24) m i e d u i e d t = j N i e d i j e u j e u i e ( f ( u j e ) f ( u i e ) ) c ̃ i j e + f i j e , * + k F i e d i , k e u ̂ i , k e u i e ( f ( u ̂ i , k e ) f ( u i e ) ) c i , k e + f i , k e , * ,

where

(25) m i e = K e φ i e d x , d i j e = max | c i j e | λ i j e , | c j i e | λ j i e , c i , k e = 1 2 Γ k e φ i e n k e d s , d i , k e = | c i , k e | λ i , k e ,

and n k e is the unit normal to Γ k e pointing outside of K e . The volumetric and interfacial wave speeds λ i j e and λ i , k e in the directions c ̃ i j e / | c ̃ i j e | and c i , k e / | c i , k e | depend on u i e and u j e or u i e and u ̂ i , k e , respectively. In this section, we generally only assume (16b) and (16c) for entries of the discrete gradient operator. The volumetric and interfacial limited antidiffusive fluxes f i j e , * and f i , k e , * are obtained from their unlimited counterparts f i j e and f i , k e by enforcing various constraints via MCL. These aspects [7], [25] need not be addressed here.

4.2 General results

For completeness, we reformulate an already established result [25], Lem. 1] in the context of quadrature.

Lemma 7

(recovery of the target scheme). If no limiting is performed, i.e., if f i j e , * = f i j e and f i , k e , * = f i , k e for all unlimited antidiffusive fluxes defined as in ref. [25], then (24) is equivalent to the DG target scheme

K e φ i e u h e t d x j = 1 l e ω j e f ( u h e ( q j e ) ) φ i e ( q j e ) + k F i e j = 1 l k e ω k , j e φ i e ( q i e ) f n k e ( u h e ( q i e ) , u ̂ h e ( q i e ) ) = 0

for all e ∈ {1, …, E}, i ∈ {1, …, N}, where ω j e , q j e are the l e quadrature weights and points used to approximate the nonlinear volume integral, and ω k , j e , x k , j e are the l k e quadrature weights and points used to approximate the boundary integral containing the target numerical flux f n k e u h e , u ̂ h e in direction n k e .

Proof.

The claim reformulates [25], Lem. 1], which assumes exact integration, using quadrature rules. □

As discussed in ref. [25], the similar structure of volumetric and interfacial terms in (24) allows for a combination of the two by adjusting the definition of local stencils and extending definitions of volumetric terms to include their interfacial counterparts. In this paper, we simply write sums over indices ji (with some abuse of notation) to indicate that both types of couplings are considered. Using this convention, the forward Euler time discretization reads

u i e , FE = u i e + τ m i e j i d i j e u j e u i e f j e f i e c i j e + f i j e , * = u i e + τ m i e j i 2 d i j e u ̄ i j e , * u i e ,

where u i e and u i e , FE are the old and new nodal values. Next, we adapt [24], Thm. 4.7] to our setting.

Proposition 1

(fully discrete local entropy inequalities, [24]). Consider an explicit SSP time discretization of (24). The low-order method, in which f i j e , * = f i , k e , * = 0 , satisfies the local discrete entropy inequalities

(26) U i e , F E U i e + τ m i e j i d i j e U j e U i e F j e F i e c ̃ i j e

w.r.t. all entropy pairs (U, F ) consistent with system (1) if the CFL condition (5) holds.

Proof.

Our low-order method fits in the framework of [24], Thm. 4.7] and the proof directly carries over. □

Remark 3.

Inequalities such as (26) cannot be derived for flux-limiters because (26) is based on properties that hold for u ̄ i j but not for u ̄ i j * in the MCL case nor for FCT-like alternatives as in refs. [11], [44]. Thus, results akin to Proposition 1 are of limited practical relevance except for the analysis of first-order schemes. For comments on which entropy conditions should be enforced, we refer to refs. [7], [27] and the references therein.

Proposition 2

(local conservation property). Let α i , k e [ 0,1 ] m be the vectors of effective correction factors, with which each component of the interfacial antidiffusive flux f i , k e is multiplied to obtain f i , k e , * for (24), then

d d t K e u h e d x = i = 1 N k F i e Γ k e φ i e 1 m α i , k e f n k e LLF u i e , u i , k e + α i , k e f n k e u h e , u h , k e d s ,

where f n LLF ( u , v ) = 1 2 [ ( f ( u ) + f ( v ) ) n + λ n ( u , v ) ( u v ) ] is the local Lax–Friedrichs (LLF) flux and ◦ denotes componentwise multiplication of vectors of the same size.

Proof.

Exploiting d i j e = d j i e , (16b) and (16c), skew-symmetry of antidiffusive fluxes, as well as the definitions of c i , k e and of f i , k e , see [25], Eq. (4.3)], we sum (24) over all local nodes, which yields the claim. □

The following two results were implied in ref. [25] but have not been formulated as such.

Proposition 3

(preservation of global bounds, [13], [24]). Consider an SSP time discretization of (24) satisfying the CFL condition (5). Let u i e , u ̄ i j e , * A for all i ∈ {1, …, N}, ji, and all element indices e, where A is the largest admissible set. Then the solution at the next time step is also in A .

Proof.

Again, it suffices to consider the forward Euler case, in which the updated solution u i e , FE is a convex combination of u i e and the u ̄ i j e , * under the CFL condition (5). The claim follows because A is convex. □

Proposition 3 implies that constraints such as global lower and upper bounds hold for numerical solutions to scalar conservation laws. For the Euler equations, limiting of u ̄ i j e , * w.r.t. density and pressure (see Section 2.3) can be used to guarantee nonnegativity of these two quantities (naturally only up to machine precision). For shock-capturing purposes, limiting w.r.t. local bounds may also be desirable.

Corollary 1

(preservation of local bounds, [13], [24]). Consider an SSP time discretization of (24) satisfying the CFL condition (5). Let u i e , u ̄ i j e , * A i for all ji in the DG stencil (volumetric and interfacial neighbors, the former depending on the sparsity of C ̃ e ), where A i A is convex. Then u i e , F E A i .

Proof.

The arguments used in the proof of Proposition 3 directly carry over to this setting. □

4.3 Results exploiting skew-symmetric discrete gradients

4.3.1 Two issues regarding sequential limiters

In the process of algebraic flux correction, it is not uncommon to perform more than one limiting step. For the Euler equations (and similar systems), we adopt the sequential approach [13], [21], [25], [49] of first adjusting density, followed by limiting of velocity components and specific total energy (the latter are limited independently of each other). Additionally, a pressure correction should always be performed and an entropy fix may also be desirable. These two steps each multiply all components of the antidiffusive flux by the same correction factors. However, due to the sequential approach used to limit specific unknowns rather than conserved ones [49], these synchronized steps may violate the maximum principles enforced beforehand for specific unknowns. This issue may not be of critical importance because such constraints are used for shock-capturing purposes rather than for enforcing nonnegotiable solution properties. Nevertheless, it would be desirable to show that this issue can be avoided. In ref. [28], Lem. 3.16], a sufficient condition for such compatibility of sequential limiters with subsequent synchronized limiters is given, which reads

(27) ρ ̄ i j e φ i e , min ( ρ φ ) ̄ i j e ρ ̄ i j e φ i e , max j i .

Here ρ is the main unknown (e.g., density), φ the specific unknown (e.g., velocity), and (ρφ) the conserved unknown (e.g., momentum) with corresponding bar states ρ ̄ i j e ,

(28) φ ̄ i j e = ( ρ φ ) ̄ i j e + ( ρ φ ) ̄ j i e ρ ̄ i j e + ρ ̄ j i e ,

and ( ρ φ ) ̄ i j e , respectively [13]. If the discrete gradient is skew symmetric, we have u ̄ i j e = u ̄ j i e , hence (28) simplifies to φ ̄ i j e = ( ρ φ ) ̄ i j e / ρ ̄ i j e and (27) becomes φ i e , min φ ̄ i j e φ i e , max j i (since ρ⩾0). This simplified compatibility condition is nothing else than a design criterion for the definition of local bounds φ i e , min and φ i e , max . It is automatically satisfied for the canonical choice [13], Eq. (78)] φ i min = min j φ ̄ i j , φ i max = max j φ ̄ i j , while for nonskew-symmetric discrete gradients, the validity of (27) is more difficult to guarantee.

A related issue is that if all correction factors of the sequential limiter are set to zero, the semi-discrete low-order method for product type variables (such as momentum or total energy) actually reads

(29) m i e d ( ρ φ ) i e d t = j i 2 d i j e ρ ̄ i j e φ ̄ i j e ( ρ φ ) i e

instead of

(30) m i e d ( ρ φ ) i e d t = j i 2 d i j e ( ρ φ ) ̄ i j e ( ρ φ ) i e .

If u ̄ i j e = u ̄ j i e , (29) and (30) are equivalent but the symmetry of bar states generally requires skew symmetry of discrete gradients. The low-order method (29) is less theoretically justified than (30). For instance, the fully discrete entropy inequalities derived in Proposition 1 and [24], Thm. 4.7] can only be shown for (30). Skew symmetry of volumetric discrete gradients is therefore desirable for sequential limiters.

4.3.2 Discrete entropy stability

As discussed in Remark 3, high-resolution schemes cannot be expected to satisfy fully discrete local entropy inequalities w.r.t. all admissible entropy pairs as the low-order method does (see Proposition 1). A different concept to achieve entropy stability is to make use of Tadmor’s semi-discrete theory [30], [31]. Kuzmin and Quezada de Luna [18] derived Tadmor’s condition for AFC schemes and proposed a limiter that enforces local semi-discrete entropy inequalities. Further results on these techniques can be found in refs. [7], [27], [29]. Rueda-Ramírez et al. [21] were the first to use this limiter in the AFC-DG context, but no theoretical results are derived therein. Since their LGL framework uses skew-symmetric discrete gradients, one can derive results similar to the next two statements, which apply to our Bernstein-DG methods.

Proposition 4

(local semi-discrete entropy inequalities, [18], [30]). Let ψ (u) = v(u) f (u) − F (u), where v = U′, be the entropy potential and let Tadmor’s condition [18], [30] for volumetric and interfacial fluxes:

(31a) v i e v j e 2 d i j e u j e u i e f j e + f i e c ̃ i j e + f i j e , * ψ j e ψ i e c ̃ i j e j N ̃ i e ,

(31b) v i e v ̂ i , k e 2 d i , k e u ̂ i , k e u i e f ̂ i , k e + f i e c i , k e + f i , k e , * ψ ̂ i , k e ψ i e c i , k e k F i e

hold. Then scheme (24) satisfies the inequality:

(32) m i e d U i e d t j N i \ { i } G i j e + F i e F j e c ̃ i j e + k F i e G i , k e + F i e F ̂ i , k e c i , k e ,

where

G i j e = v i e + v j e 2 d i j e u j e u i e + f i j e , * + v i e v j e 2 f i e f j e c ̃ i j e , G i , k e = v i e + v ̂ i , k e 2 d i , k e u ̂ i , k e u i e + f i , k e , * + v i e v ̂ i , k e 2 f i e f ̂ i , k e c i , k e .

Proof.

Following [7], [18], [30], [31], we multiply (24) by v i e , which yields the left-hand side of (32). On the right, v i e is split into symmetric and antisymmetric parts. After exploiting Tadmor’s condition, the rest of the proof follows from algebraic manipulations. We refer to ref. [27], Sect. 4.1] for a proof in the CG context that can directly be adapted to handle both the volumetric and interfacial terms appearing in (24). □

Corollary 2

(elementwise entropy inequalities, [18]). Let the assumptions of Proposition 4 hold for all i ∈ {1, …, N}, and, additionally, let c ̃ i j e = c ̃ j i e for all i, j ∈ {1, …, N}, ji, then (24) satisfies the inequality

(33) d d t K e U h e d x + i = 1 N k F i e R i , k e u h e , u ̂ h , k e 0 , R i , k e u h e , u ̂ h , k e : = F i e + F ̂ i , k e c i , k e G i , k e .

Proof.

Summing (32) over i ∈ {1, …, N}, exploiting (16a)–(16c) (thus G i j e = G j i e ), and (25) we obtain:

i = 1 N m i e d U i e d t i = 1 N j N i \ { i } G i j e + F i e F j e c ̃ i j e + i = 1 N k F i e G i , k e + F i e F i , k e c i , k e = j = 1 N F j e K e φ j e d x + 1 2 i = 1 N F i e k F i e Γ k e φ i e n k e d s + i = 1 N k F i e G i , k e F i , k e c i , k e = i = 1 N F i e k F i e c i , k e + i = 1 N k F i e G i , k e F i , k e c i , k e = i = 1 N k F i e R i , k e u h e , u h , k e .

Extracting the time derivative, we find that the left-hand side of this inequality is equal to d d t K e U h e d x . □

Corollary 3

(global entropy inequality, [18]). Let u ̂ h be the input for the Riemann solver at every domain boundary segment. Under the assumptions of Corollary 2, scheme (24) satisfies the global entropy inequality:

(34) d d t Ω U h d x + e = 1 E i = 1 x i e Ω N k F i e R i , k e ( u h , u ̂ h ) 0 .

Proof.

Summing (33) over all elements, we rewrite the sum of fluxes as a sum over faces, which we split into interior and boundary faces. As is common in the DG setting, all interior fluxes cancel, yielding (34). □

Tadmor’s conditions (31) is enforced as in refs. [18], [27]. In the DG context, we compute the entropy-adjusted fluxes for both, volumetric and interfacial contributions w.r.t. the same entropy pairs.

5 CFL-like time step restrictions for AFC schemes with SSP-RK

The fact that the SSP update (4) is a convex combination of admissible states is essential for many algebraic limiting techniques, e.g., [11], [13], [20], [21], [24], [25], which imposes the CFL time step restriction (5). In simple settings, the right-hand side of (5) can be evaluated, allowing for an a priori comparison of different baseline discretizations in terms of their efficiency. We consider three different DG discretizations of the 1D advection equation u t + v u x = 0 with constant velocity v R \ { 0 } on a periodic domain. Let the 1D mesh be tessellated using intervals of uniform length h > 0. For the new Bernstein sparsification (19), we have

(35) m i e = h p + 1 , d i j e = | v | max | c i j e | , | c j i e | = | v | 2 , d i , k e = | v | | c i , k e | = | v | 2 .

Due to sparsity (16d), every node has exactly two neighbors. Interior nodes within the element possess two neighbor nodes within the same element, while nodes on the element boundaries have exactly one neighbor in that element and one outside of it. Thus, j N ̃ i \ { i } d i j = | v | ( 1 / 2 + 1 / 2 ) = | v | , and (5) reduces to

(36) τ h 2 ( p + 1 ) | v | ,

which is only slightly more restrictive than the common CFL condition τ h ( 2 p + 1 ) | v | [50], Eq. (2.35)] for a (p + 1)-order DG discretization combined with a (p + 1)-order Runge–Kutta method.

For the non-skew-symmetric Bernstein sparsification [14], [25], [32], the value of

(37) j N ̃ i \ { i } d i j e = | v | max | c ̃ i , i 1 e | , | c ̃ i 1 , i e | + max | c ̃ i , i + 1 e | , | c ̃ i + 1 , i e |

varies while m i e and d i , k e remain unchanged compared to (35). Using [14], Eq. (B.4)], it is possible to evaluate (37) for each i ∈ {0, …, p}. To avoid tedious calculations, we only show that for p > 1, this scheme requires time steps τ smaller than (36). Indeed, for i = 0, we have

d i , 1 e + j N ̃ i \ { i } d i j e = | v | 2 + | v | max | c 0,1 e | , | c 1,0 e | = | v | 1 2 + 1 p + 1 max { p , | 1 | } = | v | ( p + 1 ) + 2 p 2 ( p + 1 ) ,

which is larger than |v| if p > 1. Thus, the use of (19) instead of our old approach [14], [25], [32] leads to less restrictive CFL conditions for Bernstein elements.

Finally, we study the case of LGL discretizations [20], [21], in which m i e = h ω i , | c i j e | = 1 2 j N ̃ i \ { i } , | c i k e | 1 2 , where ω i are the LGL quadrature weights. For p > 2, the LGL nodes are not uniformly distributed within the elements but are concentrated around the element boundaries. Generally, the weights of the two boundary nodes for the unit interval [0, 1] are given by ω 0 = ω p = 1 p ( p + 1 ) . Thus, the largest possible time step satisfying the CFL condition (5) is bounded by h 2 p ( p + 1 ) | v | , which is more restrictive than the optimal Bernstein-CFL (36) by a factor of p. This unfortunate drawback of LGL discretizations could potentially be cured by replacing elementwise discrete gradient operators [20], [21] (which are identical to the ones we use here for the Bernstein ansatz) with matrices that account for variations in the LGL quadrature weights.

In conclusion, for p > 1, our new sparsification can use somewhat larger time steps (depending on p) than the previous one [14], [25], [32], while improvements over time steps for LGL are significant. For p = 1, Bernstein polynomials are simply the Lagrange basis functions, which makes the scheme equivalent to the LGL space discretization. In 1D and on box elements, the old and new sparsifications are also identical.

Remark 4.

The comparisons made here can be easily adapted to discretizations of nonlinear systems in multiple space dimensions, where the same conclusions can be drawn. For instance, the multidimensional version of the optimal Bernstein-CFL (36) reads

(38) τ m i 2 j i d i j = [ h / ( p + 1 ) ] d 2 j i 1 2 | v n i j | [ h / ( p + 1 ) ] d 1 = h ( p + 1 ) j i | v n i j | ,

where the n i j R d are either the volumetric or interfacial integrals in the stencil.

Remark 5.

The CFL restrictions derived here apply to low-order and flux-limited approximations. CFL numbers w.r.t. the target scheme may be more restrictive, see e.g., [51], [52] for further analysis on this topic. Violations of CFL conditions typically lead to severe blowups, in which case the limiter would adapt the unstable target solution to satisfy all constraints that are being enforced. In this manner, one can, for instance, guarantee nonnegativity. However, it is usually preferable to limit an already stable target scheme rather than to rely on limiting for that purpose as well. The numerical example in Section 6.1 provides an illustration of how our limiters work if applied to such an unstable baseline discretization. CFL conditions of standard DG methods combined with appropriate time stepping schemes can be relaxed by modifying the numerical fluxes [53]. While this may lead to a loss of the superconvergence property, all other advantages of standard DG remain valid. In practice, we have not observed any cases, where the DG target discretization combined with SSP-RK schemes produces more restrictive CFL conditions than the limiter does.

6 Numerical experiments

We now apply the proposed AFC schemes to common test problems for the compressible Euler equations, where γ = 1.4 by default. The application to other conservation laws is straightforward. Time discretization is performed using the SSP Runge–Kutta paradigm [37], [38], [39]. The methods are implemented in the open-source C++ library mfem [54], [55] and snapshots are visualized with the accompanying GLVis toolbox [56].

To distinguish between different variants of MCL schemes, we use the following conventions: If all antidiffusive fluxes are set to zero, the low order method is employed. The sequential limiter enforcing discrete maximum principles following [25], shall be labelled ‘seq’. All MCL-type schemes enforcing global bounds are referred to by a tuple or triple of quantities that are being limited, e.g., (ρ, p) first enforces nonnegativity of density followed by the pressure fix, while (ρ, p, U) subsequently also enforces Tadmor’s entropy stability conditions (31). Often, we append a suffix indicating which numerical flux (LLF, HLL, etc.) is used. To distinguish between the nonskew-symmetric and skew-symmetric sparsifications, we simply use the terminology old versus new. Unless stated otherwise, we use adaptive SSP time stepping of the same order as the polynomial space, e.g., SSP2 for p = 1. The time step is set equal to ν ∈ (0, 1] times the right-hand side of (5), where the constant ν is chosen to be 0.5 in all tests for the Euler equations.

6.1 1D blast wave

We begin our experiments with a challenging 1D test problem studied first in ref. [57]. The spatial domain Ω = (0, 1) is equipped with reflecting wall boundaries, and the initial flow configuration is as follows:

ρ 0 1 , v 0 0 , p 0 ( x ) = 1,000 if  x < 0.1 , 0.01 if  0.1 < x < 0.9 , 100 if  0.9 < x .

The solution exhibits strong shocks due to the vastly different pressure values, which makes this benchmark an extreme test for preservation of positive internal energies. For small times, the exact solution can be obtained by solving two Riemann problems [58], Ch. 4] but at the end time t = 0.038, interactions with the domain boundaries and shock collisions have occurred. Therefore, only reference solutions are available.

Since these do not contain any highly oscillatory features, the most economical choice in terms of accuracy versus computational cost is to use second-order schemes on rather fine meshes. We use our P 1 -DG discretization to solve this problem on a grid consisting of 500 uniformly spaced elements. As our MCL approach is capable of using any numerical flux and performing limiting to guarantee various constraints, we begin by using mean value fluxes f n ( u , v ) = 1 2 ( f ( u ) + f ( v ) ) n and ensure only nonnegativity of density and pressure. Since limiting is carried out w.r.t. global bounds only, the result in the left panel of Figure 3 is extremely poor. Performing entropy limiting subsequently to the other steps significantly enhances the solution quality, as seen in the red curve in the right panel of Figure 3 but some spurious oscillations remain.

Figure 3: 
Density profiles of the 1D blast wave solved on 500 




P


1




${\mathbb{P}}_{1}$



 elements.
Figure 3:

Density profiles of the 1D blast wave solved on 500 P 1 elements.

A more reasonable approach than using mean value fluxes is to employ the local Lax–Friedrichs Riemann solver f n ( u , v ) = 1 2 [ ( f ( u ) + f ( v ) ) n + λ ( u v ) ] . We combine this scheme with (ρ, p)-limiting for the volume terms (the interfacial LLF fluxes do not require limiting to ensure any properties in the 1D case [25]). Comparing this result with the (ρ, p, U)-limited profile, we observe less smearing of the contact discontinuity at x ≈ 0.6, which occurs if entropy limiting is enabled. We investigate this issue further in the following section. At this stage, it is safe to say that entropy limiting in our context can drastically improve unstable schemes by adding sufficient amounts of dissipation. The question is, are these amounts also necessary?

6.2 2D isentropic vortex

Let us now numerically assess the convergence order of schemes by considering an example with a smooth solution [59], Sect. 5.1]. Here, the doubly-periodic domain is Ω = (−5, 5)2 and at every time t = 10k, k N , the solution coincides with the initial condition, which reads

ρ 0 ( x , y ) = θ 0 ( x , y ) 1 / ( γ 1 ) , v 0 ( x , y ) = 1 1 + ε 2 π e 0.5 ( 1 ( x 2 + y 2 ) ) y x , p 0 ( x , y ) = θ 0 ( x , y ) γ / ( γ 1 ) , θ 0 ( x , y ) = 1 ( γ 1 ) ε 2 8 γ π 2 e 1 ( x 2 + y 2 ) , ε = 5 .

We use structured, uniform meshes and perform the tests that were alluded to in Section 3. In addition, the convergence rate with entropy limiting enabled is checked for the Q 1 space. Other than entropy stability, only nonnegativity of density and pressure are enforced. Since the solution is smooth, we do not rely on limiters enforcing local bounds (these would deteriorate optimal rates). In Tables 13, we present the L1(Ω) errors in density at the final time t = 10 and the corresponding experimental orders of convergence (EOC).

Table 1:

Isentropic vortex, Q 1 -HLL on square cells.

10 h (ρ, p, U) EOC (ρ, p) EOC
32 1.76E-02 9.32E-04
64 1.23E-02 0.52 1.58E-04 2.56
128 7.56E-03 0.70 2.90E-05 2.44
256 4.29E-03 0.82 6.28E-06 2.21
Avg. EOC 0.68 2.40
Table 2:

Isentropic vortex, Q 2 -(ρ, p) on square cells.

10 h New-LLF EOC New-HLL EOC Old-HLL EOC
32 5.07E-05 2.60E-05 2.60E-05
64 7.84E-06 2.69 2.35E-06 3.47 2.35E-06 3.47
128 1.22E-06 2.68 2.70E-07 3.12 2.70E-07 3.12
256 1.79E-07 2.78 3.31E-08 3.03 3.31E-08 3.03
Avg. EOC 2.72 3.21 3.21
Table 3:

Isentropic vortex, (ρ, p)-HLL on triangles with different gradients, compare Figure 1.

200 h P 1 , Figure 1a EOC P 1 , Figure 1b EOC P 1 , Figure 1d EOC P 2 , Figure 1a EOC P 2 , Figure 1d EOC
32 1.07E-03 3.24E-03 9.78E-04 4.08E-05 4.08E-05
64 1.66E-04 2.69 1.13E-03 1.52 1.66E-04 2.56 3.60E-06 3.50 3.60E-06 3.50
128 3.08E-05 2.43 4.16E-04 1.44 3.08E-05 2.43 3.96E-07 3.18 3.96E-07 3.18
256 6.70E-06 2.20 1.88E-04 1.15 6.70E-06 2.20 4.85E-08 3.03 4.85E-08 3.03
Avg. EOC 2.44 1.37 2.40 3.24 3.24

First, we observe that the entropy fix leads to a catastrophic reduction of the convergence order. This serves as an explanation for our earlier results, in particular, it explains why the entropy fix is capable of fixing the oscillatory profile on the left of Figure 3. It is still shocking that the order is reduced by that much. Since the solution to this test is smooth, its exact entropy integrated over the domain remains constant in time. Numerical schemes may fail to mirror this behavior due to quadrature errors, numerical entropy production, or dissipation. It seems that Tadmor’s entropy fix designed to make discretizations entropy conservative (not dissipative) does not work well in the DG case even for Q 1 spaces (where Bernstein polynomials are simply the nodal Lagrange basis functions). In principle, an unnecessarily high rate of entropy dissipation can be attributed to too diffusive numerical fluxes such as LLF or HLL. However, the culprit here is clearly the volumetric entropy limiting, as we obtain second-order rates by disabling this fix. A comparison with the CG case and similar entropy-limited AFC schemes is in order to gain further understanding of this issue and how to resolve it. Second, we notice that LLF fluxes are too diffusive to exhibit third-order convergence for Q 2 discretizations [28], Sect. 6.3]. This problem can be cured using more accurate numerical fluxes such as HLL. Finally, we confirm the results regarding the different gradients. The old and new sparsification approaches are almost identical in terms of error values and convergence orders. In particular, the fact that three leading digits in the error values are identical for third-order schemes is striking given that the new sparsification requires noticeably fewer time steps, cf. Table 5. Out of the five triangular gradients in Figure 1 only (b) does not converge with optimal accuracy (rates for Figures 1c and 1e not printed for brevity) but the gradient in Figure 1d generally requires the fewest time steps, cf. Tables 4 and 5.

Table 4:

Isentropic vortex: number of time steps required by P 1 -HLL on triangles with different gradients, compare Figure 1.

200 h Figure 1a (old) Figure 1b Figure 1c Figure 1d (new) Figure 1e
32 1771 2,121 2,552 1742 4,130
64 3,557 4,254 5,080 3,453 8,272
128 7,068 8,496 10,119 6,866 16,535
256 14,126 16,966 20,193 13,688 33,062
Table 5:

Isentropic vortex: number of time steps required by P 2 -HLL on triangles with different gradients, compare Figure 1, and Q 2 -HLL on square cells with old and new sparsification approaches.

200 h P 2 , Figure 1a (old) P 2 , Figure 1b P 2 , Figure 1c P 2 , Figure 1d (new) 10 h Q 2 (old) Q 2 (new)
32 2,833 2,856 4,613 2,497 32 2,800 2,100
64 5,659 5,657 9,216 4,954 64 5,557 4,168
128 11,304 11,298 18,419 9,863 128 11,076 8,307
256 22,595 22,587 36,820 19,681 256 22,119 16,589

6.3 Modified Sod shock tube

Sod’s shock tube is a common test case that we solved with Bernstein-DG-MCL and the old sparsification in ref. [25]. A popular variant of this benchmark [58], Sect. 8.5.1] slightly changes the classical setup such that a sonic point appears in the exact solution. Many numerical schemes fail in the proximity of this location, where one of the eigenvalues of the flux changes its sign, by producing a nonphysical entropy shock (see Figure 4a). The setup of this test is as follows: the domain Ω = (0, 1) has an inlet boundary on the left and an outlet on the right. The inlet boundary profile is identical to u L for all times, where

u 0 ( x ) = u L if  x < 0.25 , u R if  x > 0.25 , u L = ( 1,0.75 , 2.78125 ) , u R = ( 0.125 , 0,0.25 ) ,

are the initial conditions. This Riemann problem admits an exact entropy solution that can be determined with arbitrary accuracy by solving a nonlinear equation for a single pressure state [58], Ch. 4]. It consists of a left rarefaction wave, a right shock wave, and contains a contact discontinuity in between.

Figure 4: 
Modified Sod shock tube: solution profiles (a)–(b) and entropy evolutions (c)–(d). DG solutions use adaptive SSP3.
Figure 4:

Modified Sod shock tube: solution profiles (a)–(b) and entropy evolutions (c)–(d). DG solutions use adaptive SSP3.

First, we solve this problem up to t = 0.2 with continuous P 1 elements and Roe [60] target fluxes (see [27] for details) employing uniform meshes of increased resolution and fixed time steps determined by the most restrictive CFL condition. The profiles in Figure 4a illustrate the appearance of the entropy shock.

Next, we use a coarser uniform mesh with only 16 uniform DG elements and polynomial degree p = 7. The result of the (ρ, p, U)-limited solution is displayed in Figure 4b. These profiles are obtained with the new sparsification approach but the old one looks quite similar. The availability of the exact solution allows us to compute the exact entropy evolution for this test. Luckily, in this example, the entropy flux F (u) at both domain boundaries is zero. Thus, entropy is only changed via dissipation at nonsmooth solution features. We see from Figure 4c and d that entropy decays linearly. Since we use entropy fixes, one would hope that this property holds true for all discrete entropy evolutions as well but we see that this is only the case for the new sparsification. Indeed, all three considered Riemann solvers produce entropy with the old sparsification at the beginning of the simulation. Since the old sparsification does not use skew-symmetric discrete gradients, this result does not contradict Corollary 3. Besides provable semi-discrete entropy stability, the new sparsification also has the advantage of needing significantly fewer time steps than the old one.

A few further remarks are in order. First, the red curve in Figure 4d is actually not monotone. This is also not in contradiction to Corollary 3 because our entropy fixes guarantee semi-discrete entropy stability rather than in the fully discrete setting. It is known that explicit time stepping schemes may result in entropy production proportional to a certain power of the time step [61]. Time step reductions can be used to make the red curve in Figure 4d monotonically decreasing. However, this approach does not remove the wiggles observed in Figure 4c, which are therefore indeed due to the spatial discretization, which fails to dissipate entropy even though an entropy limiter is used. Second, despite these small entropy productions, we do not actually observe the formation of entropy shocks. This is consistent with our previous assessment [27] that entropy limiting does not seem to be necessary for this system. Instead, preservation of nonnegotiable constraints and shock-capturing should be implemented. Finally, if we use the sequential approach in combination with either sparsification, all entropy evolutions become monotone. This observation shows that enforcing constraints such as maximum principles via AFC leads to dissipation of entropy even though entropy does not play a role in the sequential limiter. Since sufficient entropy dissipation cannot be guaranteed by this scheme, our above arguments regarding entropy evolutions remain valid and we recommend using the new sparsification rather than the old one.

6.4 High-Mach number astrophysical jet

Finally, we apply our schemes to a hypersonic test problem, in which the Mach number is more than 2,000 [62]. The spatial domain is Ω = (0, 1)2, the adiabatic constant is γ = 5/3, and the initial conditions for the primitive unknowns are ρ 0 ≡ 0.5, v 0 ≡ 0, p 0 ≡ 0.4127. A jet enters at the left boundary {0} × [0, 1], where the inflow profile is set according to

( ρ , v , p ) ( 0 , y , t ) = ( 5 , [ 800,0 ] , 0.4127 ) if  | y 0.5 | 0.05 , ( 0.5 , [ 0,0 ] , 0.4127 ) otherwise, y [ 0,1 ] .

For the end time t = 10−3, the other three boundaries can be set somewhat arbitrarily (we use free outflow).

We solve this problem numerically on a uniform quadrilateral mesh with 5122 cells and an unstructured triangular mesh with 543,744 elements. Generally, we compare results of the (ρ, p)-limiter with the sequential approach using Q p and P p spaces with p ∈ {1, 2} and contrast the old and new sparsification approaches (not for Q 1 , as old and new are identical for these). The results are shown in Figures 58; Figure 5 additionally displays the low order solution. Note that the color bar (which is the same for all plots) is logarithmic. The number of required time steps (t.s.) is shown in the figure captions for each snapshot. These were computed using HLL fluxes [23] with wave speed estimates

s = min { v L n a L , v R n a R } , s + = max { v L n + a L , v R n + a R } ,

where L and R denote the two input states of the flux, n is the normal, and a = γ p / ρ the sound speed.

Figure 5: 
Density profiles of the astrophysical jet on a uniform quadrilateral mesh, 




Q


1




${\mathbb{Q}}_{1}$



 solutions.
Figure 5:

Density profiles of the astrophysical jet on a uniform quadrilateral mesh, Q 1 solutions.

Figure 6: 
Density profiles of the astrophysical jet on a uniform quadrilateral mesh, 




Q


2




${\mathbb{Q}}_{2}$



 solutions.
Figure 6:

Density profiles of the astrophysical jet on a uniform quadrilateral mesh, Q 2 solutions.

Figure 7: 
Density profiles of the astrophysical jet on an unstructured triangular mesh, 




P


1




${\mathbb{P}}_{1}$



 solutions.
Figure 7:

Density profiles of the astrophysical jet on an unstructured triangular mesh, P 1 solutions.

Figure 8: 
Density profiles of the astrophysical jet on an unstructured triangular mesh, 




P


2




${\mathbb{P}}_{2}$



 solutions.
Figure 8:

Density profiles of the astrophysical jet on an unstructured triangular mesh, P 2 solutions.

Overall, all numerical approximations agree well with each other and no unsurprising results are observed. The two sparsification approaches produce essentially indistinguishable results but the reduced number of time steps makes us favor the new technique. As before, sequential limiting usually requires fewer time steps but in this example, there are some exceptions ( P 1 with either sparsification, P 2 with the new one). The sequential limiter introduces some additional streaks along the transition between the head of the jet and the background. It seems that these features are spurious and can be attributed to a choice of bounds for numerical admissibility that are not well-suited for this example. On the other hand, only enforcing nonnegativity of density and pressure seems to work quite well here despite the severity of this test case. Naturally, there exist examples where the oscillations arising because the numerical viscosity along shocks is too low will reverse the situation. Finding a framework that introduces precisely the right amount of diffusion in high-order methods, which also converge with optimal orders of accuracy is an open problem.

7 Conclusions

A new, skew-symmetric variant of discrete gradient operators, commonly used in finite element methods, and in particular algebraic flux correction schemes, was proposed and analyzed. For continuous and discontinuous Galerkin discretizations using Bernstein basis functions, the feasibility of the novel techniques was demonstrated in various geometries. We used the new operators in the context of our monolithic convex limiting scheme and compared it with existing techniques, based on non-skew-symmetric discrete gradients and LGL approaches. In the context of semi-discrete entropy stabilizations via MCL, the necessity of having skew-symmetric off-diagonal entries was demonstrated numerically. Moreover, additional theoretical results for the new and old versions of this approach were derived. Finally, we showed the optimality of the novel technique in terms of explicit time step restrictions. Numerical tests demonstrated that the new version of the scheme performs well in terms of shock capturing and also preserves optimal orders of accuracy.

The discrete gradients developed in this work are by no means restricted to flux-correction schemes nor monolithic limiting. In fact, they can readily be employed in the context of FCT algorithms for both continuous and discontinuous ansatz spaces using elementwise operators. Potential future studies include their use for other types of equations, such as incompressible flows [63]. Moreover, we plan to further develop our flux-correction techniques by adding additional features to the described methodology. These include smoothness indicators [47], flux limiting using general Runge–Kutta time discretizations [33], as well as the inclusion of diffusive [40] and source [12] terms. These topics shall be addressed in future publications.


Corresponding author: Hennes Hajduk, Department of Geosciences, University of Oslo, Oslo, Norway, E-mail: 

Acknowledgments

The author would also like to thank the two anonymous reviewers whose comments helped to significantly improve this manuscript.

  1. Research ethics: Not applicable.

  2. Informed consent: Not applicable.

  3. Author contributions: The author has accepted responsibility for the entire content of this manuscript and approved its submission.

  4. Use of Large Language Models, AI and Machine Learning Tools: None declared.

  5. Conflict of interest: The author states no conflict of interest.

  6. Research funding: This work was partially supported by The Rough Ocean Project, funded by the Research Council of Norway under the Klimaforsk-programme, Project 302743.

  7. Data availability: Not applicable.

References

[1] R. Abgrall and J. Trefilík, “An example of high order residual distribution scheme using non-Lagrange elements,” J. Sci. Comput., vol. 45, pp. 3–25, 2010, https://doi.org/10.1007/s10915-010-9405-y.Search in Google Scholar

[2] T. J. Barth and D. C. Jespersen, “The design and application of upwind schemes on unstructured meshes,” in 27th Aerospace Sciences Meeting, American Institute of Aeronautics and Astronautics, 1989.Search in Google Scholar

[3] E. Gaburro, P. Öffner, M. Ricchiuto, and D. Torlo, “High order entropy preserving ADER-DG schemes,” Appl. Math. Comput., vol. 440, p. 127644, 2023.Search in Google Scholar

[4] Y. Lin and J. Chan, “High order entropy stable discontinuous Galerkin spectral element methods through subcell limiting,” J. Comput. Phys., vol. 498, p. 112677, 2024, https://doi.org/10.1016/j.jcp.2023.112677.Search in Google Scholar

[5] S. Faghih-Naini and V. Aizinger, “p-adaptive discontinuous Galerkin method for the shallow water equations with a parameter-free error indicator,” Int. J. Geomath., vol. 13, p. 18, 2022, https://doi.org/10.1007/s13137-022-00208-3.Search in Google Scholar

[6] J. Vedral, “Dissipative weno stabilization of high-order discontinuous Galerkin methods for hyperbolic problems,” Preprint, arXiv: 2309.12019 [math.NA], 2023.Search in Google Scholar

[7] D. Kuzmin and H. Hajduk, Property-preserving Numerical Schemes for Conservation Laws, London, UK, World Scientific Europe, 2023.Search in Google Scholar

[8] L. Micalizzi, D. Torlo, and W. Boscheri, “Efficient iterative arbitrary high-order methods: an adaptive bridge between low and high order,” Commun. Appl. Math. Comput., vol. 7, pp. 40–77, 2023, https://doi.org/10.1007/s42967-023-00290-w.Search in Google Scholar PubMed PubMed Central

[9] D. Kuzmin, R. Löhner, and S. Turek, Flux-Corrected Transport, 2nd ed. Springer, 2012.Search in Google Scholar

[10] R. Anderson, et al.., “High-order local maximum principle preserving (MPP) discontinuous Galerkin finite element method for the transport equation,” J. Comput. Phys., vol. 334, pp. 102–124, 2017, https://doi.org/10.1016/j.jcp.2016.12.031.Search in Google Scholar

[11] J.-L. Guermond, M. Nazarov, B. Popov, and I. Tomas, “Second-order invariant domain preserving approximation of the Euler equations using convex limiting,” SIAM J. Sci. Comput., vol. 40, pp. A3211–A3239, 2018, https://doi.org/10.1137/17m1149961.Search in Google Scholar

[12] H. Hajduk and D. Kuzmin, “Bound-preserving and entropy-stable algebraic flux correction schemes for the shallow water equations with topography,” in Eleventh International Conference on Computational Fluid Dynamics, ICCFD11 Proceedings, 2022 [Online]. Available at: https://www.iccfd.org/iccfd11/assets/pdf/papers/ICCFD11_Paper-3003.pdf.Search in Google Scholar

[13] D. Kuzmin, “Monolithic convex limiting for continuous finite element discretizations of hyperbolic conservation laws,” Comput. Methods Appl. Mech. Eng., vol. 361, p. 112804, 2020, https://doi.org/10.1016/j.cma.2019.112804.Search in Google Scholar

[14] C. Lohmann, D. Kuzmin, J. N. Shadid, and S. Mabuza, “Flux-corrected transport algorithms for continuous Galerkin methods based on high order Bernstein finite elements,” J. Comput. Phys., vol. 344, pp. 151–186, 2017, https://doi.org/10.1016/j.jcp.2017.04.059.Search in Google Scholar

[15] G. R. Barrenechea, V. John, and P. Knobloch, “Finite element methods respecting the discrete maximum principle for convection–diffusion equations,” SIAM Rev., vol. 66, pp. 3–88, 2024, https://doi.org/10.1137/22m1488934.Search in Google Scholar

[16] C. Lohmann, Physics-Compatible Finite Element Methods for Scalar and Tensorial Advection Problems, Springer Spektrum, 2019.Search in Google Scholar

[17] H. Hajduk and A. Rupp, “Analysis of algebraic flux correction for semi-discrete advection problems,” BIT Numer. Math., vol. 63, p. 8, 2023, https://doi.org/10.1007/s10543-023-00957-z.Search in Google Scholar

[18] D. Kuzmin and M. Quezada de Luna, “Algebraic entropy fixes and convex limiting for continuous finite element discretizations of scalar hyperbolic conservation laws,” Comput. Methods Appl. Mech. Eng., vol. 372, p. 113370, 2020, https://doi.org/10.1016/j.cma.2020.113370.Search in Google Scholar

[19] D. Kuzmin and S. Turek, “Flux correction tools for finite elements,” J. Comput. Phys., vol. 175, no. 2, pp. 525–558, 2002. https://doi.org/10.1006/jcph.2001.6955.Search in Google Scholar

[20] W. Pazner, “Sparse invariant domain preserving discontinuous Galerkin methods with subcell convex limiting,” Comput. Methods Appl. Mech. Eng., vol. 382, p. 113876, 2021, https://doi.org/10.1016/j.cma.2021.113876.Search in Google Scholar

[21] A. M. Rueda-Ramírez, B. Bolm, D. Kuzmin, and G. J. Gassner, “Monolithic convex limiting for Legendre–Gauss–Lobatto discontinuous Galerkin spectral-element methods,” Commun. Appl. Math. Comput., vol. 6, pp. 1860–1898, 2024, https://doi.org/10.1007/s42967-023-00321-6.Search in Google Scholar

[22] D. Hoff, “A finite difference scheme for a system of two conservation laws with artificial viscosity,” Math. Comput., vol. 33, pp. 1171–1193, 1979, https://doi.org/10.2307/2006454.Search in Google Scholar

[23] A. Harten, P. D. Lax, and B. van Leer, “On upstream differencing and Godunov-type schemes for hyperbolic conservation laws,” SIAM Rev., vol. 25, no. 1, pp. 35–61, 1983. https://doi.org/10.1137/1025002.Search in Google Scholar

[24] J.-L. Guermond and B. Popov, “Invariant domains and first-order continuous finite element approximation for hyperbolic systems,” SIAM J. Numer. Anal., vol. 54, no. 4, pp. 2466–2489, 2016. https://doi.org/10.1137/16m1074291.Search in Google Scholar

[25] H. Hajduk, “Monolithic convex limiting in discontinuous Galerkin discretizations of hyperbolic conservation laws,” Comput. Math. Appl., vol. 87, pp. 120–138, 2021, https://doi.org/10.1016/j.camwa.2021.02.012.Search in Google Scholar

[26] P. Moujaes and D. Kuzmin, “Monolithic convex limiting and implicit pseudo-time stepping for calculating steady-state solutions of the Euler equations,” Preprint, arXiv: 2407.03746 [math.NA], 2024.Search in Google Scholar

[27] D. Kuzmin, H. Hajduk, and A. Rupp, “Limiter-based entropy stabilization of semi-discrete and fully discrete schemes for nonlinear hyperbolic problems,” Comput. Methods Appl. Mech. Eng., vol. 389, p. 114428, 2022, https://doi.org/10.1016/j.cma.2021.114428.Search in Google Scholar

[28] H. Hajduk, “Algebraically constrained finite element methods for hyperbolic problems with applications in geophysics and gas dynamics,” Ph.D. dissertation, TU Dortmund University, 2022.Search in Google Scholar

[29] D. Kuzmin and M. Quezada de Luna, “Entropy conservation property and entropy stabilization of high-order continuous Galerkin approximations to scalar conservation laws,” Comput. Fluids, vol. 213, p. 104742, 2020, https://doi.org/10.1016/j.compfluid.2020.104742.Search in Google Scholar

[30] E. Tadmor, “The numerical viscosity of entropy stable schemes for systems of conservation laws, I,” Math. Comput., vol. 49, pp. 91–103, 1987, https://doi.org/10.2307/2008251.Search in Google Scholar

[31] E. Tadmor, “Entropy stability theory for difference approximations of nonlinear conservation laws and related time-dependent problems,” Acta Numer., vol. 12, pp. 451–512, 2003, https://doi.org/10.1017/s0962492902000156.Search in Google Scholar

[32] D. Kuzmin and M. Quezada de Luna, “Subcell flux limiting for high-order Bernstein finite element discretizations of scalar hyperbolic conservation laws,” J. Comput. Phys., vol. 411, p. 109411, 2020, https://doi.org/10.1016/j.jcp.2020.109411.Search in Google Scholar

[33] D. Kuzmin, M. Quezada de Luna, D. I. Ketcheson, and J. Grüll, “Bound-preserving flux limiting for high-order explicit Runge–Kutta time discretizations of hyperbolic conservation laws,” J. Sci. Comput., vol. 91, 2022, Art. no. 21, https://doi.org/10.1007/s10915-022-01784-0.Search in Google Scholar

[34] T. C. Fisher and M. H. Carpenter, “High-order entropy stable finite difference schemes for nonlinear conservation laws: finite domains,” J. Comput. Phys., vol. 252, pp. 518–557, 2013, https://doi.org/10.1016/j.jcp.2013.06.014.Search in Google Scholar

[35] G. J. Gassner, “A skew-symmetric discontinuous Galerkin spectral element discretization and its relation to sbp-sat finite difference methods,” SIAM J. Sci. Comput., vol. 35, no. 3, pp. A1233–A1253, 2013. https://doi.org/10.1137/120890144.Search in Google Scholar

[36] C. M. Dafermos, Hyperbolic Conservation Laws in Continuum Physics, Springer, 2000.Search in Google Scholar

[37] C.-W. Shu and S. Osher, “Efficient implementation of essentially non-oscillatory shock-capturing schemes,” J. Comput. Phys., vol. 77, no. 2, pp. 439–471, 1988. https://doi.org/10.1016/0021-9991(88)90177-5.Search in Google Scholar

[38] S. Gottlieb, C.-W. Shu, and E. Tadmor, “Strong stability-preserving high-order time discretization methods,” SIAM Rev., vol. 43, no. 1, pp. 89–112, 2001. https://doi.org/10.1137/s003614450036757x.Search in Google Scholar

[39] S. Gottlieb, D. I. Ketcheson, and C.-W. Shu, Strong Stability Preserving Runge–Kutta and Multistep Time Discretizations, World Scientific, 2011.Search in Google Scholar

[40] M. Quezada de Luna and D. I. Ketcheson, “Maximum principle preserving space and time flux limiting for diagonally implicit Runge–Kutta discretizations of scalar convection–diffusion equations,” J. Sci. Comput., vol. 92, p. 102, 2022, https://doi.org/10.1007/s10915-022-01922-8.Search in Google Scholar

[41] A. Harten, “On the symmetric form of systems of conservation laws with entropy,” J. Comput. Phys., vol. 49, no. 1, pp. 151–164, 1983. https://doi.org/10.1016/0021-9991(83)90118-3.Search in Google Scholar

[42] X. Zhang, “On positivity-preserving high order discontinuous Galerkin schemes for compressible Navier–Stokes equations,” J. Comput. Phys., vol. 328, pp. 301–343, 2017, https://doi.org/10.1016/j.jcp.2016.10.002.Search in Google Scholar

[43] R. Abgrall, “Essentially non-oscillatory residual distribution schemes for hyperbolic problems,” J. Comput. Phys., vol. 214, no. 2, pp. 773–808, 2006. https://doi.org/10.1016/j.jcp.2005.10.034.Search in Google Scholar

[44] D. Kuzmin, M. Möller, J. N. Shadid, and M. Shashkov, “Failsafe flux limiting and constrained data projections for equations of gas dynamics,” J. Comput. Phys., vol. 229, no. 23, pp. 8766–8779, 2010.Search in Google Scholar

[45] C. A. J. Fletcher, “The group finite element formulation,” Comput. Methods Appl. Mech. Eng., vol. 37, no. 2, pp. 225–244, 1983. https://doi.org/10.1016/0045-7825(83)90122-6.Search in Google Scholar

[46] G. R. Barrenechea and P. Knobloch, “Analysis of a group finite element formulation,” Appl. Numer. Math., vol. 118, pp. 238–248, 2017, https://doi.org/10.1016/j.apnum.2017.03.008.Search in Google Scholar

[47] A. M. Rueda-Ramírez, W. Pazner, and G. J. Gassner, “Subcell limiting strategies for discontinuous Galerkin spectral element methods,” Comput. Fluids, vol. 247, p. 105627, 2022, https://doi.org/10.1016/j.compfluid.2022.105627.Search in Google Scholar

[48] H. Hajduk, “Preconditioned gradient matrix on the reference simplex,” 2020. Available at: https://github.com/HennesHajduk/PrecMatSimplex.Search in Google Scholar

[49] V. Dobrev, T. Kolev, D. Kuzmin, R. Rieben, and V. Tomov, “Sequential limiting in continuous and discontinuous Galerkin methods for the Euler equations,” J. Comput. Phys., vol. 356, pp. 372–390, 2018, https://doi.org/10.1016/j.jcp.2017.12.012.Search in Google Scholar

[50] B. Cockburn and C.-W. Shu, “TVB Runge–Kutta local projection discontinuous Galerkin finite element method for conservation laws, II. General framework,” Math. Comput., vol. 52, pp. 411–435, 1989, https://doi.org/10.1090/s0025-5718-1989-0983311-4.Search in Google Scholar

[51] N. Chalmers and L. Krivodonova, “A robust CFL condition for the discontinuous Galerkin method on triangular meshes,” J. Comput. Phys., vol. 403, p. 109095, 2020, https://doi.org/10.1016/j.jcp.2019.109095.Search in Google Scholar

[52] L. Krivodonova and R. Qin, “An analysis of the spectrum of the discontinuous Galerkin method,” Appl. Numer. Math., vol. 64, pp. 1–18, 2013, https://doi.org/10.1016/j.apnum.2012.07.008.Search in Google Scholar

[53] N. Chalmers, L. Krivodonova, and R. Qin, “Relaxing the CFL number of the discontinuous Galerkin method,” SIAM J. Sci. Comput., vol. 36, no. 4, pp. A2047–A2075, 2014. https://doi.org/10.1137/130927504.Search in Google Scholar

[54] R. Anderson, et al.., “MFEM: a modular finite element methods library,” Comput. Math. Appl., vol. 81, pp. 42–74, 2021, https://doi.org/10.1016/j.camwa.2020.06.009.Search in Google Scholar

[55] “MFEM: modular finite element methods [software],” Available at: https://mfem.org.Search in Google Scholar

[56] “GLVis: OpenGL finite element visualization tool,” Available at: https://glvis.org.Search in Google Scholar

[57] P. Woodward and P. Colella, “The numerical simulation of two-dimensional fluid flow with strong shocks,” J. Comput. Phys., vol. 54, no. 1, pp. 115–173, 1984. https://doi.org/10.1016/0021-9991(84)90142-6.Search in Google Scholar

[58] E. F. Toro, Riemann Solvers and Numerical Methods for Fluid Dynamics, 3rd ed. Springer, 2009.Search in Google Scholar

[59] C.-W. Shu, “Essentially non-oscillatory and weighted essentially non-oscillatory schemes for hyperbolic conservation laws,” in Advanced Numerical Approximation of Nonlinear Hyperbolic Equations, ser. Lecture Notes in Mathematics, Springer, 1998, pp. 325–432.Search in Google Scholar

[60] P. L. Roe, “Approximate Riemann solvers, parameter vectors, and difference schemes,” J. Comput. Phys., vol. 43, no. 2, pp. 357–372, 1981. https://doi.org/10.1016/0021-9991(81)90128-5.Search in Google Scholar

[61] C. Lozano, “Entropy production by explicit Runge–Kutta schemes,” J. Sci. Comput., vol. 76, pp. 521–564, 2018, https://doi.org/10.1007/s10915-017-0627-0.Search in Google Scholar

[62] Y. Ha, C. L. Gardner, A. Gelb, and C.-W. Shu, “Numerical simulation of high Mach number astrophysical jets with radiative cooling,” J. Sci. Comput., vol. 24, pp. 29–44, 2005, https://doi.org/10.1007/s10915-004-4786-4.Search in Google Scholar

[63] H. Hajduk, D. Kuzmin, G. Lube, and P. Öffner, “Locally energy-stable finite element schemes for incompressible flow problems: design and analysis for equal-order interpolations,” Comput. Fluids, vol. 294, pp. 106622, 2025. https://doi.org/10.1016/j.compfluid.2025.106622.Search in Google Scholar

Received: 2024-07-08
Accepted: 2024-12-09
Published Online: 2025-06-26
Published in Print: 2025-12-17

© 2025 the author(s), published by De Gruyter, Berlin/Boston

This work is licensed under the Creative Commons Attribution 4.0 International License.

Downloaded on 25.1.2026 from https://www.degruyterbrill.com/document/doi/10.1515/jnma-2024-0098/html
Scroll to top button