Home Life Sciences Modeling and analysis of ensemble average solvation energy and solute–solvent interfacial fluctuations
Article Open Access

Modeling and analysis of ensemble average solvation energy and solute–solvent interfacial fluctuations

  • Yuanzhen Shao EMAIL logo , Zhan Chen and Shan Zhao
Published/Copyright: December 31, 2024

Abstract

Variational implicit solvation models (VISMs) have gained extensive popularity in the molecular-level solvation analysis of biological systems due to their cost-effectiveness and satisfactory accuracy. Central in the construction of VISM is an interface separating the solute and the solvent. However, traditional sharp-interface VISMs fall short in adequately representing the inherent randomness of the solute–solvent interface, a consequence of thermodynamic fluctuations within the solute–solvent system. Given that experimentally observable quantities are ensemble averaged, the computation of the ensemble average solvation energy (EASE)–the averaged solvation energy across all thermodynamic microscopic states–emerges as a key metric for reflecting thermodynamic fluctuations during solvation processes. This study introduces a novel approach to calculating the EASE. We devise two diffuse-interface VISMs: one within the classic Poisson–Boltzmann (PB) framework and another within the framework of size-modified PB theory, accounting for the finite-size effects. The construction of these models relies on a new diffuse interface definition u ( x ) , which represents the probability of a point x found in the solute phase among all microstates. Drawing upon principles of statistical mechanics and geometric measure theory, we rigorously demonstrate that the proposed models effectively capture EASE during the solvation process. Moreover, preliminary analyses indicate that the size-modified EASE functional surpasses its counterpart based on the classic PB theory across various analytic aspects. Our work is the first step toward calculating EASE through the utilization of diffuse-interface VISM.

1 Introduction

In the quantitative analysis of biological processes, the complex interactions between the solute and solvent are typically described by solvation energies (or closely related quantities): the free energy of transferring the solute (e.g., biomolecules, such as proteins, DNA, RNA) from the vacuum to a solvent environment of interest (e.g., water at a certain ionic strength). The process conceptually can be decomposed into two basic processes: a “nonpolar” process of inserting the uncharged solute into solvent and a “polar” process of charging the solute in the vacuum and a solvent environment [15].

The free energy change in the “nonpolar” process is called the nonpolar solvation energy. It represents the contributions from the surface tension, mechanical work, and attractive solvent–solute dispersion interactions. There are many nonpolar solvation models available. The most commonly used one is the scaled particle theory [49,54] which includes the energy of surface tension effect and the mechanical work of immersing a particle into the solvent. The works by Gallicchio et al. [30,32], Levy et al. [42], Gallicchio and Levy [31], and Wagoner and Baker [57] have showed the importance of nonpolar solvent models with attractive solute–solvent dispersion term.

The difference of energies associated with the “polar” process is called polar solvation energy. It describes the solvent’s effect on the solute charging process by calculating a difference in charging free energies in the vacuum and a solvent environment [15]. The origin of polar solvation energy is electrostatic interactions, which are ubiquitous in nature for any system of charged or polar molecules.

There are two major approaches for solvation energy calculations: explicit solvent models and implicit solvent models [46]. Explicit models, treating both the solute and the solvent as individual molecules, is too computationally expensive for large solute–solvent systems, such as the solvation of macromolecules in ionic environments. In contrast, implicit models, by averaging the effect of solvent phase as continuum media [1,2,8,9,12,27,45], are much more efficient and thus are able to handle much larger systems [2,17,29,34,35,40,48,56].

An inevitable prerequisite for describing the solvation energy in implicit solvent models is an interface separating the discrete solute and the continuum solvent domains. All of the physical properties related to solvation processes, including biomolecular surface area, biomolecular cavitation volume, p K a value, and electrostatic free energy, are very sensitive to the interface definition [22,55,57]. There are a number of different surface definitions, which include the van der Waals surface, the solvent excluded surface, and the solvent accessible surface. These surface definitions have found many successful applications in biomolecular modeling [20,23,39,53]. However, these predetermined interfaces are ad hoc partitions and thus either nonnegligibly overestimate or underestimate the solvation free energies [57]. Moreover, none of them takes into account the minimization of interfacial free energies during the equilibrium solvation.

Variational implicit solvation models (VISMs) stand out as a successful approach to compute the disposition of an interface separating the solute and the solvent [4,13,14,18,21,61,64]. In a VISM, the desired interface profile is obtained by optimizing a solvation energy functional coupling the discrete description of the solute and the continuum description of the solvent in addition to polar and nonpolar interactions.

However, traditional sharp-interface VISMs have limitations in capturing the inherent randomness of the solute–solvent boundary. This randomness arises from factors such as atomic vibrations and thermodynamic fluctuations. In practical applications, ions, solutes, and solvent molecules are not rigid entities; they undergo small or significant conformational changes. The disposition of the solute–solvent interface is influenced by both the structure of the solute molecule and the surrounding solvent configuration. Due to thermodynamic fluctuations, when a fixed solute molecular structure being considered, solvent molecules and ions can still adopt different configurations [3], equivalent to forming different microstates in the language of statistical mechanics. This highlights that the position of the solute–solvent interface is not unique, and a single, fixed configuration does not capture the full range of possible solute–solvent interactions.

On the other hand, experimentally observable quantities are ensemble averaged. Utilizing the energy computed from a single microstate to predict the averaged energy from all microstates is inherently prone to inaccuracy. Indeed, studies show that disregarding thermodynamic fluctuations during the solvation process can cause severe errors in predictions of solvation free energies [47]. Therefore, it is of imminent practical importance to develop a solvation model capable of calculating ensemble average solvation energy (EASE) with thermodynamic fluctuations being taken into account.

According to statistical mechanics, the ensemble average of a quantity is the weighted average (by means of the Boltzmann weight) of its values among all microstates. However, in practice, it is a challenging task to directly compute the EASE of a solute–solvent system by means of this definition. If one only samples a handful of random realizations, it is likely that one can only capture an outlaw behavior rather than the meaningful average behavior of the stochastic regime. In contrast, a tremendous sampling of random realizations, although delivers a more accurate approximation of EASE, is computationally too expensive, sometimes even unaffordable.

To address this dilemma, we provide an alternative approach to the computation of EASE in Section 3. More precisely, we show that instead of computing and averaging the solvation energies among all microstates, one should characterize the “mean behavior” of the solute–solvent interfaces. This gives rise to a novel “interface” profile u : Ω [ 0 , 1 ] such that u ( x ) represents the probability of a point x Ω found in the solute phase among all microstates.

On the basis of this insight, we rigorously demonstrate that the EASE of a solute–solvent system can be effectively captured by using a VISM defined in terms of this “interface” profile u . This modeling paradigm is highly flexible, enabling us to incorporate various considerations into EASE modeling.

As an illustrative example, we develop two EASE functionals, one within the framework of classic Poisson-Boltzmann (PB) theory and the other within the size-modified PB theory framework to account for finite size effects. These proposed models have the potential to significantly expedite the computation of EASE. They achieve this by using a single diffuse-interface profile instead of numerous sharp interfaces in various microstates, making the computation more efficient and computationally accessible.

The proposed models introduced in this work fall within the category of diffuse-interface VISM [14,21,24,44,5860,62,63]. This family of models replaces the traditional sharp solute–solvent interface with a diffuse-interface profile, denoted as u : Ω R . Our work provides a novel and mathematically solid interpretation of the diffuse-interface profile. On the basis of this interpretation, for the first time in the literature, we establish the connection between EASE and the energy predicted by diffuse-interface VISMs. This clarification establishes a link between the sharp-interface and diffuse-interface VISM models, demonstrating that the diffuse-interface model indeed computes “mean” energies consistent with sharp-interface models.

Furthermore, our work provides an alternative approach to compute the EASE. Compared to the routine calculation of the EASE, which typically involves performing explicit solvent simulations, such as molecular dynamics (MD), to obtain thousands of solute–solvent states and then performing energy calculations for each state, the proposed approach has the potential to improve the efficiency of EASE calculations. By utilizing a single diffuse-interface profile instead of numerous sharp interfaces across various microstates, the proposed method offers computational efficiency and accessibility.

The rest of this article is organized as follows. Section 2 provides an introduction to one of the most widely used formulations of sharp-interface VISMs. In Section 3, we present the development of two EASE functionals within the frameworks of classic PB theory and the size-modified PB theory, taking into account finite size effects. Section 4 comprises preliminary analyses of the proposed EASE models, with a particular emphasis on the size-modified EASE functional. In Appendix A, we gather relevant information and facts about a class of strictly convex functionals related to the polar portion of EASE. To help readers gain deeper insight into the improvements of our EASE models over existing VISMs, Appendix B offers a brief comparative analysis between our newly proposed models and one of the most widely employed diffuse-interface VISMs, the geometric flow-based VISM (GFBVISM) [14,24,5860,62]. Appendix C provides an introduction to the basic concepts from geometric measure theory used in this article. Finally, in Section 5, we draw conclusions and summarize the key findings and implications of our research.

2 Ensemble-averaged solvation energy

List of notations: Given any open sets U and Ω , U Ω means that U ¯ Ω and U ¯ stands for the closure of U . We denote by N and N 1 the N -dimensional Lebesgue measure and the ( N 1 ) -dimensional Hausdorff measure, respectively. For any 1 p , p is the Hölder conjugate of p . The phrase l.s.c is the abbreviation of lower semi-continuous. χ E always denotes the characteristic function of a set E .

For any two Banach spaces X and Y , the notation ( X , Y ) stands for the set of all bounded linear operators from X to Y and ( X ) ( X , X ) .

2.1 Background

We consider a solute–solvent system with a fixed biomolecular structure contained in a bounded Lipschitz domain Ω R 3 by using a (statistical) grand canonical ensemble. The temperature, chemical potentials, and volume of the system are kept constant. Suppose that there are N c ion species in Ω and the system contains N c + 2 types of particles, i.e., the solute and solvent molecules and N c ion species. We assume that the solute atoms are centered at x 1 , , x N m Ω .

Define a probability space ( S , , P ) , where the sample space S = { S α : α A } denotes the set of all possible states of the system with A being the index set of the states. is a σ -algebra of S and P is the probability measure. Because a biomolecular structure is fixed, each state S α is uniquely determined by a function:

C α L 1 ( Ω ; R N c + 1 ) : C α ( x ) = ( C α , 0 ( x ) , C α , 1 ( x ) , , C α , N c ( x ) ) ,

where C α , j ( x ) is the (number) concentration of the jth ion species at x for j = 1 , , N c , or the (number) concentration of solvent molecules for j = 0 . In state S α , a point x Ω is in the solute phase if C α ( x ) = ( 0 , , 0 ) , otherwise in the solvent phase. A point that is not occupied by any particle is typically considered to be in the vacuum. However, due to the similar dielectric constants of the solute and the vacuum, it is reasonable to assume that such points can be treated as being in the solute phase.

It is important to note that different states can result in the same solute and solvent phases. For instance, consider the scenario where a solvent molecule and an ion are interchanged, which are originally located at different positions, within a given state. Allowing for a slight abuse of notation, we refer to the subset of states within S that share the same solute and solvent phases as a microstate of the solute–solvent system. Assume that the system undergoes K microstates: 1 , , K ; and each k occurs with a probability p k = P ( k ) . For notational brevity, we put K = { 1 , 2 , , K } . This leads to a decomposition S = k K k ; and thus k K p k = 1 .

We define several random variables that will be extensively used in this article.

  • X : S K , where X = k K k χ k . Thus, X ( S α ) indicates the microstate that S α belongs to.

  • C j : S L 1 ( Ω ) , where C j ( S α ) = C α , j with j { 0 , 1 , , N c } . Thus, when j { 1 , , N c } , C j ( S α ) ( x ) is the (number) concentration of the jth ion species at x Ω in S α ; C 0 ( S α ) ( x ) is the (number) concentration of the solvent molecule at x Ω in S α .

Let Z be a Banach space. The ensemble average of a random variable Y : S Z (in S ) is defined as follows:

(2.1) Y E ( Y ) = E [ E ( Y X = k ) ] = k K p k E ( Y X = k ) .

Similarly, if one considers k as a (statistic) ensemble, the ensemble average of Y in k is defined by

Y k E ( Y X = k ) .

Throughout the remainder of this article, unless explicitly stated otherwise, the ensemble average of a quantity Y always refers to the object defined by (2.1).

2.2 A sharp interface VISM

In this section, we will introduce the solvation energy defined in a microstate k with a fixed sharp solute–solvent interface. Assume that in microstate k , the solute phase is represented by a Caccioppoli subset D k Ω , or equivalently, U k = Ω \ D ¯ k is the solvent phase. Then the solute–solvent interface is given by D k . See Figure 1(a) for an illustration.

Figure 1 
                  (a) Illustration of a microstate 
                        
                           
                           
                              
                                 
                                    ℳ
                                 
                                 
                                    k
                                 
                              
                           
                           {{\mathcal{ {\mathcal M} }}}_{k}
                        
                     : 
                        
                           
                           
                              
                                 
                                    D
                                 
                                 
                                    k
                                 
                              
                           
                           {D}_{k}
                        
                      is the solute region; 
                        
                           
                           
                              
                                 
                                    U
                                 
                                 
                                    k
                                 
                              
                           
                           {U}_{k}
                        
                      is the solvent region; and 
                        
                           
                           
                              ∂
                              
                                 
                                    D
                                 
                                 
                                    k
                                 
                              
                           
                           \partial {D}_{k}
                        
                      is the solute–solvent interface. (b) Domain decomposition for a grand canonical ensemble: 
                        
                           
                           
                              
                                 
                                    Ω
                                 
                                 
                                    i
                                 
                              
                           
                           {\Omega }_{{\rm{i}}}
                        
                      is the region occupied by the solute in all microstates; 
                        
                           
                           
                              
                                 
                                    Ω
                                 
                                 
                                    e
                                 
                              
                           
                           {\Omega }_{{\rm{e}}}
                        
                      denotes the region occupied by the solvent in all microstates; and 
                        
                           
                           
                              
                                 
                                    Ω
                                 
                                 
                                    t
                                 
                              
                           
                           {\Omega }_{{\rm{t}}}
                        
                      denotes the transition region where 
                        
                           
                           
                              0
                              ≤
                              u
                              ≤
                              1
                           
                           0\le u\le 1
                        
                     .
Figure 1

(a) Illustration of a microstate k : D k is the solute region; U k is the solvent region; and D k is the solute–solvent interface. (b) Domain decomposition for a grand canonical ensemble: Ω i is the region occupied by the solute in all microstates; Ω e denotes the region occupied by the solvent in all microstates; and Ω t denotes the transition region where 0 u 1 .

In this article, solute atoms are treated as hard spheres. More precisely, we consider the atom centered at x i , i = 1 , , N m , as a sphere of radius σ i > 0 , i.e., B ( x i , σ i ) , for which solvent molecules and ions cannot enter. On the other hand, the solvent and ion concentrations are the same as their bulk concentrations in regions sufficiently far away from the solute. As a consequence, such regions cannot be contained in the solute phase in any microstate. On the basis of these observations, we may assume that there are two open subsets, Ω i and Ω e , of Ω with i = 1 N m B ¯ ( x i , σ i ) Ω i Ω \ Ω ¯ e and Ω Ω e such that

(2.2) Ω i D k Ω \ Ω ¯ e for all k K .

For instance, one can take Ω i to be the region enclosed by a smoothed solvent excluded surface and Ω e to be region outside a smoothed solvent accessible surface. Let Ω t Ω \ ( Ω ¯ i Ω ¯ e ) be the transition region. By Assumption (2.3), in all microstates,

  • ions are located outside Ω i , and

  • the solute–solvent interfaces are located inside Ω ¯ t .

In addition, we define Σ 1 = Ω i and Σ 0 = Ω e \ Ω . We assume that Σ 0 , Σ 1 are C 2 and Ω t is connected so that Σ 0 Σ 1 = . See Figure 1(b) for an illustration.

The hydrophobicity of the amino acids in the biomolecule varies from position to position. To account for this phenomenon, we introduce a positive variable surface tension function θ C 1 ( Ω ¯ t ) . Without loss of generality, we may extend θ to be a function in C 1 ( Ω ¯ ) , still denoted by θ , such that

θ 0 θ ( x ) θ 1 , x Ω

for some constants 0 < θ 0 < θ 1 . By [10, Lemma 1],

(2.3) Ω θ ( x ) d D f ( x ) = sup Ω f ( x ) ( θ ( x ) ϕ ( x ) ) d x : ϕ C c 1 ( Ω ; R 3 ) , ϕ 1 , f B V ( Ω ) .

See Definition C.1 for the definition of B V -functions. It is clear that, for any f B V ( Ω ) with f 0 in Ω e and f 1 in Ω i , the value of Ω θ ( x ) d D f ( x ) is independent of how θ is extended outside Ω ¯ t .

As proposed in [24,25], the solvation free energy in microstate k predicted by a sharp-interface VISM is the value of the following energy functional evaluated at χ D k :

( χ D k ) = I np ( χ D k ) + I p ( χ D k , ψ ) ,

where ψ : Ω R is the electrostatic potential. The two components I np and I p are termed the nonpolar and polar portion of the solvation energy, respectively. For every fixed D k , ψ = ψ χ D k is chosen to maximize I p ( χ D k , ψ ) among all ψ taking a predetermined Dirichlet boundary value ψ D on Ω .

The nonpolar solvation energy consists of three parts:

I np ( χ D k ) = Ω θ d D χ D k + P h Vol ( D k ) + U k ρ s U vdW ( x ) d x .

When θ γ for some constant γ > 0 , the first term reduces to γ Per ( D k ; Ω ) with Per ( D k ; Ω ) being the perimeter (in Ω ) of the biomolecule region D k . This term measures the disruption of intermolecular and/or intramolecular bonds that occurs when a surface is created. In addition, Vol ( D k ) represents the volume of D k ; P h is the (constant) hydrodynamic pressure. Therefore, P h Vol ( D k ) is the mechanical work of creating the biomolecular size vacuum in the solvent. In the last integral, ρ s is the (constant) solvent bulk density; and U vdW represents the Lennard-Jones potential [57]; as such U vdW C ( Ω ¯ \ { x 1 , , x N m } ) .

Currently, one of the most widely used polar solvation models is based on the PB theory [1,28,29,35,40]. In the framework of the classical PB theory, the polar energy is expressed as follows:

(2.4) I p ( χ D k , ψ ) = D k ρ ( x ) ψ ( x ) ε p 8 π ψ ( x ) 2 d x U k ε s 8 π ψ ( x ) 2 + β 1 j = 1 N c c j ( e β q j ψ ( x ) 1 ) d x ,

where ρ is an L -approximation of the solute partial charges supported in i = 1 N m B ( x i , σ i ) . ε p and ε s are the dielectric constants of the solute and the solvent, respectively. Usually, ε p 1 for the protein and ε s 80 for the water. q j is the charge of ion species j , j = 1 , 2 , , N c , and β = 1 k B T , where k B is the Boltzmann constant, T is the (constant) absolute temperature. c j is the (constant) bulk concentration of the jth ionic species. For notational brevity, we define

(2.5) B ( s ) = β 1 j = 1 N c c j ( e β s q j 1 ) , s R .

In addition, we assume the charge neutrality condition

(2.6) j = 1 N c c j q j = 0 .

It is important to observe that B ( 0 ) = 0 and, by (2.7), B ( 0 ) = 0 and B ( ± ) = ± . Further, B ( s ) > 0 . We thus conclude that B ( 0 ) = min s R B ( s ) and B is strictly convex.

To sum up, in microstate k , the solvation energy is given by

(2.7) ( χ D K ) = Ω θ d D χ D k + P h Vol ( D k ) + U k ρ s U vdW ( x ) d x + D k ρ ( x ) ψ ( x ) ε p 8 π ψ ( x ) 2 d x U k ε s 8 π ψ ( x ) 2 + B ( ψ ( x ) ) d x ,

where ψ = ψ χ D k solves the PB equation:

(2.8) ( ( ε p χ D k + ε s χ U k ) ψ ) 4 π χ U k B ( ψ ) + 4 π ρ = 0 in Ω

in the admissible set

(2.9) A = { ψ H 1 ( Ω ) : ψ Ω = ψ D } for some ψ D W 1 , ( Ω ) .

The boundary data ψ D is fixed throughout this article.

Remark 2.1

Now, we will present a concise discussion regarding the expression for polar energy expression (2.5). This discussion will serve as a crucial foundation for the derivation of EASE. To begin, let us review the definitions of the random variables X and C j defined in Section 2.1.

A pivotal element of the PB theory is an electrostatic potential ψ , which is identical across all states within a fixed (statistical) ensemble under consideration. Consider a fixed microstate k as a (statistical) ensemble. In k , the electrostatic potential ψ satisfies the fundamental equation of electrostatics, the Poisson equation:

(2.10) [ ( ε χ D k + ε s χ U k ) ψ ] = 4 π ρ + j = 1 N c q j c j k ,

where c j k is the (number) concentration of the jth ion speices in k , i.e.,

(2.11) c j k = C j k = E ( C j X = k ) ;

and thus, c j k 0 in D k . However, since the probability distribution of C j is unknown, it is impossible to directly compute c j k by means of (2.12). Indeed, the expression of c j k should be derived from the Helmholtz free energy, cf. [28,36,43].

To find the Helmholtz free energy H k in k , we consider the random variable E ψ : S R , where

(2.12) E ψ ( S α ) = D k ρ ψ ε p 8 π ψ 2 d x + U k j = 1 N c ( q j C α , j ψ C α , j μ j ) ε s 8 π ψ 2 d x

is the internal energy in state S α with μ j being the chemical potential of the jth ion species, which is uniform everywhere in a grand canonical ensemble. Then the ensemble average of E ψ in k is given by

(2.13) E ψ k = E ( E ψ X = k ) = D k ρ ψ ε p 8 π ψ 2 d x + U k j = 1 N c ( q j c j k ψ c j k μ j ) ε s 8 π ψ 2 d x .

Meanwhile, within the framework of classic PB theory, the entropy S k in k is given by

(2.14) T S k = β 1 U k j = 1 N c c j k [ ln ( v c j k ) 1 ] d x ,

where 1 v is a reference concentration. It is crucial to highlight that, in statistic mechanics, entropy is not a random variable of ( S , , P ) and should not be obtained by ensemble averaging its counterparts in all states.

The Helmholtz free energy H k is given by H k = E ψ k T S k . Equating the first variation of H k with respect to c j k to zero gives

μ j = β 1 ln ( v c j k ( x ) ) + q j ψ ( x ) in U k .

This leads to the relation c j k ( x ) = χ U k ( x ) v 1 e β μ j e β q j ψ ( x ) . When x is sufficient far away from the solute, we have ψ ( x ) = 0 and c j k ( x ) = c j . This implies that c j = v 1 e β μ j . Hence, we obtain the relation

(2.15) c j k ( x ) = χ U k ( x ) c j e β q j ψ ( x ) .

This is exactly the Boltzmann distribution. It is important to emphasize that the term χ U k in Equation (2.16) arises from the fact that, in H k , the domain of integration for any integral involving c j k is restricted to U k . This restriction is necessary because c j k 0 within the domain D k . By using the characteristic function χ U k , we ensure that the integration is performed only over the relevant region U k where c j k is nonzero. Plugging (2.16) into the expression of H k yields

(2.16) H k = D k ρ ( x ) ψ ( x ) ε p 8 π ψ ( x ) 2 d x U k ε s 8 π ψ ( x ) 2 + β 1 j = 1 N c c j e β q j ψ ( x ) d x .

Since the polar energy equals zero when ψ vanishes everywhere, following [52], a constant term

(2.17) β 1 U k j = 1 N c c j d x

should be added to (2.17), which yields the polar energy expression (2.5) in k . Note that j = 1 N c q j c j k = χ U k B ( ψ ) , see (2.6). Plugging this expression into (2.11) shows that ψ solves (2.9). By Lemma A.1, ψ maximizes I p ( χ D k , ) in A .

To summarize, within the framework of the classic PB theory, the following points are crucial in the derivation of EASE in a fixed ensemble:

  • Universal electrostatic potential: It is necessary to use a universal electrostatic potential, denoted as ψ , which applies across all microstates, when deriving the Boltzmann distribution. This potential is independent of specific microstates and is used consistently in the calculations.

  • Entropy is not a random variable: In the context of EASE, it is important to note that one component of the solvation energy, namely, entropy, is not considered a random variable of the system ( S , , P ) . Therefore, when calculating the entropy within EASE, it should not be obtained by averaging its counterpart in the individual microstates.

By considering these factors, EASE can be derived within a fixed ensemble, incorporating an appropriate electrostatic potential and accounting for the nonrandom nature of entropy.

3 Modeling of EASE

If one tries to directly use (2.1) and compute EASE, an obvious barrier is how to calculate the probability p k . Although, according to statistical mechanics, p k can be obtained by means of the Boltzmann weight, an accurate approximation of the Boltzmann weight requires a sufficiently large sampling of microstates, which creates a significant computational burden. Indeed, in the routine calculation of the EASE, the computation of p k may involve energy calculations for thousands of solute–solvent states.

The core of our modeling paradigm is, instead of ensemble averaging the outputs of the solvation energy functional (2.8), one should ensemble average the inputs, i.e., χ D k , to obtain a diffuse-interface profile. Then the EASE can be computed by using this one profile instead of thousands of solute–solvent states. This leads to the following definition:

u = k K p k χ D k .

As such, u ( x ) represents the probability of x Ω found in the solute phase among all microstates. Figure 1(b) shows the range of u in different regions. The definition of u enforces the following physical constraints:

(3.1) u ( x ) [ 0 , 1 ] for a.a. x Ω

and

(3.2) u = 1 a.e. in Ω i and u = 0 a.e. in Ω e .

Then the admissible set for u is defined as follows:

X = { u B V ( Ω ) : u satisfies Constraints (3.1) and (3.2) } .

To construct the EASE functional, we introduce the following functions defined on K :

f D : K B V ( Ω ) , f D ( k ) = χ D k f U : K B V ( Ω ) , f U ( k ) = χ U k f s : K R , f s ( k ) = Ω θ d D χ D k f i : K ( L 1 ( Ω ) , R ) = L ( Ω ) , f i ( k ) v = Ω χ D k v d x = D k v d x , v L 1 ( Ω ) f e : K ( L 1 ( Ω ) , R ) = L ( Ω ) , f e ( k ) v = Ω χ U k v d x = U k v d x , v L 1 ( Ω ) .

The ensemble averages of all bulk and interfacial energy components can be derived from the above functions. For example,

  • the ensemble average of interfacial energy, i.e., Ω θ d D χ D k , is given by f s ( X ) ;

  • the ensemble average of the term P h Vol ( D k ) is given by f i ( X ) P h ;

  • the ensemble average of the term U k ρ s U vdW d x , is given by f e ( X ) ( ρ s U vdW ) .

Proposition 3.1

Assume that v L 1 ( Ω ) is an energy density function. Then the ensemble average of the energy stored in the solute and solvent phases can be computed as follows:

f i ( X ) v = Ω u v d x and f e ( X ) v = Ω ( 1 u ) v d x .

Proof

The proof is straightforward. Indeed, we will only check the equality for f i ( X ) v . Consider the random variable f i ( X ) v : S R . Then

f i ( X ) v = E [ E ( f i ( X ) v X = k ) ] = k K p k D k v d x = Ω k K p k χ D k v d x = Ω u v d x .

Following the discussion in Remark 2.1, let ψ : Ω R be the universal electrostatic potential ψ , which is identical among all S α . To formulate the Poisson equation, we notice that the dielectric function ε p f D ( k ) + ε s f U ( k ) = ε p χ D k + ε s χ U k in (2.11) should be replaced by the ensemble averaged one:

ε ( u ) E ( ε p f D ( X ) + ε s f U ( X ) ) = u ε p + ( 1 u ) ε s .

Hence, ψ A solves

(3.3) [ ε ( u ) ψ ] = 4 π ρ + j = 1 N c q j c j in Ω ,

where c j is the ion concentration of the jth ion species in the ensemble under consideration, i.e.,

(3.4) c j = E ( C j ) = E [ E ( C j X = k ) ] = k K p k c j k = k K p k c j k χ U k .

Note that the function c j 0 in the set { u = 1 } { x Ω : u ( x ) = 1 } . By a similar argument to Remark 2.1 and Proposition 3.1, the ensemble average of E ψ , cf. (2.13) and (2.14), equals

(3.5) E ψ = k K p k D k ρ ψ ε p 8 π ψ 2 d x + U k j = 1 N c ( q j c j k ψ c j k μ j ) ε s 8 π ψ 2 d x = Ω ρ ψ + j = 1 N c ( q j c j ψ c j μ j ) ε ( u ) 8 π ψ 2 d x .

In the following two subsections, we will derive two functionals for the EASE within the frameworks of the classic PB theory and the size-modified PB theory.

3.1 Classic PB theory

In this subsection, we will first consider the classic PB theory, in which ions and solvent molecules are assumed to be point-like and have no correlation between their concentrations.

3.1.1 Ensemble average polar energy

In the classic PB theory framework, the entropy can be formulated as follows:

T S = β 1 { u < 1 } j = 1 N c c j [ ln ( v c j ) 1 ] d x .

The domain of integration { u < 1 } = { x Ω : u ( x ) < 1 } is due to the fact that c j vanishes identically in { u = 1 } . By applying a similar argument leading to (2.16) to the Helmholtz energy H = E ψ S T , one can obtain the Boltzmann distribution:

(3.6) c j = χ { u < 1 } c j e β q j ψ .

After plugging (3.6) into H , as in (2.18), the constant term

(3.7) β 1 Ω χ { u < 1 } j = 1 N c c j d x

needs to be added to H to adjust the reference state of the zero energy in the ensemble under consideration. Finally, we arrive at the ensemble average polar energy

I p ( u , ψ ) = H + β 1 Ω χ { u < 1 } j = 1 N c c j d x = Ω ρ ψ ε ( u ) 8 π ψ 2 χ { u < 1 } B ( ψ ) d x .

On the other hand, replacing c j in (3.3) by (3.6) shows that ψ A solves the generalized PB equation:

(3.8) [ ε ( u ) ψ ] 4 π χ { u < 1 } B ( ψ ) + 4 π ρ = 0 in Ω .

Lemma A.1 shows that ψ maximizes I p ( u , ) in A for any fixed u X .

3.1.2 Ensemble average interfacial energies

Following the discussions in Section 3.1.1, the EASE in the framework of the classic PB theory can be represented by

f s ( X ) + L ( u ) , where L ( u ) = Ω [ P h u + ρ s ( 1 u ) U vdW ] d x + I p ( u , ψ u )

with ψ u maximizing I p ( u , ) in A . It only remains to compute the ensemble averaged interfacial energy

f s ( X ) = E [ E ( f s ( X ) X = k ) ] = k K p k Ω θ d D χ D k .

Proposition 3.3 below shows that the integral Ω θ d D u can be used to approximate the ensemble averaged interfacial energy. Before proving Proposition 3.3, we will need the following lemma.

Lemma 3.2

Let { D k } k K be a family of Caccioppoli sets satisfying Ω i D k Ω \ Ω ¯ e and p k [ 0 , 1 ] with k K p k = 1 . Then for each ε > 0 , there exists another family { D ˜ k } k K of Caccioppoli sets satisfying Ω i D ˜ k Ω \ Ω ¯ e and

(3.9) 2 ( * D ˜ k * D ˜ j ) = 0 , k , j K , k j ,

with * D being the reduced boundary of a Caccioppoli set D, cf. Definition C.3. Moreover,

k K p k Ω θ d D χ D k + L ( u ) k K p k Ω θ d D χ D ˜ k L ( u ˜ ) < ε ,

where u = k K p k χ D k and u ˜ = k K p k χ D ˜ k .

Proof

In view of Lemma A.2, it suffices to construct a family of Caccioppoli sets { D k , n } n = 1 satisfying Assumption (3.9) and Ω i D k , n Ω \ Ω ¯ e , such that

(3.10) lim n χ D k χ D k , n L 1 = 0 , lim n Ω θ d D χ D k , n = Ω θ d D χ D k .

Indeed, letting u n = k K p k χ D k , n , (3.10) implies that u n u and χ { u n < 1 } χ { u < 1 } in L 1 ( Ω ) . Then it follows from Lemma A.2 and the dominated convergence theorem that L ( u n , ψ u n ) L ( u , ψ u ) as n .

Because Σ i , i = 0 , 1 , are C 2 , there exists some a > 0 such that Σ i has a tubular neighborhood B a ( Σ i ) of width a > 0 , cf. [33, Exercise 2.11] and [41, Remark 3.1]. For sufficiently small r ( 0 , a ) , let

D k r = ( D k B r ( Σ 1 ) ) \ B r ( Σ 0 ) .

In view of the relation Per ( S T ; Ω ) + Per ( S T ; Ω ) Per ( S ; Ω ) + Per ( T ; Ω ) for any Caccioppoli sets S , T Ω , D k r are again Caccioppoli sets. Recall that Per ( S ; Ω ) is the perimeter of S in Ω . According to Lemma 4.6, we have

lim r 0 + χ D k χ D k r L 1 = 0 , lim r 0 + Ω θ d χ D k r = Ω θ d χ D k .

Note that the proof of Lemma 4.6 is independent of other results in this article. Therefore, without loss of generality, we may assume that for some sufficiently small r > 0

Ω i B ( Σ 1 , r ) D k Ω \ ( Ω ¯ e B r ( Σ 0 ) ) .

Claim 1: For every k , there exists a sequence { f k , j } j = 1 C ( Ω ¯ ) such that 0 f k , j 1 a.e. in Ω and

lim j χ D k f k , j L 1 = 0 , lim j Ω θ d D f k , j = Ω θ d D χ D k .

Proof of Claim 1

We consider χ D k as an element in B V ( R 3 ) . Choose a sequence ε j 0 + and define f k , j η ε j χ D k , where η ε j is the standard Friedrichs mollifying kernel. Then lim j χ D k f k , j L 1 = 0 .

[10, Corollary 1] implies that Ω θ d D χ D k ( x ) liminf j Ω θ d D f k , j ( x ) . Note that f k , j X C ( Ω ¯ ) for sufficiently large j . For any ϕ C 0 1 ( Ω ; R 3 ) with ϕ 1 , it follows from [10, Lemma 1] that

Ω f k , j ( θ ϕ ) d x = Ω ( η ε j χ D k ) ( θ ϕ ) d x = Ω χ D k [ η ε j ( θ ϕ ) ] d x = Ω χ D k θ η ε j ( θ ϕ ) θ d x ( η ε j θ ) θ L Ω θ d D χ D k ( x ) .

Here, ( η ε j θ ) ( x ) = Ω η ε j ( x y ) θ ( y ) d y for x Ω . Taking supremum over all such ϕ , we derive that

Ω θ d D f k , j ( x ) ( η ε j θ ) θ L Ω θ d D χ D k ( x ) .

By the uniform continuity of θ , it is not a hard task to verify that lim j ( η ε j θ ) θ L ( Ω ) = 1 . Therefore, the above inequality implies that limsup j Ω θ d D f k , j ( x ) Ω θ d D χ D k ( x ) .

By the Sard’s theorem, there exists some S ( 0 , 1 ) with 1 ( ( 0 , 1 ) \ S ) = 0 such that, for all t S , the super-level set E k , j t = { f k , j > t } has a smooth boundary. The coarea formula implies that

Ω θ d D χ D k = lim j Ω θ d D f k , j = lim j 0 1 Ω θ d D χ E k , j t d t 0 1 liminf j Ω θ d D χ E k , j t d t .

Therefore, for some t S ,

liminf j Ω θ d D χ E k , j t Ω θ d D χ D k .

Pick the subsequence { j n } n = 1 such that

liminf j Ω θ d D χ E k , j t = lim n Ω θ d D χ E k , j n t .

On the other hand, we can infer from the Chebyshev’s theorem that

3 ( E k , j n t \ D k ) 1 t f k , j n χ D k L 1 , 3 ( D k \ E k , j n t ) 1 1 t f k , j n χ D k L 1 .

This implies that

lim n χ E k , j n t χ D k L 1 = 0 .

Therefore, it follows from [10, Corollary 1] that D ˜ k , n = E k , j n t satisfies (3.10) with D k , n replaced by D ˜ k , n .

Assume that 2 ( χ D ˜ k , n χ D ˜ j , n ) > 0 for some k , j K and k j . Since D ˜ k , n is C 2 , there exists some a > 0 such that D ˜ k , n has a tubular neighborhood B a ( D ˜ k , n ) of width a > 0 . Denote by ν the outward unit normal of D ˜ k , n pointing into Ω \ D ˜ k , n . Then the map defined by

Λ : D ˜ k , n × ( a , a ) B a ( D ˜ k , n ) : ( x , r ) x + r ν ( x ) ,

is a C 1 -diffeomorphism. Let Γ r Λ ( D ˜ k , n , r ) . Then, for every i K , there are at most countably many r ( a , a ) such that 2 ( Γ r χ D ˜ i , n ) > 0 . Hence, we can find r ( a , a ) sufficiently close to 0 such that

2 ( Γ r χ D ˜ i , n ) = 0 , i K

and Γ r Ω t and

χ D k , n χ D ˜ k , n L 1 + Ω θ d D χ D k , n Ω θ d D χ D ˜ k , n < 1 n ,

where D k , n is the region enclosed by Γ r . Modifying all D ˜ k , n , k K , in such a way yields a family of smooth sets { D k , n } k K satisfying Ω i D k , n Ω \ Ω ¯ e and Assumptions (3.9) and (3.10).□

The aforementioned lemma elucidates that within a given ensemble, slight adjustments can be made to the solute regions { D k } k K in order to attain (3.9), while ensuring that the alterations in the associated solvation energy remain “infinitesimally” negligible. Consequently, it is permissible to consistently presume that { D k } k K adheres to (3.9).

Proposition 3.3

Assume that { D k } k K satisfies (3.9). Then

Ω θ d D u = k K p k Ω θ d D χ D k .

Proof

By the De Giorgi’s structure theorem, cf. [26, Theorem 5.7.2], χ D k are 2-rectifiable, i.e., there exist Borel sets F k and C 1 -functions g k , n : U k , n R 3 , U k , n R 2 compact, such that D k ( F k ) = 0 and

* D k = F k n = 1 g k , n ( U k , n ) .

Pick arbitrary ε > 0 . For every k K , we can find N k = N k ( ε ) such that

D k ( Ω n = N k + 1 g k , n ( U k , n ) ) ε K θ 1 p k .

Recall K = K . Therefore, we obtain compact sets

G k , ε n = 1 N k ( ε ) g k , n ( U k , n ) , k K , and G ε k K G k , ε .

By the definition of the reduced boundary, the unit normal ν exists everywhere on G ε . By the Urysohn’s Lemma, there exists a continuous function ϕ : R 3 R 3 such that ϕ G ε = ν . By restricting ϕ on Ω ¯ and applying Stone-Weierstrass, we can find a smooth function ϕ ε : Ω ¯ : R 3 such that

ϕ ε ϕ L ( G ε ) < ε .

Then, in a neighborhood H Ω of G ε , we have ϕ ε > 1 2 . Pick h C 0 ( Ω ; [ 0 , 1 ] ) such that h 1 in H . Setting ψ ε ( x ) = h ( x ) ϕ ε ( x ) ϕ ε ( x ) , it is an easy task to check that

1 ψ ε ( x ) ν ( x ) 1 ε 1 + ε , x G ε .

By [10, Lemma 1], we can estimate

Ω θ d D u Ω ( θ ψ ε ) d D u = k K p k Ω θ ψ ε D χ D k d x k K p k G k , ε θ ψ ε D χ D k d x ε k K p k ( 1 ε ) 1 + ε G k , ε θ d D χ D k ε = k K p k ( 1 ε ) 1 + ε Ω θ d D χ D k Ω \ G k , ε θ d D χ D k ε k K p k ( 1 ε ) 1 + ε Ω θ d D χ D k 2 ε 1 + ε .

Since ε is arbitrary, we have

Ω θ d D u k K p k Ω θ d D χ D k .

The inverse inequality is obvious. We have thus proved the assertion.□

3.1.3 Total energy functional

Based on Propositions 3.1 and 3.3, the ensemble average nonpolar energy can be formulated as follows:

I np ( u ) = Ω θ d D u + Ω [ P h u + ρ s ( 1 u ) U vdW ] d x .

Therefore, the EASE functional is given by

(3.11) ( u ) = Ω θ d D u + Ω [ P h u + ρ s ( 1 u ) U vdW ] d x + Ω ρ ψ ε ( u ) 8 π ψ 2 χ { u < 1 } B ( ψ ) d x

with admissible set X and ψ = ψ u A is determined via the generalized PB equation (3.8).

3.2 Size-modified PB theory

In the classic PB theory, ionic solvents are assumed to be ideal: all ions and solvent molecules are point-like, and there is no correlation between the (number) concentrations of these particles. However, the ideal ionic solvent assumption overlooks the crucial finite size effect of mobile ions and solvent molecules. Numerical simulations have demonstrated that the point-like ion assumption in the classic PB theory can lead to nonphysically large ion concentrations near charged surfaces [6].

We will follow the idea in the pioneering work [5] by Bikerman to derive the size-modified polar energy formulation. Assume that each mobile ion and solvent molecule occupies a finite volume v . Building upon the definition of C 0 introduced in Section 2.1, we define

c 0 k = E ( C 0 X = k ) and c 0 = E ( C 0 ) = k K p k c 0 k .

It is assumed that inside the solvent phase, apart from the occupancy of ions, the remaining space is filled with solvent molecules. In this context, the following relation holds:

(3.12) v j = 0 N c c j k ( x ) = χ U k ( x ) , x Ω .

This relation enforces a maximum concentration of v 1 for each ion species. (3.12) implies that

(3.13) c 0 = k K p k c 0 k = k K p k v 1 j = 1 N c c j k χ U k = v 1 ( 1 u ) j = 1 N c c j ,

Following [5], we can include the solvent entropic contribution and then the size-modified entropy S m d can be written as follows:

T S m d = β 1 { u < 1 } j = 0 N c c j [ ln ( v c j ) 1 ] d x = β 1 { u < 1 } j = 1 N c c j [ ln ( v c j ) 1 ] d x + β 1 { u < 1 } v 1 ( 1 u ) j = 1 N c c j ln ( 1 u ) v j = 1 N c c j 1 d x .

See also [6,7,37,38] for some related works.

Following the discussion in Remark 2.1, the variation of H m d = E ψ T S m d , cf. (3.5), with respect to c j , j = 1 , , N c , gives

μ j = q j ψ + β 1 ln ( v c j ) ln ( 1 u ) v i = 1 N c c i χ { u < 1 } .

This leads to the relation

c j ( 1 u ) v i = 1 N c c i = χ { u < 1 } v 1 e β μ j e β q j ψ .

When x is sufficient far away from the solute, we have u ( x ) = 0 , ψ ( x ) = 0 and c j ( x ) = c j with j { 0 , 1 , , N c } , where c 0 is the (constant) bulk concentration of the solvent molecule. This implies that c j = c 0 e β μ j , where we have used the relation (3.13).

(3.14) c j ( 1 u ) v i = 1 N c c i = χ { u < 1 } h j e β q j ψ , where h j = c j v c 0 .

By summing over j = 1 , , N c , one can find out that

j = 1 N c c j = ( 1 u ) χ { u < 1 } j = 1 N c h j e β q j ψ 1 + χ { u < 1 } v j = 1 N c h j e β q j ψ = ( 1 u ) j = 1 N c c j e β q j ψ v c 0 + v j = 1 N c c j e β q j ψ .

Here, we have utilized the relation ( 1 u ) χ { u < 1 } = 1 u . Note that the aforementioned equality implies that

1 u v j = 1 N c c j = ( 1 u ) v c 0 v c 0 + v j = 1 N c c j e β q j ψ .

Plugging this expression into (3.14) yields

(3.15) c j = ( 1 u ) c j e β q j ψ v c 0 + v i = 1 N c c i e β q i ψ .

Using (3.15), the size-modified Helmholtz free energy H m d can be reformulated as follows:

H m d = Ω ρ ψ ε ( u ) 8 π ψ 2 + β 1 v 1 ( 1 u ) ln ( 1 u ) v c 0 v c 0 + v j = 1 N c c j e β q j ψ 1 d x .

A constant term (with respect to fixed u )

(3.16) β 1 v 1 ( 1 u ) ln v c 0 + v j = 1 N c c j β 1 v 1 ( 1 u ) [ ln ( ( 1 u ) v c 0 ) 1 ]

needs to be added to H m d to adjust the reference state of the zero energy in the grand canonical ensemble under consideration. We finally arrive at the following size-modified polar energy functional:

I p , m d ( u , ψ ) = Ω ρ ψ ε ( u ) 8 π ψ 2 ( 1 u ) B m d ( ψ ) d x ,

where the function B m d : R R is defined by

B m d ( s ) = β 1 v 1 ln v c 0 + v j = 1 N c c j e β q j s v c 0 + v j = 1 N c c j .

Direct computations show

B m d ( s ) = j = 1 N c c j q j e β q j s v c 0 + v j = 1 N c c j e β q j s

and

B m d ( s ) = v β c 0 j = 1 N c c j q j 2 e β q j s + v β j = 1 N c c j q j 2 e β q j s j = 1 N c c j e β q j s v β j = 1 N c c j q j e β q j s 2 v c 0 + v j = 1 N c c j e β q j s 2 .

The Cauchy-Schwartz inequality shows that B m d ( s ) > 0 for all s R . Therefore, B m d is strictly convex. It follows from (2.7) that B m d ( 0 ) = 0 , and thus, 0 = B m d ( 0 ) = min s R B m d ( s ) . Plugging (3.15) into (3.3) gives the size-modified PB equation

(3.17) [ ε ( u ) ψ ] 4 π ( 1 u ) B m d ( ψ ) + 4 π ρ = 0 in Ω .

Lemma A.1 shows that, for any fixed u X , ψ A solves (3.17) iff it maximizes I p , m d ( u , ) .

Define L m d : X R by

L m d ( u ) = Ω [ P h u + ρ s ( 1 u ) U vdW ] d x + I p , m d ( u , ψ u ) ,

where ψ u maximizes I p , m d ( u , ) in A . Following the proof of Lemma 3.2, we can prove a similar result for L m d .

Lemma 3.4

Let { D k } k K be a family of Caccioppoli sets satisfying Ω i D k Ω \ Ω ¯ e and p k [ 0 , 1 ] with k K p k = 1 . Then for each ε > 0 , there exists another family { D ˜ k } k K of Caccioppoli sets satisfying Ω i D ˜ k Ω \ Ω ¯ e and

2 ( * D ˜ k * D ˜ j ) = 0 , k , j K , k j .

Moreover,

k K p k Ω θ d D χ D k + L m d ( u ) k K p k Ω θ d D χ D ˜ k L m d ( u ˜ ) < ε ,

where u = k K p k χ D k and u ˜ = k K p k χ D ˜ k .

By virtue of Lemma 3.4 and Proposition 3.3, the size-modified EASE functional can be expressed as follows:

(3.18) m d ( u ) = Ω θ d D u + Ω P h u + ( 1 u ) ( ρ s U vdW B m d ( ψ ) ) + ρ ψ ε ( u ) 8 π ψ 2 d x

with admissible set X and ψ = ψ u A solves the size-modified PB equation (3.17).

Remark 3.5

The main difference between (3.11) and (3.18) (or between (3.8) and (3.17)) is the expressions of Boltzmann distributions. At each x Ω , in the classic PB theory, the concentration of the jth ion species is represented by (3.6), which can tend to + as long as u ( x ) < 1 . In contrast, (3.15) enforces the constraint j = 1 N c c j ( 1 u ) v 1 . Hence, the concentration of each ion species cannot exceed v 1 .

4 Analysis of EASE

We will perform some preliminary analysis for the EASE models developed in the previous section. The following example shows that is not l.s.c. with respect to convergence in W 1 , 1 ( Ω ) .

Example 4.1

We take ψ D 1 , Ω i = B ( 0 , 1 ) , Ω t = { x R 3 : 1 < x < 2 } and Ω e = { x R 3 : 2 < x < 3 } . Choose u X C ( Ω ) such that u ( x ) = 1 for x 5 3 and u ( x ) < 1 elsewhere. We can construct a sequence u n X W 1 , 1 ( Ω ) such that u n u in W 1 , 1 ( Ω ) satisfying

  • u n ( x ) < 1 for all x > 1 ,

  • u n ( x ) = n 1 n for 4 3 x 5 3 .

For instance, we can define u n ( x ) = n + 3 n 3 n x for 1 x 4 3 and u n ( x ) = n 1 n u ( x ) for x > 4 3 . Direct computations show that

lim n I np ( u n ) = I np ( u ) .

However, Lemma A.2 shows that

(4.1) lim n I p ( u n , ψ u n ) = Ω ρ ψ * ε ( u ) 8 π ψ * 2 χ { x > 1 } B ( ψ * ) d x I p ( u , ψ * ) I p ( u , ψ u ) ,

where ψ u A is the solution of (3.8) and ψ u n is the solution with u replaced by u n . In addition, ψ * is the solution of

[ ε ( u ) ψ ] 4 π χ { x > 1 } B ( ψ ) + 4 π ρ = 0 in Ω .

Note that equalities hold in (4.1) iff ψ u = ψ * and ψ * ( x ) = 0 for a.a. 1 < x 5 3 . If the second condition holds, then ψ * solves

[ ε ( u ) ψ ] 4 π B ( ψ ) = 0 in { 1 < x < 3 } , ψ = 0 on { x = 1 } , ψ = 1 on Ω .

Then it follows from the elliptic unique continuation theorem, c.f. [19, Theorem 1.4], that ψ * 0 , which contradicts with the fact that ψ * = 1 on Ω .

The aforementioned analysis reveals that the absence of lower semicontinuity in arises from the discontinuous term χ { u < 1 } . Minimizing nonlower semicontinuous functionals poses a significant challenge. Particularly, the computational and analytical complexities associated with the discontinuous term further compound the difficulty. Consequently, the minimization of remains an open problem.

In contrast, the discontinuous term χ { u < 1 } is obviated in m d through the relation ( 1 u ) χ { u < 1 } = 1 u . This elimination allows us to establish the lower semicontinuity of m d , as demonstrated in the proof of Theorem 4.2 below. This paves the way for establishing additional analytic properties of m d . Consequently, our focus in the remainder of this article will center on the analysis of the size-modified EASE functional m d . The analyses in the following two subsections underscore the advantages of incorporating finite size effects in EASE modeling from an analytical perspective.

4.1 Existence of minimizer

Theorem 4.2

m d has a global minimizer in X .

Proof

First, we will show that m d is convex. Indeed, given any u 0 , u 1 X , let u t = t u 0 + ( 1 t ) u 1 for t [ 0 , 1 ] . We have u t X for all t [ 0 , 1 ] . For any v X , let ψ v be the solution of (3.17) in A with u replaced by v . Note that, for any fixed ψ A , both I np ( ) and I p , m d ( , ψ ) are convex. Then it follows from Lemma A.1 that

m d ( u t ) = I np ( u t ) + I p , m d ( u t , ψ u t ) t [ I np ( u 0 ) + I p , m d ( u 0 , ψ u t ) ] + ( 1 t ) [ I np ( u 1 ) + I p , m d ( u 1 , ψ u t ) ] t [ I np ( u 0 ) + I p , m d ( u 0 , ψ u 0 ) ] + ( 1 t ) [ I np ( u 1 ) + I p , m d ( u 1 , ψ u 1 ) ] = t m d ( u 0 ) + ( 1 t ) m d ( u 1 ) .

The lower semicontinuity of m d in X with respect to convergence in L 1 ( Ω ) is a direct conclusion from the dominated convergence theorem, [10, Corollary 1], and Lemma A.2. Now the existence of a minimizer of m d in X can be immediately obtained by the direct method of calculus of variation.□

Remark 4.3

  1. The assertion of Theorem 4.2 remains valid if we merely assume that Σ 0 and Σ 1 are Lipschitz continuous.

  2. Since the functional m d is not strictly convex, global minimizer of m d is generally not unique. However, the primary focus in the applications of VISM is on the computed solvation energy min u X m d rather than the specific surface profile u . In the study by Shao et al. [51], we proposed a p -Laplacian-based computational model to capture the solvation energy effectively.

4.2 Continuous dependence on the constrains

Let E = min u X m d ( u ) . Since E depends on Constraint (3.2) and thus on Σ 0 and Σ 1 , we will justify the robustness of (3.18) by demonstrating that E depends continuously on Σ j with j = 0 , 1 in a proper topology. Without loss of generality, we may assume that Σ j are connected. The general case can be proved in a similar manner.

For k = 1 , 2 , let ℳℋ k be the set of pairs ( Σ 0 , Σ 1 ) of compact, connected and oriented C k -hypersurfaces without boundary contained in the open set Ω , which induces a decomposition ( Ω i , Ω t , Ω e ) as shown in Figure 1(b). The orientation of Σ 0 ( Σ 1 , resp.) is so taken that its outward unit normal vector field ν Σ 0 ( ν Σ 1 , resp.) points into Ω t . To summarize, the open sets ( Ω i , Ω t , Ω e ) satisfy the following properties.

  • ( Ω i , Ω t , Ω e ) are connected so that Σ 0 Σ 1 = .

  • i = 1 N m B ¯ ( x i , σ i ) Ω i Ω \ Ω ¯ e .

  • Ω Ω e .

Theorem 4.2 and Remark 4.3 show that E can be considered a functional defined on ℳℋ 1 . In the rest of this section, we denote m d , X , and ( Ω i , Ω t , Ω e ) by m d , Γ , X Γ , and ( Ω i , Γ , Ω t , Γ , Ω e , Γ ) to indicate their dependence on Γ = ( Σ 0 , Σ 1 ) ℳℋ 1 . In Theorem 4.4, it will be shown that E is indeed continuous while restricted to ℳℋ 2 . To this end, we will first demonstrate that ℳℋ k are metric spaces.

The normal bundle of a compact, connected and oriented C 1 -hypersurface Σ without boundary is given by

N 1 Σ = { ( q , ν Σ ( q ) ) : q Σ } R 3 × R 3 ,

where ν Σ ( q ) , is the outward unit normal of Σ at q Σ . If in addition, Σ is C 2 , its second normal bundle can be defined as follows:

N 2 Σ = { ( q , ν Σ ( q ) , Σ ν Σ ( q ) ) : q Σ } R 3 × R 3 × R 6 ,

where Σ is the surface gradient of Σ . Recall that the Hausdorff metric on compact subsets K R n is defined by

d ( K 1 , K 2 ) = max { sup x K 1 d ( x , K 2 ) , sup x K 2 d ( x , K 1 ) } .

We can equip ℳℋ k with the metric

d ℳℋ k ( ( Σ 0 , Σ 1 ) , ( Σ ˜ 0 , Σ ˜ 1 ) ) = d ( N k Σ 0 , N k Σ ˜ 0 ) + d ( N k Σ 1 , N k Σ ˜ 1 ) , ( Σ 0 , Σ 1 ) , ( Σ ˜ 0 , Σ ˜ 1 ) ℳℋ k .

Following [50, Chapter 2], one can show that ℳℋ 2 equipped with d ℳℋ 2 is a Banach manifold. The following theorem is the main result of this section.

Theorem 4.4

E C ( ℳℋ 2 ) .

We will split the proof of Theorem 4.4 into several steps. Pick arbitrary Γ = ( Σ 0 , Σ 1 ) ℳℋ 2 . To prove the continuity of E at Γ , we choose an arbitrary sequence Γ n ( Σ 0 , n , Σ 1 , n ) ℳℋ 1 . In the sequel, we will first prove two propositions concerning the convergence lim n E ( Γ n ) = E ( Γ ) under the condition

(4.2) lim n d ℳℋ 1 ( Γ n , Γ ) = 0

when Γ n converges to Γ from the interior and exterior of Γ (with respect to the orientations of ( Σ 0 , Σ 1 ) ). In the rest of this section, we will denote by u min ( u min ; n , resp.) a global minimizer of m d , Γ ( m d , Γ n resp.) in X Γ ( X Γ n resp.).

Proposition 4.5

Assume that Γ = ( Σ 0 , Σ 1 ) ℳℋ 2 and Γ n = ( Σ 0 , n , Σ 1 , n ) ℳℋ 1 satisfy (4.2). If

Ω i , Γ n Ω i , Γ and Ω e , Γ n Ω e , Γ ,

then lim n E ( Γ n ) = E ( Γ ) .

Proof

Observe that u min X Γ n for all n . Thus,

E ( Γ n ) = m d , Γ n ( u min ; n ) m d , Γ n ( u min ) = m d , Γ ( u min ) = E ( Γ ) .

It follows from Lemma A.1 that there exists some constant M > 0 such that

Ω ρ s U vdW ( 1 u min ; n ) d x + I p , m d ( u min ; n , ψ u min ; n ) M ,

where ψ u min ; n A is the solution of (3.17) with u replaced by u min ; n . This implies that

θ 0 Ω d D u min ; n + P h u min ; n L 1 M E ( Γ ) .

Therefore, u min ; n B V is uniformly bounded. [26, Theorem 5.2.3.4] implies that there exists a subsequence, not relabeled, and some u B V ( Ω ) such that u min ; n u in L 1 ( Ω ) . Note that (4.2) implies that

lim n χ Ω i , Γ n χ Ω i , Γ L 1 = lim n χ Ω e , Γ n χ Ω e , Γ L 1 = 0 ,

and thus, u X Γ . From [10, Corollary 1], Lemma A.2 and the dominated convergence theorem, we infer that

E ( Γ ) m d , Γ ( u ) liminf n E ( Γ n ) limsup n E ( Γ n ) E ( Γ ) .

This proves the assertion.□

To prove the convergence lim n E ( Γ n ) = E ( Γ ) in case

Ω i , Γ n Ω i , Γ and Ω e , Γ n Ω e , Γ ,

we will need some preparations. Because Σ i are C 2 , following the proof of Lemma 3.2, the map

Λ i : Σ i × ( a , a ) B a ( Σ i ) : ( q , r ) q + r ν Σ i ( q ) , i = 0 , 1 ,

is a C 1 -diffeomorphism for some a > 0 , where B a ( Σ i ) is the tubular neighborhood of Σ i with width a and ν Σ i is the outward unit normal of Σ i . By the inverse function theorem, there exist two maps P i C 1 ( B a ( Σ i ) , Σ i ) and d i C 1 ( B a ( Σ i ) , ( a , a ) ) , where P i is the nearest point projection onto Σ i and d i is the signed distance to Σ i with d i ( x ) > 0 for x B a ( Σ i ) Ω t . Note that G i r Λ i ( Σ i , r ) ℳℋ 1 . Its orientation is chosen to be compatible with that of Σ i , so that its outward unit normal

ν i , r ( x ) = ν Σ i ( P i ( x ) ) , x G i r .

See [50, Section 2.2.2]. Then it is clear that lim r 0 d ( G i r , Σ i ) = 0 . Without loss of generality, we may assume that a > 0 is so small that for any r ( a , a ) , G 0 r G 1 r = .

Lemma 4.6

For every f X Γ , we define

f r ( x ) = 1 , x Ω i B r ( Σ 1 ) 0 , x Ω e B r ( Σ 0 ) f ( x ) , elsewhere

for r ( 0 , a ) . Then f r f in L 1 ( Ω ) and Ω θ d D f r ( x ) Ω θ d D f ( x ) as r 0 + .

Proof

The proof for f r f in L 1 ( Ω ) is straightforward. So we will only show the second part. [10, Corollary 1] implies that Ω θ d D f ( x ) liminf r 0 + Ω θ d D f r ( x ) . In the rest of the proof, it is assumed that i { 0 , 1 } . Put U i r B r ( Σ i ) Ω t . For any ϕ C c 1 ( Ω ; R 3 ) with ϕ 1 and u X Γ , the trace theorem of B V -functions, cf. [26, Theorem 5.3.1] implies that

U i r u ( θ ϕ ) d x + U i r ( θ ϕ ) d [ D u ] = G i r ( θ ϕ ) ν i , r T r u d 2 Σ i ( θ ϕ ) ν Σ i T r u d 2 .

Here, T r u is the trace of u U i r on U i r ; and [ D u ] is the vector-valued measure for the gradient of u , cf. Appendix C. Pushing r 0 + yields that

(4.3) lim r 0 + G i r ( θ ϕ ) ν i , r T r u d 2 = Σ i ( θ ϕ ) ν Σ i T r u d 2 .

Direct computations show

Ω ( θ ϕ ) d [ D f r ] = i = 0 , 1 G i r ( θ ϕ ) d [ D f r ] + Ω t \ ( U 0 r U 1 r ) ( θ ϕ ) d [ D f ] = Ω ( θ ϕ ) d [ D f ] i = 0 , 1 U i r ( θ ϕ ) d [ D f ] i = 0 , 1 Σ i ( θ ϕ ) d [ D f ] i = 0 , 1 G i r ( θ ϕ ) d [ D ( f f r ) ] Ω θ d D f ( x ) i = 0 , 1 U i r ( θ ϕ ) d [ D f ] i = 0 , 1 Σ i ( θ ϕ ) d [ D f ] i = 0 , 1 G i r ( θ ϕ ) d [ D ( f f r ) ] .

It follows from (4.3) that

i = 0 , 1 Σ i ( θ ϕ ) d [ D f ] i = 0 , 1 G i r ( θ ϕ ) d [ D ( f f r ) ] = i = 0 , 1 Σ i ( θ ϕ ) ν Σ i ( i T r f ) d 2 i = 0 , 1 G i r ( θ ϕ ) ν i , r ( i T r f ) d 2 0

as r 0 + . See the proof of [26, Theorem 5.4.1]. We clearly have lim r 0 + U i r ( θ ϕ ) d [ D f ] = 0 . Therefore, we can infer from (2.4) and [26, Theorem 5.3.1] that

limsup r 0 + Ω θ d D f r ( x ) Ω θ d D f ( x ) .

This proves the assertion□

Proposition 4.7

Assume that Γ = ( Σ 0 , Σ 1 ) ℳℋ 2 and Γ n = ( Σ 0 , n , Σ 1 , n ) ℳℋ 1 satisfy (4.2). If

Ω i , Γ n Ω i , Γ and Ω e , Γ n Ω e , Γ ,

then lim n E ( Γ n ) = E ( Γ ) .

Proof

Let r n = d ℳℋ 1 ( Γ n , Γ ) . For sufficiently large n N , we define

u n ( x ) = 1 , x Ω i B r n ( Σ 1 ) , 0 , x Ω e B r n ( Σ 0 ) , u min ( x ) , elsewhere .

Since u n X Γ n and u min ; n X Γ , we have

(4.4) E ( Γ ) m d , Γ ( u min ; n ) = m d , Γ n ( u min ; n ) = E ( Γ n ) m d , Γ n ( u n ) = m d , Γ ( u n ) .

Lemma 4.6 implies that as n

u n u min in L 1 ( Ω ) and Ω θ d D u n Ω θ d D u min .

Using the dominated convergence theorem and Lemma A.2 yields lim n m d , Γ ( u n ) = m d , Γ ( u min ) = E ( Γ ) . Then the asserted statement follows by pushing n in (4.4).□

Proof of Theorem 4.4

Now we assume that Γ n ℳℋ 2 and

Γ n ( Σ 0 , n , Σ 1 , n ) Γ in ℳℋ 2 .

Let r n = d ℳℋ 2 ( Γ n , Γ ) . Then G ± r n = ( G 0 ± r n , G 1 ± r n ) ℳℋ 1 . We denote by u n ± a global minimizer of m d , G ± r n in X G ± r n . Because u min ; n X G r n and u n + X Γ n , we infer that

E ( G r n ) m d , G r n ( u min ; n ) = m d , Γ n ( u min ; n ) = E ( Γ n ) m d , Γ n ( u n + ) = m d , G + r n ( u n + ) = E ( G + r n ) .

Propositions 4.5 and 4.7 show that

lim n E ( G r n ) = lim n E ( G + r n ) = E ( Γ ) .

Therefore, lim n E ( Γ n ) = E ( Γ ) .

5 Conclusion

VISM have been successful in the computation of solvation energies. However, traditional sharp-interface VISMs do not account for the inherent randomness of the solute–solvent interface, stemming from thermodynamic fluctuations. It has been demonstrated that neglecting these fluctuations can lead to substantial errors in predicting solvation free energies [47].

In this work, a new approach to calculating EASE is developed using diffuse-interface VISM. Grounded in principles of statistical mechanics and geometric measure theory, the method effectively captures EASE during the solvation process by employing a novel diffuse-interface profile u ( x ) , which represents the probability of finding a point x in the solute phase across all microstates within the grand canonical ensemble. To showcase the versatility of our modeling paradigm, we formulate two EASE functionals: one within the classic PB framework and another within the framework of size-modified PB theory, accounting for finite-size effects of mobile ions and solvent molecules.

In the routine calculation of the EASE, one needs to carry out MD simulations to obtain thousands of solute–solvent configures (snapshots) and perform energy calculations for each snapshot. By meticulously modeling the impact of conformational changes in the solvent medium, the proposed model can reproduce EASE with just one diffuse-interface configuration. Compared with ensemble-averaging energies from thousands of snapshots, the alternative approach provided by our EASE models has the potential to improve the efficiency of EASE calculations. The recent work [11] provides some partial support for the effectiveness of the proposed approach, where it is shown that ensemble average polar solvation energy can be approximated by using one Gaussian-based smooth dielectric function.

Preliminary analyses of the proposed EASE models indicate that the size-modified EASE functional outperforms the EASE functional in the classic PB theory in various analytical aspects. More precisely, we show that the size-modified EASE functional enjoys lower semicontinuity, a property that the classic EASE functional fails to have. This guarantees the existence of a global minimizer of the size-modified EASE functional. Further, the predicted solvation energy by the size-modified EASE functional depends continuously on the model constraints. Motivated by these observations, we plan to conduct numerical implementations and further theoretical analyses of the size-modified EASE functional in the future work.

  1. Funding information: This research was partially supported by the National Science Foundation under grants DMS-2306991 (Shao, Zhao), DMS-1818748 and DMS-2306992 (Chen), DMS-2110914 (Zhao), and a CARSCA grant from the University of Alabama (Shao).

  2. Author contributions: All authors contribute equally to the preparation of this manuscript.

  3. Conflict of interest: The authors declare that they have no known competing financial interests that could have appeared to influence the work reported in this article. Although Dr. Shan Zhao is an Editor-in-Chief of the Computational and Mathematical Biophysics, he had no involvement in the peer reviewing process or the final decision of this manuscript.

  4. Ethical approval: This research does not require ethical approval.

  5. Data availability statement: Data sharing is not applicable to this article as no datasets were generated or analysed during the current study.

Appendix A class of strictly convex functionals

In this section, we collect some results concerning a class of strictly convex functionals associated with the polar solvation energy . These results can be proved by following the proofs of [16, Propositions 3.1 and 3.2] line by line. We will thus omit their proofs.

Throughout this appendix, we assume that F C ( R ) is a strictly convex function with F ( 0 ) = 0 and F ( 0 ) = 0 .

Lemma A.1

Assume that a , b , c L ( Ω ) satisfy

(A1) 0 < L 0 a L 1 , 0 b L 2 , c L 3

for some constants L i . Then the functional

G ( ψ ) = Ω a 2 ψ 2 + b F ( ψ ) c ψ d x

has a unique minimizer ψ A for every ψ D W 1 , ( Ω ) , c.f. (2.10), or equivalently, ψ weakly solves

( a ψ ) b F ( ψ ) + c = 0 in Ω ; ψ = ψ D on Ω .

Moreover, for some constant C depending only on L i and ψ D

ψ H 1 + ψ L C .

Lemma A.2

Let a n , b n , c L ( Ω ) satisfy (A1), n = 0 , 1 , . Assume that ψ n is the unique minimizer of

G n ( ψ ) = Ω a n 2 ψ 2 + b n F ( ψ ) c ψ d x

in A for some ψ D W 1 , ( Ω ) . If a n a 0 and b n b 0 in L 1 ( Ω ) as n , then

ψ n ψ 0 in H 1 ( Ω ) , and lim n G n ( ψ n ) = G 0 ( ψ 0 ) .

B A comparison with GFBVISM

In this section, we will conduct a comparative analysis of the proposed models (3.11) and (3.18) with a closely related VISM known as the GFBVISM, which is rooted in classic PB theory. GFBVISM [14] is defined as follows:

(B1) ( 2 ) ( u ) = I np ( u ) + I p ( 2 ) ( u , ψ ) ,

where

(B2) I p ( 2 ) ( u , ψ ) = Ω ρ ψ ε ( u ) 8 π ψ 2 ( 1 u ) B ( ψ ) d x ,

and ψ A solves

(B3) [ ε ( u ) ψ ] 4 π ( 1 u ) B ( ψ ) + 4 π ρ = 0 in Ω .

Despite certain similarities between (3.11) and (3.18) and GFBVISM, there exist fundamental distinctions between them.

First, (B1) was introduced in an ad hoc way to create a transition region between the solute and solvent, lacking an explanation of the physical meanings of the transition parameter u and the predicted energy.

Second, it has come to attention that the original formulation of GFBVISM [14] does not incorporate Constraints (3.1) and (3.2). This absence of constraints introduces the possibility of nonphysical minimizers within the GFBVISM model. For instance, it may allow for trivial values of u or even negative values, which are not physically meaningful in the given context. Therefore, including these constraints becomes crucial to ensure the model’s solutions align with physical principles.

Third and most importantly, the proposed models correct and improve the derivation of the ensemble average polar energy. Due to Proposition 3.1, one may guess that I p ( 2 ) approximates the ensemble average polar energy. However, Formulation (B2) is questionable in the sense that it is derived from an erroneous “entropy” formulation. To see this, on a heuristic level, one can ensemble average the entropy and obtain

T S = T k K p k S k = β 1 k K p k Ω j = 1 N c c j k [ ln ( v c j k ) 1 ] d x and H E T S ,

where S k is defined in (2.15). By taking the first variations of H with respect to all c j k , a similar argument to Remark 2.1 gives (2.16). It follows from (3.4) that

(B4) c j ( x ) = k K p k c j k ( x ) = k K p k χ U k ( x ) c j k ( x ) = ( 1 u ( x ) ) c j e β q j ψ ( x ) .

See (2.15) for the definition of S k . Plugging (B4) into (2.11) yields (B3). Using the expression (B4) of c j in H and adding a constant term β 1 j = 1 N c c j Ω ( 1 u ) d x to adjust the state of zero energy as in (3.16) give the polar energy formulation (B2). From a statistical mechanics perspective, it is indeed important to note that the choice of the “entropy” S and “Helmholtz free energy” H in the previous derivation is incorrect. Consequently, (3.11) corrects the polar energy formulation within the classical PB theory framework, albeit introducing a discontinuous term χ { u < 1 } . It is crucial to adhere to the correct statistical mechanics principles when formulating the ensemble averages. Moreover, (3.18) further refines the EASE formulation by accounting for the finite size effects and eliminate the discontinuous term χ { u < 1 } .

C Geometric measure theorem

In this section, we will introduce some concepts from geometric measure theory used in this article. Throughout, we assume that U is an open set of R N . The main reference is [14].

Definition C.1

A function f L 1 ( U ) has bounded variation in U if

D f ( U ) sup U f ϕ d x : ϕ C c 1 ( U ; R N ) , ϕ 1 < .

We denote by B V ( U ) the space of all functions of bounded variation in U . B V ( U ) is a Banach space equipped with the norm

f B V ( U ) f L 1 ( U ) + D f ( U ) .

An N -measurable subset E U is a Caccioppoli subset of U if χ E B V ( U ) . For each V U , we write

E ( V ) sup E ϕ d x : ϕ C c 1 ( V ; R N ) , ϕ 1 .

The following result is called the structure theorem for B V -functions.

Theorem C.2

Let f B V ( U ) . Then there exist a Radon measure μ on U and a μ -measurable function σ : U R N such that

  1. σ ( x ) = 1 μ -a.e., and

  2. for every ϕ C c 1 ( U ; R N )

    U f ϕ d x = U ϕ σ d μ .

The measure μ is denoted by D f . If f = χ E for a Caccioppoli subset E of U, we write ν E = σ .

The (vector-valued) measurable with density function σ with respect to μ is denoted by [ D f ] .

Definition C.3

Let E R N be a Caccioppoli set and x R N . We say x E , the reduced boundary of E , if

  1. D χ E ( B ( x , r ) ) > 0 for all r > 0 , and

  2. lim r 0 + 1 N ( B ( x , r ) ) B ( x , r ) ν E d E = ν E ( x ) , and

  3. ν E ( x ) = 1 .

References

[1] Baker, N. A. (2005). Improving implicit solvent simulations: a Poisson-centric view. Current Opinion in Structural Biology, 15(2), 137–43. DOI: https://doi.org/10.1016/j.sbi.2005.02.001.10.1016/j.sbi.2005.02.001Search in Google Scholar PubMed

[2] Baker, N. A., Sept, D., Joseph, S., Holst, M. J., & McCammon, J. A. (2001). Electrostatics of nanosystems: Application to microtubules and the ribosome. Proceedings of the National Academy of Sciences, 98(18), 10037–10041. DOI: https://doi.org/10.1073/pnas.18134239.10.1073/pnas.181342398Search in Google Scholar PubMed PubMed Central

[3] Ball, P. (2003). How to keep dry in water. Nature, 423(6935), 25–26. DOI: https://doi.org/10.1038/423025a.10.1038/423025aSearch in Google Scholar PubMed

[4] Bates, P. W., Wei, G. W., & Zhao, S. (2008). Minimal molecular surfaces and their applications. Journal of Computational Chemistry, 29(3), 380–91. DOI: https://doi.org/10.1002/jcc.20796.10.1002/jcc.20796Search in Google Scholar PubMed

[5] Bikerman, J. (1942). XXXIX. Structure and capacity of electrical double layer. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, 33(220), 384–397. DOI: https://doi.org/10.1080/14786444208520813.10.1080/14786444208520813Search in Google Scholar

[6] Borukhov, I., Andelman, D., & Orland, H. (1997). Steric effects in electrolytes: A modified Poisson-Boltzmann equation. Physical Review Letters, 79, 435–438. DOI: https://doi.org/10.1103/PhysRevLett.79.435.10.1103/PhysRevLett.79.435Search in Google Scholar

[7] Borukhov, I., Andelman, D., & Orland, H. (2000). Adsorption of large ions from an electrolyte solution: a modified Poisson-Boltzmann equation. Electrochimica Acta, 46(2), 221–229. DOI: https://doi.org/10.1016/S0013-4686(00)00576-4.10.1016/S0013-4686(00)00576-4Search in Google Scholar

[8] Boschitsch, A. H., & Fenley, M. O. (2004). Hybrid boundary element and finite difference method for solving the nonlinear Poisson-Boltzmann equation. Journal of Computational Chemistry, 25(7), 935–955. DOI: https://doi.org/10.1002/jcc.20000.10.1002/jcc.20000Search in Google Scholar PubMed

[9] Botello-Smith, W. M., Liu, X., Cai, Q., Li, Z., Zhao, H., & Luo, R. (2013). Numerical Poisson-Boltzmann model for continuum membrane systems. Chemical Physics Letters, 555, 274–281. DOI: https://doi.org/10.1016/j.cplett.2012.10.081.10.1016/j.cplett.2012.10.081Search in Google Scholar PubMed PubMed Central

[10] Carlier, G., & Comte, M. (2007). On a weighted total variation minimization problem. Journal of Functional Analysis, 250(1), 214–226. DOI: https://doi.org/10.1016/j.jfa.2007.05.022.10.1016/j.jfa.2007.05.022Search in Google Scholar

[11] Chakravorty, A., Jia, Z., Li, L., Zhao, S., & Alexov, E. (2018). Reproducing the ensemble average polar solvation energy of a protein from a single structure: Gaussian-based smooth dielectric function for macromolecular modeling. Journal of Chemical Theory and Computation, 14(2), 1020–1032. DOI: https://doi.org/10.1021/acs.jctc.7b00756.10.1021/acs.jctc.7b00756Search in Google Scholar PubMed PubMed Central

[12] Che, J., Dzubiella, J., Li, B., & McCammon, J. A. (2008). Electrostatic free energy and its variations in implicit solvent models. The Journal of Physical Chemistry B, 112(10), 3058–3069. DOI: https://doi.org/10.1021/jp7101012.10.1021/jp7101012Search in Google Scholar PubMed

[13] Chen, C. J., Saxena, R., & Wei, G. W. (2010). A multiscale model for virus capsid dynamics. International Journal of Biomedical Imaging, 308627. DOI: 10.1155/2010/308627.10.1155/2010/308627Search in Google Scholar PubMed PubMed Central

[14] Chen, Z., Baker, N. A., & Wei, G. W. (2010). Differential geometry based solvation model I: Eulerian formulation. Journal of Computational Physics, 229(22), 8231–8258. DOI: https://doi.org/10.1016/j.jcp.2010.06.036.10.1016/j.jcp.2010.06.036Search in Google Scholar PubMed PubMed Central

[15] Chen, Z., Baker, N. A., & Wei, G. W. (2010). Differential geometry based solvation models II: Lagrangian formulation. Journal of Mathematical Biology, 63, 1139–1200. DOI: https://doi.org/10.1007/s00285-011-0402-z.10.1007/s00285-011-0402-zSearch in Google Scholar PubMed PubMed Central

[16] Chen, Z., & Shao, Y. (2023). A new approach to constrained total variation solvation models and the study of solute–solvent interface profiles. Computers & Mathematics with Applications, 130, 119–136. DOI: https://doi.org/10.48550/arXiv.2203.11285.10.1016/j.camwa.2022.12.002Search in Google Scholar

[17] Chen, Z., Zhao, S., Chun, J., Thomas, D. G., Baker, N. A., Bates, P. W., & Wei, G. W. (2012). Variational approach for nonpolar solvation analysis. The Journal of Chemical Physics, 137(8), 084101. DOI: https://doi.org/10.1063/1.4745084.10.1063/1.4745084Search in Google Scholar PubMed PubMed Central

[18] Cheng, L. T., Dzubiella, J., McCammon, A. J., & Li, B. (2007). Application of the level-set method to the implicit solvation of nonpolar molecules. Journal of Chemical Physics, 127(8), 084503. DOI: https://doi.org/10.48550/arXiv.0809.0181.10.1063/1.2757169Search in Google Scholar PubMed

[19] Choulli, M. Uniqueness of continuation for semilinear elliptic equations. arXiv:2208.08378. Search in Google Scholar

[20] Crowley, P. B., & Golovin, A. (2005). Cation-pi interactions in protein-protein interfaces. Proteins: Structure, Function, and Bioinformatics, 59(2), 231–239. DOI: https://doi.org/10.1002/prot.20417.10.1002/prot.20417Search in Google Scholar PubMed

[21] Dai, S., Li, B., & Lu, J. (2018). Convergence of phase-field free energy and boundary force for molecular solvation. Archive for Rational Mechanics and Analysis, 227(1), 105–147. DOI: https://doi.org/10.1007/s00205-017-1158-4.10.1007/s00205-017-1158-4Search in Google Scholar

[22] Dong, F., & Zhou, H. X. (2006). Electrostatic contribution to the binding stability of protein-protein complexes. Proteins, 65(1), 87–102. DOI: https://doi.org/10.1002/prot.21070.10.1002/prot.21070Search in Google Scholar PubMed

[23] Dragan, A. I., Read, C. M., Makeyeva, E. N., Milgotina, E. I., Churchill, M. E., Crane-Robinson, C., & Privalov, P. L. (2004). DNA binding and bending by HMG boxes: Energetic determinants of specificity. Journal of Molecular Biology, 343(2), 371–393. DOI: https://doi.org/10.1016/j.jmb.2004.08.035.10.1016/j.jmb.2004.08.035Search in Google Scholar PubMed

[24] Dzubiella, J., Swanson, J. M. J., & McCammon, J. A. (2006). Coupling hydrophobicity, dispersion, and electrostatics in continuum solvent models. Physical Review Letters, 96, 087802. DOI: https://doi.org/10.1103/PhysRevLett.96.087802.10.1103/PhysRevLett.96.087802Search in Google Scholar PubMed

[25] Dzubiella, J., Swanson, J. M. J., & McCammon, J. A. (2006). Coupling nonpolar and polar solvation free energies in implicit solvent models. The Journal of Chemical Physics, 124(8), 084905. DOI: https://doi.org/10.1063/1.2171192.10.1063/1.2171192Search in Google Scholar PubMed

[26] Evans, L. C., & Gariepy, R. F. (1992). Measure theory and fine properties of functions. Studies in Advanced Mathematics, CRC Press, Boca Raton, FL. Search in Google Scholar

[27] Feig, M., & Brooks III, C. L. (2004). Recent advances in the development and application of implicit solvent models in biomolecule simulations. Current Opinion in Structural Biology, 14, 217–224. DOI: https://doi.org/10.1016/j.sbi.2004.03.009.10.1016/j.sbi.2004.03.009Search in Google Scholar PubMed

[28] Fogolari, F., & Briggs, J. M. (1997). On the variational approach to Poisson-Boltzmann free energies. Chemical Physics Letters, 281(1), 135–139. DOI: https://doi.org/10.1016/S0009-2614(97)01193-7.10.1016/S0009-2614(97)01193-7Search in Google Scholar

[29] Fogolari, F., Brigo, A., & Molinari, H. (2002). The Poisson-Boltzmann equation for biomolecular electrostatics: a tool for structural biology. Journal of Molecular Recognition, 15(6), 377–92. DOI: https://doi.org/10.1002/jmr.577.10.1002/jmr.577Search in Google Scholar PubMed

[30] Gallicchio, E., Kubo, M. M., & Levy, R. M. (2000). Enthalpy-entropy and cavity decomposition of alkane hydration free energies: Numerical results and implications for theories of hydrophobic solvation. Journal of Physical Chemistry B, 104(26), 6271–85. DOI: https://doi.org/10.1021/jp0006274.10.1021/jp0006274Search in Google Scholar

[31] Gallicchio, E., & Levy, R. M. (2004). AGBNP: An analytic implicit solvent model suitable for molecular dynamics simulations and high-resolution modeling. Journal of Computational Chemistry, 25(4), 479–499. DOI: https://doi.org/10.1002/jcc.10400.10.1002/jcc.10400Search in Google Scholar PubMed

[32] Gallicchio, E., Zhang, L. Y., & Levy, R. M. (2002). The SGB/NP hydration free energy model based on the surface generalized Born solvent reaction field and novel nonpolar hydration free energy estimators. Journal of Computational Chemistry, 23(5), 517–29. DOI: https://doi.org/10.1002/jcc.10045.10.1002/jcc.10045Search in Google Scholar PubMed

[33] Gilbarg, D., & Trudinger, N. S. (1983). Elliptic partial differential equations of second order. Volume 224 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences], Springer-Verlag, Berlin, second edition. Search in Google Scholar

[34] Grant, J. A., Pickup, B. T., Sykes, M. T., Kitchen, C. A., & Nicholls, A. (2007). The Gaussian Generalized Born model: application to small molecules. Physical Chemistry Chemical Physics, 9, 4913–22. DOI: https://doi.org/10.1039/B707574J.10.1039/b707574jSearch in Google Scholar PubMed

[35] Grochowski, P., & Trylska, J. (2007). Continuum molecular electrostatics, salt effects and counterion binding. a review of the Poisson-Boltzmann theory and its modifications. Biopolymers, 89(2), 93–113. DOI: https://doi.org/10.1002/bip.20877.10.1002/bip.20877Search in Google Scholar PubMed

[36] Horng, T.-L. (2020). Review and modification of entropy modeling for steric effects in the Poisson-Boltzmann equation. Entropy, 22(6), Paper 632, 15. DOI: https://doi.org/10.3390/e22060632.10.3390/e22060632Search in Google Scholar PubMed PubMed Central

[37] Kilic, M. S., Bazant, M. Z., & Ajdari, A. (2007). Steric effects in the dynamics of electrolytes at large applied voltages. i. double-layer charging. Physical Review E, 75, 021502. DOI: https://doi.org/10.1103/PhysRevE.75.021502.10.1103/PhysRevE.75.021502Search in Google Scholar PubMed

[38] Kilic, M. S., Bazant, M. Z., & Ajdari, A. (2007). Steric effects in the dynamics of electrolytes at large applied voltages. ii. modified poisson-nernst-planck equations. Physical Review E, 75, 021503. DOI: https://doi.org/10.1103/PhysRevE.75.021503.10.1103/PhysRevE.75.021503Search in Google Scholar PubMed

[39] Kuhn, L. A., Siani, M. A., Pique, M. E., Fisher, C. L., Getzoff, E. D., & Tainer, J. A. (1992). The interdependence of protein surface topography and bound water molecules revealed by surface accessibility and fractal density measures. Journal of Molecular Biology, 228(1), 13–22. DOI: https://doi.org/10.1016/0022-2836(92)90487-5.10.1016/0022-2836(92)90487-5Search in Google Scholar PubMed

[40] Lamm, G. (2003). The Poisson-Boltzmann equation. In: Lipkowitz, K. B., Larter, R., & Cundari, T. R., editors, Reviews in Computational Chemistry, (pp. 147–366). John Wiley and Sons, Inc., Hoboken, N.J. 10.1002/0471466638.ch4Search in Google Scholar

[41] LeCrone, J., Shao, Y., & Simonett, G. (2020). The surface diffusion and the Willmore flow for uniformly regular hypersurfaces. Discrete Contin. Dyn. Syst. Ser. S, 13(12), 3503–3524. DOI: https://doi.org/10.48550/arXiv.1901.00208.10.3934/dcdss.2020242Search in Google Scholar

[42] Levy, R. M., Zhang, L. Y., Gallicchio, E., & Felts, A. K. (2003). On the nonpolar hydration free energy of proteins: surface area and continuum solvent models for the solute–solvent interaction energy. Journal of the American Chemical Society, 125(31), 9523–9530. DOI: https://doi.org/10.1021/ja029833a.10.1021/ja029833aSearch in Google Scholar PubMed

[43] Li, B. (2009). Minimization of electrostatic free energy and the Poisson-Boltzmann equation for molecular solvation with implicit solvent. SIAM Journal on Mathematical Analysis, 40(6), 2536–2566. DOI: https://doi.org/10.1137/080712350.10.1137/080712350Search in Google Scholar

[44] Li, B., & Liu, Y. (2015). Diffused solute–solvent interface with Poisson-Boltzmann electrostatics: free-energy variation and sharp-interface limit. SIAM Journal on Applied Mathematics, 75(5), 2072–2092. DOI: https://doi.org/10.1137/15M100701X.10.1137/15M100701XSearch in Google Scholar PubMed PubMed Central

[45] Li, C., Li, L., Zhang, J., & Alexov, E. (2012). Highly efficient and exact method for parallelization of grid-based algorithms and its implementation in delphi. Journal of Computational Chemistry, 33(24), 1960–1966. DOI: https://doi.org/10.1002/jcc.23033.10.1002/jcc.23033Search in Google Scholar PubMed PubMed Central

[46] Li, L., Li, C., Zhang, Z., & Alexov, E. (2013). On the dielectric constant of proteins: Smooth dielectric function for macromolecular modeling and its implementation in delphi. Journal of Chemical Theory and Computation, 9(4), 2126–2136. DOI: https://doi.org/10.1021/ct400065j.10.1021/ct400065jSearch in Google Scholar PubMed PubMed Central

[47] Mobley, D. L., Dill, K. A., & Chodera, J. D. (2008). Treating entropy and conformational changes in implicit solvent simulations of small molecules. The Journal of Physical Chemistry B, 112(3), 938–946. DOI: https://doi.org/10.1021/jp0764384.10.1021/jp0764384Search in Google Scholar PubMed PubMed Central

[48] Mongan, J., Simmerling, C., McCammon, J. A., Case, D. A., & Onufriev, A. (2007). Generalized Born model with a simple, robust molecular volume correction. Journal of Chemical Theory and Computation, 3(1), 159–69. DOI: https://doi.org/10.1021/ct600085e.10.1021/ct600085eSearch in Google Scholar PubMed PubMed Central

[49] Pierotti, R. A. (1976). A scaled particle theory of aqueous and nonaqeous solutions. Chemical Reviews, 76(6), 717–726. DOI: https://doi.org/10.1021/cr60304a002.10.1021/cr60304a002Search in Google Scholar

[50] Prüss, J., & Simonett, G. (2016). Moving interfaces and quasilinear parabolic evolution equations, volume 105 of Monographs in Mathematics, Birkhäuser/Springer, Cham. 10.1007/978-3-319-27698-4Search in Google Scholar

[51] Shao, Y., Chen, Z., & Shan, Z. (2024). A p-laplacian approach and its analysis for the calculation of ensemble average solvation energy. Communications in Information and Systems, 24(3), 275–311. DOI: https://doi.org/10.4310/CIS.241021231643.10.4310/CIS.241021231643Search in Google Scholar

[52] Sharp, K. A., & Honig, B. (1990). Calculating total electrostatic energies with the nonlinear Poisson-Boltzmann equation. The Journal of Physical Chemistry, 94(19), 7684–7692. DOI: https://doi.org/10.1021/j100382a068.10.1021/j100382a068Search in Google Scholar

[53] Spolar, R. S., Ha, J. H., & Record Jr, M. T. (1989). Hydrophobic effect in protein folding and other noncovalent processes involving proteins. Proceedings of the National Academy of Sciences of the United States of America, 86(21), 8382–8385. DOI: https://doi.org/10.1073/pnas.86.21.8382.10.1073/pnas.86.21.8382Search in Google Scholar PubMed PubMed Central

[54] Stillinger, F. H. (1973). Structure in aqueous solutions of nonpolar solutes from the standpoint of scaled-particle theory. Journal of Solution Chemistry, 2, 141–158. DOI: https://doi.org/10.1007/BF00651970.10.1007/BF00651970Search in Google Scholar

[55] Swanson, J. M. J., Mongan, J., & McCammon, J. A. (2005). Limitations of atom-centered dielectric functions in implicit solvent models. Journal of Physical Chemistry B, 109(31), 14769–72. DOI: https://doi.org/10.1021/jp052883s.10.1021/jp052883sSearch in Google Scholar PubMed

[56] Tjong, H., & Zhou, H. X. (2007). GBr6NL: A generalized Born method for accurately reproducing solvation energy of the nonlinear Poisson-Boltzmann equation. Journal of Chemical Physics, 126, 195102. DOI: https://doi.org/10.1063/1.2735322.10.1063/1.2735322Search in Google Scholar PubMed

[57] Wagoner, J. A., & Baker, N. A. (2006). Assessing implicit models for nonpolar mean solvation forces: the importance of dispersion and volume terms. Proceedings of the National Academy of Sciences of the United States of America, 103(22), 8331–6. DOI: https://doi.org/10.1073/pnas.0600118103.10.1073/pnas.0600118103Search in Google Scholar PubMed PubMed Central

[58] Wang, B., & Wei, G. (2015). Parameter optimization in differential geometry based solvation models. The Journal of Chemical Physics, 143(13), 134119. DOI: http://dx.doi.org/10.1063/1.4932342.10.1063/1.4932342Search in Google Scholar PubMed PubMed Central

[59] Wei, G.-W. (2010). Differential geometry based multiscale models. Bulletin of Mathematical Biology, 72(6), 1562–1622. DOI: https://doi.org/10.1007/s11538-010-9511-x.10.1007/s11538-010-9511-xSearch in Google Scholar PubMed PubMed Central

[60] Wei, G. W., & Baker, N. A. (2016). Differential geometry-based solvation and electrolyte transport models for biomolecular modeling: a review. In: Many-Body Effects and Electrostatics in Biomolecules, (pp. 435–480). United States: Jenny Stanford Publishing. 10.1201/b21343-16Search in Google Scholar

[61] Wei, G. W., Sun, Y. H., Zhou, Y. C., & Feig, M. (2005). Molecular multiresolution surfaces. arXiv:math-ph/0511001v1, pp. 1–11. Search in Google Scholar

[62] Zhao, S. (2011). Pseudo-time-coupled nonlinear models for biomolecular surface representation and solvation analysis. International Journal for Numerical Methods in Biomedical Engineering, 27(12), 1964–1981. DOI: https://doi.org/10.1002/cnm.1450.10.1002/cnm.1450Search in Google Scholar

[63] Zhao, Y., Kwan, Y.-Y., Che, J., Li, B., & McCammon, J. A. (2013). Phase-field approach to implicit solvation of biomolecules with coulomb-field approximation. The Journal of Chemical Physics, 139(2), 024111. DOI: https://doi.org/10.1063/1.4812839.10.1063/1.4812839Search in Google Scholar PubMed PubMed Central

[64] Zhou, Y. (2020). On curvature driven rotational diffusion of proteins on membrane surfaces. SIAM Journal on Applied Mathematics, 80(1), 359–381. DOI: https://doi.org/10.1137/19M1241714.10.1137/19M1241714Search in Google Scholar

Received: 2023-12-08
Revised: 2024-06-18
Accepted: 2024-11-25
Published Online: 2024-12-31

© 2024 the author(s), published by De Gruyter

This work is licensed under the Creative Commons Attribution 4.0 International License.

Articles in the same Issue

  1. Special Issue: Recent Trends in Mathematical Biology – Theory, Methods, and Applications
  2. Editorial for the Special Issue: Recent trends in mathematical Biology – Theory, methods, and applications
  3. Behavior of solutions of a discrete population model with mutualistic interaction
  4. Influence of media campaigns efforts to control spread of COVID-19 pandemic with vaccination: A modeling study
  5. Optimal control and bifurcation analysis of SEIHR model for COVID-19 with vaccination strategies and mask efficiency
  6. A mathematical study of the adrenocorticotropic hormone as a regulator of human gene expression in adrenal glands
  7. On building machine learning models for medical dataset with correlated features
  8. Analysis and numerical simulation of fractional-order blood alcohol model with singular and non-singular kernels
  9. Stability and bifurcation analysis of a nested multi-scale model for COVID-19 viral infection
  10. Augmenting heart disease prediction with explainable AI: A study of classification models
  11. Plankton interaction model: Effect of prey refuge and harvesting
  12. Modelling the leadership role of police in controlling COVID-19
  13. Robust H filter-based functional observer design for descriptor systems: An application to cardiovascular system monitoring
  14. Regular Articles
  15. Mathematical modelling of COVID-19 dynamics using SVEAIQHR model
  16. Optimal control of susceptible mature pest concerning disease-induced pest-natural enemy system with cost-effectiveness
  17. Correlated dynamics of immune network and sl(3, R) symmetry algebra
  18. Variational multiscale stabilized FEM for cardiovascular flows in complex arterial vessels under magnetic forces
  19. Assessing the impact of information-induced self-protection on Zika transmission: A mathematical modeling approach
  20. An analysis of hybrid impulsive prey-predator-mutualist system on nonuniform time domains
  21. Modelling the adverse impacts of urbanization on human health
  22. Markov modeling on dynamic state space for genetic disorders and infectious diseases with mutations: Probabilistic framework, parameter estimation, and applications
  23. In silico analysis: Fulleropyrrolidine derivatives against HIV-PR mutants and SARS-CoV-2 Mpro
  24. Tangleoids with quantum field theories in biosystems
  25. Analytic solution of a fractional-order hepatitis model using Laplace Adomian decomposition method and optimal control analysis
  26. Effect of awareness and saturated treatment on the transmission of infectious diseases
  27. Development of Aβ and anti-Aβ dynamics models for Alzheimer’s disease
  28. Compartmental modeling approach for prediction of unreported cases of COVID-19 with awareness through effective testing program
  29. COVID-19 transmission dynamics in close-contacts facilities: Optimizing control strategies
  30. Modeling and analysis of ensemble average solvation energy and solute–solvent interfacial fluctuations
  31. Application of fluid dynamics in modeling the spatial spread of infectious diseases with low mortality rate: A study using MUSCL scheme
Downloaded on 20.12.2025 from https://www.degruyterbrill.com/document/doi/10.1515/cmb-2024-0017/html
Scroll to top button