Home Ionic surfactants critical micelle concentration prediction in water/organic solvent mixtures by artificial neural network
Article Publicly Available

Ionic surfactants critical micelle concentration prediction in water/organic solvent mixtures by artificial neural network

  • Anton Soria-Lopez

    Anton Soria-Lopez is a PhD student in the Agri-environmental and Food Research Group at the Campus of Ourense (University of Vigo). His research interests focus on the application of Artificial Neural Networks to chemical and biological problems.

    , María García-Martí

    María García-Martí is a postdoctoral researcher in the Agri-Environmental and Food Research Group at the University of Ourense. Her research interests focus on food chemistry and environmental research.

    , Enrique Barreiro

    Enrique Barreiro is a member of the School of Computer Science Engineering at Ourense (University of Vigo). His research interest is focused on big data.

    and Juan C. Mejuto

    Juan C. Mejuto is currently a full professor in the Department of Physical Chemistry at the University of Vigo. He is the head of the Agri-environmental and Food Research Group at the Campus of Ourense. His research interests include (i) physical organic and physical inorganic chemistry, (ii) reactivity mechanisms in homogeneous and micro-heterogeneous media, (iii) stability of self-assembling aggregates and (iv) supramolecular chemistry.

    EMAIL logo
Published/Copyright: September 26, 2024
Become an author with De Gruyter Brill

Abstract

Critical micellar concentration (CMC) is a key physicochemical property of surfactants used to study their behaviour. This property is affected by factors such as temperature, pressure, pH, the type of organic solvent/water mixture, the chemical structure of the surfactants and the presence of electrolytes. Most of the existing studies in the literature have predicted the CMC under fixed conditions based on the chemical parameters of the surfactant. In this study, a machine learning approach using artificial neural network (ANN) models was used to estimate the CMC of some ionic surfactants. These models considered variables defining both the organic solvent-water mixture (T, molecular weight, molar fraction and log P) and the chemical structure of the surfactant (number of atoms of each element). A database consisting of a total of 258 CMC values for 10 ionic surfactants was collected from the literature. The ANN architecture consisting of an input layer with 12 neurons, an intermediate layer with 25 neurons and one neuron in the output layer is proposed. According to the results, the normalized ANN models provided the best statistical adjustments for the CMC prediction. These ANN models could be a promising method for CMC estimation.

1 Introduction

Surfactants are organic compounds of an amphiphilic nature consisting of two different polarity zones with a unique structure. They have a hydrophilic group, which has affinity for the polar phase and is located at the head of the compound, and a hydrophobic group, located at the tail, which has an affinity for the non-polar phase. 1 , 2 These compounds can be produced chemically (synthetic surfactants) or being based on biological materials (biosurfactants). 3 According to Hussain et al., 4 surfactants can be classified into four groups based on the charge of the head group: nonionic, zwitterionic, anionic and cationic, playing an important role in transitions and aggregation structures depending on that. 5 Thanks to these characteristics and their propensity to self-assemble allow them to form micelles. 6 Surfactants have been widely used to reduce the surface tension between the two immiscible phases. 7 , 8 Their applications are related to different purposes such a pharmaceutical and cosmetic uses, 9 , 10 detergents, 11 , 12 wetting agents 13 and food additives 14 among others.

Nowadays, self-assembly is an important phenomenon in daily life and science where molecules are organized into stable aggregates. An example of this phenomenon of molecular aggregation in biochemical systems is the formation of micelles. 15 Micelles are aggregates of oriented surfactants in an aqueous solution, where the interior is hydrophobic and the exterior is hydrophilic part. 16 According to Schork, 17 the micelles contain about 50–100 monomeric units, and they can improve the solubility of hydrophobic substances. 18 The critical micelle concentration (CMC) is the minimum concentration at which the micelle begins to form. 19 Each surfactant molecule presents a certain CMC value that depends on factors including temperature and electrolyte concentration. 20 , 21 When the concentration of surfactants is higher than the CMC, they tend to aggregate forming thermodynamically stable micelles. 22 The study of CMC is based on drastic changes in macroscopic parameters. 23 For this reason, the physicochemical methods are used for measuring and determination of CMC, such as surface tension, conductivity, osmotic pressure, density and light dispersion. 24 , 25 Finally, the kinetics of micellization have an important role in many applications. Therefore, understanding the physicochemical properties of the surfactant’s solution is crucial from both a technological and environmental perspective. 26

The temperature over the CMC of surfactants in water does not follow a linear trend. The CMC initially decreases with increasing temperature until reaching a minimum, and then increases at higher temperatures. It is known that higher temperatures lead to the formation of micelles more easily due to the lower hydration of the hydrophilic group of the surfactant, but also to the destruction of micelles due to the rupture of the structured water molecules that surround the hydrophobic part of the surfactant. 27 Furthermore, the addition of salt to anionic surfactant solutions generally conduct to a reduction in surface tension, being more significant at higher salt concentrations. This effect can be explained by electrostatic interactions that facilitate the migration of surfactant monomers towards the interface. 27 , 28 According to Niraula et al., 29 when adding an electrolyte, micelles will generally form at a concentration lower than the CMC of the pure surfactant. Physical-chemical properties such as the degree of the ionization, reaction rates and phase separation are affected when incorporating the additive to a set of surfactants. 30 On the other hand, the presence of alcohols in water leads to a decomposition of the water structure and the reduction of its dielectric constant, which contributes to the increase of CMC. 31

Because the self-aggregation of this surfactant in solution is controlled by CMC, accurately determining CMC values is of great scientific interest. One way to reliably obtain these values is experimental measurements. However, this method is considered challenging, particularly at high temperatures and pressures, due to the time-consuming and expensive nature of these procedures. 27 For this reason, the application of prediction methods and mathematical models can be a good practical alternative in this area. 27 In this sense, QSPR models using different linear regression techniques such as multiple linear regression (MLR) and partial least squares (PLS) that are widely used for CMC predictions. 32 , 33 , 34 , 35

On the other hand, algorithms based on machine learning (ML), very useful today, are also used for the study of surfactants due to their potential to overcome the challenges associated with experimental and simulation approaches. 36 In fact, using an machine learning (ML) model within QSPR (ML-QSPR) has the advantage that it allows finding those most revealing molecular descriptors for the target variables based solely on the analysis of the relationships between the input (descriptors) and the output (target variables). 37 Among ML models, artificial neural network (ANN) model can be applied to QSPR studies as a nonlinear regression model. 35 , 38 , 39 , 40 Although ANNs provide robust predictable capabilities compared to alternative methods, their application on surfactants represents an innovative approach. 41

According to Abooali et al., 27 most studies that use QSPR models to estimate CMC are developed from chemical descriptors at constant room temperature (20–25 °C) and use pure water as a solvent without salinity. However, as has been shown, the temperature, the presence of additives and the solvent composition influence the CMC. Furthermore, CMC is significantly affected by other factors such as the chemical structure of the surfactant, pressure, and pH of the solution. 42 For these reasons, it is interesting to approach this research considering these multiple variables.

The aim of this study is to develop ANN models to estimate the CMC for 10 ionic surfactants (six cationic and four anionic). The experimental data were collected from various scientific publications in the literature. The effective parameters on CMC such as temperature, composition of the surfactant chemical formula and the nature of the solution were incorporated as variables in prediction models. Furthermore, the alcohol properties such as log P (log K WO), molecular weight and molar fraction were also included. Indeed, log P is the ratio of that substance’s concentrations in a two-phase mixture made up of two immiscible solvents in equilibrium – n-octanol and water. It calculates the differential solubility of a solute in these two solvents. It enables the establishment of a substance’s hydrophobicity scale.

2 Materials and methods

2.1 Experimental dataset

The database used in this research was compiled from bibliographic sources. The CMC values of different ionic surfactants in mixtures of organic solvents and pure water, measured by conductivity were collected at different temperatures. All experimental data were then organized with the Microsoft Excel 2016 program. The total data obtained is composed of 258 experimental cases. Table 1 shows the references used in this research to obtain the CMC values (in mmol L−1) for each ionic surfactant. In total, 10 ionic surfactants have been chosen (six cationic and four anionic). The cationic surfactants studied include benzyldodecyldimethyl ammonium bromide (BDAB), cetylpyridinium chloride (CPyCl), cetyltrimethylammonium bromide (CTAB), dodecylpyridinium chloride (DPC), dodecyltrimethyl ammonium bromide (DTAB), and tetradecyltrimethyl ammonium bromide (TTAB). The anionic ones studied include sodium deoxycholate (SDC), sodium dodecylbenzenesulfonate (SDBS), sodium dodecyl sulfate (SDS), and sodium N-lauroylsarcosinate (SDDS).

Table 1:

Ionic surfactants used in this study and the mixed solvent analyzed.

Surfactant Ref.
Name Solvent mixture Molecular formula
Anionic
Sodium deoxycholate (SDC) Ethanol/water C 24 H 39 Na O 4 43
Sodium dodecylbenzene sulphonate (SDBS) Ethylene glycol/water C 18 H 29 Na O 3 S 44
Sodium dodecylsulfate (SDS) Methanol/water C 12 H 25 Na O 4 S 21 , 29 , 45 , 46 , 47
Ethanol/water
Ethylene glycol/water
Isopropanol/water
Sodium N-lauroylsarcosinate (SDDS) Isopropanol/water C 15 H 28 NNa O 3 48
Cationic
Benzyldodecyldimethyl ammonium bromide (BDAB) Methanol/water C 21 H 38 NBr 49
Ethylene glycol/water
Cetylpyridinium chloride (CPyCl) Methanol/water C 21 H 38 NCl 31 , 50 , 51
Ethanol/water
Isopropanol/water
Cetyltrimethylammonium bromide (CTAB) Ethanol/water C 19 H 42 NBr 45 , 46 , 47 , 52 , 53
Isopropanol/water
Ethylene glycol/water
Docecylpyridinium chloride (DPC) Ethanol/water C 17 H 30 NCl 54
Dodecyltrimethyl ammonium bromide (DTAB) Methanol/water C 15 H 34 NBr 55 , 56
Ethanol/water
Tetradecyltrimethyl ammonium bromide (TTAB) Ethylene glycol/water C 17 H 38 NBr 57

A description of the database variables used for the development of prediction models are summarized in Table 2. A total of 12 input variables have been used to predict CMC (output variable). The input variables can be divided into two descriptors: those that define the surfactant and those that define the solvent (organic solvent-water mixture). Regarding surfactant, the number of atoms of each element that the chemical formula of each surfactant is composed of have been included: number of carbons, hydrogen, bromine, chlorine, nitrogen, sodium, oxygen and sulphur. Regarding the solvent, the molar fraction, molecular weight (in g mol−1), octanol/water partition coefficient defined as log P (K WO) and temperature (in K) were included. Finally, the output variable is the logarithmic value of CMC (in mol L−1).

Table 2:

Description of variables used in this study to performance prediction models.

Variable Description Units
Inputs
Atoms of C Atoms of carbon contained in the chemical formula of the surfactant Number
Atoms of H Atoms of hydrogen contained in the chemical formula of the surfactant Number
Atoms of Br Atoms of bromine contained in the chemical formula of the surfactant Number
Atoms of Cl Atoms of chlorine contained in the chemical formula of the surfactant Number
Atoms of N Atoms of nitrogen contained in the chemical formula of the surfactant Number
Atoms of Na Atoms of sodium contained in the chemical formula of the surfactant Number
Atoms of O Atoms of oxygen contained in the chemical formula of the surfactant Number
Atoms of S Atoms of sulphur contained in the chemical formula of the surfactant Number
Molar fraction Molar fraction of solvent Dimensionless
Molecular weight Molecular weight of the organic solvent g mol−1
logP Octanol-water partition coefficient of solvent Dimensionless
Temperature Solvent temperature when CMC value was measured K
Output
Log CMC The critical micellar concentration on a logarithmic scale mol L−1 (M)

The organic solvents dissolved in pure water used in this research were: acetone and the following alcohols namely methanol, ethanol, isopropanol (propan-2-ol) and ethylene glycol (ethane-1,2-diol). To ensure uniformity of units, all mass and volume fractions of solvents taken from bibliography were converted to molar fraction. The temperature of the mixture of organic solvent in water varied in a range between 298.15 K and 323.15 K. The molecular weight and log P (K WO) of each organic solvent were obtained from the open chemical database PubChem. 58 Finally, the CMC values (in mmol L−1) were transformed to the logarithmic form of log CMC (in mol L−1) to ensure the linear distribution.

Finally, these 258 experimental cases were divided into three groups before implementing the models: training (T), validation (V), and testing (Z). T group is used to train the model, V group is used to select the best model generated from the training group, and Z group is used to assess the model’s performance with data not used in training (model generalization ability). For this research, the database was randomly divided as follows: 70 % for training (T), 20 % for validation (V), and 10 % for testing (Z).

2.2 Artificial neural networks

Artificial neural network (ANN) is a class of machine learning model that mimics the structure and function of biological neural networks present in the human brain. 59 This prediction algorithm is widely used for complex pattern modelling and regression problems. 60 Furthermore, ANN can find and learn hidden patterns between input data and target sets. 61

There are different classes of ANNs according to its structure, where the multiplayer perceptron (MLP) is the simplest and used for different fields of prediction and classifications. 62 The MPL is a fully connected multilayer neural network composed of neurons located in three areas of the network: an input layer, one or more hidden/intermediate layers, and an output layer. 63 , 64 The input layer receives information and is responsible for transferring it to the hidden layer, which then transfers this information to the output layer. 65 This transmission between different layers is controlled by the propagation function. 66

The operation of ANN can be described as follows: random weights are assigned to neurons located in the different interconnected layers to solve mathematical operations. 67 These neurons recognize the relationship between target values and input data sets. 68 The input information received by these neurons is structured in different layers to convert them into a single output. 69 This output is then sent to the other layer to perform the same sequence and generate the output. 70 The output of each neuron is produced from a series of operations performed by the activation function. 66 The main activation functions are linear, logarithmic and tan sigmoid. 70

According to Gue et al., 71 the backpropagation algorithm is the most used learning algorithm. It minimizes the least squares error between the network output signal with the signal propagating forward and the desired response. 64 This algorithm corrects the weights between different neurons according to the error output. This process continues until the desired value is achieved. 66

In this research, we used a multilayer neural network algorithm with one hidden layer to collect data from different training cases. The ANN models were constructed using various topologies, with the number of hidden neurons varied between 1 to 2n+1 (where n is the number of input variables). According to number of input variables used in this study, the number of hidden neurons varied between one and 25 (Figure 1). The backpropagation algorithm was used as a learning algorithm. In addition, the sigmoid function was used as an activation function.

Figure 1: 
Scheme of ANN with a topology of 12-25-1 (12, 25 and one neurons in the input, hidden and output layer, respectively) to model CMC. n is the number of neurons.
Figure 1:

Scheme of ANN with a topology of 12-25-1 (12, 25 and one neurons in the input, hidden and output layer, respectively) to model CMC. n is the number of neurons.

For all ANN models developed, the input variables included molar fraction, molecular weight, log P, temperature and number of atoms of each element present in the chemical formula of each ionic surfactant. Finally, log CMC was the output variable (the variable to forecast).

The hyper-parameters used to develop the ANN models included training cycles (ranging between 1 and 524,288) in 19 steps, in linear (designed without subscript) or logarithm scale (designed with a subscript L) and decay (true or false). The hyper-parameters were determined using the trial-and-error procedure. Using these two hyper-parameter combinations, six different ANN approaches were developed (ANN, ANN L , ANN R , ANN R-L , ANN Z , and ANN Z-L ). Four ANN models were normalized to a determined range because the data showed different scales and units. The main aim of the normalization process is to prevent any of the variables from having a disproportionate effect on the training of the model. Two normalization methods, that were used to input and output variables, were applied: range transformation, designed with a subscript R, varying between −1 and 1 (ANN R ) and Z-transformation, designed with a subscript Z (ANN R ). These methods were applied to all data groups in the following order: first to the training data and then to the validation and test data. The results were then de-normalized to compare between them. This transformation is necessary since it allows the predictions obtained to be interpreted in their original context.

2.3 Metrics

Once the models were developed, three statistical parameters were used to evaluate their performance defined as the measurement of forecasting capability. Root mean square error (RMSE), mean absolute percentage error (MAPE) and the linear squared correlation coefficient (R 2) were the parameters used in this research. All of them were calculated for training, validation and testing groups. According to 72 , 73 RMSE (1) MAPE (2) and R 2 (3) equations are the following:

(1) RMSE = 1 N i = 1 N ( y real y pred ) 2

(2) MAPE = 100 % N i = 1 N | y pred y real y real |

(3) R 2 = 1 ( y real y pred ) 2 ( y real y real ) 2

where y real are experimental data, y pred are the data predicted by the model, y real is mean value of y real and N are the total amount of data. Lower values of RMSE and MAPE (closer to value 0) and higher values of R 2 (closer to value 1), lead to better prediction models. 74

2.4 Software

The different machine learning models were developed using the RapidMiner Studio Educational 10.2.000 version (RapidMiner GmbH). The computational equipment used were an Intel® Core(TM) i9-10900K CPU at 3.70 GHz processor with 64 GB RAM installed and with Windows 11 Pro. Figure 1 was made with Microsoft PowerPoint 2016 (Microsoft) and the scatter plots (Figure 2) was made with SigmaPlot 14.0 (Systat Software Inc.).

Figure 2: 
Scatter plots showing the real and predicted values of log CMC real (axis x), and log CMC predicted (axis y) for the ANN
Z
 model. The red dashed line corresponds to the line with slope one.
Figure 2:

Scatter plots showing the real and predicted values of log CMC real (axis x), and log CMC predicted (axis y) for the ANN Z model. The red dashed line corresponds to the line with slope one.

3 Results and discussion

In this research, six different models were developed using ANN algorithm to predict the CMC value for ionic surfactants. To evaluate the performance of each ANN model, three statistical parameters were utilized. Table 3 shows the six ANN models alongside their corresponding statistical parameters values.

Table 3:

Adjustments to the real and predicted values obtained by the six ANN models developed. RMSE is root mean square error (M), MAPE is mean absolute percentage error (%). and R 2 is the linear squared correlation coefficient.

Model Training Validation Testing
RMSE MAPE R 2 RMSE MAPE R 2 RMSE MAPE R 2
ANN 0.249 16.5 0.737 0.226 13.5 0.866 0.164 11.3 0.844
ANN L 0.257 17.3 0.718 0.228 13.8 0.865 0.182 12.2 0.802
ANN R 0.028 1.6 0.997 0.046 3.0 0.993 0.064 2.9 0.977
ANN R-L 0.029 1.7 0.997 0.044 3.0 0.995 0.061 3.0 0.981
ANN Z 0.025 1.5 0.997 0.040 2.7 0.995 0.069 3.5 0.970
ANN Z-L 0.030 2.0 0.996 0.050 3.0 0.992 0.064 3.1 0.979

According to Table 3, the best adjustments for the training and validation phases were observed in the ANN models that were normalized on both linear and logarithmic scales. Significant differences in all statistical parameter values can be observed between the unnormalized and normalized ANN models. This discrepancy is due to the normalization process, which ensures that data measured on different scales do not disproportionately influence the model, especially since the variables are measured in different units. The four normalized models showed similar RMSE values in the validation phase (between 0.040 M and 0.050 M, being the lowest value for ANN Z model). In terms of MAPE, the values for these four models showed very close values, ranging from 2.7 % and 3.0 % (being the lowest value for ANNZ model), and the linear square correlation coefficient (R 2) values also varied slightly between 0.992 and 0.995. Regarding to the training phase, the RMSE values also showed low oscillations. RMSE values varied between 0.025 M and 0.030 M, being the lowest value for the ANN Z model. In the case of MAPE, the ANN Z model presented the lowest value (1.5 %), while ANN Z-L showed the highest value (2.0 %). Finally, the R 2 parameter exhibited similar values in all models (around 0.997).

Hence, it is demonstrated that the normalized models showed excellent fits for both the training and validation phases. That is, these models trained well with the internal data. However, it is crucial to also verify this performance with external data, i.e. data that was not used to train the model (testing phase) to avoid overfitting problems.

Regarding the test phase, the models exhibited slightly worse adjustments of RMSE, MAPE and R 2 compared to those to the training and validation phases (Table 3). Specifically, RMSE values ranged between 0.061 M (for ANN R-L model) and 0.069 M (for ANN Z model), while MAPE values ranged between 2.9 % (for ANN R model) and 3.5 % (for ANN Z model). Finally, R 2 values were below 0.982 for the four normalized models.

A good model should be able to perform good adjustments in the training and validation phases, but also in the testing phase. Based on the results in Table 3, all normalized ANN models presented good adjustments in the three phases, without problems of model overfitting and with high model generalization capacity.

The best ANN model from the four normalized models was then selected for further study of its performance. The lowest RMSE value in the validation phase was used as the criterion for selecting the best model. According to this, the ANN Z model was the best model with an RMSE value of 0.040 M.

Figure 2 shows scatter plots (predicted vs. observed values) of the ANNZ model across the three phases.

Figure 2A illustrates the dispersion between the CMC values predicted by the ANN Z model compared to those observed for the internal data (training and validation phase). Generally, the points are close to the pointed red dashed line (line with slope 1), although it should be noted that there are some points that present significant deviations from the line. For example, the training phase (−1.31, −1.49) and validation phase (−1.90, −1.78) cases, both located in the lower middle part of the graph, presented the highest absolute error values (0.174 M and 0.120 M, respectively). Regarding the training case, the observed value was −1.31 M, while the model predicted a value of −1.49 M, leading to an absolute percentage error of 13.2 % (overestimation). Regarding the validation case, it showed an absolute percentage error of 6.3 % (underestimation). On the other hand, it should be noted that there are also cases, especially in the lower part of the graph, which are not so deviated from the slope line 1 (low absolute errors), but present significant values of absolute percentage errors. For example, the validation cases (−0.733, −0.819) and (−0.848, −0.941), showed absolute percentage errors of 11.8 % and 11.0 %, respectively. If those cases that contain absolute percentage errors greater than 10 % were excluded, the MAPE value is reduced to 1.58 %.

Figure 2B shows the dispersion for the external data (test phase). Generally, it can be observed that the points are close to the dashed red line with slope 1. However, there is an important case, very deviated from the straight line, which showed important errors. This test case (−1.82, −2.09) is in the low part of the graph, in which it presents the highest values of absolute error and absolute percentage error. Particularly, the model predicted a CMC value of −2.09 M versus −1.82 M, leading to an overestimation of 14.4 %, while the absolute error was 0.263 M. If this case was excluded, the MAPE value is reduced to 3.0 %. It is worth mentioning another case, that is not so easily visible, but which presents a relatively high absolute error. This is the case (−1.95, −1.83), which is located in the lower part of the graph, in which the absolute error is 12.9 M.

There are numerous studies in the literature that have developed quantitative structure-property relationship (QSPR) models to establish relationships between molecular structures and the critical micellar concentration surfactants. Notably, most of these studies focus on a single class of surfactants, particularly anionic. For example, Yuan et al. 75 used a QSPR model with multiple linear regression to predict the CMC of 37 anionic surfactants in aqueous solution at 40 °C. In their study, the octanol/water partition coefficient (log P), the dipole moment of the molecule and the area of the molecule were the selected descriptors, achieving an R2 greater than 0.980. Roberts 76 used QSPR analysis to predict the CMC of anionic surfactants, including ether sulphates and ester sulfonates, using the hydrophobic parameter ðh for the hydrophobic domain of the surfactant. This study achieved good results, with an R 2 of 0.976 for 133 compounds. Another study carried out by Li et al. 77 estimated the CMC of 98 anionic surfactants at 40 °C, using three descriptors: total atom number of the hydrophobic-hydrophilic segment, the dipole moment of the segment, and the maximum net atomic charge on the carbon atom in the segment. Their QSPR model, developed using multiple linear regression techniques, showed an R 2 of 0.980, demonstrating excellent robustness. Roy et al. 78 developed QSPR models using multiple linear regression to model the CMC of 37 anionic surfactants measured at 25 °C in water without additives. This study used extended topochemical atom (ETA) indices along with computed hydrophobicity descriptors. According to the results, the ETA models provided better adjustments (R 2 = 0.923) compared to models without ETA topological descriptors along with hydrophobicity terms (R 2 = 0.885). All these results suggest that the use of the QSPR models is appropriate for predicting the CMC of anionic surfactants.

Machine learning models can also be incorporated in QSPR models. According to Thacker et al., 36 the use of machine learning models for CMC prediction can overcome the existing limits with experimental and simulation approaches. Several studies in the literature have employed machine learning models for this purpose. For example, Rahal et al. 35 developed QSPR models using three regression methods to predict the CMC of 50 anionic surfactants. According to the results, the best adjustments were obtained using multilayer perceptron-artificial neural networks (MLP/ANN). Brozos et al. 79 used the graph neural network (GNN) algorithm together with the QSPR model to estimate, among other properties, the CMCs of different surfactants. According to the results, the GNN predicted quite accurate results for CMC. 79 In another study, Qin et al. 80 used graph convolutional neural network (GCN) to predict CMC from the molecular structure of surfactant. The results concluded that GCNs are suitable and effective for high-throughput screening of surfactants. 80

All of these studies mentioned used mathematical correlations to predict experimental CMC values based on chemical descriptors under fixed temperatures, pure water conditions and without additives. However, as mentioned in this research, physical-chemical properties such as temperature, pH, pressure, salinity of the solution and the type of organic solvent water mixture strongly influence the CMC. To date, there have been few studies where these properties were used as input variables for developing CMC prediction models. In fact, the estimation of CMC at different temperatures is considered novel and innovative. Abooali & Soleimani 27 used different nonlinear models employing two machine learning approaches, stochastic gradient boosting trees (SGB) and genetic programming (GP), to model the CMC of 111 anionic surfactants. Their study used five chemical factors (Lop, CIC2, EEig12x, BEHp2, and G3s) and three physical parameters (temperature, with a range between 273.15 K and 363.15 K, pH and salinity). The results showed that the stochastic gradient boosting (SGB) model had better statistical parameter values (RMSE = 0.019 and R 2 = 0.999) compared to the genetic programming (GP) model (RMSE = 0.166 and R 2 = 0.954). Another study conducted by Boukelkal et al. 38 used QSPR methodology to establish a relationship between the molecular structure of 593 surfactants of different classes (anionic, cationic, nonionic, zwitterionic and Gemini) and the negative logarithmic value of CMC. They used different machine learning models to build QSPR models with 14 molecular descriptors along with temperature, varying from 10 °C to 60 °C. The authors concluded that the support vector regression model with Dragonfly hyperparameter optimization (SVR-DA) provided the best adjustments for predicting pCMC values (RMSE = 0.205 and R 2 = 0.974). 38 Finally, our study is another example in which physical-chemical properties are used as modelling variables for estimating CMC. However, comparing the current study with those existing in the literature is challenging due to the differences in the size of the data set, in the division of the data, in the variables used for the development of the models, and in the modelling methods used to predict CMCs. Despite these differences, ANNZ model (RMSE = 0.035 and R 2 = 0.995), together with the SGB model, showed the best adjustments for the global phase. Therefore, we can conclude that our ANN models have the potential to provide excellent prediction capabilities.

4 Conclusions

CMC estimation is one of the most interesting aspects for both academic and industrial applications involving surfactants. Machine learning-based approaches, such as those performed in this study, have demonstrated to be useful as a cost-effective and time-efficient alternative to traditional laboratory measurements. Our ANN models for predicting the CMC of 10 ionic surfactants were developed using physico-chemical properties of the organic solvent-water mixture and chemical properties of the surfactant. The research concluded that the normalized ANN models were highly accurate based on three statistical parameters. Particularly, the ANNZ model presented the best adjustments, with an RMSE of 0.040 M (R 2 = 0.995) for the validation phase and of 0.069 M (R 2 = 0.970) in the test case. The mean absolute percent errors were 2.7 % and 3.5 %, respectively. In this sense, this implemented algorithm is reliable and effective for predicting CMC values of ionic surfactants. Overall, ANN models show excellent potential as tools for accurate and efficient CMC prediction.

However, we must add that although ANNs are a powerful tool in machine learning and have been shown to be effective in predicting CMC values both in the present work and in previous works by our research group, 66 there are alternatives that may be more appropriate in certain contexts or for specific types of problems. This is the case of support vector machines (SVM) – a type of supervised model whose objective is to find a hyperplane that maximizes the separation between data classes and that can be used for both classification and regression – or random forests (RF) – which are built as a combination of several decision trees and the prediction is made by majority vote of the individual trees. We must point out that these alternatives have been successfully applied by our research group for the prediction and modelling of various issues. 81 , 82 , 83 , 84 A future line of research would be to check the validity of these alternatives applied to the prediction of CMC values. In fact, some authors have already successfully incorporated some of these techniques for modelling the behaviour of different surfactants. 85


Corresponding author: Juan C. Mejuto, Departamento de Química Física, Facultade de Ciencias, Universidade de Vigo, Ourense, 32004, Spain, E-mail:

About the authors

Anton Soria-Lopez

Anton Soria-Lopez is a PhD student in the Agri-environmental and Food Research Group at the Campus of Ourense (University of Vigo). His research interests focus on the application of Artificial Neural Networks to chemical and biological problems.

María García-Martí

María García-Martí is a postdoctoral researcher in the Agri-Environmental and Food Research Group at the University of Ourense. Her research interests focus on food chemistry and environmental research.

Enrique Barreiro

Enrique Barreiro is a member of the School of Computer Science Engineering at Ourense (University of Vigo). His research interest is focused on big data.

Juan C. Mejuto

Juan C. Mejuto is currently a full professor in the Department of Physical Chemistry at the University of Vigo. He is the head of the Agri-environmental and Food Research Group at the Campus of Ourense. His research interests include (i) physical organic and physical inorganic chemistry, (ii) reactivity mechanisms in homogeneous and micro-heterogeneous media, (iii) stability of self-assembling aggregates and (iv) supramolecular chemistry.

Acknowledgments

The authors would like to thank RapidMiner Inc. for the Educational and the free license of RapidMiner Studio software (version 10.2.000).

  1. Research ethics: All authors have read and agreed to the published version of the manuscript.

  2. Author contributions: Conceptualization, J.C.M. and E.B. Methodology, A.S-L. and M.G-A. Validation, A.S-L. and M.G-A. Formal analysis, A.S-L. and M.G-A. Investigation, A.S-L., E.B. and M.G-A. Writing – original draft preparation, A.S-L., E.B. and M.G-A. Writing – review and editing, E.B. and J.C.M. Visualization, A.S-L. Supervision, J.C.M. and E.B. Project administration, J.C.M. Funding acquisition, J.C.M.

  3. Use of Large Language Models, AI and Machine Learning Tools: No Large Language Models, AI or Machine Learning Tools were used in this manuscript. The software used for the implementation of ANNs was acknowledged in the manuscript.

  4. Conflict of interest: The authors declare no conflict of interest.

  5. Research funding: This research received no external funding. This research was supported by an FPU grant from the Spanish Ministry of Science and Innovation (MCINN) to Anton Soria-Lopez (FPU2020/06140).

  6. Data availability: The data presented in this study are available on request from the corresponding authors. The data are not publicly available due to privacy restrictions.

References

1. Poša, M. The Gibbs-Helmholtz Equation and the Enthalpy–Entropy Compensation (EEC) Phenomenon in the Formation of Micelles in an Aqueous Solution of Surfactants and the Cloud Point Effect. J. Mol. Liq. 2024, 396, 124109. https://doi.org/10.1016/j.molliq.2024.124109.Search in Google Scholar

2. Alam, Md. S.; Siddiq, A. M.; Natarajan, D.; Kiran, M. S.; Baskar, G. Physicochemical Properties and Bioactivity Studies of Synthesized Counterion Coupled (COCO) Gemini Surfactant, 1,6-Bis(N,N-hexadecyldimethylammonium) Adipate. J. Mol. Liq. 2019, 273, 16–26. https://doi.org/10.1016/j.molliq.2018.09.082.Search in Google Scholar

3. Aguirre-Ramírez, M.; Silva-Jiménez, H.; Banat, I. M.; Díaz De Rienzo, M. A. Surfactants: Physicochemical Interactions with Biological Macromolecules. Biotechnol. Lett. 2021, 43 (3), 523–535; https://doi.org/10.1007/s10529-020-03054-1.Search in Google Scholar PubMed PubMed Central

4. Hussain, S. M. S.; Kamal, M. S.; Fogang, L. T. Synthesis and Physicochemical Investigation of Betaine Type Polyoxyethylene Zwitterionic Surfactants Containing Different Ionic Headgroups. J. Mol. Struct. 2019, 1178, 83–88; https://doi.org/10.1016/j.molstruc.2018.09.094.Search in Google Scholar

5. Yang, J.; Huang, H.; Zheng, J.; Huang, Y.; Xie, H.; Gao, F. Effect of Head Group of Surfactant on the Self-assembly Structures and Aggregation Transitions in a Mixture of Cationic Surfactant and Anionic Surfactant-like Ionic Liquid. J. Mol. Liq. 2020, 308, 112995. https://doi.org/10.1016/j.molliq.2020.112995.Search in Google Scholar

6. Shaban, S. M.; Kang, J.; Kim, D.-H. Surfactants: Recent Advances and Their Applications. Compos. Commun. 2020, 22, 100537; https://doi.org/10.1016/j.coco.2020.100537.Search in Google Scholar

7. Aguirre-Ramírez, M.; Silva-Jiménez, H.; Banat, I. M.; Díaz De Rienzo, M. A. Surfactants: Physicochemical Interactions with Biological Macromolecules. Biotechnol. Lett. 2021, 43 (3), 523–535; https://doi.org/10.1007/s10529-020-03054-1.Search in Google Scholar

8. Otzen, D. E. Biosurfactants and Surfactants Interacting with Membranes and Proteins: Same but Different? Biochim. Biophys. Acta (BBA) Biomembr. 2017, 1859 (4), 639–649. https://doi.org/10.1016/j.bbamem.2016.09.024.Search in Google Scholar PubMed

9. Ohadi, M.; Shahravan, A.; Dehghannoudeh, N.; Eslaminejad, T.; Banat, I. M.; Dehghannoudeh, G. Potential Use of Microbial Surfactant in Microemulsion Drug Delivery System: A Systematic Review. Drug Des. Dev. Ther. 2020, 541–550; https://doi.org/10.2147/dddt.s232325.Search in Google Scholar PubMed PubMed Central

10. Adu, S. A.; Naughton, P. J.; Marchant, R.; Banat, I. M. Microbial Biosurfactants in Cosmetic and Personal Skincare Pharmaceutical Formulations. Pharmaceutics 2020, 12 (11), 1099; https://doi.org/10.3390/pharmaceutics12111099.Search in Google Scholar PubMed PubMed Central

11. Cheng, K. C.; Khoo, Z. S.; Lo, N. W.; Tan, W. J.; Chemmangattuvalappil, N. G. Design and Performance Optimisation of Detergent Product Containing Binary Mixture of Anionic-Nonionic Surfactants. Heliyon 2020, 6 (5); https://doi.org/10.1016/j.heliyon.2020.e03861.Search in Google Scholar PubMed PubMed Central

12. Hordyjewicz‐Baran, Z.; Wasilewski, T.; Zarębska, M.; Seweryn, A.; Zajszły‐Turko, E.; Stanek‐Wandzel, N.; Chrobak, J. Application of Aggregation Behavior of Nonionic Surfactants to Develop a Smart Detergent for Washing Fruits with Emphasis on Pesticide Residues Removal. J. Surfactants Deterg. 2024, 27 (1), 57–69; https://doi.org/10.1002/jsde.12679.Search in Google Scholar

13. Kovalchuk, N. M.; Simmons, M. J. H. Surfactant-mediated wetting and spreading: recent advances and applications. Curr. Opin. Colloid Interface Sci. 2021, 51, 101375; https://doi.org/10.1016/j.cocis.2020.07.004.Search in Google Scholar

14. Ribeiro, B. G.; Guerra, J. M. C.; Sarubbo, L. A. Biosurfactants: Production and Application Prospects in the Food Industry. Biotechnol. Prog. 2020, 36 (5), e3030; https://doi.org/10.1002/btpr.3030.Search in Google Scholar PubMed

15. Ghosh, S.; Ray, A.; Pramanik, N. Self-assembly of Surfactants: An Overview on General Aspects of Amphiphiles. Biophys. Chem. 2020, 265, 106429; https://doi.org/10.1016/j.bpc.2020.106429.Search in Google Scholar PubMed

16. Poša, M. The Gibbs-Helmholtz Equation and the Enthalpy–Entropy Compensation (EEC) Phenomenon in the Formation of Micelles in an Aqueous Solution of Surfactants and the Cloud Point Effect. J. Mol. Liq. 2024, 396, 124109. https://doi.org/10.1016/j.molliq.2024.124109.Search in Google Scholar

17. Schork, F. J. Monomer Transport in Emulsion Polymerization. Can. J. Chem. Eng. 2022, 100 (4), 645–653. https://doi.org/10.1002/cjce.24075.Search in Google Scholar

18. Alam, A.; Anis-Ul-Haque, K. M.; Khan, J. M.; Kumar, D.; Irfan, M.; Rana, S.; Hoque, M. A.; Kabir, S. E. Assessment of the Assembly Behaviour and Physicochemical Parameters for the Tetradecyltrimethylammonium Bromide and Promazine Hydrochloride Mixture: Impact of Monohydroxy Organic Compounds. Colloid Polym. Sci. 2024, 302 (5), 721–734; https://doi.org/10.1007/s00396-024-05223-4.Search in Google Scholar

19. Kumar, N.; Mandal, A. Thermodynamic and Physicochemical Properties Evaluation for Formation and Characterization of Oil-in-Water Nanoemulsion. J. Mol. Liq. 2018, 266, 147–159. https://doi.org/10.1016/j.molliq.2018.06.069.Search in Google Scholar

20. Tadros, T. F. Applied Surfactants: Principles and Applications; WILEY-VCH Verlag GmbH & Co. KGaA: Weinheim, 2005.Search in Google Scholar

21. El-Dossoki, F. I.; Gomaa, E. A.; Hamza, O. K. Solvation Thermodynamic Parameters for Sodium Dodecyl Sulfate (SDS) and Sodium Lauryl Ether Sulfate (SLES) Surfactants in Aqueous and Alcoholic-Aqueous Solvents. SN Appl. Sci. 2019, 1 (8), 1–17; https://doi.org/10.1007/s42452-019-0974-6.Search in Google Scholar

22. Chirani, M. R.; Kowsari, E.; Teymourian, T.; Ramakrishna, S. Environmental Impact of Increased Soap Consumption during COVID-19 Pandemic: Biodegradable Soap Production and Sustainable Packaging. Sci. Total Environ. 2021, 796, 149013. https://doi.org/10.1016/j.scitotenv.2021.149013.Search in Google Scholar PubMed PubMed Central

23. Astray, G.; Iglesias-Otero, M. A.; Moldes, O. A.; Mejuto, J. C. Predicting Critical Micelle Concentration Values of Non-ionic Surfactants by Using Artificial Neural Networks. Tenside Surfactants Deterg. 2013, 50(2), 118–124. https://doi.org/10.3139/113.110242.Search in Google Scholar

24. Scholz, N.; Behnke, T.; Resch-Genger, U. Determination of the Critical Micelle Concentration of Neutral and Ionic Surfactants with Fluorometry, Conductometry, and Surface Tension—A Method Comparison. J. Fluoresc. 2018, 28 (1), 465–476; https://doi.org/10.1007/s10895-018-2209-4.Search in Google Scholar PubMed

25. Tadros, T. F. Applied Surfactants: Principles and Applications; WILEY-VCH Verlag GmbH & Co. KGaA: Weinheim, 2005.10.1002/3527604812Search in Google Scholar

26. Rahal, S.; Hadidi, N.; Hamadache, M. In Silico Prediction of Critical Micelle Concentration (CMC) of Classic and Extended Anionic Surfactants from Their Molecular Structural Descriptors. Arabian J. Sci. Eng. 2020, 45 (9), 7445–7454; https://doi.org/10.1007/s13369-020-04598-0.Search in Google Scholar

27. Abooali, D.; Soleimani, R. Structure-based Modeling of Critical Micelle Concentration (CMC) of Anionic Surfactants in Brine Using Intelligent Methods. Sci. Rep. 2023, 13 (1), 13361; https://doi.org/10.1038/s41598-023-40466-1.Search in Google Scholar PubMed PubMed Central

28. Rafique, A. S.; Khodaparast, S.; Poulos, A. S.; Sharratt, W. N.; Robles, E. S. J.; Cabral, J. T. Micellar Structure and Transformations in Sodium Alkylbenzenesulfonate (NaLAS) Aqueous Solutions: Effects of Concentration, Temperature, and Salt. Soft Matter 2020, 16 (33), 7835–7844; https://doi.org/10.1039/D0SM00982B.Search in Google Scholar PubMed

29. Niraula, T. P.; Chatterjee, S. K.; Bhattarai, A. Micellization of Sodium Dodecyl Sulphate in Presence and Absence of Alkali Metal Halides at Different Temperatures in Water and Methanol-Water Mixtures. J. Mol. Liq. 2018, 250, 287–294. https://doi.org/10.1016/j.molliq.2017.12.014.Search in Google Scholar

30. Rub, M. A.; Azum, N.; Asiri, A. M. Interaction of Cationic Amphiphilic Drug Nortriptyline Hydrochloride with TX-100 in Aqueous and Urea Solutions and the Studies of Physicochemical Parameters of the Mixed Micelles. J. Mol. Liq. 2016, 218, 595–603. https://doi.org/10.1016/j.molliq.2016.02.049.Search in Google Scholar

31. Bhattarai, A.; Shah, S. K.; Yadav, A. K. Effect of Solvent Composition on the Critical Micelle Concentration of Cetylpyridinium Chloride in Ethanol-Water Mixed Solvent Media. Nepal J. Sci. Technol. 2013, 13 (1), 89–93; https://doi.org/10.3126/njst.v13i1.7446.Search in Google Scholar

32. Anoune, N.; Nouiri, M.; Berrah, Y.; Gauvrit, J.-Y.; Lanteri, P. Critical Micelle Concentrations of Different Classes of Surfactants: A Quantitative Structure Property Relationship Study. J. Surfactants Deterg. 2002, 5, 45–53; https://doi.org/10.1007/s11743-002-0204-2.Search in Google Scholar

33. Roy, K.; Kabir, H. QSPR with Extended Topochemical Atom (ETA) Indices: Exploring Effects of Hydrophobicity, Branching and Electronic Parameters on logCMC Values of Anionic Surfactants. Chem. Eng. Sci. 2013, 87, 141–151. https://doi.org/10.1016/j.ces.2012.10.002.Search in Google Scholar

34. Gaudin, T.; Rotureau, P.; Pezron, I.; Fayet, G. New QSPR Models to Predict the Critical Micelle Concentration of Sugar-Based Surfactants. Ind. Eng. Chem. Res. 2016, 55 (45), 11716–11726; https://doi.org/10.1021/acs.iecr.6b02890.Search in Google Scholar

35. Rahal, S.; Hadidi, N.; Hamadache, M. In Silico Prediction of Critical Micelle Concentration (CMC) of Classic and Extended Anionic Surfactants from Their Molecular Structural Descriptors. Arabian J. Sci. Eng. 2020, 45 (9), 7445–7454; https://doi.org/10.1007/s13369-020-04598-0.Search in Google Scholar

36. Thacker, J. C. R.; Bray, D. J.; Warren, P. B.; Anderson, R. L. Can Machine Learning Predict the Phase Behavior of Surfactants? J. Phys. Chem. B 2023, 127 (16), 3711–3727; https://doi.org/10.1021/acs.jpcb.2c08232.Search in Google Scholar PubMed PubMed Central

37. Seddon, D.; Müller, E. A.; Cabral, J. T. Machine Learning Hybrid Approach for the Prediction of Surface Tension Profiles of Hydrocarbon Surfactants in Aqueous Solution. J. Colloid Interface Sci. 2022, 625, 328–339; https://doi.org/10.1016/j.jcis.2022.06.034.Search in Google Scholar PubMed

38. Boukelkal, N.; Rahal, S.; Rebhi, R.; Hamadache, M. QSPR for the Prediction of Critical Micelle Concentration of Different Classes of Surfactants Using Machine Learning Algorithms. J. Mol. Graph. Model. 2024, 129, 108757. https://doi.org/10.1016/j.jmgm.2024.108757.Search in Google Scholar PubMed

39. Katritzky, A. R.; Pacureanu, L. M.; Slavov, S. H.; Dobchev, D. A.; Karelson, M. QSPR Study of Critical Micelle Concentrations of Nonionic Surfactants. Ind. Eng. Chem. Res. 2008, 47 (23), 9687–9695; https://doi.org/10.1021/ie800954k.Search in Google Scholar

40. Katritzky, A. R.; Pacureanu, L. M.; Slavov, S. H.; Dobchev, D. A.; Shah, D. O.; Karelson, M. QSPR Study of the First and Second Critical Micelle Concentrations of Cationic Surfactants. Comput. Chem. Eng. 2009, 33, 321–332. https://doi.org/10.1016/j.compchemeng.2008.09.011.Search in Google Scholar

41. Belhaj, A. F.; Elraies, K. A.; Alnarabiji, M. S.; Abdul Kareem, F. A.; Shuhli, J. A.; Mahmood, S. M.; Belhaj, H. Experimental Investigation, Binary Modelling and Artificial Neural Network Prediction of Surfactant Adsorption for Enhanced Oil Recovery Application. Chem. Eng. J. 2021, 406, 127081. https://doi.org/10.1016/j.cej.2020.127081.Search in Google Scholar PubMed PubMed Central

42. Katritzky, A. R.; Pacureanu, L.; Dobchev, D.; Karelson, M. QSPR Study of Critical Micelle Concentration of Anionic Surfactants Using Computational Molecular Descriptors. J. Chem. Inf. Model. 2007, 47 (3), 782–793; https://doi.org/10.1021/ci600462d.Search in Google Scholar PubMed

43. Bhattarai, A.; Shah, S. K.; Yadav, A. K.; Adhikari, J. Effect of Solvent Composition on the Critical Micelle Concentration of Sodium Deoxycholate in Ethanol-Water Mixed Solvent Media. Bibechana 2013, 9, 63–68; https://doi.org/10.3126/bibechana.v9i0.7176.Search in Google Scholar

44. Khandelwal, M.; J.S, A.; Rai, B.; Sarasan, G. Thermodynamic Study of Micellization of SDBS in Aqueous and in Binary Solvent Systems of Ethylene Glycol. Int. J. Eng. Res. Technol. 2020, 9 (06), 581–586; https://doi.org/10.17577/ijertv9is060363.Search in Google Scholar

45. Ghimire, Y.; Amatya, S.; Shah, S. K.; Bhattarai, A. Thermodynamic Properties and Contact Angles of CTAB and SDS in Acetone–Water Mixtures at Different Temperatures. SN Appl. Sci. 2020, 2 (7), 1–19; https://doi.org/10.1007/s42452-020-3036-1.Search in Google Scholar

46. Bakshi, M. S. Micelle Formation by Anionic and Cationic Surfactantsin Binary Aqueous Solvents. J. Chem. Soc. Faraday Trans. 1993, 89 (24), 4323–4326; https://doi.org/10.1039/ft9938904323.Search in Google Scholar

47. Rauniyar, B. S.; Bhattarai, A. Study of Conductivity, Contact Angle and Surface Free Energy of Anionic (SDS, AOT) and Cationic (CTAB) Surfactants in Water and Isopropanol Mixture. J. Mol. Liq. 2021, 323; https://doi.org/10.1016/j.molliq.2020.114604.Search in Google Scholar

48. Mandal, B.; Ghosh, S.; Moulik, S. P. Interaction between a Bio-Tolerable Amino-Acid Based Amphiphile (N-Dodecanoylsarcosinate, SDDS) and Modified Cationic Polymers, Hydroxyethylcelluloses (JR 400, and LM 200) in Isopropanol-Water Medium. Colloids Surf. A Physicochem. Eng. Asp. 2019, 566 (January), 156–165; https://doi.org/10.1016/j.colsurfa.2019.01.002.Search in Google Scholar

49. Ghosh, K. K.; Baghel, V. Micellar Properties of Benzyldimethyldodecylammonium Bromide in Aquo-organic Solvent Media. Indian J. Chem. A 2008, 47 (8), 1230–1233.Search in Google Scholar

50. Devi, Y. G.; Gurung, J.; Pulikkal, A. K. Micellar Solution Behavior of Cetylpyridinium Surfactants in 2-Propanol-Water Mixed Media at Different Temperatures. J. Chem. Eng. Data 2021, 66 (1), 368–378; https://doi.org/10.1021/acs.jced.0c00734.Search in Google Scholar

51. Bhattarai, A.; Yadav, A. K.; Sah, S. K.; Deo, A. Influence of Methanol and Dimethyl Sulfoxide and Temperature on the Micellization of Cetylpyridinium Chloride. J. Mol. Liq. 2017, 242, 831–837. https://doi.org/10.1016/j.molliq.2017.07.085.Search in Google Scholar

52. Akbaş, H.; Kartal, Ç. Conductometric Studies of Hexadecyltrimethylammonium Bromide in Aqueous Solutions of Ethanol and Ethylene Glycol. Colloid J. 2006, 68 (2), 125–130; https://doi.org/10.1134/S1061933X06020013.Search in Google Scholar

53. Bhattarai, P.; Niraula, T. P.; Bhattarai, A. Thermodynamic Properties of Cetyltrimethylammonium Bromide in Ethanol-Water Media With/without the Presence of the Divalent Salt. J. Oleo Sci. 2021, 70 (3), 363–374; https://doi.org/10.5650/jos.ess20207.Search in Google Scholar PubMed

54. Akbaş, H.; Batıgöç, Ç. Micellization of Dodecylpyridinium Chloride in Water-Ethanol Solutions. Colloid J. 2008, 70 (2), 127–133; https://doi.org/10.1134/s1061933x08020014.Search in Google Scholar

55. Acharya, S.; Niraula, T. P.; Bhattarai, A. Conductivity Study of DTAB in Water and Ethanol-Water Mixture in the Presence and Absence of ZnSO4. Baghdad Sci. J. 2020, 17 (4), 1207–1215; https://doi.org/10.21123/bsj.2020.17.4.1207.Search in Google Scholar

56. Shah, S. K.; Chatterjee, S. K.; Bhattarai, A. The Effect of Methanol on the Micellar Properties of Dodecyltrimethylammonium Bromide (DTAB) in Aqueous Medium at Different Temperatures. J. Surfactants Deterg. 2016, 19 (1), 201–207; https://doi.org/10.1007/s11743-015-1755-x.Search in Google Scholar

57. Rodríguez, A.; del Mar Graciani, M.; Fernández, G.; Moyá, M. L. Effects of Glycols on the Thermodynamic and Micellar Properties of TTAB in Water. J. Colloid Interface Sci. 2009, 338 (1), 207–215; https://doi.org/10.1016/j.jcis.2009.06.005.Search in Google Scholar PubMed

58. PubChem. National Library of Medicine; National Center for Biotechnology Information. https://pubchem.ncbi.nlm.nih.gov/ (accessed 2024-09-07).Search in Google Scholar

59. Zhan, Y.; Zhu, J. Response Surface Methodology and Artificial Neural Network-Genetic Algorithm for Modeling and Optimization of Bioenergy Production from Biochar-Improved Anaerobic Digestion. Appl. Energy 2024, 355, 122336. https://doi.org/10.1016/j.apenergy.2023.122336.Search in Google Scholar

60. Khan, J.; Lee, E.; Kim, K. A Higher Prediction Accuracy-Based Alpha–Beta Filter Algorithm Using the Feedforward Artificial Neural Network. CAAI Trans. Intell. Technol. 2023, 8 (4), 1124–1139. https://doi.org/10.1049/cit2.12148.Search in Google Scholar

61. Moosavi, V.; Vafakhah, M.; Shirmohammadi, B.; Behnia, N. A Wavelet-ANFIS Hybrid Model for Groundwater Level Forecasting for Different Prediction Periods. Water Resour. Manag. 2013, 27 (5), 1301–1321; https://doi.org/10.1007/s11269-012-0239-2.Search in Google Scholar

62. Dragović, S. Artificial Neural Network Modeling in Environmental Radioactivity Studies – A Review. Sci. Total Environ. 2022, 847, 157526; https://doi.org/10.1016/j.scitotenv.2022.157526.Search in Google Scholar PubMed

63. Rosenblatt, F. The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain. Psychol. Rev. 1958, 65 (6), 386–408. https://doi.org/10.1037/h0042519.Search in Google Scholar PubMed

64. Dragović, S. Artificial Neural Network Modeling in Environmental Radioactivity Studies – A Review. Sci. Total Environ. 2022, 847, 157526; https://doi.org/10.1016/j.scitotenv.2022.157526.Search in Google Scholar PubMed

65. Shah, A.; Shah, M.; Pandya, A.; Sushra, R.; Sushra, R.; Mehta, M.; Patel, K.; Patel, K. A Comprehensive Study on Skin Cancer Detection Using Artificial Neural Network (ANN) and Convolutional Neural Network (CNN). Clin. eHealth 2023, 6, 76–84. https://doi.org/10.1016/j.ceh.2023.08.002.Search in Google Scholar

66. Astray, G.; Iglesias-Otero, M. A.; Moldes, O. A.; Mejuto, J. C. Predicting Critical Micelle Concentration Values of Non-ionic Surfactants by Using Artificial Neural Networks. Tenside Surfactants Deterg. 2013, 50(2), 118–124. https://doi.org/10.3139/113.110242.Search in Google Scholar

67. Scabini, L. F. S.; Bruno, O. M. Structure and Performance of Fully Connected Neural Networks: Emerging Complex Network Properties. Phys. A Stat. Mech. Its Appl. 2023, 615, 128585. https://doi.org/10.1016/j.physa.2023.128585.Search in Google Scholar

68. Ikram, R. M. A.; Ewees, A. A.; Parmar, K. S.; Yaseen, Z. M.; Shahid, S.; Kisi, O. The Viability of Extended Marine Predators Algorithm-Based Artificial Neural Networks for Streamflow Prediction. Appl. Soft Comput. 2022, 131, 109739. https://doi.org/10.1016/j.asoc.2022.109739.Search in Google Scholar

69. Ikram, R. M. A.; Ewees, A. A.; Parmar, K. S.; Yaseen, Z. M.; Shahid, S.; Kisi, O. The Viability of Extended Marine Predators Algorithm-Based Artificial Neural Networks for Streamflow Prediction. Appl. Soft Comput. 2022, 131, 109739. https://doi.org/10.1016/j.asoc.2022.109739.Search in Google Scholar

70. Khan, J.; Lee, E.; Kim, K. A Higher Prediction Accuracy–Based Alpha–Beta Filter Algorithm Using the Feedforward Artificial Neural Network. CAAI Trans. Intell. Technol. 2023, 8 (4), 1124–1139. https://doi.org/10.1049/cit2.12148.Search in Google Scholar

71. Gue, I. H. V.; Ubando, A. T.; Tseng, M. L.; Tan, R. R. Artificial Neural Networks for Sustainable Development: A Critical Review. Clean Technol. Environ. Policy 2020, 22, 1449–1465; https://doi.org/10.1007/s10098-020-01883-2.Search in Google Scholar

72. Wang, J.; Wang, Y.; Li, H.; Yang, H.; Li, Z. Ensemble Forecasting System Based on Decomposition-Selection-Optimization for Point and Interval Carbon Price Prediction. Appl. Math. Model. 2023, 113, 262–286. https://doi.org/10.1016/j.apm.2022.09.004.Search in Google Scholar

73. Yang, H.; Zhang, Y.; Li, G. Air Quality Index Prediction Using a New Hybrid Model Considering Multiple Influencing Factors: A Case Study in China. Atmos. Pollut. Res. 2023, 14 (3), 101677. https://doi.org/10.1016/j.apr.2023.101677.Search in Google Scholar

74. Chicco, D.; Warrens, M. J.; Jurman, G. The Coefficient of Determination R-Squared Is More Informative Than SMAPE, MAE, MAPE, MSE and RMSE in Regression Analysis Evaluation. PeerJ Comput. Sci. 2021, 7, e623. https://doi.org/10.7717/peerj-cs.623.Search in Google Scholar PubMed PubMed Central

75. Yuan, S.; Cai, Z.; Xu, G.; Jiang, Y. Quantitative Structure–Property Relationships of Surfactants: Critical Micelle Concentration of Anionic Surfactants. J. Dispersion Sci. Technol. 2002, 23 (4), 465–472; https://doi.org/10.1081/DIS-120014014.Search in Google Scholar

76. Roberts, D. W. Application of Octanol/Water Partition Coefficients in Surfactant Science: A Quantitative Structure−Property Relationship for Micellization of Anionic Surfactants. Langmuir 2002, 18 (2), 345–352; https://doi.org/10.1021/la0108050.Search in Google Scholar

77. Li, X.; Zhang, G.; Dong, J.; Zhou, X.; Yan, X.; Luo, M. Estimation of Critical Micelle Concentration of Anionic Surfactants with QSPR Approach. J. Mol. Struct. THEOCHEM 2004, 710 (1), 119–126. https://doi.org/10.1016/j.theochem.2004.08.039.Search in Google Scholar

78. Roy, K.; Kabir, H. QSPR with Extended Topochemical Atom (ETA) Indices: Exploring Effects of Hydrophobicity, Branching and Electronic Parameters on logCMC Values of Anionic Surfactants. Chem. Eng. Sci. 2013, 87, 141–151. https://doi.org/10.1016/j.ces.2012.10.002.Search in Google Scholar

79. Brozos, C.; Rittig, J. G.; Bhattacharya, S.; Akanny, E.; Kohlmann, C.; Mitsos, A. Graph Neural Networks for Surfactant Multi-property Prediction. Colloids Surf. A Physicochem. Eng. Asp. 2024, 694, 134133. https://doi.org/10.1016/j.colsurfa.2024.134133.Search in Google Scholar

80. Qin, S.; Jin, T.; Van Lehn, R. C.; Zavala, V. M. Predicting Critical Micelle Concentrations for Surfactants Using Graph Convolutional Neural Networks. J. Phys. Chem. B 2021, 125, 10610–10620; https://doi.org/10.1021/acs.jpcb.1c05264.Search in Google Scholar PubMed

81. Rodríguez-Fernández, R.; Fernández-Gómez, A.; Mejuto, J. C.; Astray, G. Machine Learning Models to Classify Shiitake Mushrooms (Lentinula edodes) According to Their Geographical Origin Labeling. Foods 2024, 13, 2656. https://doi.org/10.3390/foods13172656.Search in Google Scholar PubMed PubMed Central

82. Rodríguez-Fernández, R.; Fernández-Gómez, Á.; Mejuto, J. C.; Astray, G. Modelling Polyphenol Extraction through Ultrasound-Assisted Extraction by Machine Learning in Olea europaea Leaves. Foods 2023, 12, 4483. https://doi.org/10.3390/foods12244483.Search in Google Scholar PubMed PubMed Central

83. Soria-Lopez, A.; Sobrido-Pouso, C.; Mejuto, J.C.; Astray, G. Assessment of Different Machine Learning Methods for Reservoir Outflow Forecasting. Water 2023, 15, 3380. https://doi.org/10.3390/w15193380.Search in Google Scholar

84. Astray, G.; Soria-Lopez, A.; Barreiro, E.; Mejuto, J.C.; Cid-Samamed, A. Machine Learning to Predict the Adsorption Capacity of Microplastics. Nanomaterials 2023, 13, 1061. https://doi.org/10.3390/nano13061061.Search in Google Scholar PubMed PubMed Central

85. Laidi, M.; Ek Hadj, A.A.; Si-Moussa, C.; Benkortebi, O.; Hentabli, M.; Hanini, S. CMC of Diverse Gemini Surfactants Modeling Using a Hybrid Approach Combining SVR-DA. Chem. Ind. Chem. Eng. Q. 2021, 27 (3), 299–312. https://doi.org/10.2298/ciceq200907048l.Search in Google Scholar

Received: 2024-08-26
Accepted: 2024-09-11
Published Online: 2024-09-26
Published in Print: 2024-11-26

© 2024 Walter de Gruyter GmbH, Berlin/Boston

Downloaded on 14.9.2025 from https://www.degruyterbrill.com/document/doi/10.1515/tsd-2024-2623/html
Scroll to top button