Professional Documents
Culture Documents
Laidi 2020
Laidi 2020
Since surfactants are considered as important versus the conventional surfactants. Gemini surfact-
interfacial active compounds, they are usually emp- ants contain two hydrophobic tails and two hydrophilic
loyed in different industrial fields, namely petroleum, heads connected by a spacer at or near the two
detergent, chemical and environmental sectors [1]. heads (Figure 1) [2]. These chemical structure modi-
Different types of surfactants have been proposed in fications grant them excellent properties, namely low
literature by trying various chemical structures to CMC, high aqueous solubility and viscosity, unique
enhance their properties; among these surfactants, micelle structure and aggregation behaviour and high
Gemini surfactants (dimeric surfactants) have gained yield in decreasing the surface tension of water sol-
significant attention due to their attractive properties utions [3-12]. As the CMC is an important parameter
to characterize surfactants, their measurement is usu-
Correspondence: M. Laidi, Laboratory of Biomaterials and Trans-
ally difficult and costly. For that reason, many studies
port Phenomena (LBMPT), University of Medea, Medea, Algeria. have been conducted to find an easy method to det-
E-mail: maamarw@yahoo.fr ermine the CMC values of Gemini surfactants based
Paper received: 7 September, 2020
Paper revised: 23 October, 2020
on chemometrics techniques [13-21].
Paper accepted: 3 December, 2020 Recently, quantitative structure-property relat-
https://doi.org/10.2298/CICEQ200907048L ionship (QSPR) method was used in many fields to
299
M. LAIDI et al.: CMC OF DIVERSE GEMINI SURFACTANTS MODELING… Chem. Ind. Chem. Eng. Q. 27 (3) 299−312 (2021)
extract and link chemicals properties to their mole- evaluated for CMC estimation yet and compare its
cular structures [22-30]. In QSPR modeling, different performance with ANN, OLS, PLS and KNN model.
computational techniques have been used, such as
multiple linear regression (OLS or MLR), PLS, ANN, QSPR CALCULATION AND FILTERING
SVR and ANFIS [31-43]. METHODOLOGY
Table 1. Structures and CMC values of 211 Gemini surfactants according to Figure 1 (1–64)
No. Tail 1 Tail 2 Ion -1 Ion-2 Spacer Exp. pCMC No. Tail 1 Tail 2 Ion-1 or Ion-2 Spacer Exp. pCMC
- + +
1 C8H17 C8H17 PO4 CH3)2N C2H4 3.0000 33 C16H32 C16H32 (CH3)2N (CH2)6 3.6990
2 C8H17 C10H21 C2H4 3.9586 34 C16H32 C16H32 (CH2)3 3.9208
3 C10H21 C8H17 C2H4 4.0000 35 C16H32 C16H32 (CH2)5 1.8601
4 C8H17 C12H25 C2H4 4.8539 36 C16H32 C16H32 (CH2)8 1.5817
5 C12H25 C8H17 C2H4 4.7447 37 C16H32 C16H32 (CH2)10 1.5800
6 C10H21 C10H21 C2H4 4.8861 38 C16H32 C16H32 (CH2)12 3.0809
t
7 C8H17 C14H29 C2H4 5.2147 39 C10H21 C10H21 (CH2)4 3.2007
t
8 C14H29 C8H17 C2H4 5.0969 40 C14H29 C14H29 (CH2)4 3.4318
t
9 C10H21 C12H25 C2H4 4.8861 41 C10H21 C10H21 (CH2)2 3.6990
t
10 C10H21 C14H29 C2H4 5.2757 42 C10H21 C10H21 (CH2)3 3.9208
11 C14H29 C10H21 C2H4 5.2596 43 C10H21 C10H21 (CH2)4 1.8601
12 C12H25 C12H25 C2H4 5.0458 44 C10H21 C10H21 (CH2)6 2.0362
13 C8H17 C18H36 C2H4 5.3188 45 C10H21 C10H21 (CH2)8 2.1249
14 C18H36 C8H17 C2H4 5.2291 46 C10H21 C10H21 (CH2)10 2.3768
15 C10H21 C16H32 C2H4 5.3010 47 C10H21 C10H21 (CH2)12 2.6576
16 C14H29 C12H25 C2H4 5.1549 48 C6H13 C6H13 (CH2)5 0.7959
t
17 C12H25 C14H29 C2H4 5.3665 49 C9H19 C9H19 (CH2)5 1.6021
+ + t
18 C12H25 C12H25 (CH3)2N (CH3)2N (CH2)2 3.0969 50 C10H21 C10H21 (CH2)5 2.0506
t
19 C12H25 C12H25 (CH2)3 3.0269 51 C11H23 C11H23 (CH2)5 2.5229
t
20 C12H25 C12H25 (CH2)4 2.9318 52 C12H25 C12H25 (CH2)5 2.9586
21 C12H25 C12H25 (CH2)6 2.9872 53 C13H27 C13H27 (CH2)5 3.3872
22 C12H25 C12H25 (CH2)8 3.0809 54 C14H29 C14H29 (CH2)5 3.7447
23 C12H25 C12H25 (CH2)10 3.2007 55 C13H27 C11H23 (CH2)6 3.0555
24 C12H25 C12H25 (CH2)12 3.4318 56 C14H29 C10H21 (CH2)6 3.0757
25 C12H25 C12H25 (CH2)14 3.6990 57 C16H32 C8H17 (CH2)6 3.1249
26 C12H25 C12H25 (CH2)16 3.9208 58 C18H37 C6H13 (CH2)6 3.2291
300
M. LAIDI et al.: CMC OF DIVERSE GEMINI SURFACTANTS MODELING… Chem. Ind. Chem. Eng. Q. 27 (3) 299−312 (2021)
No. Tail 1 Tail 2 Ion -1 Ion-2 Spacer Exp. pCMC No. Tail 1 Tail 2 Ion-1 or Ion-2 Spacer Exp. pCMC
t
27 C8H17 C8H17 (CH2)3 1.8601 59 C8H17 C8H17 (C2H4)2N(CH3) 1.5086
t
28 C8H17 C8H17 (CH2)4 1.5817 60 C9H19 C9H19 (C2H4)2N(CH3) 1.7447
t
29 C8H17 C8H17 (CH2)5 1.5800 61 C10H21 C10H21 (C2H4)2N(CH3) 2.0757
t
30 C8H17 C8H17 (CH2)6 3.0809 62 C11H23 C11H23 (C2H4)2N(CH3) 2.5229
31 C16H32 C16H32 (CH2)2 3.2007 63 C12H25 C12H25 (C2H4)2N(CH3) 2.9586
32 C16H32 C16H32 (CH2)4 3.4318 64 C13H27 C13H27 (C2H4)2N(CH3) 3.4815
Table 1. Structures and CMC values of 211 Gemini surfactants according to Figure 1 (65–100)
No. Tail 1 Tail 2 Ion-1 or Ion-2 Spacer Exp. pCMC No. Tail 1 Tail 2 Ion-1 or Ion-2 Spacer Exp. pCMC
+
65 C14H29 C14H29 (CH3)2N (C2H4)2N(CH3) 3.9208 88 C8H17 C8H17 CH2C≡CCH2 1.3206
+
66 C15H31 C15H31 (CH3)2N (C2H4)2N(CH3) 4.3665
+
67 C16H33 C16H33 (CH3)2N (C2H4)2N(CH3) 4.9208
+ t
68 C8H17 C8H17 (CH3)2N (C2H4)2O 1.3565 89 C10H21 C10H21 CH2C≡CCH2 2.2757
t +
69 C9H19 C9H19 (CH3)2N (C2H4)2O 1.6576
t +
70 C10H21 C10H21 (CH3)2N (C2H4)2O 2.1135
+ t
71 C11H23 C11H23 (CH3)2N (C2H4)2O 2.5850 90 C10H21 C10H21 CH2C≡CCH2 2.2248
+
72 C12H25 C12H25 (CH3)2N (C2H4)2O 3.0000
+
73 C13H27 C13H27 (CH3)2N (C2H4)2O 3.4202
+
74 C14H29 C14H29 (CH3)2N (C2H4)2O 3.7212 91 C14H29 C14H29 CH2C≡CCH2 3.8861
+
75 C16H33 C16H33 (CH3)2N (C2H4)2O 3.1938
+
76 C6H13 C6H13 (CH3)2N (C2H4)2S 0.8861
+ +
77 C8H17 C8H17 (CH3)2N (C2H4)2S 1.4318 92 C8H17 C8H17 (CH3)2N 1.6459
+
78 C10H21 C10H21 (CH3)2N (C2H4)2S 2.2076
t +
79 C12H25 C12H25 (CH3)2N (C2H4)2S 3.0177
t + +
80 C14H29 C14H29 (CH3)2N (C2H4)2S 4.0000 93 C8H17 C8H17 (CH3)2N 1.6289
+
81 C16H33 C16H33 (CH3)2N (C2H4)2S 5.0132
+
82 C18H37 C18H37 (CH3)2N (C2H4)2S 6.0000
+ +
83 C10H21 C10H21 (CH3)2N CH2C≡CCH2 2.8570 94 C8H17 C8H17 (CH3)2N 1.6108
+
84 C8H17 C8H17 (CH3)2N CH2C≡CCH2 1.5935
+
85 C12H25 C12H25 CH2C≡CCH2 3.0506 95 C10H21 C10H21 (CH3)2N (CH2CH(OH))2 2.4318
+
96 C12H25 C12H25 (CH3)2N (CH2CH(OH))2 3.1549
+
97 C14H29 C14H29 (CH3)2N (CH2CH(OH))2 4.0706
+
86 C12H25 C12H25 CH2C≡CCH2 3.0410 98 C16H33 C16H33 (CH3)2N (CH2CH(OH))2 4.3010
t
99 C10H21 C10H21 (CH2)4 2.5702
t
87 C8H17 C8H17 CH2C≡CCH2 1.4634 100 C12H25 C12H25 (CH2)4 3.2518
Table 1. Structures and CMC values of 211 Gemini surfactants according to Figure 1 (101–136)
No. Tail 1 Tail 2 Ion-1 Ion-2 Spacer Exp. pCMC No. Tail 1 Tail 2 Ion -1 Ion-2 Spacer Exp. pCMC
+ +
101 C10H21 C10H21 (CH2)6 2.6990 127 C13H27 C13H27 (CH3)2N (CH3)2N (CH2)2 3.3979
128 C12H25 C12H25 CH2COO(CH2)2– 3.2218
102 C12H25 C12H25 3.2676 –OCOCH2
301
M. LAIDI et al.: CMC OF DIVERSE GEMINI SURFACTANTS MODELING… Chem. Ind. Chem. Eng. Q. 27 (3) 299−312 (2021)
No. Tail 1 Tail 2 Ion-1 Ion-2 Spacer Exp. pCMC No. Tail 1 Tail 2 Ion -1 Ion-2 Spacer Exp. pCMC
+ + +
107 C12H25 C12H25 (CH3)2N (CH3)2N (C2H4O)2C2H4 3.0362 131 C12H25 C12H25 (C3H7) (C2H5) N CH2COO(CH2)–2 3.0362
–OCOCH2
+ +
108 C12H25 C12H25 (CH3)2N (CH3)2N (C2H4O)3C2H4 3.0655 132 C12H25 C12H25 C3H7 3.0809
t + +
109 C12H25 C12H25 (CH3)2N (CH3)2N (C2H4O)4C2H4 3.0458
t + +
110 C12H25 C12H25 (CH3)2N (CH3)2N (C2H4O)5C2H4 2.9101
+ +
111 E-C12O E-C12O (CH3)2N (CH3)2N (CH2)3 3.2596 133 C12H25 C12H25 CH2S(CH2)2SCH2 3.3979
+ +
112 E-C12O E-C12O (CH3)2N (CH3)2N (CH2)4 3.2218
+ +
113 E-C12O E-C12O (CH3)2N (CH3)2N (CH2)6 3.2403
+ +
114 E-C14O E-C14O (CH3)2N (CH3)2N (CH2)3 3.5800
+ +
115 E-C14O E-C14O (CH3)2N (CH3)2N (CH2)4 3.3799
+ +
116 E-C14O E-C14O (CH3)2N (CH3)2N (CH2)6 3.4802 134 C12H25 C12H25 CH2S(CH2)3SCH2 3.4559
+ +
117 E-C16O E-C16O (CH3)2N (CH3)2N (CH2)3 4.5406
+ +
118 E-C16O E-C16O (CH3)2N (CH3)2N (CH2)4 4.3507
t + +
119 E-C16O E-C16O (CH3)2N (CH3)2N (CH2)6 4.4101
t + +
120 C14H29 C14H29 (CH3)2N (CH3)2N (CH2)6 3.8013 135 C12H25 C12H25 CH2S(CH2)4SCH2 3.5086
+ +
121 C8H17 C8H17 CH3)2N CH3)2N CH2COO(CH2)2– 1.9574
–OCOCH2
+ +
122 C10H21 C10H21 CH3)2N CH3)2N 2.6326
+ +
123 C12H25 C12H25 CH3)2N CH3)2N 2.9747
+ +
124 C14H29 C14H29 (CH3)2N (CH3)2N 3.8239 136 C14H29 C14H29 CH2S(CH2)2SCH2 4.0000
+ +
125 C16H33 C17H35 (CH3)2N (CH3)2N 4.1549
+ +
126 C18H37 C18H37 (CH3)2N (CH3)2N 4.6990
Table 1. Structures and CMC values of 211 Gemini surfactants according to Figure 1 (137–193)
No. Tail 1 Tail 2 Ion-1 or Ion-2 Spacer Exp. pCMC No. Tail 1 Tail 2 Ion-1 or Ion-2 Spacer Exp. pCMC
137 C14H29 C14H29 CH2S(CH2)3– 4.0706 165 C8H17 C8H17 3.3760 2.7900
–SCH2
138 C14H29 C14H29 CH2S(CH2)4– 4.1487 166 C10H11CH– C10H11CH– C8H16 2.7990
–SCH2 –(OH)CH2 –(OH)CH2
t
139 C16H3 C16H33 CH2S(CH2)2– 4.6576 167 C10H11CH– C10H11CH– C6H12 2.8040
–SCH2 –(OH)CH2 –(OH)CH2
t
140 C16H3 C16H33 CH2S(CH2)3– 4.6778 168 C9H19 C9H19 CH2O(CH2)2– 2.8100
–SCH2 –OCH2
t
141 C16H3 C16H33 CH2S(CH2)4– 4.6990 169 C8H17 C8H17 2.8300
–SCH2
t
142 CH3 CH3 CH2S(CH2)3– 2.1800 170 C10H11CH– C10H11CH– C5H10 2.8330
–SCH2 –(OH)CH2 –(OH)CH2
143 C2H5 C2H5 CH2S(CH2)2– 2.2300 171 C10H11CH– C10H11CH– C4H8 2.8540
–SCH2 –(OH)CH2 –(OH)CH2
144 C2H5 C2H5 CH2S(CH2)3– 2.2600 172 C10H11CH– C10H11CH– C3H6 2.8630
–SCH2 –(OH)CH2 –(OH)CH2
145 C2H5 C2H5 CH2S(CH2)4– 2.3000 173 C8H17 C8H17 CH2O(CH2)6– 2.8700
–SCH2 –OCH2
146 C2H5 C2H5 CH2S(CH2)5– 2.3300 174 C9H19 C9H19 CH2O(CH2)4– 2.8800
–SCH2 –OCH2H2
147 C8H17 C8H17 CH2CH– 2.3600 175 C9H19 C9H19 CH2O(CH2)5– 2.9100
–(OH)CH2 –OCH2
148 C2H5 C2H5 CH2S(CH2)6– 2.3700 176 C10H21 C10H21 CH2O(CH2)2– 2.9200
–SCH2 –OCH2
t
149 C4H9 C4H9 CH2S(CH2)2– 2.4100 177 C10H21 C10H21 CH2O(CH2)3– 2.9500
–SCH2 –OCH2
302
M. LAIDI et al.: CMC OF DIVERSE GEMINI SURFACTANTS MODELING… Chem. Ind. Chem. Eng. Q. 27 (3) 299−312 (2021)
No. Tail 1 Tail 2 Ion-1 or Ion-2 Spacer Exp. pCMC No. Tail 1 Tail 2 Ion-1 or Ion-2 Spacer Exp. pCMC
t
150 C3H7 C3H7 CH2S(CH2)5– 2.4100 178 C9H19 C9H19 CH2O(CH2)6– 2.9500
–SCH2 –OCH2
t
151 C4H9 C4H9 CH2S(CH2)5– 2.4900 179 C11H23 C11H23 CH2O(CH2)3– 3.0700
–SCH2 –OCH2
t
152 C5H11 C5H11 CH2S(CH2)4– 2.5300 180 C10H21 C10H21 CH2O(CH2)5– 3.0700
–SCH2 –OCH2
153 C4H9 C4H9 CH2S(CH2)6– 2.5400 181 C11H23 C11H23 CH2O(CH2)4– 3.1300
–SCH2 –OCH2
154 C5H11 C5H11 CH2S(CH2)5– 2.5800 182 C12H25 C12H25 C6H12 3.1310
–SCH2
155 C6H13 C6H13 CH2S(CH2)2– 2.5800 183 C12H25 C12H25 C4H8 3.1370
–SCH2
156 C10H21 C10H21 CH2CH– 2.5970 184 C12H25 C12H25 CH2O(CH2)2– 3.1500
–(OH)CH2 –OCH2
157 C5H11 C5H11 CH2S(CH2)6– 2.6200 185 C11H23 C11H23 CH2O(CH2)5– 3.1600
–SCH2 –OCH2
158 C6H13 C6H13 CH2S(CH2)4– 2.6400 186 C12H25 C12H25 CH2O(CH2)3– 3.1800
–SCH2 –OCH2
t
159 C7H15 C7H15 CH2S(CH2)2– 2.6500 187 C12H25 C12H25 (CH2)2 3.1940
–SCH2
t
160 C6H13 C6H13 CH2S(CH2)5– 2.6700 188 C11H23 C11H23 CH2O(CH2)6– 3.2000
–SCH2 –OCH2
t
161 C7H15 C7H15 CH2S(CH2)4– 2.7100 189 C12H25 C12H25 CH2O(CH2)4– 3.2200
–SCH2 –OCH2
t
162 C6H13 C6H13 CH2S(CH2)6– 2.7100 190 C12H25 C12H25 CH2O(CH2)5– 3.2500
–SCH2 –OCH2
163 C8H17 C8H17 CH2S(CH2)2– 2.7200 191 C12H25 C12H25 CH2O(CH2)6– 3.2900
–SCH2 –OCH2
164 C8H17 C8H17 CH2S(CH2)3– 2.7500 192 C14H29 C14H29 CH2O(CH2)2– 3.3400
–SCH2 –OCH2
193 C16H33OCH2– C16H33OCH2– (CH2)2 3.3760
–CH(OH)CH2 –CH(OH)CH2
Table 1. Structures and CMC values of 211 Gemini surfactants according to Figure 1 (194–211)
No. Tail 1 Tail 2 Ion-1 or Ion-2 Spacer Exp. pCMC No. Tail 1 Tail 2 Ion-1 or Ion-2 Spacer Exp. pCMC
t
194 C14H29 C14H29 CH2O(CH2)4OCH2 3.4000 203 C6H13 C6H13 CH2O(CH2)3OCH2 2.6100
t
195 C14H29 C14H29 CH2O(CH2)6OCH2 3.4700 204 C7H15 C7H15 CH2O(CH2)3OCH2 2.6800
t
196 C14H29 C14H29 CH2O(CH2)2OCH2 3.5200 205 C7H15 C7H15 CH2O(CH2)5OCH2 2.7500
t
197 C14H29 C14H29 CH2O(CH2)3OCH2 3.5800 206 C7H15 C7H15 CH2O(CH2)6OCH2 2.7900
t
198 C14H29 C14H29 CH2O(CH2)6OCH2 3.6500 207 C9H19 C9H19 CH2O(CH2)3OCH2 2.8400
t t
199 CH3 CH3 CH2O(CH2)2OCH2 2.1500 208 C10H21 C10H21 CH2O(CH2)4OCH2 3.0000
t t
200 CH3 CH3 CH2O(CH2)5OCH2 2.2500 209 C11H23 C11H23 CH2O(CH2)2OCH2 3.0400
t t
201 C4H9 C4H9 CH2O(CH2)4OCH2 2.4500 210 C10H21 C10H21 CH2O(CH2)6OCH2 3.0700
t t
202 C3H7 C3H7 CH2O(CH2)6OCH2 2.4500 211 C14H29 C14H29 (CH2)4 3.9210
Notes: 1-141, 166-167,170-172, 182-183, 187, 195, 211 di-bromide Gemini surfactants, the rest are di-chloride; CmHn (No. 103-106 Gemini surfactants)
means branched alkyl group; E-CmO (No. 111-119 Gemini surfactants) represents esterquat Gemini surfactants with the formula: [H2m-1Cm-
+ + - t
1COOCH2CH2(CH3)2N (CH2)nN (CH3)2CH2CH2OOCCm-1H2m-1]2Br ; the chemicals were in the test set, and others were in the training set
2. The optimized 2D-structures of the studied 3. To avoid overfitting, the number of variables
surfactants were after that uploaded to alvaDesc soft- (descriptors) must be reduced based on the following
ware to compute 5305 descriptors for each surfactant. steps and using alvaDesc software.
a. Constant and near constant descriptor values
were removed;
303
M. LAIDI et al.: CMC OF DIVERSE GEMINI SURFACTANTS MODELING… Chem. Ind. Chem. Eng. Q. 27 (3) 299−312 (2021)
Figure 2. Example of the drawn structures of the investigated Gemini surfactants by HyperChem software (compound number 100).
b. All missing values or at least one missing chemical formula of each compound are available
value were removed; from the author upon request).
c. Every descriptor with a relative standard devi-
Modeling techniques
ation <0.001 was removed;
d. Descriptors with pair correlation ≥0.95 were alvaDesc is a tool for the calculation of mole-
removed. cular descriptors. It calculates almost 5305 1, 2, 3-
4. Relevant descriptors were screened after -dimensional descriptors divided into 30 logical blocks
performing stepwise MLR with F value method. 20 (3885-2D and 1420-3D) as represented in Figure 3.
descriptors were selected among 5305 with stepwise alvaModel is a software tool to create quantitat-
regression. For each model, we have used the same ive structure-property relationship (QSPR) models
type and number of descriptors, as well as the same using descriptors previously calculated in alvaDesc.
composition of the training and test sets. A set of the This study deals with five different modelling tech-
selected descriptors is summarized in Table 2 with niques cited below.
their signification and types. The datasets with linear − OLS: an ordinary least squares model;
− KNN: a K-nearest neighbours regression model;
304
M. LAIDI et al.: CMC OF DIVERSE GEMINI SURFACTANTS MODELING… Chem. Ind. Chem. Eng. Q. 27 (3) 299−312 (2021)
Table 3. Statistical quality of all developed models for the global dataset
soft-computing techniques
Problem Data collection ANN-MLP, SVR- DA Performance comparison Best model
identification and processing Linear computing techniques of developed models SVR- DA
OLS, PLS, KNN
305
M. LAIDI et al.: CMC OF DIVERSE GEMINI SURFACTANTS MODELING… Chem. Ind. Chem. Eng. Q. 27 (3) 299−312 (2021)
Figure 5. Scatter plot of the global data adjusted by the optimal model between predicted and experimental pCMC values for the global
dataset: a) OLS; b) PLS; c) ANN; d) SVR.
represents different statistical indicators for the best The OLS model was done by alvaModel 1.0.4
OLS-QSPR model; more details are available from software and can be written in linear way versus des-
the authors upon request. criptors and non-standardized coefficients:
306
M. LAIDI et al.: CMC OF DIVERSE GEMINI SURFACTANTS MODELING… Chem. Ind. Chem. Eng. Q. 27 (3) 299−312 (2021)
between the experimental and predicted values and The PLS-QSPR model can be written in linear
does not confirm the good predictive ability of PLS- way versus descriptors and non-standardized coeffici-
-QSPR model. ents:
Table 4. Optimized parameters of the ANN model with its statistical quality
307
M. LAIDI et al.: CMC OF DIVERSE GEMINI SURFACTANTS MODELING… Chem. Ind. Chem. Eng. Q. 27 (3) 299−312 (2021)
308
M. LAIDI et al.: CMC OF DIVERSE GEMINI SURFACTANTS MODELING… Chem. Ind. Chem. Eng. Q. 27 (3) 299−312 (2021)
where k is the number of forecaster variables inc- The obtained deviation against the experimental
luded in the model. values can be explained by the fact that the selected
Figure 8a shows the applicability domain repre- descriptors for SVR-DA QSPR analysis could not
sented by Williams plot of the developed SVR-DA capture some relevant structural features that may be
QSPR model where 199 compounds in both the train- present in these compounds. The obtained model can
ing and the test set were in the domain, and 12 com- be used to estimate CMC of new Gemini surfactants if
pounds were considered as outliers. For the Euclid- these are located later within the AD of the model.
ean distance, all the compounds in both the training
Sensitivity analysis
set and the test set were located in the acceptable
region. In this section, a sensitivity analysis of the sel-
ected descriptors on the CMC has been undertaken
Figure 8. Applicability domains for the developed SVR-DA QSPR model based on Williams plot (a) and relevance factor plot of the
descriptors to the pCMC of Gemini surfactants (b).
309
M. LAIDI et al.: CMC OF DIVERSE GEMINI SURFACTANTS MODELING… Chem. Ind. Chem. Eng. Q. 27 (3) 299−312 (2021)
using the relevance factor (r) method to determine the mized using dragonfly algorithm (SVR-DA) and using
existing dependency between inputs and the output. the calculated descriptors was capable to accurately
The relevance factor can be calculated using Eq. (7) predict the non-linear behaviour of critical micelle con-
and given in the range of [-1, 1]: centration of Gemini surfactants, more than ANN,
OLS, PLS and KNN models for the global dataset. In
i =1( X k ,i − X k )(Yi −Y )
n
addition, the applicability domain of the five models
rk = (7)
and for all Gemini surfactants has been conducted
i =1( X k ,i − X k ) i =1(Yi −Y )
n 2 n 2
310
M. LAIDI et al.: CMC OF DIVERSE GEMINI SURFACTANTS MODELING… Chem. Ind. Chem. Eng. Q. 27 (3) 299−312 (2021)
[18] K. Roy, H. Kabir, Chem. Eng. Sci. 87 (2013) 141–151 [42] S.K. Lahiri, K.C. Ghanta, Asia-Pacific J. Chem. Eng. 5
[19] B. Bhhatarai, P. Gramatica, Environ. Sci. Technol. 45 (2010) 763–777
(2011) 8120–8128 [43] J. Ding, H. Wang, K. Dai, Y. Zi, Z. Shi, Chem. Ind. Chem.
[20] X. Li, G. Zhang, J. Dong, X. Zhou, X. Yan, M. Luo, J. Mol. Eng. Q. 20 (2014) 29–38
Struc: THEOCHEM 710 (2004) 119–126 [44] Z. Kardanpour, B. Hemmateenejad, T. Khayamian, Anal.
[21] M. Poša, Steroids 76 (2011) 85–93 Chim. Acta 531 (2005) 285–291
[22] L. Jiao, X. Wang, S. Bing, Z. Xue, H. Li, Quím. Nova 38 [45] J. Wu, H. Gao, D. Shi, Y. Yang, Y. Zhang, W. Zhu, J. Mol.
(2015) 510–517 Liquids 299 (2020) 112248
[23] H.A. Martens, P. Dardenne, Chemomet. Intell. Lab. Sys. [46] M.J. Rosen, L. Liu, JAOCS, J. Am. Oil Chem. Soc. 73
44 (1998) 99–121 (1996) 885–890
[24] N. Pal, N. Saxena, A. Mandal, Colloids Surfaces, A 533 [47] E. Alami, G. Beinert, P. Marie, R. Zana, Langmuir 9
(2017) 20–32 (1993) 1465–1467
[25] T. Gaudin, P. Rotureau, I. Pezron, G. Fayet, Ind. Eng. [48] M. Dreja, K. Heine, B. Tieke, G. Junkers, J. Colloid
Chem.Res. 55 (2016) 11716–11726 Interface Sci. 191 (1997) 131–140
[26] L. Jiao, X. Wang, S. Bing, Z. Xue, H. Li, RSC Adv. 5 [49] G. Savelli, L. Brinchi, R. Germani, Surfact. Sci. Ser.
(2015) 6617–6624 (2001) 175–246
[27] L. Jiao, X. Zhang, Y. Qin, X. Wang, H. Li, Chemomet. [50] F.M. Menger, J.S. Keiper, Angew. Chem. Int.Ed. 39
Intell. Lab. Sys. 157 (2016) 202–207 (2000) 1906–1920
[28] K. Parikh, S. Singh, A. Desai, S. Kumar, J. Mol. Liquids [51] M. Dreja, W. Pyckhout-Hintzen, H. Mays, B. Tieke,
278 (2019) 290–298 Langmuir 15 (1999) 391–399
[29] L. Jiao, Z. Xue, G. Wang, X. Wang, H. Li, Chemomet. [52] M. Dreja, S. Gramberg, B. Tieke, Chem. Commun. (1998)
Intell. Lab. Sys. 137 (2014) 91–96 1371–1372
[30] A.S. Kulkarni, A.J. Kasabe, M.S. Bhatia, N.M. Bhatia, V.L. [53] A. Kumar, E. Alami, K. Holmberg, V. Seredyuk, F.M.
Gaikwad, AAPS PharmSciTech 20 (2019) Menger, Colloids Surfaces, A 228 (2003) 197–207
[31] H. Benimam, C.S. Moussa, M. Hentabli, S. Hanini, M. [54] Kabir-ud-Din, P.A. Koya, J. Chem. Eng. Data 55 (2010)
Laidi, J. Chem. Eng. Data 65 (2020) 3161–3172 1921–1929
[32] C.B. Carvalho, E.P. Carvalho, M.A.S.S. Ravagnani, [55] M. Dreja, W. Pyckhout-Hintzen, B. Tieke, Macro-
Chem. Ind. Chem. Eng. Q. 26 (2020) 125–134 molecules 31 (1998) 272–280
[33] M. Beigi, M. Torki-Harchegani, M. Mahmoodi-Eshkaftaki, [56] X. Jiang, L. Zhou, Y. Li, Z. Chen, X. Hu, Langmuir 23
Chem. Ind. Chem. Eng. Q. 23 (2017) 251–258 (2007) 11404–11408
[34] A.A. El Hadj, C. Si-Moussa, S. Hanini, M. Laidi, Chem. [57] M.M. Khalaf, A.H. Tantawy, K.A. Soliman, H.M. Abd
Ind. Chem. Eng. Q. 19 (2013) 449–460 El-Lateef, J. Mol. Struc. 1203 (2020) 127442
[35] P.S. Milić, K.M. Rajković, P.M. Milićević, S.M. Milić, T.P. [58] M. Dreja, B. Tieke, Langmuir 14 (1998) 800–807
Brdarić, V.M. Pavelkić, Chem. Ind. Chem. Eng. Q. 19 [59] A.M. Asemu, N.G. Habtu, M.A. Delele, B. Subramanyam,
(2013) 141–152 S. Alavi, J. Food Proc. Eng. 43 (2020) 1–19
[36] H. Golmohammadi, A. Rashidi, S.J. Safdari, Chem. Ind. [60] E. Setiawan, Mudasir, K. Wijaya, IOP Conf. Ser. 742
Chem. Eng. Q. 19 (2013) 321–331 (2020)
[37] J.B. Savković-Stevanović, D.R. Francescetti, Chem. Ind. [61] R. Soleimani, A.H. Saeedi Dehaghani, N.A. Shoushtari,
Chem. Eng. Q. 6 (2000) 361–371. P. Yaghoubi, A. Bahadori, Kor. J. Chem. Eng. 35 (2018)
[38] M. Esfahanian, M. Nikzad, G. Najafpour, A. Asghar 1556–1569
Ghoreyshi, Chem. Ind. Chem. Eng. Q.19 (2013) 241–252 [62] A. Golbraikh, A. Tropsha, Mol. Divers. 5 (2000) 231–243
[39] A. Ghaderi, S. Abbasi, A. Motevali, S. Minaei Chem. Ind. [63] R. Todeschini, D. Ballabio, F. Grisoni, J. Chem. Inform.
Chem. Eng. Q. 18 (2012) 283–293 Model. 56 (2016) 1905–1913
[40] M.S. Leite, M.A. Santos, E.M.F. Costa, A. Balieiro, Á.S. [64] I.E. Frank, R. Todeschini, The data analysis handbook,
Lima, O.L. Sanchez, C.M.F. Soares, Chem. Ind. Chem. Elsevier, Amsterdam, 1994
Eng. Q. 25 (2019) 369–382 [65] C. Cortes, V. Vapnik, Machine Learning 20 (1995) 273–
[41] A.A. Amooey, M. Ahangarian, F. Rezazadeh, Chem. Ind. –297.
Chem. Eng. Q. 20 (2014) 565–569
311
M. LAIDI et al.: CMC OF DIVERSE GEMINI SURFACTANTS MODELING… Chem. Ind. Chem. Eng. Q. 27 (3) 299−312 (2021)
312