Benimam July 2019

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 19

Neural Computing and Applications

https://doi.org/10.1007/s00521-019-04356-w (0123456789().,-volV)(0123456789().,-volV)

ORIGINAL ARTICLE

Modeling the activity coefficient at infinite dilution of water in ionic


liquids using artificial neural networks and support vector machines
Hania Benimam1 • Cherif Si-Moussa1 • Maamar Laidi1 • Salah Hanini1

Received: 23 May 2018 / Accepted: 18 July 2019


 Springer-Verlag London Ltd., part of Springer Nature 2019

Abstract
The activity coefficient at infinite dilution of water in ionic liquids is a thermodynamic property of a paramount importance
in separation processes. However, accurate modeling of this parameter remains a challenging task due to the highly
nonlinear behavior of the water/ionic liquid systems. Also, available models consider a large number of inputs that are
usually difficult to access and require complicated use techniques. Therefore, the main objective of this paper is to use
artificial intelligence techniques to propose models (based on a reduced number of inputs that are easily accessible, and to
improve the accuracy of the correlative performance for activity coefficient at infinite dilution of water in ILs). The present
work features the application of artificial neural networks, support vector machine and least square support vector machine,
among data-driven methods, for modeling the activity coefficient at infinite dilution of water in 53 ionic liquids. Overall,
the models proposed are able to accurately correlate 318 experimental data points gathered from the literature. According
to the results, the ANN is more powerful and effective computational learning machine than the two remaining ones. The
correlation coefficients R2 and deviations expressed as an average absolute relative deviation for the neural network model
are estimated to be 0.99997 and 0.56%, respectively. Furthermore, the neural network model’s interpolation and extrap-
olation capabilities are demonstrated, and its accuracy is compared to other proposed models in the literature based on
multi-linear regression, least squares support vector machine and another feedforward neural network. This work also
includes a graphical user interface for the proposed model, as well as an inputs’ sensitivity analysis.

Keywords Activity coefficient  Ionic liquids  Artificial neural networks  Support vector machine  Least square support
vector machine  Infinite dilution

1 Introduction processes with no negative impact on the atmosphere in


contrast to volatile organic solvents. They are normally
Ionic liquids (ILs) are a new class of organic-based salts composed of a large organic cations and organic or inor-
that present numerous advantageous physicochemical ganic anions. When the melting point is at room temper-
properties compared to volatile organic solvents. They ature, the ILs are referred to as room-temperature ionic
have attracted a considerable interest during the last two liquids (RTILs) [1–4].
decades within the academic and industrial fields due to The remarkable physicochemical properties of ILs such
their growing potential applications. Literature stated that as low vapor pressure, non-flammability, high solvating
ILs may help in improving the capability of chemical power [5] for organic, inorganic and organometallic com-
pounds and high thermal stability [6] have made them the
subject of exploration in numerous applications. Further-
Electronic supplementary material The online version of this
article (https://doi.org/10.1007/s00521-019-04356-w) con-
more, non-aqueous high polarity, viscosity, high thermal
tains supplementary material, which is available to autho- conductivity and solubility of ILs can be finely tuned by
rized users. selecting the combination of cations or anions among a
huge diversity, allowing the design of an appropriate IL for
& Maamar Laidi
laidi.maamar@univ-medea.dz a particular application [7–9], such as reaction media
[10–12], active pharmaceutical ingredients [13], solvents
1
Biomaterials and Transport Phenomena Laboratory of poorly soluble drugs, their precursors, and even as drug
(LBMPT), University Yahia Fares of Médéa, Médéa, Algeria

123
Neural Computing and Applications

delivery vehicles [14, 15], catalysts [12] and separation gas liquid chromatography (GLC). Their study was fol-
media [16]. Hence, the subsequent numbers of commer- lowed by a succession of works established by the same
cially or not available ILs are annually and exponentially authors based on the same principle but different ionic
increasing [17, 18]. The knowledge of their physical and liquids [33–35]. Other collaborations aiming the estimation
chemical properties and phase equilibrium is fundamental of activity coefficient at infinite dilution of organic and
for their applications. inorganic solutes and water in different ionic liquids often
It has been roughly estimated that about 106 possible by GLC were conducted using almost the same protocol
ionic liquids could be theoretically produced by the com- [36–40]. However, due to the massive number of possible
bination of cations and anions from the current chemical ILs, conducting time-consuming and expensive experi-
database. In addition, the binary and ternary mixture of ments for the estimation of infinite-dilution activity coef-
ionic liquids could produce 1012 and 1018 ionic liquid ficient (IDAC) in ILs has became impractical, the reason
possibilities, respectively [19]. This makes the experi- why a lot of researches have been devoted to develop
mental characterization of all possible ionic liquids for the theoretical and computational modeling of this property.
selection of an optimum ionic liquid for specific purpose Detailed review of the previous works on modeling the
unfeasible. Therefore, predictive models that enable a infinite-dilution activity coefficient of different solutes in
priori estimation of the properties of ionic liquids are ILs is given in Sect. 2. Paduszynski [41] whose work will
necessary [20]. be discussed in detail in the following section summarized
In order to use ILs as solvents in industry, namely the main research achievements in modeling activity
selection of solvents for liquid–liquid extraction, extractive coefficient at infinite dilution. The author argues that
distillation, separation of the azeotropic mixtures and despite the fact that conductor-like screening model for real
separation processes, or design new ones, the knowledge of solvents (COSMO-RS) computation method allows a
interactions of ILs with different compounds is needed. deeper insight into interactions between ILs and solutes, it
The solute–solvent complex and dynamic interactions in necessitates a commercial package and it is quite compli-
solution are controlled by the nature and interplay of the cated in use and requires user’s quantum chemical exper-
cation–anion pair containing the IL. Thus, these interac- tise and experience which makes its implementation in
tions are the outcome of various energetic and geometric research quite a difficult and challenging task. Hence,
factors leading to uniquely organized, hydrogen-bonded quantitative structure–property relationship (QSPR) mod-
and self-segregated solvent nanostructures [21]. Conse- els seem capable of overcoming these problems; therefore,
quently, a profound study linking the chemical structure of the author proposed three QSPR models based on three
the ILs with their physicochemical and thermodynamic machine learning algorithms for activity coefficient esti-
properties is indispensable in order to embrace these mation [41]. However, the models suffered from a large
compounds as new promising solvents. number of input variables that needed methods and tech-
For instance, in extraction and separation processes, the niques to reduce their number, and in many cases, the
choice of a solvent is interrelated to two fundamental inputs retained are not easily accessible. Nonetheless, the
properties which are its selectivity and capacity. These two quite high deviations that the models capitulate (average
can be calculated using the activity coefficient at infinite- absolute relative deviation AARD of more than 20%) is a
dilution values by simple expressions: selectivity Sij = major reason to look for other models with improved
c! ! ! !
i /cj and capacity kj = 1/cj [22]. performance and accuracy, and also preferably using
The most broadly used methods for probing the inter- fewer, simply calculated and significant input variables.
action between an ionic liquid and water use either In this present work, an attempt will be made to estimate
experimental techniques, such as infrared (IR) and nuclear activity coefficient at infinite dilution of water in ionic
magnetic resonance (NMR) spectroscopy [23–28] or the- liquids using three methods, a single artificial neural net-
oretical calculations [27, 29–31]. work (ANN) predictive model, a support vector machine
Given the importance of the activity coefficient at infi- and a least square support vector machine. The same set of
nite dilution of organic or inorganic solutes in ILs, in the inputs is considered for the three types of models which
selection and design of separation processes, a lot of includes the temperature of the water/ionic liquid system T,
researches have been focused in this area. One of the the molecular weight of the ionic liquid M and four pure
earliest researches was carried out by Domańska and component properties of the ionic liquid: critical temper-
Marciniak in 2007 [32] where they measured the activity ature Tc, critical pressure Pc, critical volume Vc and
coefficients at infinite dilution c!, for 29 solutes, alkanes, acentric factor x. The properties of the ILs are chosen as
alkenes, alkynes, cycloalkanes, aromatic hydrocarbons, inputs in the purpose of differentiating between the diverse
alcohols and water in the ionic liquid 1-ethyl-3- solvents (ILs) and describing all the systems water/ionic
methylimidazolium trifluoroacetate ([EMIM] [TFA]) using liquid in one system. To the best of our knowledge, this

123
Neural Computing and Applications

will be the first attempt to model the IDAC of water in ILs molecules. The interactions between the solutes (in this
based on physical properties of ILs using three powerful case water) and the solvent (IL) are obtained in the form of
machine learning techniques. The number of inputs con- pairwise interacting surface segments based on the geom-
sidered in this work is the least among other models pre- etry and surface charge density of the solvent molecules.
viously proposed in the literature. An input sensitivity Statistical thermodynamics model is subsequently applied
analysis will play a double role, that of classification of in order to calculate a thermodynamic property of fluids
inputs parameters in terms of effect on the output and and liquid mixtures, namely activity coefficient [48]. In-
confirmation of the appropriate choice of inputs. In sum- depth description of the COSMO-RS theory and the
mary, the aim of the study is to evaluate the capability of activity coefficient term is described in the literature
the different machine learning techniques for modeling the [51–55]. The first COSMO-RS model investigating the
IDAC in the purpose of correlation and/or prediction, and activity coefficient at infinite dilution of solutes in ILs was
to compare the proposed models with three literature proposed by Diedenhofen et al. [56]. The study included 38
models: a multi-linear regression (MLR) model based on solutes in three different ionic liquids. The results had
the COSMO-RS sigma profiles, least squares support proven the accuracy and capability of COSMO-RS method
vector machine (LSSVM) model and feedforward artificial in predicting the infinite-dilution activity coefficients in
neural network (FFANN) model. various ionic liquids. Other researchers like Freire et al.
[57] used COSMO-RS method based on quantum chem-
istry calculations to predict binary liquid–liquid equilibria
2 Modeling the activity coefficient LLE and VLE measurements in several alcohols–ILs sys-
at infinite dilution of water in ILs tems. Their model predictions were reasonably in good
agreement with experimental data. Banerjee and Khanna
IDAC can be defined as when the concentration of the [58], Kurnia et al. [59], Gonfa et al. [60], Gonfa et al. [61]
solute tends to zero or in the range from 10-7 to 10-4. The and Matheswaran et al. [42] all employed COSMO-RS for
importance of this parameter resides in describing the the prediction of activity coefficients at infinite dilution in
behavior of liquid mixtures, fitting Gibbs free energy ILs. The results had proven the capability of the method in
model parameters (like Margules, Wilson, van Laar, non- predicting qualitatively the IDAC in ionic liquids, but had
random two-liquid (NRTL), universal quasichemical often failed in providing accurate quantitative predictions.
(UNIQUAC)), and development of predictive methods Recently, Paduszynski [62] gave an overview of the
such as the universal quasichemical functional group COSMO-RS performance in predicting the IDAC of dif-
activity coefficient (Dortmund) (UNIFAC (Do)) model ferent solutes in ILs. The approach was able to provide
[42]. quantitative and/or qualitative predictions of the largest
The experimental methods used for the measurement of data set available yet with 41868 data points related to 150
the infinite-dilution activity coefficients are GLC [43], molecular solutes and 233 ILs. However, the IDAC cal-
dilutor technique [44] and vapor–liquid equilibria VLE culation by this approach yielded an overall AARD of
measurements [45, 46]. However, due to the unlimited approximately 30% which is still far from adequate.
possible combinations of cations and anions, the experi- Although the COSMO-RS models have become very
mental determination of activity coefficient at infinite popular due to their logarithmic simplicity, numerical sta-
dilution of water in ILs is time-consuming and costly. The bility and their accuracy despite the outlying charge errors,
opportunity to select new ILs may also be missed [47]. their major limitation results from the fact that the quantum
Hence, the use of predictive models is very helpful. Shiflett chemical calculations are performed only in the reference
and Maginn [48] gave an in-depth elucidation of the fluid state of a conductor, and not for real solvent polarity [63].
phase equilibrium of gas/IL systems and highlighted the To overcome some of the limitations of COSMO-RS
potential use of predictive computational modeling. A which treats molecules in an implicit manner, atomistic
review of an example application of computational intel- force field methods are a reliable alternative for estimation
ligence in hydrology field is presented by Ardabili et al. of not just infinite-dilution activity coefficient in ILs but
[49], and another application in agricultural science is also to obtain full isotherms. Based on atomistic simula-
provided by Zhang et al. [50]. tions and taking into account the energetics, also known as
The first type of predictive computational modeling is a force fields, this method uses molecular dynamics or Monte
hybrid quantum/statistical thermodynamics method, and Carlo methods to propose a statistical modeling of all
the most commonly used approach in this kind of modeling molecular aspects [48]. However, in the particular case of
is the COSMO-RS. In this method, quantum mechanical water (as a solute), this method has failed to represent
calculations are carried out on a solute in a conducting experimental absorption isotherms. This is expected since
continuum without a precise modeling of the solvent

123
Neural Computing and Applications

the current force fields lack polarizability treatment of the (alkanes, alkenes, alkynes, cycloalkanes, aromatics, and
solute and solvent [64]. alcohols) in 16 common imidazolium-based ionic liquids,
Correlative methods, usually referred to as QSPR at different temperatures. The predicted and experimental
models, are another computational modeling approach that c!
i were in good agreement, and the root mean square error
has shown high accuracy and robustness, especially when (RMSE) was equal to 0.128 for test data [74].
applying advanced neural networks and other machine Another excellent methodology which has recently
learning techniques. The first QSPR models devoted to the attracted extensive attention is support vector machines
estimation of infinite-dilution activity coefficients in ILs (SVM). This method was developed by Vapnik in 1990s
were proposed by Eike et al. [65]. Briefly, the study and is based on statistical learning theory (SLT) imple-
involved 38 solutes in three ionic liquids and correlated ments structure risk minimization (SRM) theory [77, 78]. It
experimental IDAC with four descriptors for each IL at was primarily employed for nonlinear classification and
four temperatures. However, for each solute/IL system, the regression analysis [79, 80]. In this approach, consider a
estimation of each descriptor had to be repeated at each training data set of N points (Xi, Yi), with i = 1,…,N. X
temperature. In addition, the models needed to be validated represents the model inputs and Y its output. An SVM
both internally and externally to appraise the stability and model takes the form:
predictive ability of the model [66, 67]. The complexity y ð x Þ ¼ x T ;ð x Þ þ b ð1Þ
and inconvenience of the calculation process affected the
implementation of the QSPR models despite the satisfac- where ;ðÞ: Rn ! Rm a nonlinear function that maps input
tory results. finite-dimensional space into a higher-dimensional space
ANNs are a computational model inspired in the which is implicitly created, x is a weight vector and b is
behavior of natural neurons intended for modeling complex the bias [81, 82].
nonlinear relationships [68]. Their most important advan- The SVM problems are solved using quadratic pro-
tages are: (i) their ability to find traceability of the entrees gramming techniques, but these are known for their fol-
and (ii) their flexibility to test the interpolation, extrapo- lowing disadvantages [83]:
lation and prediction of the model [69]. The quantity and
• They are practically difficult to employ.
placement of neurons in different layers is referred to as its
• They are time-consuming.
architecture. The neurons receive data related to a given
• They require a large memory and a considerable central
property, solubility, for instance, and some independent
processing unit time (CPU time).
variables related to the dependent main variable (temper-
ature at equilibrium, for instance). The input and output The least squares support vector machine (LSSVM), a
variables are weighed by weights and shifted by a bias modified adaptation of the SVM was later developed. It
factor specific to each neuron. Through a process of opti- replaces the convex quadratic programming and inequality
mization using traditional backpropagation algorithms or constraints of the original SVM by solving a linear set of
evolutionary algorithms such as particle swarm optimiza- equations and using equality constraints instead. In the
tion (PSO) [70, 71], firefly algorithm [72], ant colony LSSVM method, a kernel function maps the finite input
optimization, artificial bee colony optimization, intelligent space into a higher-dimensional attribute space [82, 83].
water drop (IWD) approach [71], sequential genetic-based The LSSVM model can be expressed in Eq. (2) as:
negative correlation learning algorithm (SGNCL) [73] and X
N
genetic algorithms, the network learns the relationship yð xÞ ¼ ak K ðx; xk Þ þ b ð2Þ
between the variables and stores the values of the weights k¼1

and biases that give the lowest error between calculated where a and b are the solutions of the following equation:
and experimental data of the dependent variable (activity     
coefficient in this work) [68, 69, 74, 75]. A recent research a o IT 0
¼ T ð3Þ
paper on swarm intelligence optimization approaches for b I X þ c1 I y
ANN optimization is proposed by Kaur et al. [71] and
where a ¼ ½a1 ; . . .; aN T are Lagrange multipliers and b is
reviews all the optimizations techniques listed above. Some
the bias that are the solutions of Eq. (3);
of the earliest applications of artificial neural networks-
based QSPR models for the prediction of IDAC in mixtures I ¼ ½1; . . .; 1T ;
were proposed by Ajmani et al. [76], and the constructed
models were satisfactory. Another attempt at predicting the y ¼ ½y1 ; y2 ; . . .; yn T ;
IDAC for organic solutes in ionic liquids by artificial X is a square matrix; the element of the i row and j column
neural network was published in 2011 [74]. The authors is described as
investigated 24 structurally different organic compounds

123
Neural Computing and Applications

   
Xij ¼ ;ðxi ÞT ; xj ¼ K xi ; xj ð4Þ 3 Results and discussion
where K is the kernel function and c is the regularization 3.1 ANN modeling
parameter.
The LSSVM adapts as a lost function the square of the The data used in the ANN modeling are gathered from the
error n2k between calculated and experimental data, while literature expressing the activity coefficient at infinite
the SVM chooses simply the error nk between the two dilution of water in 53 ILs at six different temperatures
[82, 83]. with a total of 318 data points. The data set can be accessed
Accordingly, the characteristics of the two analogous in Supporting Information 1. The selection of appropriate
methods make the LSSVM a better computational learning inputs is preliminary step in modeling. The inputs have to
machine because it reduces significantly the complexity of be significant, which means they have a theoretical effect
the optimization algorithm, and this decreases the compu- on the target (output). In this study, the temperature is
tational burden and accelerates the computation time [84]. considered as an input given the fact that the activity
The advantages of the two methods reside in: (i) the coefficient is a thermodynamic property, so by definition it
regularization parameter, which results in avoiding over- is dependent on temperature. Also, since at infinite dilution,
fitting; (ii) the use of kernel trick and kernel engineering the molecule of solute is considered fully encircled by
that allow an explicit knowledge about the problem; (iii) in molecules of solvent (in our case IL) and only molecular-
SVM, the problem is solved by a convex optimization, level interactions between the solute and solvent are taking
which circumvents the local minima problem often place, the 3D structure of the IL is a crucial factor to
encountered with other computational learning machines activity coefficient at infinite dilution [86]; therefore, the
[85]. critical volume of the IL and its molecular weight are
Lately, in silico calculation of infinite-dilution activity considered as inputs in the neural network. On the other
coefficients of molecular solutes including water in ionic hand, since phase equilibrium between solute/solvent can
liquids was the subject of a more recent research project be expressed by the iso-fugacity criterion which can be
elaborated by Paduszynski [41]. The author gave a critical approximated by the modified Raoult’s law for vapor liquid
review of existing QSPR models and proposed new QSPR equilibrium:
models based on three machine learning algorithms.
yi P ¼ xi ci Psat
i ð5Þ
Stepwise multiple linear regression (SWMLR)-based,
FFANN-based and LSSVM-based models were compared where yi and xi are the molar fraction of component i in
in terms of RMSE and AARD between calculated and vapor and liquid phase, respectively. P is the pressure, ci is
experimental c1 . Significant large amount of data were the activity coefficient, and Psat
i is the saturated pressure of
employed in the work, namely 188 ILs and 128 solutes for component i at temperature T.
a total data points exceeding 34,000. The FFANN and Equation (5) can be written as:
LSSVM gave considerably lower errors than the SWMLR, yi P yi P
while it was demonstrated that LSSVM generated slightly ci ¼ ¼ : ð6Þ
xi Psat
i xi Psat
i
better results compared to FFANN [41].
In the aim of improving the modeling of the activity Since Psat
i is a function of temperature, Eq. 6 shows that ci
coefficient at infinite dilution of water in ionic liquids, our is a function of temperature and pressure.
present work employs FFANN, SVM and LSSVM to On the other hand, the relationship between the tem-
develop correlative/predictive models for IDAC calcula- perature, pressure, volume and compressibility factor can
tion based on properties of ionic liquids. For this purpose, be expressed by the following equation of state:
the independent variables selected as inputs for the mod- PV  ZRT ¼ 0 ð7Þ
eling are operating conditions which are the temperature
and the molecular weight and four pure component prop- where Z is compressibility factor and is a function of
erties of the ILs which are Tc, Pc, Vc and x. These lasts reduced temperature Tr, reduced pressure Pr and acentric
parameters were the subject of numerous researches using factor x.
ANN, and the results of those researches were promising Z ¼ f ðPr ; Tr ; xÞ ð8Þ
[68].
where Pr ¼ PPc and Tr ¼ TTc
where Tc and Pc are the critical temperature and critical
pressure of the component, respectively.
From the combination of Eqs. (7), (6) and (8), it can be
seen that activity coefficient ci is theoretically dependent

123
Neural Computing and Applications

on critical properties (Tc, Pc, Vc) and acentric factor (x) of activation functions in each layer, results in identifying a
the component; which justifies the choice of Tc, Pc and x final optimized network with a given type and architecture.
as additional inputs in the model. It is worth mentioning that in the case of using a nonlinear
Some of the properties of ionic liquids are calculated by function and considering a sufficient number of neurons,
Valderrama et al. [87] using group contribution method adopting more hidden layers is not recommended as this
based on the effect of functional groups present in the IL. may result in over-fitting [89], the reason why a one hidden
Similarly, for ILs with a molecular weight exceeding layer is considered for our neural network. Accordingly,
500 g/mol, the extension of a group contribution method the general structure of the ANN used in our modeling of
proposed by Valderrama et al. [88] is used to determine the the activity coefficient at infinite dilution of water in ionic
properties of ionic liquids with high molecular mass, liquids is depicted in Fig. 1, while its more detailed type
whereas for the ionic liquids whose properties were not and architecture are illustrated in Table 2.
calculated before, the same group contribution method is Levenberg–Marquardt backpropagation algorithm
applied to determine the properties. The experimental data locates the minimum of a multivariate function which
are divided into a training set, a test set and a validation set. represents the sum of squares of nonlinear real-valued
Each data set should be well distributed throughout the functions. The performance function is reduced in an
model space. Accordingly, two-thirds of the whole data are iterative way during the algorithm. To avoid the conver-
for training set (total 212 data points), and the remaining gence of the backpropagation training algorithm to a local
one-third of the whole data are equally divided into test minima instead of a global minima, optimization using
data set (53 data points) and validation (53 data points). ‘‘trainlm’’ function is repeated 50 times, with different sets
The validation and test data are referred to as a validation of initial weights and bias, and the best results in terms of
data (106 data points), and the neural network statistical error are saved.
parameters for validation set include the validation and test A Matlab code documenting the connection weights and
sets combined. The ranges of variables used are given in biases of the optimal network Logsig (6,16) and Linear
Table 1. (16,1) is provided in Supporting Information 4.
The ANN modeling is performed using MATLAB In order to assess the predictive ability of the optimized
(version 7.5). Before training, it is often useful to scale ANN model, the validation agreement vector and the val-
inputs and targets, so that they will fall within a specified idation agreement plot of the predicted versus experimental
range as this may help avoiding numerical overflows and dependent variable from the validation data set only are
accelerates the convergence of the algorithm. Thus, the analyzed. Using the Matlab function ‘‘postreg,’’ the plot of
present input and output parameters are normalized the predicted versus experimental validation data set and
between - 1 and 1 using Eq. (5) which can be expressed in the parameters of the linear regression are directly
Matlab as ‘‘mapminmax’’ and normalizes each parameter obtained.
(xi) of inputs and output into (xin) under the following The total data agreement scheme is shown in Fig. 2a
expression: which illustrates the ANN model’s calculated outputs
2fxi  minðxi Þg versus the experimental data with an agreement vector, [a
xin ¼ 1 ð9Þ (slope), b (y intercept), R (correlation coeffi-
fmaxðxi Þ  minðxi Þg
cient)] = [1.0004, 0.0011, 0.99997]. This reveals an ideal
where xin is the normalized variable. The normalization of modeling of the whole data by the optimized network.
inputs and output is made between - 1 and 1. However, Fig. 2b points up the validation agreement plot
After selection of T, M, Tc, Pc, Vc and x as appropriate which shows the predicted output values following exactly
inputs and the activity coefficient in the form of ‘‘lnc!
i ’’ as the trend of the experimental data values with an agreement
the output, the optimal ANN is selected through a trial-and- vector approaching the ideal, [a (slope), b (y intercept),
error procedure; screening different types of neural net- R (correlation coefficient)] = [1.0013,0.0034,0.99992]
works, numbers of layers, numbers of neurons in each which demonstrates the predictive ability and accuracy of
hidden layer, and using different training algorithms and our model.

Table 1 The variables ranges of the ANN used in this work


Variable T (K) Tc (K) Pc (Mpa) Vc (cm3/mol) M (g/mol) x c!
i

Ranges of variable (min– 318.15–368.15 767.19–1410.89 1.38–7.10 568.8–1476.7 169.3–605.2 0.1603–2.9211 17 9 10-3–
max) 9.36

123
Neural Computing and Applications

The statistical parameters of the NN model for the


training set, validation set and total data (training, valida-
T tion and test) are the correlation coefficient R2 and the root
mean square error RMSE (square root of the average sum
Tc
of squares), calculated for the predicted activity coefficient
Pc at infinite dilution of water in ionic liquids by Eq. (10), and
ln i
the average absolute relative deviation AARD is expressed
Vc in Eq. (11):
vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
M u N  2
u 1 X 1Exp
RMSE ¼ t ci  c1cali ð10Þ
N i¼1

Fig. 1 Schematic illustration of the optimized ANN model

Table 2 Structure of the optimized ANN model


Type of Training algorithm Input Hidden layer Output layer
network layer
No. of No. of Activation function No. of Activation function
neurons neurons neurons

FFBP NN Levenberg–Marquardt 6 16 Logarithmic sigmoid transfer 1 Linear transfer function.


Matlab backpropagation function. Matlab function ‘‘logsig’’ Matlab function ‘‘purelin’’
function Matlab function
‘‘newff’’ ‘‘trainlm’’

Linear Fit: ln cal=(1)ln exp+(0.0011), R=0.99997 Linear Fit: ln cal=(1)ln exp+(0.0034), R=0.99992
3 3

Data Points Data Points


2 Best Linear Fit Best Linear Fit
2
ln cal = ln exp ln cal = ln exp

1
1

0
ln cal

cal

-1
ln

-1

-2

-2
-3

-3
-4

-5 -4
-5 -4 -3 -2 -1 0 1 2 3 -4 -3 -2 -1 0 1 2 3

ln exp ln
exp

(a) (b)

Fig. 2 Predicted versus experimental data agreement plot: a for the total data, b for the validation data

123
Neural Computing and Applications

100 X
N
c1Exp  c1cal data points for each ionic liquid. The new data points used
i i
AARDð%Þ ¼ ð11Þ for interpolation and extrapolation test belong to three ILs
N i¼1 c1Exp
i previously implicated in the study.
Table 3 summarizes the model statistical parameters for The results depicted in Fig. 4 show that the interpolated
the data sets concerning the training, validation and test. and extrapolated data fit perfectly in the trend of IDAC as a
The correlation coefficient R2 for the training set is 1, function of temperature and reveal a good agreement
which indicates that the variation in the training data and between the IDAC calculated using the NN model and the
the correlation are in ideal agreement. The test data cor- new data points; therefore, our model is also capable of
relation coefficient represents a comparison between the effectively estimating IDAC of water in 53 ILs at any given
experimental data and NN predicted results inside the temperature within the range [318.15 K–368.15 K] as this
range of experimental data in order to show the interpo- represents the interval of temperature where the model
lating ability of the ANN model. For the validation data set, fitting capability is actually tested.
the regression coefficient R2 is 0.99992, which shows
excellent agreement between experimental literature data 3.2 SVM modeling
and the NN predicted results within the ranges of inde-
pendent variables for which the model has been designed. In this paper, the SVM and LSSVM are employed for
For visual representation of the relative deviation nonlinear modeling of the IDAC of water in ionic liquids
between NN calculated and experimental IDAC, the reader using the same literature data used to develop the ANN
is referred to Fig. 3. As can be seen, the majority of the model. The SVM and LSSVM models generate a nonlinear
calculated points are aligned near zero relative deviation, relationship between the inputs, which are T, Tc ; Pc ; Vc ; x
except for few points. Overall, more than 99% of the data and M, and the output ln c1 . For that reason, a kernel
points relative deviations fall within the range ± 10%, and function is required for implicit mapping of training pat-
the couple points outside this range are most likely terns into feature space. A kernel can be any mathematical
explained by the fact that c!i is equal to 1 which makes ln
function that satisfies Mercer theorem. The choice of the
c!i zero and results in a high RD when dividing the dif- kernel and kernel/regularization parameters can be auto-
ference between calculated and experimental ln c! i by mated by optimization of model selection criterion [89];
zero. The high correlating performance of ANN in mod- however, this can be a tricky task and may result in severe
eling the activity coefficient at infinite dilution and the fact over-fitting the model selection criterion (typically cross-
that the ANN model allows obtaining IDAC directly validation based) when more than few hyper-parameter are
through operating temperature T, critical temperature Tc, to be tuned [85]. Different kernel functions are reported in
critical pressure Pc, critical volume Vc, acentric factor x the literature [90–94], and since there is no guarantee for a
and molecular weight M of the ionic liquid, without getting kernel to work better than the others on a specific data
involved in complicated molecular-level interactions, (other than measuring similarity), it is more practical to test
reveal the proficiency of ANN in modeling activity coef- the performance of different kernels in our modeling.
ficient at infinite dilution and call for testing its perfor- Therefore, in the present SVM modeling, the kernel func-
mance for expanded ranges of input variables. tions existing in the tool box R 2018a of Matlab are radial
In order to widen the spectrum of use/application of the basis function (RBF), polynomial, linear and Gaussian that
ANN model, the ability of the model for interpolation and are tested, and the function that gave the best results in
extrapolation is studied. The NN’s model interpolation terms of deviation and correlation coefficient is the kernel
ability is tested, not only using the test data, but using 15 used in our SVM modeling.
new data points at temperatures situated inside the ranges The data previously used to develop the ANN model are
of the variable but were not included in elaboration or exploited using the SVM method, partitioned equally
validation of the ANN model, whereas the extrapolation between training and testing using the SVM method. The
ability of the ANN model is assessed by withdrawing the inputs and output of the SVM model are normalized to a
temperature outside the ranges of the variable in a new two scale of - 1 to 1 by the mean of ‘‘premnmx’’ function, and

Table 3 Statistical parameters


Data set Training Validation (including test) Total
of ANN model developed in this
work Number of points 212 106 318
R2 1 0.99992 0.99997
RMSE 4.8614 9 10-6 0.0314 0.0181
AARD(%) 0.2983 1.0719 0.5562

123
Neural Computing and Applications

Fig. 3 Relative deviation 30


scatter plot for the NN model
20

Relative Deviation (%)


10

-10

-20

-30
50 100 150 200 250 300

N (data points)

11
[C4C1im][SCN] EXP
10 [C4C1im][SCN] CAL
[C2C1im][SCN] EXP
9
[C2C1im][SCN] CAL
[C4C1pip][SCN] EXP
8
[C4C1pip][SCN] CAL

6
IDAC

1
310 320 330 340 350 360 370 380
Temperature (K)

Fig. 4 Interpolation and extrapolation test results

later unnormalized using ‘‘postmnmx’’ function. The regularization parameter c are obtained through a trial-and-
mathematical expressions of the kernel functions used are error test [93]. Other methods of optimization of parame-
as follows: ters based on optimal design are discussed in depth by
  n o Stehlik et al. [95] and could be used as well. To perform
RBF : k Xi ; Xj ¼ exp Xi  Xj2 =2r2 the SVM and LSSVM modeling, the same software
   d MATLAB (version 7.5) used for ANN modeling is used.
Poly ðpolynomialÞ : k Xi ; Xj ¼ axT þ c The correlation coefficient R2 and AARD are the chosen
 
Linear : k Xi ; Xj ¼ xT y þ c statistical parameters of the models for performance
n o evaluation.
  1
Gaussian : k Xi ; Xj ¼ pffiffiffiffiffiffi exp Xi  Xj2 =2r2 Different permutations of meta-parameters, i.e., penalty
r 2p parameter C and loss function e, and different combina-
where d and r2 are user-defined kernel parameters, r2 is the tions of kernel parameters d and r2 are adopted on a trial-
width of RBF function and d is the degree of polynomial and-error basis. The combinations of parameters with dif-
function. In SVM and LSSVM modeling, the constant e in ferent kernel functions that gave the best performance were
the loss function and penalty parameter C must be defined recorded and set as optimal kernel parameters. An
by the user. In the work, optimal values of d and r2 and

123
Neural Computing and Applications

illustration of the hyper-parameters optimization results is coefficient R2, AARD and RMSE are 0.9950, 6.70% and
given in Fig. 5. 0.31, respectively.
Optimal values of kernel, penalty and loss function In the SVM model based on Gaussian kernel, optimal
parameter defining the best performance of the SVM values of hyper-parameters r2, e and C correspond to
model, along with the model’s statistical parameters used values of 2.44, 0.0020 and 195, respectively. The number
to assess the performance of the models, are summarized in of support vectors value of 195 is the same for all kernel
Table 4. functions employed in this study, except for the polynomial
Based on the tests’ results presented in Table 4, it can be of degree 3 in which it was found to be 158. This indicates
seen that the linear and the polynomial of degree 2, 2.5, 3 that all the training patterns were used by each kernel to
performed poorly in SVM modeling using our data. The build the optimal regression function that is later used to
fair correlation coefficient and high error that these func- make predictions of test data. Even though a model’s large
tions yield suggest their inaccuracy in SVM modeling the number of support vectors may be a sign of over-fitting of
activity coefficient using the present data set. Nonetheless, test data, the generalization performance of the optimal
despite the improved modeling results achieved with the model is demonstrated by the small difference in correla-
RBF kernel, the polynomial of degree 3.2 and Gaussian tion coefficients and errors between training and testing
kernel, the last function outperformed the two remaining indicating the good generalization performance of the
functions and is found to be the more capable of modeling model. Thus, the results confirm the proper setting of
our data by means of SVM technique; therefore, the hyper-parameters in this work
Gaussian kernel is implemented in the present SVM
modeling. 3.3 LSSVM modeling
As shown in Fig. 6, the SVM model exhibited accept-
able efficiency in correlating the global data, confirmed by In order to develop a third model for activity coefficient at
a correlation coefficient value of 0.9976 and a significant infinite dilution of water in ILs, the LSSVM learning
low average absolute relative deviation AARD of 3.44%. technique is employed. The data splitting is the same for
Also, the predictive ability of the SVM model is demon- LSSVM and SVM modeling (see previous section). The
strated by testing 159 data points which follow closely the inputs and output are normalized to a scale of - 1 to 1
trend of the experimental IDAC, except for few points for using Matlab function ‘‘mapminmax.’’‘ As mentioned
which the deviations between experimental and model previously in this article, the choice of the kernel function
predicted values are noteworthy. The correlation is crucial for model performance. Thus, many functions

AARD
228,0

205,5

183,0
200
160,5

4,4 138,0
150
4,2 115,5

4,0 93,00
AARD

3,8 100 70,50


AARD

3,6 48,00
3,4 50 25,50
3,2
4,000
138
,00
185 0
190 0 0
195
200
,0
2

205 50
2
2,

0
5
si
4

gm

210
2,

sigm
C
a

a
6
2,

215
8

10
2,

220
00
10
0
3,

Fig. 5 3-D optimization of kernel parameter r2 and C as a function of the error AARD (zoom of the optimal combination presented to the left of
the figure)

123
Neural Computing and Applications

Table 4 Models performance using optimal values of kernel, penalty and loss function parameters
Kernel function C e d r2 (Sigma) Quantity of support vectors Cross-validation error R2 (global) AARD (%) (global)

Linear 11 0.5914 – – 159 0.5672 0.8620 97.405


Gaussian 195 0.0020 – 2.44 159 9.9724 9 10-04 0.9976 3.44
Polynomial 76 0.211 2 – 159 0.1856 0.9547 35.81
Polynomial 1 0.241 2.5 – 159 0.1795 0.9614 37.13
Polynomial 26 0.6036 3 – 158 0.6811 0.8258 58.10
Polynomial 6 0.0151 3.2 – 159 0.0163 0.9961 8.14
RBF 500 0.0010 – 2.7 159 5.1677 9 10-04 0.9959 3.76
C Penalty parameter, e loss function, d kernel parameters, r2 width of RBF function, R2 correlation coefficients, AARD (%) average absolute
relative deviation, RBF Radial Basic Function

ln vs. ln , R=0.99761 ln vs. ln , R=0.99501


CalGlobal ExpGlobal CalTest ExpTest
3 3

Data Points Data Points


Best Linear Fit Best Linear Fit
2 2
ln CalGlobal = ln ExpGlobal ln CalTest = ln ExpTest

1
1

0
0
CalGlobal

CalTest

-1
ln

-1
ln

-2

-2
-3

-3
-4

-5 -4
-5 -4 -3 -2 -1 0 1 2 3 -4 -3 -2 -1 0 1 2 3
ln ln
ExpGlobal ExpTest

(a) (b)

Fig. 6 SVM predicted versus experimental data agreement plot: a for the global data, b for the test data

existing in the toolbox of LSSVM [96, 97] are tested in The criteria adopted as a measure of models performance
addition to exponential RBF and Cauchy kernel. The are correlation coefficient R2 and AARD for both global
mathematical expressions of the these lasts are as follows: and test data. The optimal values correspond to the best
   x  y performance of the LSSVM model. The results of testing
Exponential RBF : k Xi ; Xj ¼ exp  different kernels and kernel parameters are given in
2r2
  1 Table 5.
Cauchy : k Xi ; Xj ¼ 2 The two parameters r2 and c are optimized through the
1 þ xy
r2 first part of the training phase (tuning), and the optimal
The function that gives the best results in terms of values of the parameters are then used to perform the rest
correlation performance and least deviation is set as the of the computational calculations.
optimal function for the LSSVM modeling. The assessment Based on the results obtained using the trial-and-error
of the optimal kernel parameters d, r2 and regularization test, it may be observed that all kernel functions have
parameter c, along with the penalty parameter C and loss demonstrated a good degree of accuracy in predicting the
function’s constant e, is performed by a trial-and-error test. activity coefficient at infinite dilution of water in ILs,
except for linear kernel which yielded the highest error in

123
Neural Computing and Applications

Table 5 Evaluation of different LSSVM models using different kernels and optimal values of hyper-parameters
RBF Linear Polynomial (degree 4) Cauchy ERBF

R2 (total data) 0.9995 0.89336 0.99903 0.9995 0.99956


R2 (test data) 0.9991 0.89605 0.99825 0.99907 0.99919
AARD (global) (%) 2.36 62.9297 3.4199 2.0999 2.0388
AARD (test) (%) 3.94% 60.9036 6.5406 3.7905 3.6279
Kernel function’s parameters c = 3.07 9 107 c = 3.2367 c = 383.6916 c = 1.0221 9 107 c = 1.2833 9 107
2 2 2
r = 26.80 r = 165.5983 r = 11.5885 r2 = 12.7718

terms of AARD value of 62.93% and 60.90% between 3.4 Comparison between literature and this
global and test, respectively. Overall, the ERBF predicted work proposed models
values are much less scattered compared to other kernel
functions with a high correlation coefficient value of The performance of the models developed in this work is
0.99956 for training and test (global) and 0.99919 for compared to the performance of other literature models
testing. Also, the marginal loss between the R2 for training based on three types of learning algorithms which are
and testing in the case of ERBF is insignificant suggesting LSSVM, ANN and MLR. For this purpose, a LSSVM
the high generalization performance of the model in terms model that was previously proposed by Paduszynski [41]
of R2. Nevertheless, the AARD value of 2.04% for global aiming the prediction of IDAC of various solvents,
and 3.63% for testing demonstrates the high accuracy of including water, in any ionic liquid, along with a FFANN
the LSSVM model using the ERBF as a kernel function model developed in the same paper [41], is evaluated in
compared to the other models based on different kernels. terms of correlating and predicting accuracy. Both models
From Fig. 7a, b, it can be seen that the LSSVM predictions are tested for the same set of 53 ILs considered in this
of the ln c1 are in good agreement with the experimental work, but the inputs considered in both models are different
data. from the inputs selected in this work. Both LSSVM and
FFANN consider 72 inputs, which summarize group

ln vs. ln , R=0.99954 ln vs. ln , R=0.99911


CalGlobal ExpGlobal CalTest ExpTest
3 3

Data Points Datta Poin


nts
Best Linear Fit Best Linear Fit
2
ln CalGlobal =ln
n ExpGlobal 2 ln CalTest = ln ExpTestt

1
1

0
CalGlobal

0
CalTest

-1
ln
ln

-1

-2

-2
-3

-3
-4

-5 -4
-5 -4 -3 -2 -1 0 1 2 3 -4 -3 -2 -1 0 1 2 3
ln ln
ExpGlobal ExpTest

(a) (b)

Fig. 7 LSSVM predicted versus experimental data agreement plot: a for the global data, b for the test data

123
Neural Computing and Applications

contribution parameters and temperature, one output, involved in complicated thermodynamic computations.
which is the IDAC, and one hidden layer with 19 neurons The results show that the NN model developed in this work
for the FFANN model. Also, a previous MLR model pro- is more accurate in predicting the IDAC of water in 53 ILs
posed by Gonfa et al. [61] based on the same experimental than the LSSVM and FFANN techniques recently proposed
data, but using eight sigma profile molecular descriptors in the literature.
along with temperature as independent variables, was The literature MLR displayed clearly a weaker perfor-
brought to comparison. mance in terms of accuracy compared to the NN proposed
The applicable input data for the 53 ILs allowing the use in this paper. The RMSE of the proposed NN model is
of the final models and the reproduction of the results are more than ten times smaller than the MLR one which
provided in the Supporting Information 2. demonstrates the stability of the neural network model. The
As illustrated in Table 6, the ANN model proposed in comparative study carried out illuminated the effect that
the present work is capable of correlating far more accu- the input variables have over model’s performance.
rately the IDAC of water in the 53 ILs, while the proposed
LSSVM model generates a lower average absolute relative 3.5 Graphical user interface of the ANN model
deviation compared to the literature FFANN, literature
MLR, literature LSSVM and this work SVM models; the In order to simplify the complexity of the work and pro-
NN model proposed in this work predominates all models gramming performed in this paper, a graphical interface
with an AARD of 0.56%, more than forty times smaller (also called user interface) is developed to improve the
than this work’s LSSVM one. usability of the developed ANN model. It is a tool that
Accordingly, the parity plots demonstrated in Fig. 8 allows a direct effortless utilization of the NN model to
show the scatter of calculated versus experimental c1 i for calculate the IDAC of water in 53 ionic liquids. The user
three models chosen as example for comparison, namely graphical interface created in this work is depicted in
this work ANN model (Fig. 8a), literature LSSVM Fig. 9.
(Fig. 7b) and literature FFANN (Fig. 8c). As seen, the data
points land exactly along the diagonal in the case of the 3.6 Input variables’ sensitivity study
present NN model, which proves the accuracy of the pre-
sent modeling. However, the large scatters observed for the One of the main objectives of the sensitivity analysis is
two remaining techniques LSSVM and FFANN reveal a revealing the input parameters that hold the biggest effect
much lower fitting capability of the two models. on the output variable. Nevertheless, this study allows
Nonetheless, the fact that the literature FFANN model identifying the inputs that have a weak influence on the
considers a large number of inputs, namely functional output and permits discarding these parameters that add
groups (66) and Abraham solvation parameters (5), in unnecessary encumber in the modeling and focus instead
addition to temperature leads to more computing com- on the more significant parameters [98]. In this context,
plexity and a high-dimensional weights’ matrix (19 regression techniques are the most widely used, and in this
rows 9 72 columns) and bias column vector (19 rows), work, standardized regression coefficients method is used
and that for the hidden layer only. However, the ANN to carry out the sensitivity analysis. During this process,
proposed in this study outperforms the other models and experimental data of inputs and output are fed into the
accurately predicts infinite-dilution activity coefficient regression algorithm which builds a linear model between
through operating temperature (T), IL’s critical temperature each input and the output, whereby the resulting regression
(Tc), critical pressure (Pc), critical volume (Vc), acentric coefficients for each factor describe the sensitivity measure
factor (x) and molecular weight (M), without getting of that factor on the output [98].

Table 6 This work and


Model type Total data points R2 AARD (%) RMSE
literature models’ statistical
parameters comparison ANN (this work) 318 0.99997 0.56 0.0181
LSSVM (this work) 318 0.9995 2.36 0.0110
SVM (this work) 318 0.99761 3.44 0.2214
LSSVM [41] 318 0.9015 22.69 0.6971
FFANN [41] 318 0.9207 40.61 0.6214
MLR [61] 318 0.9618 – 0.2806
R2 Correlation coefficients, AARD (%) average absolute relative deviation, RBF Radial Basic Function

123
Neural Computing and Applications

vs. , R=0.99965 vs. , R=0.90148


Cal Exp cal Exp
10 10
Data Points
Data Points
Best Linear Fit
9 9 Best Linear Fit
=
Cal Exp
Cal
= Exp
8 8

7 7

6 6
Cal

cal
5 5

4 4

3 3

2 2

1 1

0 0
0 2 4 6 8 10 0 2 4 6 8 10

Exp Exp

(a) (b)

vs. , R=0.92073
Cal Exp
10
Data Points
Best Linear Fit
9
cal
= Exp

6
Cal

0
0 2 4 6 8 10

Exp

(c)

Fig. 8 Parity plots of the calculated versus experimental c1


i for the same data set. a NN in this work; b LSSVM technique [41]; c FFANN model
[41]

Our models have six inputs: T, Tc, Pc, Vc, x and M, and c1 = f ðT; Tc ; Pc ; Vc ; x; M Þ: The Monte Carlo sample in
one output ln c1 . The models describe ln this case can be represented as:

123
Neural Computing and Applications

1ðiÞ
Matlab function postreg ðln cCal ; ln c1ðiÞ Þ has to exceed
0.7, confirming that the regression model is able to corre-
late the given data. Figure 10 shows the regression model
performance for our case of study where Rlnc1 was calcu-
lated and found to be 0.8398, which ascertains the feasi-
bility of this method for the present sensitivity analysis. In
this case, the effect or sensitivity of each input on the
output is simply quantified and may be prearranged by the
absolute value of its bj .
The pie chart in Fig. 11 is a representation of the relative
importance of the six input variables on the output variable
calculated using the standardized regression coefficients
method. The results demonstrate that all the input variables
have a variable effect on the activity coefficient. The
molecular weight of the ionic liquid holds the highest
contribution with a relative importance of 57.69%, while
the rest of the inputs have less significant contributions,
Fig. 9 The user graphical interface elaborated in this work specifically 11.93%, 9.57%, 8.46%, 7.94% and 4.42%
8 9 corresponding to T, Pc ; x, Vc and Tc , respectively.
> ð1Þ ð1Þ ð1 Þ
>
> T ð1 Þ Tc Pc Vc xð1Þ M ð1 Þ > >
>
>
> >
ð2 Þ >
< T ð2 Þ Tc
ð2Þ
Pc
ð2Þ
Vc
ð2 Þ
xð2Þ M =
M¼ .. >; while
>
>
.. .. .. .. ..
. >
4 Conclusion
>
> . . . . . >
>
>
: ð318Þ ð318Þ ð318Þ ð318Þ >
;
T Tc Pc Vc xð318Þ M ð318Þ
In this work, three machine learning techniques are applied
lnc1ð1Þ to activity coefficient at infinite dilution of water in ionic
lnc1ð2Þ liquids-related literature data to develop and validate models
ln c1 ¼ ..
. capable of predicting this property. Three models based on
lnc 1ð318Þ ANNs, SVM and LSSVM are developed with the aim of
quantifying numerically the relationship that connects the
For example, in order to measure the sensitivity of the input variables to the output. For that reason, six indepen-
input T, the mean and standard deviation MT and STDT , dent variables are chosen, namely the temperature of the
respectively, of the factor T are calculated. Similarly, Mlnc1
and STDlnc1 are the corresponding quantities for the output Outputs vs. Targets, R=0.83981
10
ði Þ
ðiÞ xj Mxj
ln c1 : We consider the two entities x~j = STD and Y ¼
xj
1ð i Þ
lnc lnc M 1 8
STDlnc1 for i = 1,2,…,318 and j = 1,2,…,6 and look for
the regression model in Eq. (12)
1ðiÞ 6
lncCal  Mlnc1 X 6
ðiÞ
¼ bj x~j ð12Þ
STDlnc1
Exp

j¼1
4
ln

where bjs are the standardized regression coefficients and


are determined by least square error eðiÞ , with
2
1ðiÞ
eðiÞ ¼ lncCal  lnc1ðiÞ ð13Þ
Accordingly, bj values obtained in the present work are, 0

respectively, -0.1855, 0.0687, -0.1487, -0.1234, 0.1315


and 0.8968 corresponding to T, Tc ; Pc ; Vc ; x and M: How-
-2
ever, to be able to use those results as a measure of the 0 2 4 6 8 10

sensitivity of the inputs on the output, the regression ln


Cal
1ðiÞ
coefficient Rlnc1 of the fit lncCal ¼ f ðlnc1ðiÞ Þ using the
Fig. 10 Regression model performance plot

123
Neural Computing and Applications

the most pertinent parameter on the IDAC, followed by the


temperature, the critical pressure, the acentric factor and
critical volume of the IL, whereas the critical temperature
of the IL is found to be the least influential input, given its
relative importance is less than 5%. Despite the high
accuracy of the models developed in this work, in partic-
ular the ANN model, the modeling techniques used are all
correlative which require large sets of data and usually do
not expand well outside the range of data used to establish
models. Therefore, suggested improvements of this work
consist of broadening the research data, testing a larger
number of ILs and different solutes, and using other
learning methods such as evolutionary and hybrid algo-
rithms in order to develop other models and correlations
Fig. 11 Relative importance of input variables on the estimation of capable of prediction. Also, in light of the good results
IDAC of water in ionic liquids obtained using the input parameters in this work, it is
hoped, in the future, that researchers and engineers who
system water/ionic liquid, the critical temperature, critical apply AI methods for modeling will consider parameters
pressure, critical volume, acentric factor and molecular that are significant and translucently elected to represent
weight of the ionic liquid, whereas the activity coefficient at the phenomenon studied.
infinite dilution of water in ionic liquids in the form of ln c1
i
is set as the output dependent variable. Among the models Acknowledgements The authors gratefully acknowledge the Algerian
proposed in this work, the feedforward backpropagation Ministry of Higher Education and Scientific Research (CNEPRU
neural network model has shown the best performance. The Projects: Nos. J0102620110007 and J0102620140015) and the
University Yahia Fares of Medea.
optimal neural network consists of an input layer with 6
neurons, one hidden layer with 16 neurons and an output
Compliance with ethical standards
layer with one neuron. This model exhibits excellent cor-
relating/predicting ability elucidated by an overall average Conflict of interest Authors declare that there is no conflict of interest
absolute relative deviation of 0.56% and a correlation to declare.
coefficient of 0.99997, while the SVM and LSSVM models
yielded a higher deviation between experimental and model
calculated results. Moreover, the ANN model demonstrates
an excellent interpolation and extrapolation capability of the References
IDAC of water in 53 ionic liquids at any temperature within
1. Pham TPT, Cho CW, Yun YS (2010) Environmental fate and
the range [313.15 K–373.15 K]. To illustrate the superiority toxicity of ionic liquids: a review. Water Res 44:352–372. https://
of the proposed ANN model, the correlative performance of doi.org/10.1016/j.watres.2009.09.030
the proposed models is compared to three literature models 2. Petkovic M, Seddon KR, Rebelo LPN, Silva Pereira C (2011)
based on three methods: MLR, LSSVM and FFNN. The Ionic liquids: a pathway to environmental acceptability. Chem
Soc Rev 40:1383–1403. https://doi.org/10.1039/C004968A
comparison reveals that the proposed models have more 3. Carvalho PJ, Álvarez VH, Machado JJB et al (2009) High pres-
accurate correlation ability than the three literature models, sure phase behavior of carbon dioxide in 1-alkyl-3-methylimi-
which demonstrates that the choice of input variables is a dazolium bis(trifluoromethylsulfonyl)imide ionic liquids.
crucial step in a modeling process. In addition, including a J Supercrit Fluids 48:99–107. https://doi.org/10.1016/j.supflu.
2008.10.012
relatively high number of unnecessary inputs not only 4. Song HN, Lee B, Lim JS (2010) Measurement of CO2 solubility
complicates the calculation process, but also it may add a in ionic liquids: [BMP][TfO] and [P14,6,6,6][Tf 2 N] by mea-
factor of inaccuracy to it. suring bubble-point pressure. J Chem Eng Data 55:891–896.
A graphical user interface related to the best proposed https://doi.org/10.1021/je9005085
5. Bogel-Lukasik R, Matkowska D, Zakrzewska ME et al (2010)
model (ANN) has been elaborated in order to provide a The phase envelopes of alternative solvents (ionic liquid, CO2)
prediction of the IDAC of water in 53 ILs at any given and building blocks of biomass origin (lactic acid, propionic
temperature within the range studied. Finally, the input acid). Fluid Phase Equilib 295:177–185. https://doi.org/10.1016/j.
sensitivity analysis is elaborated using standardized fluid.2010.05.013
6. Domańska U, Bogel-Lukasik R (2005) Physicochemical proper-
regression coefficients method, appropriately. The results ties and solubility of alkyl-(2-hydroxyethyl)-dimethylammonium
represent a quantification of the contribution of each input bromide. J Phys Chem B 109:12124–12132. https://doi.org/10.
parameter on the output and reveal the molecular weight as 1021/jp058015c

123
Neural Computing and Applications

7. Lourenço C, Melo CI, Bogel-Łukasik R, Bogel-Łukasik E (2012) Chem Chem Phys 3:5192–5200. https://doi.org/10.1039/
Solubility advantage of pyrazine-2-carboxamide: application of b106900d
alternative solvents on the way to the future pharmaceutical 24. Huang J-F, Chen P-Y, Sun IW, Wang SP (2001) NMR evidence
development. J Chem Eng Data 57:1525–1533. https://doi.org/10. of hydrogen bonding in 1-ethyl-3-methylimidazolium-tetrafluo-
1021/je300044x roborate room temperature ionic liquid. Inorg Chim Acta
8. Alevizou EI, Voutsas EC (2013) Solubilities of p-coumaric and 320:7–11. https://doi.org/10.1016/S0020-1693(01)00477-7
caffeic acid in ionic liquids and organic solvents. J Chem Ther- 25. Mele A, Tran CD, De Paoli Lacerda SH (2003) The structure of a
modyn 62:69–78. https://doi.org/10.1016/j.jct.2013.02.013 room-temperature ionic liquid with and without trace amounts of
9. Neves CMSS, Batista MLS, Cláudio AFM et al (2010) Ther- water: the role of C-H…O and C–H…F interactions in 1-n-butyl-
mophysical properties and water saturation of [PF 6]-based ionic 3-methylimidazolium tetrafluoroborate. Angew Chemie
liquids. J Chem Eng Data 55:5065–5073. https://doi.org/10.1021/ 115:4500–4502. https://doi.org/10.1002/ange.200351783
je100638g 26. Dwan J, Durant D, Ghandi K (2008) Nuclear magnetic resonance
10. Pârvulescu VI, Hardacre C (2007) Catalysis in ionic liquids. spectroscopic studies of the trihexyl (tetradecyl) phosphonium
Chem Rev 107:2615–2665. https://doi.org/10.1021/cr050948h chloride ionic liquid mixtures with water. Cent Eur J Chem
11. Melo CI, Bogel-Łukasik R, Bogel-Łukasik E (2012) Combination 6:347–358. https://doi.org/10.2478/s11532-008-0034-3
of supercritical carbon dioxide and ionic liquid in a novel 27. Zhang QG, Wang NN, Yu ZW (2010) The hydrogen bonding
assembly of carvacrol. J Supercrit Fluids 61:191–198. https://doi. interactions between the ionic liquid 1-ethyl-3-methylimida-
org/10.1016/J.SUPFLU.2011.10.005 zolium ethyl sulfate and water. J Phys Chem B 114:4747–4754.
12. Bogel-Lukasik E, Santos S, Bogel-Lukasik R, Nunes Da Ponte M https://doi.org/10.1021/jp1009498
(2010) Selectivity enhancement in the catalytic heterogeneous 28. Fumino K, Wulf A, Ludwig R (2009) The potential role of
hydrogenation of limonene in supercritical carbon dioxide by an hydrogen bonding in aprotic and protic ionic liquids. Phys Chem
ionic liquid. J Supercrit Fluids 54:210–217. https://doi.org/10. Chem Phys 11:8790. https://doi.org/10.1039/b905634c
1016/j.supflu.2010.04.011 29. Hanke CG, Lynden-Bell RM (2003) A simulation study of water-
13. Hough WL, Smiglak M, Rodrı́guez H et al (2007) The third dialkylimidazolium ionic liquid mixtures. J Phys Chem B
evolution of ionic liquids: active pharmaceutical ingredients. 107:10873–10878. https://doi.org/10.1021/jp034221d
New J Chem 31:1429. https://doi.org/10.1039/b706677p 30. Katayanagi H, Nishikawa K, Shimozaki H et al (2004) Mixing
14. Smith K, Bridson R, Leeke G (2011) Solubilities of pharma- schemes in ionic liquid–H2O systems: a thermodynamic study.
ceuticals compounds in ionic liquids. J Chem Eng Data J Phys Chem B 108:19451–19457. https://doi.org/10.1021/
56(5):2039–2043 jp0477607
15. Lee T, Kuo CS, Chen YH (2006) Solubility, polymorphism, 31. Jiang W, Wang Y, Voth GA (2007) Molecular dynamics simu-
crystallinity, and crystal habit of acetaminophen and ibuprofen by lation of nanostructural organization in ionic liquid/water mix-
initial solvent screening. Pharm Technol 30:72–92 tures. J Phys Chem B 111:4812–4818. https://doi.org/10.1021/
16. Arce A, Earle MJ, Katdare SP et al (2007) Phase equilibria of jp0671421
mixtures of mutually immiscible ionic liquids. Fluid Phase 32. Domańska U, Marciniak A (2007) Activity coefficients at infinite
Equilib 261:427–433. https://doi.org/10.1016/j.fluid.2007.06.017 dilution measurements for organic Solutes and water in the ionic
17. Luis P, Ortiz I, Aldaco R, Irabien A (2007) A novel group con- liquid 1-ethyl-3-methylimidazolium Trifluoroacetate. J Phys
tribution method in the development of a QSAR for predicting Chem B 111:11984–11988. https://doi.org/10.1021/jp075079?
the toxicity (Vibrio fischeri EC50) of ionic liquids. Ecotoxicol 33. Domańska U, Marciniak A (2008) Measurements of activity
Environ Saf 67:423–429. https://doi.org/10.1016/j.ecoenv.2006. coefficients at infinite dilution of aromatic and aliphatic hydro-
06.010 carbons, alcohols, and water in the new ionic liquid
18. Torrecilla JS, Garcı́a J, Rojo E, Rodrı́guez F (2009) Estimation of [EMIM][SCN] using GLC. J Chem Thermodyn 40:860–866.
toxicity of ionic liquids in Leukemia Rat Cell Line and Acetyl- https://doi.org/10.1016/j.jct.2008.01.004
cholinesterase enzyme by principal component analysis, neural 34. Domańska U, Marciniak A (2010) Physicochemical properties
networks and multiple lineal regressions. J Hazard Mater and activity coefficients at infinite dilution for organic solutes and
164:182–194. https://doi.org/10.1016/j.jhazmat.2008.08.022 water in the ionic liquid 1-decyl-3-methylimidazolium tetra-
19. Seddon K (1999) Ionic liquids: designer solvents. In: Boghosian cyanoborate. J Phys Chem B 114:16542–16547. https://doi.org/
S, Dracopoulos V, Kontoyannis CG, Voyiatzis GA (eds) Sym- 10.1021/jp109469s
posium TIGP. Institute of Chemical Engineering and High 35. Domańska U, Marciniak A (2008) Activity coefficients at infinite
Temperature Chemical Processes, Patras, pp 131–135 dilution measurements for organic solutes and water in the ionic
20. Mai NL, Koo YM (2014) Quantitative prediction of lipase liquid 1-butyl-3-methylimidazolium trifluoromethanesulfonate.
reaction in ionic liquids by QSAR using COSMO-RS molecular J Phys Chem B 112:11100–11105. https://doi.org/10.1021/
descriptors. Biochem Eng J 87:33–40. https://doi.org/10.1016/j. jp804107y
bej.2014.03.010 36. Domańska U, Laskowska M (2009) Measurements of activity
21. Domańska U, Królikowska M, Acree WE, Baker GA (2011) coefficients at infinite dilution of aliphatic and aromatic hydro-
Activity coefficients at infinite dilution measurements for organic carbons, alcohols, thiophene, tetrahydrofuran, MTBE, and water
solutes and water in the ionic liquid 1-ethyl-3-methylimidazolium in ionic liquid [BMIM][SCN] using GLC. J Chem Thermodyn
tetracyanoborate. J Chem Thermodyn 43:1050–1057. https://doi. 41:645–650. https://doi.org/10.1016/j.jct.2008.12.018
org/10.1016/j.jct.2011.02.012 37. Domańska U, Paduszyński K (2010) Measurements of activity
22. Wlazło M, Marciniak A (2012) Activity coefficients at infinite coefficients at infinite dilution of organic solutes and water in
dilution and physicochemical properties for organic solutes 1-propyl-1-methylpiperidinium bis{(trifluoromethyl)sul-
and water in the ionic liquid 4-(2-methoxyethyl)-4-methyl- fonyl}imide ionic liquid using g.l.c. J Chem Thermodyn
morpholinium trifluorotris(perfluoroethyl)phosphate. J Chem 42:1361–1366. https://doi.org/10.1016/j.jct.2010.05.017
Thermodyn 54:366–372. https://doi.org/10.1016/j.jct.2012.05. 38. Domańska U, Królikowska M (2010) Measurements of activity
017 coefficients at infinite dilution in solvent mixtures with thio-
23. Cammarata L, Kazarian SG, Salter PA, Welton T (2001) cyanate-based ionic liquids using GLC technique. J Phys Chem B
Molecular states of water in room temperature ionic liquids. Phys 114:8460–8466. https://doi.org/10.1021/jp103496d

123
Neural Computing and Applications

39. Domańska U, Królikowska M (2011) Measurements of activity 55. Diedenhofen M, Klamt A (2010) COSMO-RS as a tool for
coefficients at infinite dilution for organic solutes and water in the property prediction of IL mixtures: a review. Fluid Phase Equilib
ionic liquid 1-butyl-1-methylpiperidinium thiocyanate. J Chem 294:31–38. https://doi.org/10.1016/j.fluid.2010.02.002
Eng Data 56:124–129. https://doi.org/10.1021/je101008y 56. Diedenhofen M, Eckert F, Klamt A (2003) Prediction of infinite
40. Domańska U, Lukoshko EV (2014) Thermodynamics and activity dilution activity coefficients of organic compounds in ionic liq-
coefficients at infinite dilution for organic solutes and water in the uids using COSMO-RS. J Chem Eng Data 48:475–479. https://
ionic liquid 1-butyl-1-methylmorpholinium tricyanomethanide. doi.org/10.1021/je025626e
J Chem Thermodyn 68:53–59. https://doi.org/10.1016/j.jct.2013. 57. Freire MG, Ventura SPM, Santos LMNBF et al (2008) Evalua-
08.030 tion of COSMO-RS for the prediction of LLE and VLE of water
41. Paduszyński K (2016) In silico calculation of infinite dilution and ionic liquids binary systems. Fluid Phase Equilib 268:74–84.
activity coefficients of molecular solutes in ionic liquids: critical https://doi.org/10.1016/j.fluid.2008.04.009
review of current methods and new models based on three 58. Banerjee T, Khanna A (2006) Infinite dilution activity coeffi-
machine learning algorithms. J Chem Inf Model 56:1420–1437. cients for trihexyltetradecyl phosphonium ionic liquids: mea-
https://doi.org/10.1021/acs.jcim.6b00166 surements and COSMO-RS prediction. J Chem Eng Data
42. Matheswaran P, Wilfred CD, Kurnia KA, Ramli A (2016) 51:2170–2177. https://doi.org/10.1021/je0602925
Overview of activity coeffient of thiophene at infinite dilution in 59. Kurnia KA, Pinho SP, Coutinho JAP (2014) Evaluation of the
ionic liquids and their modeling using COSMO-RS. Ind Eng conductor-like screening model for real solvents for the predic-
Chem Res 55:788–797. https://doi.org/10.1021/acs.iecr.5b04152 tion of the water activity coefficient at infinite dilution in ionic
43. Knoop C, Tiegs D, Gmehling J (1989) Measurement liquids. Ind Eng Chem Res 53:12466–12475. https://doi.org/10.
of.gamma.infin. using gas-liquid chromatography. 3. Results for 1021/ie5021415
the stationary phases 10-nonadecanone, N-formylmorpholine, 60. Gonfa G, Bustam MA, Sharif AM et al (2015) Tuning ionic
1-pentanol, m-xylene, and toluene. J Chem Eng Data liquids for natural gas dehydration using COSMO-RS method-
34:240–247. https://doi.org/10.1021/je00056a027 ology. J Nat Gas Sci Eng 27:1141–1148. https://doi.org/10.1016/
44. Krummen M, Gruber D, Gmehling J (2000) Measurement of j.jngse.2015.09.062
activity coefficients at infinite dilution in solvent mixtures using 61. Gonfa G, Bustam MA, Shariff AM et al (2016) Quantitative
the dilutor technique. Ind Eng Chem Res 39:2114–2123. https:// structure–activity relationships (QSARs) for estimation of activ-
doi.org/10.1021/ie990830p ity coefficient at infinite dilution of water in ionic liquids for
45. Vasiltsova TV, Verevkin SP, Bich E et al (2005) Thermodynamic natural gas dehydration. J Taiwan Inst Chem Eng 66:222–229.
properties of mixtures containing ionic liquids. activity coeffi- https://doi.org/10.1016/j.jtice.2016.06.027
cients of ethers and alcohols in 1-methyl-3-ethyl-imidazolium 62. Paduszyński K (2017) An overview of the performance of the
bis(trifluoromethyl-sulfonyl) imide using the transpiration COSMO-RS approach in predicting the activity coefficients of
method. J Chem Eng Data 50:142–148. https://doi.org/10.1021/ molecular solutes in ionic liquids and derived properties at infi-
je0497799 nite dilution. Phys Chem Chem Phys 19:11835–11850. https://
46. Kato R, Krummen M, Gmehling J (2004) Measurement and doi.org/10.1039/C7CP00226B
correlation of vapor-liquid equilibria and excess enthalpies of 63. Klamt A (2011) The COSMO and COSMO-RS solvation models.
binary systems containing ionic liquids and hydrocarbons. Fluid Wiley Interdiscip Rev Comput Mol Sci 1:699–709. https://doi.
Phase Equilib 224:47–54. https://doi.org/10.1016/j.fluid.2004.05. org/10.1002/wcms.56
009 64. Marin-Rimoldi E, Shah JK, Maginn EJ (2015) Monte Carlo
47. Gonfa G, Bustam MA, Murugesan T et al (2012) COSMO-RS simulations of water solubility in ionic liquids: a force field
based screening ionic liquids for separation of benzene and assessment. Fluid Phase Equilib 407:117–125. https://doi.org/10.
cyclohexane. Int J Chem Environ Eng 3:244–254 1016/j.fluid.2015.07.007
48. Shiflett MB, Maginn EJ (2017) The solubility of gases in ionic 65. Eike DM, Brennecke JF, Maginn EJ (2004) Predicting infinite-
liquids. AIChE J 63:4722–4737. https://doi.org/10.1002/aic. dilution activity coefficients of organic solutes in ionic liquids.
15957 Ind Eng Chem Res 43:1039–1048. https://doi.org/10.1021/
49. Ardabili SF, Najafi B, Shamshirband S et al (2018) Computa- ie034152p
tional intelligence approach for modeling hydrogen production: a 66. Gramatica P (2007) Principles of QSAR models validation:
review. Eng Appl Comput Fluid Mech 12:438–458. https://doi. internal and external. QSAR Comb Sci 26:694–701. https://doi.
org/10.1080/19942060.2018.1452296 org/10.1002/qsar.200610151
50. Zhang S, Chau K (2009) Dimension reduction using semi-su- 67. Tropsha A, Gramatica P, Gombar V (2003) The importance of
pervised locally linear embedding for plant leaf classification, being earnest: validation is the absolute essential for successful
948–955 application and interpretation of QSPR models. QSAR Comb Sci
51. Klamt A (2005) COSMO-RS: from quantum chemistry to fluid 22:69–77. https://doi.org/10.1002/qsar.200390007
phase thermodynamics and drug design, 1st edn. Elsevier Sci- 68. Si-Moussa C, Hanini S, Derriche R et al (2008) Prediction of
ence, Amsterdam high-pressure vapor liquid equilibrium of six Binary systems,
52. Klamt A, Eckert F (2000) COSMO-RS: a novel and efficient carbon dioxide with six esters, using an artificial neural network
method for the a priori prediction of thermophysical data of model. Braz J Chem Eng 25:183–199. https://doi.org/10.1590/
liquids. Fluid Phase Equilib 172:43–72. https://doi.org/10.1016/ S0104-66322008000100019
S0378-3812(00)00357-5 69. Khaouane L, Si-Moussa C, Hanini S, Benkortbi O (2012) Opti-
53. Klamt A, Jonas V, Bürger T, Lohrenz JCW (1998) Refinement mization of culture conditions for the production of pleuromutilin
and parametrization of COSMO-RS. J Phys Chem A from pleurotus mutilus using a hybrid method based on central
102:5074–5085. https://doi.org/10.1021/jp980017s composite design, neural network, and particle swarm optimiza-
54. Klamt A, Schüürmann G (1993) COSMO: a new approach to tion. Biotechnol Bioprocess Eng 17:1048–1054. https://doi.org/
dielectric screening in solvents with explicit expressions for the 10.1007/s12257-012-0254-4
screening energy and its gradient. J Chem Soc 2(5):799–805. 70. Taormina R, Chau KW, Sivakumar B (2015) Neural network
https://doi.org/10.1039/P29930000799 river forecasting through baseflow separation and binary-coded

123
Neural Computing and Applications

swarm optimization. J Hydrol 529:1788–1797. https://doi.org/10. 85. Cawley GC, Talbot NLC (2010) On over-fitting in model selec-
1016/j.jhydrol.2015.08.008 tion and subsequent selection bias in performance evaluation.
71. Kaur J, Kalra A, Sharma D (2018) Comparative survey of swarm J Mach Learn Res 11:2079–2107
intelligence optimization approaches for ANN optimization. Adv 86. Alessi P, Fermeglia M, Kikic I (1991) Significance of dilute
Intell Syst Comput 624:305–314. https://doi.org/10.1007/978- regions. Fluid Phase Equilib 70:239–250. https://doi.org/10.1016/
981-10-5903-2_32 0378-3812(91)85037-U
72. Moazenzadeh R, Mohammadi B, Shamshirband S, Chau K 87. Valderrama JO, Forero LA, Rojas RE (2012) Critical properties
(2018) Coupling a firefly algorithm with support vector regres- and normal boiling temperature of ionic liquids: update and a new
sion to predict evaporation in northern Iran. Eng Appl Comput consistency test. Ind Eng Chem Res 51:7838–7844. https://doi.
Fluid Mech 12:584–597. https://doi.org/10.1080/19942060.2018. org/10.1021/ie202934g
1482476 88. Valderrama JO, Forero LA, Rojas RE (2015) Extension of a
73. Kazemi SMR, Minaei Bidgoli B, Shamshirband S et al (2018) group contribution method to estimate the critical properties of
Novel genetic-based negative correlation learning for estimating ionic liquids of high molecular mass. Ind Eng Chem Res
soil temperature. Eng Appl Comput Fluid Mech 12:506–516. 54:3480–3487. https://doi.org/10.1021/acs.iecr.5b00260
https://doi.org/10.1080/19942060.2018.1463871 89. Yahyaee A, Nazifi M, Kianpour M, Heidar KT (2018) experi-
74. Nami F, Deyhimi F (2011) Prediction of activity coefficients at mental investigation and modeling of activity coefficient at infi-
infinite dilution for organic solutes in ionic liquids by artificial nite dilution of solutes using dicationic solvent based on
neural network. J Chem Thermodyn 43:22–27. https://doi.org/10. pyrrolidinium as a new stationary phase in gas chromatography.
1016/j.jct.2010.07.011 Am J Anal Chem 09:257–271. https://doi.org/10.4236/ajac.2018.
75. Wu CL, Chau KW (2011) Rainfall-runoff modeling using artifi- 94020
cial neural network coupled with singular spectrum analysis. 90. Samui P, Sitharam TG (2008) Least-square support vector
J Hydrol 399:394–409. https://doi.org/10.1016/j.jhydrol.2011.01. machine applied to settlement of shallow foundations on cohe-
017 sionless soils. Int J Numer Anal Methods Geomech
76. Ajmani S, Rogers SC, Barley MH et al (2008) Characterization of 32:2033–2043. https://doi.org/10.1002/nag.731
mixtures part 1: prediction of infinite-dilution activity coefficients 91. Vapnik V, Vapnik V, Golowich SE, Smola A (1996) Support
using neural network-based QSPR models. QSAR Comb Sci vector method for function approximation, regression estimation,
27:1346–1361. https://doi.org/10.1002/qsar.200860022 and signal processing. Adv Neural Inf Process Syst 9(9):281–287
77. Vapnik AV (1998) Statistical learning theory. Wiley, New York 92. Smola AJ, Schölkopf B (2004) A tutorial on support vector
78. Vapnik AV (1999) The nature of statistical learning theory, 2nd regression. Stat Comput 14:199–222. https://doi.org/10.1023/B:
edn. Springer, New York STCO.0000035301.49549.88
79. Vapnik AV (1995) The nature of statistical learning theory. 93. Samui P, Sitharam TG, Kurup PU et al (2008) OCR prediction
Springer, New York using support vector machine based on piezocone data. J Geotech
80. Pourbasheer E, Riahi S, Ganjali MR, Norouzi P (2009) Appli- Geoenviron Eng 134(6):894–898
cation of genetic algorithm-support vector machine (GA-SVM) 94. Al-Anazi AF, Gates ID (2012) Support vector regression to
for prediction of BK-channels activity. Eur J Med Chem predict porosity and permeability: effect of sample size. Comput
44:5023–5028 Geosci 39:64–76. https://doi.org/10.1016/j.cageo.2011.06.011
81. Vapnik V, Golowich SE, Smola AJ (1997) Support vector 95. Stehlı́k M, Hermann P, Torres S et al (2017) On dynamics
method for function approximation, regression estimation and underlying variance of mass balance estimation in Chilean gla-
signal processing. In: Mozer MC, Jordan MI, Petsche T (eds) ciers. Ecol Complex 31:149–164. https://doi.org/10.1016/j.eco
Advances in neural information processing systems, vol 9. MIT com.2017.06.008
Press, Cambridge, pp 281–287 96. LSSVM toolbox. https://www.esat.kuleuven.be/sista/lssvmlab.
82. Suykens JAK, Vandewalle J (1999) Least squares support vector Accessed 15 Jun 2018
machine classifiers. Neural Process Lett 9:293–300. https://doi. 97. Brabanter K De, Karsmakers P, Ojeda F, et al (2010) LS-SVMlab
org/10.1023/A:1018628609742 toolbox user’s guide
83. Wang X, Chen J, Liu C, Pan F (2009) Hybrid modeling of 98. Andrea S, Stefano T, Francesca C, Marco R (2004) Sensitivity
penicillin fermentation process based on least square support analysis in practice: a guide to assessing scientific models.
vector machine. Chem Eng Res Des 88:415–420. https://doi.org/ Chichester, England. https://doi.org/10.1002/0470870958
10.1016/j.cherd.2009.08.010
84. Ghaslani D, Eshaghi Gorji Z, Ebrahimpoor Gorji A, Riahi S Publisher’s Note Springer Nature remains neutral with regard to
(2017) Descriptive and predictive models for Henry’s law con- jurisdictional claims in published maps and institutional affiliations.
stant of CO2 in ionic liquids: a QSPR study. Chem Eng Res Des
120:15–25. https://doi.org/10.1016/j.cherd.2016.12.020

123

You might also like