Download as pdf or txt
Download as pdf or txt
You are on page 1of 11

Quantification of human sperm concentration using machine

learning-based spectrophotometry
Ali Lesani1, Somaieh Kazemnejad2, Mahdi Moghimi Zand1,*, Mojtaba Azadi3, Hassan Jafari4,
Mohammad R.K. Mofrad5, Reza Nosrati6,*
1 Small Medical Devices, BioMEMS and LoC Lab, School of Mechanical Engineering, College of Engineering, University of Tehran, Iran
2 Reproductive Biotechnology Research Center, Avicenna Research Institute, ACECR, Tehran, Iran
3 School of Engineering, San Francisco State University, US
4 Department of Biostatistics and Health Informatics, Institute of Psychology Psychiatry and Neuroscience, King's College London, United Kingdom
5 Departments of Mechanical & Bioengineering, UC Berkeley, Berkeley, US
6 Department of Mechanical & Aerospace Engineering, Monash University, Clayton, Victoria 3800, Australia

* Corresponding authors
M. Moghimi Zand. Small Medical Devices, BioMEMS and LoC Lab, School of Mechanical Engineering, College of Engineering, University of Tehran, Iran
Email: ahdimoghimi@ut.ac.ir
R. Nosrati. Department of Mechanical and Aerospace Engineering, Monash University, VIC 3800, Australia.
Email: Reza.Nosrati@monash.edu

ABSTRACT
Spectrophotometry is an indirect non-invasive and quantitative method for specifying materials with unknown contents based
on absorption behavior. This paper presents the first application of artificial neural network in spectrophotometry for
quantification of human sperm concentration. A well-trained full spectrum neural network (FSNN) model is developed by
examining the absorption response of sperm samples from 41 human subjects to different light spectra (wavelength from 390 to
1100 nm). It is shown that this FSNN accurately estimates sperm concentration based on the full absorption spectrum with over
93% prediction accuracy, and provides 100% agreement with clinical assessments in differentiating the samples of healthy donor
from patient samples. We suggest the machine learning-based spectrophotometry approach with the trained FSNN model as a
rapid, low-cost, and powerful technique to quantify sperm concentration. The performance of this technique is superior to
available spectrophotometry methods currently used for semen analysis and will provide novel research and clinical opportunities
for tackling male infertility.

KEYWORDS
Semen analysis, Sperm concentration, Spectrophotometry, Artificial Neural Network

parameters [5, 12], and can be used as the first stand-alone


step to diagnose male infertility. Moreover, sperm
1 | INTRODUCTION concentration for each individual changes on a daily basis [13],
and for a reasonable judgement on the male fertility health,
Infertility is on the rise worldwide [1, 2], affecting one in six three cycles of semen analyses are to be conducted over the
couples with almost half of the cases caused by male infertility course of 2 to 3 weeks, each at least seven days apart [14].
factors [3, 4]. Male infertility is commonly attributed to low Additionally, follow-up treatments need even more frequent
sperm count (oligospermia), low sperm vitality analysis of semen to track subsequent changes in sperm
(necrospermia), poor sperm motility (asthenospermia), and concentration [14, 15].
abnormal sperm morphology (teratospermia) [5]. Other Conventional methods for sperm concentration analysis
quality metrics such as semen volume and DNA integrity are includes hemocytometry, computer-aided semen analysis
also shown to be correlated with male infertility, however, (CASA), microfluidic methods, and spectrophotometry [16].
with less direct association [6]. Despite the rising trend of Hemocytometry is the most traditional and oldest method for
infertility, improvements in conventional diagnostic methods quantifying cell concentration by counting the number of cells
have been infrequent, and new technological development is in a chamber of known volume. Besides being laborious and
required to allow for more accurate semen analysis outside of prone to the operator error, this method is the third most
fertility clinics or for at-home testing [2, 7]. imprecise method for estimating sperm concentration [17].
Semen analysis to quantify sperm concentration, vitality, CASA is the gold standard method for semen analysis in
motility, and morphology is crucial to infertility diagnosis [2, clinics. In CASA, captured image sequences from at least 100
8-10]. Current clinical semen analysis methods demonstrate sperm in the sample are processed in real-time using an
about 89.6% sensitivity, with the ability to accurately advanced computer-based image processing unit to quantify
diagnose 9 out of 10 infertile men [5]. Routinely, the first step sperm concentration and motility [18]. CASA system is
in clinics is to quantify sperm concentration, motility and expensive, while system-to-system variations between the
morphology as the three critical indicators [11]. In up to 90% image processing units can also influence calculated sperm
of male infertility cases, low sperm concentration exhibits a concentrations [19]. Microfluidic methods [2, 20],
strong positive correlation with other abnormal semen particularly in paper-based formats [3, 21] and in combination
with cell-phone based applications [10] have also provided previously unseen sets of input variables [38]. The size of the
new opportunities for low-cost and accessible semen analysis. training dataset considerably influences the performance of
While promising, lack of standardization and reproducibility the trained network, while increasing the number of input
have prevented adoption and/or commercialization of variables can also improve the prediction accuracy by
developed microfluidic technologies, mainly due to variation decreasing the generalization error [39]. Due to well-
between different cell-phone-based imaging devices and characterized and multi-input nature of spectrophotometry
challenges accessioned with handling delicate volumes of (absorption intensity as a function of wavelength), ANN
fluid. More recently, cell-phone based systems have been used models can be employed to standardize and improve the
as a capturing device to overcome issues associated with the prediction accuracy of the method for sperm concentration
lack of standardization procedures, providing a portable and analysis. An artificial neural network can be used to build a
more standard analyzing system [22]. correlation between absorption spectrum and sperm
Spectrophotometry is a precise method for quantifying concentration to more accurately predict sperm concentration.
cell concentration and is routinely used for estimating sperm Here, using a UV-Visible spectrophotometer, we
concentration in wild and captive animals such as bull [23], introduce a well-trained ANN approach to evaluate human
horse [24], and aquatic species [12, 25]. This method is based sperm concentration, by establishing a correlation between
on measuring the transmission of light through a known the full absorption spectrum and sperm concentration. This
volume of sample, in which light absorption values at specific method is the first application of artificial neural network in
wavelengths are used to quantify sample concentration [12]. spectrophotometry for sperm concentration analysis, and by
According to Beer-Lambert law, the absorption magnitude, A, using the full absorption spectrum rather than only one
is equal to the log ratio of transmitted light intensity, I, to the optimal wavelength, significantly improves over the
initial light intensity, Io, as A=log(I0/I) [26]. Using this method, traditional spectrophotometry-based methods to predict sperm
sperm concentration in unknown samples is estimated based concentration. Due to the inherent nature of
on a previously established linear correlation between spectrophotometry, our method is rapid and low-cost with
absorption intensity (recorded at a set wavelength) and sperm over 93% accuracy in predicting sperm concentration,
concentration [27]. This method has been applied to quantify providing identical clinical outcomes for tested patient and
bull sperm concentration, demonstrating good agreement with donor samples.
that of the hemocytometry method [28]. While
spectrophotometry is rapid, accessible and well-suited to
process a large number of samples, the method is very time 2 | M ate r ial and Me th od s
sensitive as the absorption reading varies significantly with
time [16]. Moreover, the optimal wavelength for the most 2.1 | Sample Preparation
accurate prediction of sperm concentration varies from
species to species and/or setups. For examples, this method The semen samples were collected from 41 male subjects
has been applied to quantify sperm concentration by recording between 30 and 45 years old who were referred to Avicenna
the absorption intensity at 400 nm for ray-finned fish and bull Infertility Clinic for treatment, with ~15% of male partners
[12, 23, 29], at 480 nm for salmonid fish [25], and at 550 nm experiencing infertility issues and the samples covering a
for roosters [30]. This variability reduces the prediction wide range of patient and donor samples. All subjects signed
accuracy of the methods where the most appropriate an informed consent form and the study was approved by the
wavelength is not applied [12], preventing its widespread ethical Committee of Avicenna Research Institute (Tehran,
application, particularly for human semen analysis. Moreover, Iran). The study excluded subjects with a history of round
due to inherent nature of spectrophotometry, the sensitivity of cells (≥ 1 × 106 per ml), radiation, smoking, alcohol
the method is influenced by the concentration of somatic cells consumption, drug abuse, systemic disease, leukocytospermia,
and debris in the raw semen sample that additionally absorb and varicocele. Semen samples were collected after 2–5 days
light and result in overestimating sperm concentration of sexual abstinence, liquefied and prepared by standard
compared to conventional CASA systems. swim-up technique. In brief, 1 ml of the semen specimen was
Machine learning and artificial neural networks (ANNs) gently mixed with 1 ml of Ham’s F10 medium (Sigma, USA)
in particular have enabled rapid and accurate analysis of supplemented with 3% human serum albumin (HSA) (Sigma,
complex biological data for applications in areas such as USA) and centrifuged at 330 g for 10 min. The supernatant
active matter [31, 32], single cell analysis [33], and disease was removed and 2 ml of Ham’s F-10 medium supplemented
diagnostics [34]. With respect to diagnostics, machine with 3% HSA was added to the pellet. Subsequently, a
learning has been used in combination with image processing MedeaLab CASA system (MTG, Germany, Ver. 4.1) with a
techniques for monitoring organ function and predicting MD-30 CCD camera and a sperm motility analyzer (VT
stroke risk [34]. Deep learning, a machine learning technique Sperm 3.1) was used for clinical measurement of sperm
for processing big data, has been used recently in male fertility concentration and progressive motility, according to World
for classification of human sperm based on their Health Organization (WHO) guidelines [40]. Finally, 0.5 ml
morphological characteristics [35], predicting sperm DNA of the washed sample was diluted in 1 ml (3-folds dilution),
integrity from morphological parameters [36], and selection 1.5 ml (4-folds dilution), and 2.5 ml (6-folds dilution) of
of sperms with high DNA integrity [37]. ANNs are supervised Ham’s F10 medium supplemented with 3% HSA for
learning algorithms that can be trained to learn a relationship assessment.
between a set of input variables and a designated dependent
output variable to predict that output variable for new and
2
2.2 | UV-Visible Spectrophotometry each with a two-layer backpropagation network, were
developed using Matlab R2019b software for predicting
Absorption magnitudes versus wavelength were recorded sperm concentration from absorption spectrum (Figure 2), one
using UV-Visible Spectrophotometer (Perkin Elmer, Lambda based on absorption peaks (5 peaks at 390, 502, 615, 929, and
25), as shown in Figure 1a. To calibrate the device, sample 1075 nm) and the other one based on the full absorption
and base cuvettes were both filled with Ham’s F-10 medium spectrum. The choice of backpropagation was because it is the
supplemented with 3% HSA, and the machine was zeroed. most popular training algorithm for adjusting network weights
The recorded absorption spectrum was used as a reference to and biases [42]. Among the 41 tested human semen specimens,
be subtracted from sperm sample spectra. The sample cuvette 26 samples were used in the model development phase (70%
was then loaded with 0.5 mL of washed semen sample. To for training, 15% for testing and 15% for validation), and an
prevent inaccurate readings due to probable and gradual additional 15 samples (previously unseen by the model) were
settlement of semen samples, the sample was remixed by used for external validation, ensuring the generality of the
gently shaking the closed-cap sample cuvette prior to each trained network for prediction and avoid overfitting of the
measurement. The wavelength was swept from 390 nm to data. The optimum number of neurons in each hidden layer
1100 nm (1 nm interval), and the measurement was repeated was also optimized through assessing 400 ANN structures
twice for each sample (sample volume was divided into two (for 5 times per structure for a total of 2000 cases) to achieve
equal parts and tested separately). Each test took about 5 the lowest RMSE for best prediction accuracy (Table S1 and
minutes, and all the experiments were performed at 11 am to Table S2 in the Supplementary Material). It is noteworthy that
minimize the sensitivity of the method to the time-frame. the prediction accuracy was defined as the percentage ratio
Furthermore, linear regressions between sperm absorbance between predicted concentrations from the model and the
and concentration were generated at 5 absorption peaks. clinical values (100 × model value/clinical value), as a clear
metric to demonstrate how closely the estimated
2.3 | Artificial Neural Networks (ANNs) concentration from the model compares with the clinical
measurements. To validate the performance of the model, 13-
ANNs are computational organizations composed of linked fold cross validation was also performed on the dataset, with
neurons that learn a relationship between a set of paired inputs the results demonstrating an average classification accuracy
and outputs to predict output values for another set of of 92% for the dataset.
previously unseen input dataset – well-suited to process big
and complex but related datasets [41]. Two neural networks,

Figure 1. (a) UV-Visible spectrophotometry experimental setup. (b) A representative absorption spectrum for a tested semen specimen,
demonstrating 5 absorption peaks at 390, 502, 615, 929, and 1075 nm. (c) The effect of sample dilution on absorption intensity for a semen
specimen with the initial raw concentration of 70 Million/ml. Absorption pattern is identical and only lower absorption magnitudes were
recorded.
Where Conci is sperm concentration for sample
number i. Purelin (linear, P), Logsig (log-sigmoid, L) and
Tansig (Hyperbolic tangent sigmoid, T) were used as transfer
functions, and Levenberge-Marquardt (LM) and Bayesian
Regularization (BR) were used as the training methods to
characterize the performance of the models (representative
RSME values for SPNN model is shown in Figure S1 in the
Supplementary Material). The optimum number of neurons in
the first and second hidden layers (between 1 to 20) was also
optimized through assessing 400 ANN structures to achieve
the lowest RMSE for best prediction accuracy and avoiding
local minimums, as listed in Table S2 in the Supplementary
Material.

2.4 | Data Analysis

Absorption spectra from a total of 41 semen specimens were


recorded, out of which 26 samples were used for establishing
the calibration curves and training the artificial neural
networks and the other 15 samples were used as unknown
samples for testing the external validity. To validate the
performance of the ANN models, 13-fold cross validation was
also performed on the dataset, with the results demonstrating
an average classification accuracy of 92%. For cross
validation, the dataset was randomly partitioned into 13 parts,
where 12 parts formed the training set and the remaining part
formed the test set. Each 13-fold cross validation procedure
was repeated three times to ensure stability, and the average
value was reported as the classification accuracy. A univariate
linear regression model was also used to measure R-squared
Figure 2. The structure of constructed neural networks for and root of mean square error for the prediction of each model
estimating sperm concentration using (a) SPNN model based on compared to the clinical measures in external samples.
5 absorption peaks, and (b) FSNN model based on the full Furthermore, linear mixed models (LMM) were used to
absorption spectrum for predicting sperm concentration (711
compare the goodness of fit for each linear regression, SPNN,
inputs per sample). Abs and Wav denote absorption and
wavelength, respectively. and FSNN models (Table S2 in the Supplementary Material).
Lower RMSE, Akaike Information Criterion (AIC), Bayesian
A multi-input selected peak neural network (SPNN) information criterion (BIC) and log likelihood indicates a
model was trained by using peak absorptions for each of the better fit. The "lme" function of the NLME package (Pinheiro
26 training samples as the input dataset and their J., Bates D., DebRoy S., Sarkar D. and R Core Team - 2017)
corresponding sperm concentration as the output dataset in R was used.
(Figure 2a). The inputs were the product of peak absorption
wavelengths (Wi=390, 502, 615, 929 and 1075 nm) and the
corresponding normalized absorption values (Abs i). The 3 | RES ULTS AND DIS CUSS ION
maximum absorption value among all tested samples was
used for normalization. Absorption measurement at the peak 3.1 | Absorption-Sperm Concentration Correlation
wavelengths introduces the smallest error due to their
relatively high signal-to-noise ratio [43], improving Figure 1b shows a representative absorption spectrum for a
measurement accuracy especially for relatively small human sample with 70 million per millilitre (M/ml) sperm
absorbance values. A full spectrum neural network (FSNN) concentration, demonstrating 5 absorption peaks at 390, 502,
model was also trained by using the full absorption spectrum 615, 929, and 1075 nm. To characterize the effect of sperm
for each training sample as the input (711 data points per concentration on absorption spectrum, the semen specimen
sample) and the corresponding sperm concentration as the (with 70 M/ml sperm concentration) was also diluted 3-, 4-,
output (Figure 2b). and 6- folds and absorption spectra were recorded (Figure 1c).
Recorded absorption spectra for diluted samples followed the
The performance of SPNN and FSNN models were
same pattern as the original sample but shifted down by
optimized for different combinations of training methods,
diluting the sample. Specifically, the absorption peak
transfer functions, and structure of hidden layers to achieve
positions were almost independent of the applied dilution (i.e.
the lowest root mean square error based on Equation (1).
change in sperm concentration), however, the peak
N 2 absorptions were considerably influenced and shifted down
RMSE =  (Conci, predict − Conci,experiment )
i =1
N (1) for lower sperm concentrations. As shown in Figure 1c, the
change in the peak absorption values was more pronounced at
4
higher wavelengths, particularly for peaks at wavelength Using a simple linear regression approach, absorption
higher than 600 nm. Specifically, by diluting the sample 4- peaks at 390, 502, 615, 929, and 1075 nm for 26 tested human
and 6-folds, the absorption magnitude at wavelength higher samples were used to establish calibration curves for sperm
than 600 nm decreased on average by 6% and 13%, concentration (Figure 4a and Figure S2 in the Supplementary
respectively, but only by 2.5% and 4% at lower wavelengths Material). The results demonstrate relatively weak
(<600 nm). Compared to the sample with 3-folds dilution, the correlations, with the best correlation obtained for the
peaks at 1084 nm decreased considerably by 10% and 28% by calibration curve at 1075 nm (R2=0.49, P≤0.005). Figure 4a
diluting the sample 4- and 6-folds, respectively. As a result of shows the calibration curve for sperm concentration based on
this nonlinear change, the maximum absorption was observed the absorption peak at 1075 nm. Change in the dominant
for the 612 nm peak by diluting the sample 6-folds, instead of absorption peak for high and low sperm concentrations as well
the maximum peak at 1075 nm for samples of higher sperm as the slight shift in the peak position for different sperm
concentrations. The results demonstrate that the absorbed concentrations (Figure 3) contribute to the weak linear
light intensity is correlated with sperm concentration, shifting correlation obtained here. Figure 4b compares the clinically
down for samples with lower concentrations. measured sperm concentrations with estimated values from
To establish a correlation between sperm concentration the linear regression model based on the calibration curve at
and absorption intensity, 26 human samples with sperm 1075 nm. For 15 human samples tested, with concentrations
concentrations ranging from 4 M/ml to 70 M/ml and sperm ranging from 5 to 65 M/ml, the simple regression model
motility ranging from 10% to 65% were tested (Figure 3). For resulted in less than 78% prediction accuracy (R2=0.68) with
samples of different concentrations, 5 absorption peaks were estimated sperm concentrations within 22% of the clinically
observed, shifting by less than 5% to be at 385-399 nm, 490- measured concentrations. The results demonstrate that a
515 nm, 590-618 nm, 925-950 nm, and 1055-1090 nm. All of simple linear correlation between absorption intensity and
the tested samples demonstrated the visible range peak at 590- sperm concentration is not powerful enough to be used as a
618 nm, but with lower peak absorption intensities for lower comprehensive tool for estimating sperm concentration from
sperm concentrations. For samples with sperm concentration an absorption spectrum.
lower than 20 M/ml (Figure 3a), the 490-515 nm peak mostly
disappeared, and the first dominant absorption peak was 3.2 | Artificial Neural Network (ANN) Models to Predict
observed at 590-618 nm. Moreover, the two absorption peaks Sperm Concentration
at 925-950 nm and 1055-1090 nm were also only observed for
samples with concentrations above 40 M/ml, but not for A multi-input selected peak neural network (SPNN) model
samples of lower concentrations. The absorption behaviour and a full spectrum neural network (FSNN) model were
was also compared for samples of different concentrations but developed to estimate sperm concentration from absorption
the same motility of 50% (obtained from clinical spectrum. These ANN models overcome the relatively weak
measurements), as shown in Figure 3b. Samples with fixed prediction accuracy of a simple linear regression model (78%
sperm motility of 50% demonstrated the same behavior, and accuracy) by establishing a weighted correlation graph to
absorption spectra shifted down at lower concentrations. It is account for the non-linear correlation between absorption
noteworthy that, while absorption spectra were influenced by spectrum and sperm concentration. Recorded absorption
sperm concentration, no clear and comprehensive correlation spectra from 26 tested samples were used to train, test and
between any of the individual absorption peaks and sperm validate the algorithms (70% for training, 15% for test and
concentration was observed. 15% for validation). Additionally, the external validity of the
trained algorithms were tested using absorption spectra from

Figure 3. Absorption spectra for human semen specimens of different concentrations. (a) Absorption spectra for 9 semen specimens ranging
in sperm concentration from 4 M/ml to 70 M/ml and in motility from 10% to 65%. Sample with higher sperm concentrations (>30 M/ml)
exhibit higher absorption intensity with distinct peaks at 929 nm and 1075 nm. (b) Absorption spectra for semen specimens of different
sperm concentrations but with fixed motility of 50%. Dashed lines indicate peak positions.
Figure 4. (a) Calibration curve for sperm concentration based on
Figure 5. Sperm concentration from clinical measurements
the absorption peak at 1075 nm. (b) Clinically measured sperm
versus values predicted from (a) the SPNN model based on the
concentration values versus predicted sperm concentration from
absorption peaks, and (b) the FSNN model based on the full
the calibration curve in a.
absorption spectrum. Each data point is the average value of 3
the other 15 previously unseen samples (Table S2 in the independent measurements with error bars as one standard
deviation.
Supplementary Material).
regression model is mainly attributed to the multi-input nature
3.2.1 | Multi input Selected Peak Neural Network (SPNN) of the SPNN model to account for the change in the dominant
model absorption peak for samples of different concentrations
(rather than only using the peak at 1075 nm).
A selected peak neural network (SPNN) model was developed
to estimate sperm concentration based on the five absorption 3.2.2 | Full Spectrum Neural Network (FSNN) model
peaks at 390, 502, 615, 929, and 1075 nm (product of peak
absorptions and corresponding wavelengths were used as the To develop a comprehensive prediction tool, a full spectrum
input, Figure 2a). The SPNN model with Levenberge- neural network (FSNN) model was constructed that estimates
Marquardt (LM) as the training method and with Logsig and sperm concentration based on the full absorption spectrum
Tansig as the transfer functions for the first and second hidden (711 data points per sample, Figure 2b). The minimum RMSE
layers, respectively, resulted in the minimum RMSE of 0.034 of 0.086 was achieved for the FSNN model with Levenberge-
(Figure S1 in the Supplementary Material). This optimal Marquardt (LM) as the training method and with Tansig and
SPNN model maps five input variables (absorption peaks) Logsig as the transfer functions for the first and second hidden
onto a single output (sperm concentration) with 9 neurons in layers respectively. The best FSNN model maps 711 input
first hidden layer and 19 neurons in the second hidden layer variables (absorption values and corresponding wavelengths)
(denoted as 5:9:19:1), listed as structure number 179 in Table onto a single output (sperm concentration) with 12 neurons in
S1 in Supplementary Material. Figure 5a compares the the first hidden layer and 20 neurons in the second hidden
clinically measured sperm concentrations with predicted layer (denoted as 711:12:20:1), listed as structure number 240
values from the SPNN model, for 15 human samples with in Table S1 in Supplementary Material. Figure 5b compares
concentrations ranging from 5 to 65 M/ml (n=15). Results sperm concentrations from clinical measurements with values
from the SPNN model strongly correlated (R2=0.90, P≤0.001) predicted by the FSNN model, for the same 15 human samples
to clinical measurements and were within 14% of the clinical tested with the SPNN model. Sperm concentration from the
values, demonstrating a prediction accuracy of 86%. This 8% FSNN model strongly correlated (R2=0.98, P≤0.0001) with
improvement in prediction accuracy over the simple linear clinical measurements, with predicted values within 7% of the

6
clinical measurements (93% prediction accuracy). The FSNN
model demonstrated over 7% and 15% improvement in
prediction accuracy as compared with the SPNN and simple
regression models, respectively. The FSNN model accurately
estimates sperm concentration by fully accounting for the
non-linear correlation between sperm concentration and
absorption spectrum, including the change in the dominant
absorption peak and the shift in the peak position for samples
of different concentrations. It is noteworthy that, both SPNN
and FSNN models were also trained and evaluated to also
estimate sperm motility from recorded absorption spectra for
clinically tested samples with sperm concentrations ranging
from 4 M/ml to 70 M/ml and sperm motility ranging from
10% to 65% (i.e. ANN models with sperm concentration and
motility as output parameters). However, the models were
incapable of predicting sperm motility and estimated Figure 6. Direct comparison of sperm concentrations from the
concentrations were independent of sperm motility. linear regression, SPNN, and FSNN models against clinically
To evaluate the performance of the developed models to measured values. Dashed line indicates the lower reference limits
estimate sperm concentration, Figure 6 compares sperm of 15 M/ml for human sperm concentration based on the WHO
concentrations from the linear regression, SPNN and FSNN guidelines.
models against clinically measured values, for patient and correlation between absorption spectrum and sperm
donor samples ranging in concentration from 5 to 65 M/ml concentration. Sperm concentration from the FSNN model
(n=15). A lower reference limit of 15 M/ml is defined by strongly correlated (R2=0.98) with clinical measurements and
WHO for sperm concentration (dashed line in Figure 6) [39], were within 7% of the clinical values, demonstrating a
which is used in clinics as a diagnostic threshold for male prediction accuracy of 93% with over 15% improvement over
infertility. While the linear regression and SPNN models the linear regression model. The FSNN model provided 100%
failed to distinguish between donor and patient samples, the agreement with clinical measurements in terms of clinical
FSNN model provided 100% agreement with clinical outcome for patients to accurately distinguish between donor
measurements in terms of clinical outcome for patients. As and patient samples. However, increasing sample
shown in Figure 6, sperm concentration for Sample number 3, representativeness in terms of sperm concentration by
6, 8, 10 and 13 were all below the threshold value of 15 M/ml increasing the sample size, could possibly increase the
based on clinical measurements and the FSNN model, while prediction accuracy of our method and better validate the
the SPNN model only identified Sample number 3 and 13 as model for clinical adoption. Collectively, the FSNN model
patient samples and the linear regression model failed to provides a rapid and powerful tool for quantifying sperm
identify the patient samples. This improvement in prediction concentration, improving on current spectrophotometry
accuracy and distinguishing between donor and patient methods for semen analysis and providing novel opportunities
samples is significant, enabling a spectrophotometry-based for male infertility diagnosis. While increasing the sample
method for semen analysis with clinical outcomes identical to size, can better inform the model for estimating sperm
those of the conventional CASA systems. The FSNN model concentration and further increase the prediction accuracy, the
also demonstrated the best prediction performance based on FSSN model still demonstrates its performance for male
our statistical analysis (Table S2 in the Supplementary infertility diagnosis. Utilizing the novel machine learning-
Material), resulting in lowest RMSE, AIC and BIC compared based spectrophotometric approach is also suggested for
to SPNN and linear regression models. The performance of samples with DNA fragmentation in search for an effective
the FSNN model was tested against three developed Support diagnostic tool.
Vector Regression (SVR) models (see the Supplementary
Material). The FSNN model outperformed the SVR models
by at least 13% in predicting sperm concentration, while the ACKNOWLEDGMENTS
SVR models also fail to distinguish between donor and patient
samples. Taken together, the FSNN model provides a rapid We gratefully acknowledge the support from the Australian
and powerful tool for quantifying sperm concentration using Research Council Discovery Program (DP190100343) and
spectrophotometry. Monash Interdisciplinary Research Grants to R.N. We also
thank Reza Nobakht Hassanlouei for his help with Artificial
4 | CONCLUS ION S Neural Network.

In summary, we demonstrated a machine learning-based AUTHOR CONTRIBUTIONS


spectrophotometry approach for quantifying human sperm
concentration. The full spectrum neural network (FSNN) A.L., M.M.Z., and R.N. designed the research. A.L.
model accurately estimates sperm concentration based on the performed research with assistance from S.K. for analyzing
full absorption spectrum by establishing a weighted sperm concentration and conducting spectrophotometry
correlation graph to fully account for the non-linear experiments and H.J. for statistical analysis. A.L., R.N.
analyzed data. All authors were involved in writing and Holographic imaging of unlabelled sperm cells for semen analysis: a
review. Journal of Biophotonics, 2015. 8(10): p. 779-789.
editing the paper. 19. Amann, R.P., Katz, David F., Andrology lab corner*: Reflections on
casa after 25 years. Journal of andrology, 2004. 25(3): p. 317-325.
20. Segerink, L., Sprenkels, Ad, Braak, Paul, Vermes, Istvan, Van den Berg,
CONFLICT OF INTEREST Albert, On-chip determination of spermatozoa concentration using
electrical impedance measurements. Lab on a chip, 2010. 10: p. 1018-
24.
The authors have no competing interests to declare. 21. Coppola, M.A., Klotz, K.L., Kim, K.-a., Cho, H.Y., Kang, J., Shetty, J.,
Howards, S.S., Flickinger, C.J., Herr, J.C., SpermCheck® Fertility, an
immunodiagnostic home test that detects normozoospermia and severe
REFERENCES oligozoospermia. Human Reproduction, 2010. 25(4): p. 853-861.
22. Ilhan, H.O., Aydin, Nizamettin, Smartphone based sperm counting - an
1. Agarwal, A., Mulgund, Aditi, Hamada, Alaa Chyatte, Michelle Renee, A alternative way to the visual assessment technique in sperm
unique view on male infertility around the globe. Reproductive biology concentration analysis. Multimedia Tools and Applications, 2020. 79(9):
and endocrinology, 2015. 13: p. 37. p. 6409-6435.
2. Nosrati, R., Graham, Percival J., Zhang, Biao, Riordon, Jason, Lagunov, 23. Muhammad Anzar, T.K., and Mary M. Buhr, Comparison of Different
Alexander, Hannam, Thomas G., Escobedo, Carlos, Jarvi, Keith, Sinton, Methods for Assessment of Sperm Concentration and Membrane
David, Microfluidics for sperm analysis and selection. Nature Reviews Integrity with Bull Semen. Journal of Andrology, 2009. 30(6).
Urology, 2017. 14(12): p. 707-730. 24. Rigby, S., Dickson,, Varner, Dickson, Thompson, James, Love, Charles,
3. Nosrati, R., Gong, Max, San Gabriel, Maria, Pedraza, Claudio, Zini, Brinsko, Steven, Blanchard, Terry, Measurement of Sperm
Armand, Sinton, David, Paper-Based Quantification of Male Fertility Concentration in Stallion Ejaculates Using Photometric or Direct Sperm
Potential. Clinical Chemistry, 2016. 62. Enumeration Techniques. Proc. Am. Ass. equine Practnrs, 2001. 47.
4. Khatun, A., Rahman, Md Saidur, Pang, Myung-Geol, Clinical 25. Azadeh Hatef, H.N., Bagher Mojazi Amiri, Sayed Mohammad Hadi
assessment of the male fertility. Obstetrics & gynecology science, 2018. Alavi & Mahmoud Karami, Sperm density, seminal plasma composition
61(2): p. 179-191. and their physiological relationship in the endangered Caspian brown
5. Kumar, N., Singh, Amit Kant, Trends of male factor infertility, an trout (Salmo trutta caspius). Aquaculture Research, 2007. 38: p. 1175-
important cause of infertility: A review of literature. Journal of human 1181.
reproductive sciences, 2015. 8(4): p. 191-196. 26. Wypych, G., PHOTOPHYSICS, in Handbook of Material Weathering
6. Harris, I.D., Fronczak, Carolyn, Roth, Lauren, Meacham, Randall B., (Sixth Edition), G. Wypych, Editor. 2018, ChemTec Publishing. p. 1-26.
Fertility and the aging male. Reviews in urology, 2011. 13(4): p. e184- 27. Dascanio, J., Calibrated Spectrophotometer Evaluation of Sperm
e190. Concentration. 2014. p. 357-359.
7. Nosrati, R., Gong, Max M., San Gabriel, Maria C., Zini, Armand, Sinton, 28. Cardeal, C., Alberton, Luiz, Boscarato, André, Rocha, Mirelly,
David, Paper-based sperm DNA integrity analysis. Analytical Methods, Marchetti, Lucas Macedo, Gustavo Lourenço, Emerson, Martins,
2016. 8(33): p. 6260-6264. Leonardo, Comparative study between the hemocytometric and
8. Dissanayake, D.M.I.H., Keerthirathna, W. L. R., Peiris, L. Dinithi C., spectrophotometric methods for measuring the sperm concentration of
Male Infertility Problem: A Contemporary Review on Present Status and young Nelore bulls. Semina: Ciências Agrárias, Londrina, 2017. 38(6):
Future Perspective. Gender and the Genome, 2019. 3: p. p. 3605-3612.
2470289719868240. 29. T. Muiño-Blanco, I.C., A. Casao, R. Pérez-Pé and José A. Cebrián-Pérez,
9. Santi, D., Spaggiari, Giorgia, Simoni, Manuela, Sperm DNA Understanding the sperm separation principles used in assisted
fragmentation index as a promising predictive tool for male infertility reproductive technology as a turning point for biomedical students.
diagnosis and treatment management; meta-analyses. Reproductive Microscopy: advances in scientific research and education (A. Méndez-
BioMedicine Online, 2018. 37(3): p. 315-326. Vilas, Ed.), 2014.
10. Kanakasabapathy, M.K., Sadasivam, Magesh, Singh, Anupriya, Preston, 30. Froman DP, R.D., Validation of a spectrophotometer-based method for
Collin, Thirumalaraju, Prudhvi, Venkataraman, Maanasa, Bormann, estimating daily sperm production and deferent duct transit. Poultry
Charles L., Draz, Mohamed Shehata, Petrozza, John C., Shafiee, Hadi, Science, 2012. 91(10): p. 2621-7.
An automated smartphone-based diagnostic assay for point-of-care 31. Riordon, J., Sovilj, Dušan, Sanner, Scott, Sinton, David, Young, Edmond
semen analysis. Science Translational Medicine, 2017. 9(382): p. 7863. W. K., Deep Learning with Microfluidics for Biotechnology. Trends in
11. Fisch, H., Declining worldwide sperm counts: disproving a myth. Biotechnology, 2019. 37(3): p. 310-324.
Urologic Clinics of North America, 2008. 35(2): p. 137-146. 32. Cichos, F., Gustavsson, Kristian, Mehlig, Bernhard, Volpe, Giovanni,
12. Eric Leclercq, L.A., Agne`s Bardon-Albaret, Corey R Anderson, Carly Machine learning for active matter. Nature Machine Intelligence, 2020.
R Somerset & Eric A Saillant, Spectrophotometric determination of 2(2): p. 94-103.
sperm concentration and short-term cold-storage of sperm in Atlantic 33. Jones, D.T., Setting the standards for machine learning in biology.
croaker Micropogonias undulatus L. broodstock. Aquaculture Research, Nature Reviews Molecular Cell Biology, 2019. 20(11): p. 659-660.
2012: p. 1-12. 34. Rong, G., Mendez, Arnaldo, Bou Assi, Elie, Zhao, Bo, Sawan,
13. Welliver, C., Benson, Aaron D., Frederick, Luke, Leader, Benjamin, Mohamad, Artificial Intelligence in Healthcare: Review and Prediction
Tirado, Edna, Feustel, Paul, Kontio, James, McAsey, Mary, Köhler, Case Studies. Engineering, 2020.
Tobias S., Analysis of semen parameters during 2 weeks of daily 35. Riordon, J., McCallum, Christopher, Sinton, David, Deep learning for
ejaculation: a first in humans study. Translational andrology and the classification of human sperm. Computers in Biology and Medicine,
urology, 2016. 5(5): p. 749-755. 2019. 111: p. 103342.
14. World Health Organisation (WHO) laboratory manual for the 36. Wang, Y., Riordon, Jason, Kong, Tian, Xu, Yi, Nguyen, Brian, Zhong,
examination of human semen and sperm-cervical mucus interaction. Junjie, You, Jae Bem, Lagunov, Alexander, Hannam, Thomas G., Jarvi,
1999: Cambridge university press. Keith, Sinton, David, Prediction of DNA Integrity from Morphological
15. Opsahl, M., Dixon, N., Robins, E., Cunningham, D., Single vs. multiple Parameters Using a Single-Sperm DNA Fragmentation Index Assay.
semen specimens in screening for male infertility factors. A comparison. Advanced Science, 2019. 6(15): p. 1900712.
The Journal of reproductive medicine, 1996. 41: p. 313-5. 37. McCallum, C., Riordon, Jason, Wang, Yihe, Kong, Tian, You, Jae Bem,
16. Kumar, K.V., Reddy, B Ram, Krishna, K Sai, Comparison of different Sanner, Scott, Lagunov, Alexander, Hannam, Thomas G., Jarvi, Keith,
methods for assessing sperm concentration in infertility workup: A Sinton, David, Deep learning-based selection of human sperm with high
review. International Journal of Medicine and Medical Sciences, 2013. DNA integrity. Communications Biology, 2019. 2(1): p. 250.
5(9): p. 396-400. 38. Xu, C., Jackson, Scott A., Machine learning and complex biological
17. Prathalingam NS, H.W., Revell SG, Fazeli AR, Watson PF, A novel data. Genome Biology, 2019. 20(1): p. 76.
method using fluorometry to evaluate sperm membrane integrity in bulls. 39. Sug, H., The effect of training set size for the performance of neural
Proceedings of the Society of Reproduction and Fertility, 2006. networks of classification. W. Trans. on Comp., 2010. 9(11): p. 1297–
18. Di Caprio, G., Ferrara, Maria Antonietta, Miccio, Lisa, Merola, 1306.
Francesco, Memmolo, Pasquale, Ferraro, Pietro, Coppola, Giuseppe, 40. Cooper TG, N.E., von Eckardstein S, Auger J, Baker HW, Behre HM,
Haugen TB, Kruger T, Wang C, Mbizvo MT, Vogelsong KM., World

8
Health Organization reference values for human semen characteristics.
Hum. Reprod. Update, 2010. 16(3): p. 231-245.
41. Reed, R.D. and R.J. Marks, II, Neural smithing : supervised learning in
feedforward artificial neural networks. 1999: Cambridge (Mass.) : MIT
press.
42. Aggarwal, K.K., et al., Bayesian Regularization in a Neural Network
Model to Estimate Lines of Code Using Function Points. Journal of
Computer Science, 2005. 1(4).
43. CHAPTER 3 - Errors in Spectrophotometry, in Studies in Analytical
Chemistry, L. Sommer, Editor. 1989, Elsevier. p. 78-94.

You might also like