Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

2012 International Conference on Biomedical Engineering (ICoBE),27-28 February 2012,Penang

Automatic Classification of Infant Cry:A Review


J. Saraswathy, M.Hariharan, Sazali Yaacob and Wan Khairunizam
School of Mechatronic Engineering,
Universiti Malaysia Perlis (UniMAP),
02600 Perlis, Malaysia
wathy_87@ymail.com

Abstract — This paper reviews the some of significant works on Automatic Speech Recognition (ASR) process which
infant cry signal analysis proposed in the past two decades and comprises two main stages that are: signal processing and
reviews the recent progress in this field. The cry of baby cannot pattern classification [5-8]. The main aim of this intelligent
be predicted accurately where it is very hard to identify for what system is to discriminate the different types of infant cries and
it cries for. Experienced parents and specialists in the area of identify the pathological cry. The automatic system receives
child care such as pediatrician and pediatric nurse can
the raw cry signal of newborn baby as the input through the
distinguish different sort of cries by just making use their
individual perception on auditory sense. This is totally subjective microphone. In the signal processing step the raw signal is
evaluation and not suitable for clinical use. Non-invasive method first pre processed: normalization, cleaning and filtering. Then
has been widely used in infant cry signal analysis and has shown the smoothed signal is analyzed and by using suitable feature
very promising results. Various feature extraction and extraction techniques the salient feature vectors are derived. In
classification algorithms used in infant cry analysis are briefly the case of higher complexity of features, the features are
described. This review gives an insight on the current state of the normally reduced into lower-dimension space by using
art works in infant cry signal analysis and concludes with suitable feature reduction methods. Then the reduced feature
thoughts about the future directions for better representation or vector undergone classification phase where it is compared
and interpretation of infant cry signals.
with knowledge that the computer has to produce a result [6].
Keywords-acoustic analysis, infant cry, abnormalities, non invasive, III. ACOUSTIC ANALYSIS
signal processing, pattern recognition
Literature works have been emphasized that feature
I. INTRODUCTION extraction plays an important role in the field of automatic
infant cry analysis. The suitable extracted acoustic features
Almost half decades ago, the attention has been made by may lead to an accurate classified output. In recent, some
pioneers on acoustic analysis of infant cry to investigate the simple feature extraction techniques have been used by
relationship between cry signal and physiological or physical researches in the infant acoustic analysis. This part briefly
status of newborn babies. This is because cry is a multimodal describes the methods utilized by pioneers.
behavior which involves limbs movement, facial expressions
which change over time and also conveys a lot of information A. Pitch Analysis
about a baby as shown in figure 1[1, 2]. The cry of baby
cannot be predicted accurately where it is very hard to identify Pitch is the one of the interested and distinguished
for what it cries for. Experienced parents and specialists in the acoustic feature to analysis in infant cry classification area. In
area of child care such as pediatrician and pediatric nurse can neonate cry classification, many researches used pitch based
distinguish different sort of cries by just making use their features and have been shown prominent results.
individual perception on auditory sense. This is totally In 2004, Gyorgy Varallyay et al. [9] proposed three
subjective evaluation and not suitable for clinical use [3]. different methods for the detection of fundamental frequency.
Therefore, by considering this problem, an automatic system In their work, only the discussion about the difference of the
to classify infant cries was firstly has been suggested in 1990’s ratio of dominant frequency and fundamental frequency of
by Cohen and Zmora which successfully classified a normal and deaf cries were reported. Gyorgy Varallyay [1]
preliminary database that consist of hunger and pain cries of reported a work based on time domain analysis mainly from
healthy neonates [4]. Researches have proposed approaches the duration of the signal. In his study, pitch, formant and
for problems of two class domain or more than two classes. other features such as segment density, specific segment
Some of the significant works on the classification of normal length, average pause length and average segment length are
and pathological infant cry is as shown in Table 1 and will be analyzed and compared between normal babies and infants
described briefly in Section III. with hearing disorder whereas the classification part is not
conducted. On following, Dror Lederman in 2010 [10]
II. AUTOMATIC INFANT CRY RECOGNITION approached the estimation of fundamental frequency for four
different types of cries (Normal, Cri-du-Chat, Hydrocephalus
Generally, infant cry automatic classification process
and Cleft Palate) using simple inverse filtering tracking
(Fig.1) is a pattern recognition problem. It is similar to
method (SIFT) which capable of overcoming the under-

978-1-4577-1991-2/12/$26.00 ©2011 IEEE 543


accuracy. O.F. Reyes-Galaviz and his co-
Infant Cry
researchers [16-17] have also described the same
work by using time delay neural network (TDNN)
and achieved an accuracy of 98.67%. Furthermore,
in [18] the normal, hypoacoustic and asphyxia cry
Weight Gender signals have been discriminated by using MFCC
Health Identity coefficients and fuzzy relational neural network
which constructed using triangular membership
function. The accuracy was 88%. The same work
First cry Emotions Preterm Vs
Fullterm
[7] also presented in different way by using
different membership function that is trapezoidal
Pathologies membership function and an improved accuracy
was obtained (98%) compare to the work in [18]
Figure 1: Different levels of information conveyed in infant cry. using triangular membership function. In 2005, O.F.
(identifying diseases mostly highlighted among researchers) [2] Reyes-Galaviz et al. [19] proposed a research by
utilizing MFCCs and time delay neural network to
estimation and over-estimation of the cry classify normal, hypoacoustic and asphyxia
fundamental frequency. In his study, the estimation neonates cry signals. The accuracy was varied from
of fundamental frequency for different cries only 96.08% to 97.39%. In [8] an evolutionary neural
concentrated and analyzed where the pattern network (Evolutionary-TDNN) classified the
recognition step is not carried out. different types of neonate cries with the accuracy
of 96.79%. In 2006, Barajas-Montiel [20] has
B. Mel-Frequency Cepstrum Coefficients conducted two studies of three class recognition
(MFCCs) problem using normal, asphyxia and hypoacoustic
Mel frequency cepstral coefficients are one of cry signal as well as pain, hunger and no-pain-no
the common and well known acoustic features hunger infant cry signals. He used MFCC in signal
which have been explored by many researchers in processing step and fuzzy support vector machine
infant cry classification analysis [7, 8, 11-29]. Mel (FSVM) for recognition phase. The best accuracies
frequency cepstral coefficients are used to encode of the two studies are 94.98% and 97.83%
the speech signals that contain irrelevant respectively.
information with large amounts of storage space
into compressed and useful features [11]. In 2008, Dror Lederman [21] has been
In 1995, M. Petroni et al. [12] has presented a system to discriminate the babies with
successfully classified three different types of cleft palate with palate plate and cleft palate
infant cries using MFCCs and artificial neural without palate plate. He used parallel hidden
networks (ANNs) and achieved an accuracy of markov model (PHMM) as classifier and the
90.4% by using the cry signals of 16 normal babies system obtained accuracy of 90%. In 2009, Azlee
which were recorded under three different stimulus Azlee Zabidi and et al. [22] used MFCC features to
situations (pain, anger, and fear). Dror Lederman analyze the normal and hypothyroidism cry signals
[13] used the MFCC features for characterizing the based on F-ratio analysis. Karen Santiago et al. [23]
normal and cleft palate infant cry signal. The attempted two class domain problem (normal and
continuous density hidden markov model (CD- hyperbilirubinemia) and three class problem
HMM) employed as classifier and resulted (normal, asphyxia and hyperbilirubinemia). They
accuracy of 91%. In other work the same author extracted MFCC in feature extraction method and
[13] differentiated the normal neonates and infants Type 2 fuzzy pattern matching used for recognition
with Respiratory Distress Syndrome (RDS) with phase. In other work, Reyes-Galaviz et al. [24]
the accuracy of 63% by using continuous density have implemented evolutionary neural system for
hidden markov model (CD-HMM). J.O Garcia et al. classifying infant cries into two class problems,
[11, 14-15] have published three papers on normal versus pathological (asphyxia or
classification of normal and deaf cry signals using hypoacoustic) and the best accuracy of their
MFCC features. In their works they implemented proposed system was up to 100%.
the scaled gradient conjugate for recognition
Data Pre- Feature
process and proposed the principal component acquisition Processing Extraction
analysis (PCA) as a feature reduction method. The
best accuracies of their works were varied from
96.80% to 97.43%. In 2004, O. F. Reyes-Galaviz et Classification Feature
al. [16] have implemented an adaptive neuro fuzzy Reduction
inference system (ANFIS) for classifying normal,
hypoacoustic and asphyxia cry signals by using Figure 2: Block Diagram of Infant Cry Automatic Recognition
Phases
MFCCs features. The system yielded 96% of

544
In 2010, R. Sahak and et al. [25] have also applied prediction coefficients and probabilistic neural
a system for the detection of asphyxia cry signals network with the best accuracy of 99%.
based on Orthogonal Least Square feature selection
method and the accuracy of the proposed system D. Others
was 93.16%. They used support vector machine for Besides these features, other acoustic features
classification system. Azlee Zabidi et al. [26] are also proposed by researchers in neonate cry
reported a paper about the classification of classification area [6, 31-34]. In 2009, Mahmoud
asphyxiated infant cries using multi layer Mansouri [6] has proposed a method based on mel-
perceptron neural network with F-Ratio as the frequency spectral entropy cepstrum coefficients
feature selection technique. The system yielded (MFECs) and multi layer perceptron neural
classification accuracy of 93.38%. Moreover, network for the detection of normal and deaf cry
Azlee Zabidi et al. [27] have been proposed a signals. The accuracy obtained was 88.3%. In other
system for identifying asphyxia cry signals based study [31], Yoshifumi Okada has been successfully
on Orthogonal Least Square (OLS) feature classified normal and Ankyloglossis with deviation
selection method. The multi layer perceptron used of the epiglottis and larynx (ADEL) infant cry
as classifier and the accuracy obtained was 94.00%. signals using iterative forward selection method
Ali Messaoud [28] has been classified four and Linear discriminant analysis (LDA). Results
different types of cries using MFCCs and yielded an accuracy of nearly 100%. In 2010, M.
probabilistic neural network. The maximum Hariharan and his co-researchers [32] have applied
accuracy was 71.4%. In 2011, Azlee Zabidi et al. a method based on time frequency analysis, short
[29] presented an analysis on binary particle swarm time fourier transform (STFT) and general
optimization (BPSO) for selection of features in the regression neural network. The proposed method
recognition of infant cries with asphyxia. They carried out using normal and deaf cry signals and
used multi layer perceptron neural network and the the obtained accuracy was 99%. Furthermore, the
best accuracy of their system was 95.07%. work in [32] has been presented for normal versus
asphyxia infant cry signals with addition of a radial
C. Linear Prediction Coefficients (LPC)
basis neural network classifier, probabilistic neural
Linear prediction analysis is a time domain network (PNN) [33]. The accuracy reported as 99%.
analysis which has been used frequently in infant M. Hariharan and et al. [34] have also explored and
cry analysis. Linear prediction analysis attempts to proposed the wavelet packet transform algorithm
predict a speech sample through a linear for the pathological infant cry analysis. They
combination of several previous signal samples [5]. utilized the probabilistic neural network for
This method of linear prediction is known as an classification and reported 99% as the best
appropriate technique to process speech and some accuracy.
works also have been performed based on this IV. CONCLUSION
technique [5, 8, 11, 14, 17, 21, 23, 30].
This paper briefly describes the significant
In 2003, J.O. Garcia and Carlos [5] have works on the infant cry classification analysis.
developed a method based on linear prediction From this, we observed that most of the studies are
technique and scaled conjugate gradient neural investigated using time domain and frequency
networks for the detection of normal and deaf cry domain analysis. Besides that, the time-frequency
signals. The classification accuracies for 314 approaches also have been applied by some
samples and 1036 samples were 91.08% and pioneers to investigate the newborn infant cries by
86.20% respectively. J.O. Garcia and Carlos have using pitch, harmonic analysis and noise analysis
proposed two studies [11, 14] with 314 samples [36-40]. In future we are interested to carry out this
and 506 samples and they obtained 94.3% and acoustic analysis using time-frequency methods but
91.30% of accuracies respectively which are lesser in different manner by utilizing: wavelet transform,
when compared with the accuracies obtained from wavelet packet transform and short time fourier
MFCCs. In [21] linear prediction coefficients also transform based on previous works [32-34] which
extracted in analysis of detection of Cleft Palate are yielded maximum accuracies of above 99%.
with Palate Plate versus Cleft Palate without Palate Moreover, we will attempt to perform this acoustic
Plate. Furthermore, in [23] the linear prediction analysis using different types of pathological infant
features extracted and analyzed together with other cries from different baselines.
acoustic features. The type2 fuzzy pattern matching
(T2-FPM) applied in classification phase. In [8, 17] REFERENCES
the linear prediction coefficients have been utilized
for analysis and the accuracies obtained from the 1. Gyorgy Varallyay, “Future prospects of the infant cry in the
respective works were 86.75% and 93.33%. In medicine”, Periodica Polytechnica Ser.El.Eng.Vol.50,
2010, M. Hariharan and et al. [30] have classified No.1-2, pp.47-62, 2006.
2. Hemant A. Patil “Cry Baby: Using Spectrographic Analysis
three different types of cries using weighted linear to Assess Neonatal Health Status”, Chapter 14, Advances in

545
Table 1: Some of the significant works on the classification of normal and pathological cry signals
First Author Database / Feature Extraction Classifiers Best Accuracy
(Number of babies)
2- class problem
Normal Versus Hypoacoustic
J.O. Garcia 31 Linear Prediction Scaled Conjugate Gradient 91.08 % (314 samples) 86.20
(2003) (Real database) Coefficients % (1036 samples)
[5]
J.O. Garcia 53 Mel-frequency cepstrum Scaled conjugate gradient 96.80 %
(2003) (Real database) coefficients and linear neural network (314 samples)
[11] prediction coefficients
J.O. Garcia 53 Mel-frequency cepstrum Scaled conjugate gradient 97.43 %
(2003) (Real database) coefficients and linear neural networks (506 samples)
[14] prediction coefficients
J.O. Garcia 53 Mel-frequency cepstrum Scaled conjugate gradient 97.43 %
(2003) (Real database) coefficients with PCA neural network
[15]
Gyorgy Varallyay JR. 37 Fundamental frequency - -
(2004) (Real database) using smoothed spectrum
[9] method (SSM)
Gyorgy Varallyay JR. 70 Time domain features - -
(2006) (Real database)
[1]
Mahmoud Mansouri Jam Baby Chillanto database [24] Mel-frequency spectral Multi layer perceptron 88.3 %
(2009) entropy cepstrum artificial neural network
[6] coefficients (MFECs) (MLP)
M. Hariharan Baby Chillanto database Short time fourier General regression neural 99 %
(2011) transform (STFT) network (GRNN)
[32]
O.F. Reyes-Galaviz, et. al Mexican-Cuba database Mel-frequency cepstrum Evolutionary- neural 100 % (from some
(2009) coefficients network system experiments)
[24]
Normal versus Asphyxia
R. Sahak Database from University of Mel-frequency cepstrum Support Vector Machine 93.16%
(2010) Milano-Bicocca coefficients (OLS)
[25]
Azlee Zabidi Baby Chillanto database Mel- frequency cepstrum Multi layer Perceptron 93.38%
(2010) coefficients (F-Ratio)
[26]
Azlee Zabidi Baby Chillanto database with Mel-frequency cepstrum Multi layer perceptron 94%
(2010) database from University of coefficients (OLS) artificial neural network
[27] Milano-Bicocca

Azlee Zabidi Baby Chillanto database Mel- frequency cepstrum Multi layer Perceptron 95.07 %
(2011) coefficients (BPSO)
[29]
M. Hariharan Baby Chillanto database Wavelet Packet Transform Probabilistic Neural 99 %
(2011) Network
[34]
M. Hariharan (in press) Baby Chillanto database Short time fourier Probabilistic Neural 99 %
(2011) transform (STFT) Network and General
[33] regression neural network
O.F. Reyes-Galaviz, et. al Mexican-Cuba database Mel-frequency cepstrum Evolutionary- neural 100 % (from some
(2009) coefficients network system experiments)
[24]
Normal versus Hypothyroidism
Azlee Zabidi Instituto Technologica superior Mel- frequency cepstrum - -
(2009) de Atlixco database coefficients (F-ratio)
[22]
Normal versus Hyperbilirubinemia
Karen Santiago-Sanchez Real database Mel-frequency cepstrum, Type 2 fuzzy pattern 95.56 %
(2009) linear prediction, Intensity matching (LPC-MFCC)
[23] and Cochleograms (T2-FPM)
Normal versus Cleft Palate
Dror Lederman 47 Mel cepstrum and other Continuous Density Hidden 91 %
2002 Real Germany-Israel database derivative of Mel Markov Model (CD-HMN)
[13] cepstrum coefficients
Normal versus Respiratory Distress Syndrome (RDS)
Dror Lederman 40 Mel cepstrum and other Continuous Density Hidden 63%
2002 Real Israel database derivative of Mel Markov Model (CD-HMN)
[13] cepstrum coefficients
Cleft Palate with Palate Plate versus Cleft Palate without Palate Plate
Dror Lederman 7 Linear prediction Parallel Hidden Markov 90 %
(2008) Real Germany database coefficients, Mel cepstrum Model (PHMN)
[21] coefficients and their
derivates
Normal versus Ankyloglossia with deviation of the epiglottis and larynx (ADEL)
Yoshifumi Okada 49 Iterative forward selection Linear discriminant ≈100%

546
(2011) (Database provided by Wang method (IFSM) analysis
[31] [35])

3-class problem
M. Petroni 16 Mel- cepstrum coefficients ANNs (feed-forward neural 90.4 %
(1995) (Anger, fear and pain) and mel- filter band network, recurrent neural
[12] Real database energy coefficients network, time delay neural
network & cascade
correlation neural network)
O. F. Reyes Galaviz, et. al (Normal Mel- cepstrum coefficients Adaptive neuro-fuzzy 96 %
(2004) Hypo acoustic inference system (ANFIS)
[16] Asphyxia)
Baby Chillanto database
O. F. Reyes Galaviz, et. al (Normal Mel-frequency cepstrum Time Delay Neural 98.67%
(2004) Hypo acoustic coefficients and linear network
[17] Asphyxia) prediction coefficients (TDNN)
Baby Chillanto database
Israel Suaste-Rivas, et. al (Normal Mel-frequency cepstrum Fuzzy-relational neural 88 %
(2004) Hypo acoustic network
[18] Asphyxia) (using triangular
Baby Chillanto database membership function)
Israel Suaste-Rivas, et. al (Normal Mel-frequency cepstrum Fuzzy-relational neural 98 %
(2004) Hypo acoustic network
[7] Asphyxia) (using trapezoidal
Baby Chillanto database membership function)
O. F. Reyes Galaviz, et. al (Normal Mel-frequency cepstrum Evolutionary-time delay 96.79 %
(2005) Hypo acoustic coefficients and linear neural network
[8] Asphyxia) prediction coefficients
Baby Chillanto database
O. F. Reyes Galaviz, et. al (Normal Mel-frequency cepstrum Feed forward input delay 96.08 % - 97.39 %
(2005) Hypo acoustic coefficients neural network
[19] Asphyxia)
Baby Chillanto database
S. E. Barajas-Montiel Exp1: normal, deaf and Mel-frequency cepstrum Fuzzy support vector 94.98 %
(2006) Asphyxia coefficients machine (FSVM) (Exp 1)
[20] Exp2: pain, hunger and no-pain- 97.83 %
no hunger (Exp 2)
Baby Chillanto database
Karen Santiago-Sanchez (Normal Mel-frequency cepstrum, Type 2 fuzzy pattern 91.74 %
(2009) Asphyxia linear prediction, Intensity matching (LPC-Cochleogram)
[23] Hyperbilirrubinemia) coefficients and (T2-FPM)
Real database Cochleograms
M.Hariharan (Normal weighted linear prediction Probabilistic neural Above 98%
(2010) Hypo acoustic cepstral coefficients network
[30] Asphyxia)
Baby Chillanto database
4-class problem
Dror Lederman (Normal Fundamental frequency - -
(2010) Cri-du-chat using simple inverse
[10] Hydrocephalus filtering tracking method
Cleft palate) (SIFT)
Real database
Ali Messaoud (pain, Mel frequency cepstrum Probabilistic neural 71.4 %
(2010) discomfort, coefficients network
[28] hunger and
illness)
Real database

Speech Recognition: Mobile Environments, 323 Call Centers pathologies by infant cry recognition” IBERAMIA 2004, LNAI
and Clinics, LLC 2010. 3315, pp. 953–962, 2004.
3. Dror Lederman, Arnon Cohen, Ehud Zmora, Kathleen Wermke, 8. Orion Fausto Reyes Galaviz and Carlos Alberto Reyes Garcia,
Stephanie Hauschildt and Angelika Stellzig-Eisenhauer “Infant cry classification to identify hypo acoustics and
“Automatic classification of the cry of infants with cleft palate”, asphyxia comparing an evolutionary-neural system with a
2002. neural network system”, MICAI 2005, LNAI 3789, pp. 949-958,
4. A. Cohen and E. Zmora, “Automatic classification of infants’ 2005. Springer-Verlag Berlin Heidelberg 2005.
hunger and pain cry”, in Proc. Int. Conf. Digital Signal 9. G. Varallyay Jr, Z. Benyo, A. Illenyi, Z. Farkas, and L. Kovacs,
Processing, Cappellini, V. and Constantinides, A.G., Eds., “Acoustic analysis of the infant cry: classical and new methods”,
Elsevier, Amsterdam, 1984. in 26th Annual International Conference of the IEEE EMBS,
5. J. Orozco and C. Garcia, “Detecting pathologies from infant cry San Francisco, CA, USA, pp. 313–316, 2004.
applying scaled conjugate gradient neural networks”, in 10. Dror Lederman, “Estimation of infant’s cry fundamental
European Symposium on Artificial Neural Networks, Bruges frequency using a modified SIFT algorithm”, arXiv:
(Belgium), pp. 349–354, 2003. 1009.2796v1 [cs. SD] 14 Sep, pp.1-5, 2010.
6. Mahmoud Mansouri Jam and Hamed Sadjedi, “Identification of 11. J.O Garcia and C.A. Reyes García, “A study on the recognition
hearing disorder by multi-band entropy cepstrum extraction of patterns of infant cry for the identification of deafness in just
from infant’s cry”, IEEE, 2009. born babies with neural networks”, LNCS 2905, pp.342-349,
7. Israel Suaste-Rivas, Orion F. Reyes-Galviz, Alejandro Diaz- 2003.
Mendez, and Carlos A. Reyes Garcia, “Implementation of a 12. M. Petroni, A.S. Malowany, C.C. Johnston and B.J. Stevens,
linguistic fuzzy relational neural network for detecting “Classification of infant cry vocalization using artificial neural

547
networks (ANNs)”, IEEE, pp.3475-3478, 1995. 2010), Kuala Lumpur, Malaysia, 30th Nov-2nd Dec 2010,
13. Dror Lederman, Arnon Cohen, Ehud Zmora, Kathleen Wermke, pp.126-129, 2010.
Stephanie Hauschildt and Angelika Stellzig-Eisenhauer, “On the 27. A. Zabidi, Wahidah Mansor, Lee Yoot Khuan, Ihsan Mohd
use of hidden markov models in infants’ cry classification”, Yassin and Rahilah Sahak, “ Classification of infant cries with
2002. asphyxia using multilayer perceptron neural network”, 2nd
14. J.O Garcia and C.A. Reyes García, “Acoustic features analysis International Conference on Computer Engineering and
for recognition of normal and hypoacoustic infant cry based on Applications, IEEE, pp.204-208, 2010.
neural networks”, Lecture Notes in Computer Science, Artificial 28. Ali Messaound and Chakib Tadj, “A cry-based babies
Neural Nets Problem Solving Methods, No.2687, pp.615-622, identification system”, ICISP 2010, LNCS 6134, pp. 192–199,
2003. 2010.
15. J.O Garcia and C.A. Reyes García, “Mel-frequency cepstrum 29. A. Zabidi, Wahidah Mansor, Lee Yoot Khuan, Ihsan Mohd
coefficients extraction extraction from infant cry for Yassin and Rahilah Sahak, “Binary particle swarm optimization
classification of normal and pathological cry with feed-forward for selection of features in the recognition of infants cries with
neural networks”, INAOE, IEEE, 2003. asphyxia”, 7th International Colloquium on signal processing &
16. O.F. Reyes Galaviz, E. Arch Tirado and C.A. Reyes Garcia, its applications (CSPA), IEEE, pp.272-276, 2011.
“Classification of infant crying to identify pathologies in 30. M. Hariharan, Lim Sin Chee and Sazali Yaacob, “Analysis of
recently born babies with ANFIS”, LNCS 3118, pp.408-415, infant cry through weighted linear predictioncepstral
2004. coefficients and probabilistic neural network” J Med Syst, LLC
17. O.F. Reyes-Galaviz and C.A. Reyes-Galaviz, “Infant cry 2010.
classification to identify hypoacoustics and asphyxia with neural 31. Yoshifumi Okada, Kentarou Fukuta and Tomomasa Nagashima,
networks”, MICAI 2004, LNAI 2972, pp.69-78, 2004. “Iterative forward on cross-validation approach and its
18. Israel Suaste-Rivas, Orion F. Reyes-Galviz, Alejandro Diaz- application to infant cry classification” in Proceedings of the
Mendez, and Carlos A. Reyes Garcia, “A fuzzy relational International Multi Conference of Engineers and Computer
neural network for pattern classification”, CIARP 2004, LNCS Scientists 2011 Volume I, IMECS 2011, March 16-18, 2011,
3287, pp.358-365, 2004. Hong Kong.
19. O.F. Reyes-Galaviz, A.Verduzco, E.Arch-Tirado and C.A 32. M. Hariharan, R. Sindhu and Sazali Yaacob, “Normal and
Reyes-Garcia, “Analysis of an infant cry recognizer for the hypoacoustic infant cry signal classification using time-
early identification of pathologies”, Nonlinear Speech Modeling frequency analysis and general regression neural network”,
and Applications, No.3445, pp.404-409, 2005. Computer Methods and Programs in Biomedicine, pp.1-11,
20. Sandra E. Barajas-Montiel and Carlos A. Reyes-Garcia, “Fuzzy 2011
support vector machines for automatic infant cry recognition”, 33. M. Hariharan, J. Saraswathy, R. Sindhu, Wan Khairunizam and
LNCIS 345, pp. 876 – 881, 2006. Springer-Verlag Berlin Sazali Yaacob, “Infant cry classification to identify asphyxia
Heidelberg 2006. using time-frequency analysis and radial basis neural networks”,
21. Dror Lederman, Arnon Cohen, Ehud Zmora, Kathleen Wermke, unpublished.
Stephanie Hauschildt and Angelika Stellzig-Eisenhauer 34. M. Hariharan, Sazali Yaacob & Saidatul Ardeenaawatie Awang,
“Classification of cries of infants with cleft-palate using parallel “Pathological infant cry analysis using wavelet packet transform
hidden markov models”, Med. Biol. Eng Comput, Vol. and probabilistic neural network”, Expert Systems with
46,pp.965-975, 2008. Applications, No.38, Vol.12 pp.15377-15382, 2011.
22. A. Zabidi, Wahidah Mansor, Lee Yoot Khuan, Rahilah Sahak 35. X. Wang, T. Nagashima, K. Fukuta, Y. Okada, M. Sawai, H.
and Farah Yasmin Abd Rahman, “Mel –frequency cepstrum Tanaka and T. Uozumi, “Statistical method for classifying cries
coefficient analysis of infant cry with hypothyroidism,” in 5th of baby based on pattern recognition of power spectrum”,
International Colloquium on signal processing & its applications International Journal of Bioimetrics, No. 2, Vol.2, pp.113-123,
(CSPA), IEEE, pp. 204-208, 2009. 2010.
23. Karen Santiago-Sanchez, Carlos A. Reyes-Garcia, and Pilar 36. D. Escobedo, S. Cano, E. Coello, L. Regueiferos and L.
Gomez-Gil, “Type-2 fuzzy sets applied to pattern matching for Capdevila, “Rising shift of pitch frequency in the infant cry pf
the classification of cries of infants under neurological risk”, some pathologic cases”, 2nd International Conference,
LNCS 5754, pp. 201-210, 2009. Springer-Verlag Berlin MAVEBA 2001, Firenze, Italy, 2001.
Heidelberg 2009. 37. S. Cano, et al. “The Spectral Analysis of Infant Cry: An Initial
24. O.F. Reyes-Galaviz, S. Cano-Ortiz and C. Reyes-Garca, Approximation”, presented at conference of EUROSPEECH' 95
“Evolutionary-neural system to classify infant cry units for (sponsored by ESCA & IEEE), Madrid, 1995.
pathologies identification in recently born babies”, in 8th 38. C. Manfredi, V. Tocchioni, and L. Bocchi, “A robust tool for
Mexican International Conference on Artificial Intelligence, newborn infant cry analysis”, presented at the international
MICAI 2009, Guanajuato, Mexico, pp. 330–335, 2009. Conference of the IEEE Engineering in Medicine and Biology
25. R. Sahak, W. Mansor, Y.K. Lee, A.I Mohd Yassin and A.Zabidi, Society, 2006.
“Orthogonal least square based support vector machine for the 39. M. Petroni, A. Malowany, C. Johnston and B. Stevens,
classification of infant cry with asphyxia”, 3rd international “Identification of Pain from Infant Cry Vocalizations Using
conference on biomedical engineering and informatics (BMEI Artificial Neural Networks (ANNs)”, Applications and Science
2010), IEEE, pp.986-990, 2010. of Artificial Neural Networks. The International Society for
26. A. Zahidi, W. Mansor, L.Y. Khuan, I.M. Yassin and R. Sahak, Optical Engineering, No. 2492, pp.729-738, 1995.
“The effect of f-ratio in the classification of asphyxiated infant 40. G. Várallyay, “The melody of crying”, presented at the
cries using multilayer perceptron neural network”, IEEE EMBS international journal of pediatric otorhinolaryngology, 2007.
conference on biomedical engineering & sciences (IECBES

548

You might also like