Professional Documents
Culture Documents
Comparative Study of Optimization Algorithm in Deep CNN-Based Model For Sign Language Recognition - SpringerLink
Comparative Study of Optimization Algorithm in Deep CNN-Based Model For Sign Language Recognition - SpringerLink
Comparative Study of Optimization Algorithm in Deep CNN-Based Model For Sign Language Recognition - SpringerLink
Conference paper
First Online: 14 September 2021
Part of the
Lecture Notes on Data Engineering and Communications Technologies
book series (LNDECT, volume 75)
Abstract
The fundamental part of the neural network is the learning rate, and the strategy of adopting the learning process in a neural
network is carried out using optimization algorithms or optimizers. This optimization algorithm helps us produce better
results to the model by changing the parameters like bias and weights, i.e., it helps us maximize or minimize the error
function and depends on the learnable parameters. In this paper, we examine how an End-to-End CNN model named
ASLNET recognizes the alphabets of the American sign language using various optimizers such as Stochastic Gradient
Descent (SGD), Root-Mean-Square propagation (RM-Sprop), Adaptive Gradient Algorithm (Adagrad), Adaptive Delta
(Adadelta),Adaptive Moment Estimation (Adam), Adam with Nesterov Momentum (Nadam), LookAhead and Rectified
Adam (RAdam). To avoid the overfitting issues, traditional data augmentation techniques are used to compare our model
with data augmentation and without augmentation with these optimizers. Among these, LookAhead and RAdam are the
most recently developed. The experiment is conducted on 2 NVIDIA TESLA P100 GPUs of batch size 64, and the
investigation was based on benchmark ASL Finger Spelling dataset.
Keywords
Optimization algorithms Deep CNN Finger Spelling dataset Sign language recognition
https://link.springer.com/chapter/10.1007/978-981-16-3728-5_35 1/5
9/15/21, 12:09 PM Comparative Study of Optimization Algorithm in Deep CNN-Based Model for Sign Language Recognition | SpringerLink
References
1. Deng, L., Li, J., Huang, J.-T., Yao, K., Yu, D., Seide, F., Seltzer, M., Zweig, G., He, X., Williams, J., et al.: Recent
advances in deep learning for speech research at microsoft. In: ICASSP 2013 (2013)
Google Scholar (https://scholar.google.com/scholar?
q=Deng%2C%20L.%2C%20Li%2C%20J.%2C%20Huang%2C%20J.-
T.%2C%20Yao%2C%20K.%2C%20Yu%2C%20D.%2C%20Seide%2C%20F.%2C%20Seltzer%2C%20M.%2C%20Zweig
%2C%20G.%2C%20He%2C%20X.%2C%20Williams%2C%20J.%2C%20et%20al.%3A%20Recent%20advances%20in
%20deep%20learning%20for%20speech%20research%20at%20microsoft.%20In%3A%20ICASSP%202013%20%28
2013%29)
2. Hinton, G.E., Salakhutdinov, R.R.: Reducing the dimensionality of data with neural networks. Science 313(5786),
504–507 (2006)
MathSciNet (http://www.ams.org/mathscinet-getitem?mr=2242509)
CrossRef (https://doi.org/10.1126/science.1127647)
Google Scholar (http://scholar.google.com/scholar_lookup?
title=Reducing%20the%20dimensionality%20of%20data%20with%20neural%20networks&author=GE.%20Hinton
&author=RR.%20Salakhutdinov&journal=Science&volume=313&issue=5786&pages=504-
507&publication_year=2006)
3. Hinton, G., Deng, L., Yu, D., Dahl, G.E., Mohamed, A.-R., Jaitly, N., Senior, A., Vanhoucke, V., Nguyen, P., Sainath,
T.N., et al.: Deep neural networks for acoustic modeling in speech recognition: The shared views of four research
groups. Signal Process. Mag. IEEE 29(6), 82–97 (2012)
Google Scholar (https://scholar.google.com/scholar?
q=Hinton%2C%20G.%2C%20Deng%2C%20L.%2C%20Yu%2C%20D.%2C%20Dahl%2C%20G.E.%2C%20Mohamed
%2C%20A.-
R.%2C%20Jaitly%2C%20N.%2C%20Senior%2C%20A.%2C%20Vanhoucke%2C%20V.%2C%20Nguyen%2C%20P.%2
C%20Sainath%2C%20T.N.%2C%20et%20al.%3A%20Deep%20neural%20networks%20for%20acoustic%20modelin
g%20in%20speech%20recognition%3A%20The%20shared%20views%20of%20four%20research%20groups.%20Sig
nal%20Process.%20Mag.%20IEEE%2029%286%29%2C%2082%E2%80%9397%20%282012%29)
4. Graves, A., Mohamed, A.-R., Hinton, G.: Speech recognition with deep recurrent neural networks. In: 2013 IEEE
International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6645–6649. IEEE (2013)
https://link.springer.com/chapter/10.1007/978-981-16-3728-5_35 2/5
9/15/21, 12:09 PM Comparative Study of Optimization Algorithm in Deep CNN-Based Model for Sign Language Recognition | SpringerLink
https://link.springer.com/chapter/10.1007/978-981-16-3728-5_35 3/5
9/15/21, 12:09 PM Comparative Study of Optimization Algorithm in Deep CNN-Based Model for Sign Language Recognition | SpringerLink
11. Zhang, M.R., Lucas, J., Hinton, G., Ba, J.: Lookahead optimizer: k steps forward, 1 step back (2019). arXiv preprint
arXiv:1907.08610 (http://arxiv.org/abs/1907.08610)
12. Liu, L., Jiang, H., He, P., Chen, W., Liu, X., Gao, J., Han, J.: On the variance of the adaptive learning rate and beyond
(2019). arXiv preprint arXiv:1908.03265 (http://arxiv.org/abs/1908.03265)
13. Pugeault, N., Bowden, R.: Spelling it out: real-time ASL fingerspelling recognition. In: Proceedings of the 1st IEEE
Workshop on Consumer Depth Cameras for Computer Vision, jointly with ICCV'2011 (2011)
Google Scholar (https://scholar.google.com/scholar?
q=Pugeault%2C%20N.%2C%20Bowden%2C%20R.%3A%20Spelling%20it%20out%3A%20real-
time%20ASL%20fingerspelling%20recognition.%20In%3A%20Proceedings%20of%20the%201st%20IEEE%20Work
shop%20on%20Consumer%20Depth%20Cameras%20for%20Computer%20Vision%2C%20jointly%20with%20ICC
V%272011%20%282011%29)
14. Chollet, F.: Keras, 2015. Available: https://keras.io/ (https://keras.io/)
15. Abadi, M., et al.: TensorFlow: large-scale machine learning on heterogeneous distributed systems (2016)
Google Scholar (https://scholar.google.com/scholar?
q=Abadi%2C%20M.%2C%20et%20al.%3A%20TensorFlow%3A%20large-
scale%20machine%20learning%20on%20heterogeneous%20distributed%20systems%20%282016%29)
Copyright information
© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2022
First Online
14 September 2021
DOI
https://doi.org/10.1007/978-981-16-3728-5_35
Publisher Name
Springer, Singapore
Print ISBN
978-981-16-3727-8
https://link.springer.com/chapter/10.1007/978-981-16-3728-5_35 4/5
9/15/21, 12:09 PM Comparative Study of Optimization Algorithm in Deep CNN-Based Model for Sign Language Recognition | SpringerLink
Online ISBN
978-981-16-3728-5
eBook Packages
Engineering
Engineering (R0)
Personalised recommendations
Not logged in
KCG College of Technology KCG Nagar (2000596414) - INDEST-AICTE-Level III (3000168247) - AICTE Electrical & Electronics & Computer Science
Engineering (3000684219)
103.249.82.131
https://link.springer.com/chapter/10.1007/978-981-16-3728-5_35 5/5