Professional Documents
Culture Documents
Survey On Machine Learning Based Scheduling in Cloud Computing
Survey On Machine Learning Based Scheduling in Cloud Computing
ABSTRACT their requests as per SLA (Service Level Agreement) with the
In the modern era, cloud computing gains a lot of attention due to customers.
its various features such as it is simple to use, minimum cost, and Cloud computing allows delivering computing resources by the
mostly low power consumption. Many algorithms and techniques means of internet. Cloud has emerged as a favorable technology
have been proposed for scheduling of virtual machines to provide to satisfy the computational needs of users. Various cloud services
dynamic load balancing, dynamic scalability and reallocation of provide services [26], the cost of which is computed in terms of
resources. Intelligent algorithms are used for the optimization of resource usage in Cloud. The three basic characteristics of Cloud
results and minimizing the makespan scheduling while utilizing are resource scalability, pay per usage and self service.
the resources efficiently based on dynamic environment. This Traditionally, the data of an organization is stored within the
paper reviews various intelligent scheduling algorithms such as organization. When the data of more than one organization is
Genetic Algorithm (GA), Simulated annealing (SA), Tabu Search stored outside with the help of third party outside organization
(TS), Ant Colony Optimization (ACO), Particle Swarm and is, accessed over web. Then, such service is termed as web
Optimization (PSO), Artificial Immune System (AIS), Bacterial service over cloud. When the client requests the data, it can only
Foraging Algorithm (BF), Fish Swarm Optimization Algorithm access the data if he is authorized.
(FS), Cat Swarm Optimization Algorithm (CS), Firefly Algorithm
(FF), Cuckoo Search Algorithm (CS), Artificial Bee Colony When resource requirements need to be met with fewer resources,
(ABC), Bat Algorithm (BA). the virtualization technique comes into play which is highly
energy efficient technology. It allows cloud to be used by multiple
CCS Concepts users at a time and thus, supports on demand delivery.
Computing methodologies~Machine learning algorithms
Cloud provides infrastructure as a service, platform as a service
Keywords and software as a service which can be accessed by the user on
pay per use model. In traditional system, whenever user needs
Machine learning, Scheduling Algorithm, Cloud
some software or platform, he needs to buy it. The temporary
Computing. nature of the user needs leads to wastage of resources and money
but cloud provides a solution to this problem by providing the
1. INTRODUCTION resources on demand. This dynamic nature of requests by users
CLOUD computing is a large scale distributed computing
leads to the problem of dynamic scheduling of requests in Cloud
paradigm, driven by an increasing demand for various levels of
Infrastructure which is to handled by the cloud service providers
pay-per-use computing resources. The basic building blocks of
with well defined strategy. Various scheduling algorithms have
cloud computing model include providing infrastructure, platform
been used in scheduling jobs to attain the overall optimization of
and software as a service. These services are few likely our
resources in Cloud Environment while providing high
telephone service that is we have to pay only that much amount
performance application interface to the user. to the In this paper,
we use (pay per use). E.g. we only need a telephone in our home
analysis of various intelligent techniques.
and needs not to set up whole exchange office personally. It’s the
job of telecom service providers to provide services from their 2. SCHEDULING ALGORITHM
exchange and one has to pay according to the services being used A major concern of process scheduling in cloud is minimum
by him. Similarly, users submit their computing tasks from their outlay of time and energy. The average turnaround time and
devices to their cloud service providers which in turn, service average cost of overall task scheduling should be minimized and
to optimize task scheduling, optimization techniques have been
Permission to make digital or hard copies of all or part of this work for
personal or classroom use is granted without fee provided that copies used [1]. Scheduling algorithms have been categorized into two
are not made or distributed for profit or commercial advantage and that type’s viz. traditional or classical and intelligent optimization
copies bear this notice and the full citation on the first page. Copyrights scheduling algorithm. For load balancing, less execution time,
for components of this work owned by others than ACM must be response time and in many more parameters traditional scheduling
honored. Abstracting with credit is permitted. To copy otherwise, or algorithms like FCFS, shortest job first, round-robin and priority
republish, to post on servers or to redistribute to lists, requires prior etc, are not recommendable over various scenarios. Due to
specific permission and/or a fee. Request permissions from inefficiency of classical optimization algorithms for solving larger
Permissions@acm.org.
problem spaces, a growing trend to meta-heuristic and intelligent
ISMSI '17, March 25-27, 2017, Hong Kong, Hong Kong
© 2017 ACM. ISBN 978-1-4503-4798-3/17/03…$15.00 optimization algorithms has been emerged which deals with the
global optimization of a function. This paper presents a review of
DOI: http://dx.doi.org/10.1145/3059336.3059352
57
various intelligent scheduling algorithms such as Genetic temperature. This algorithm starts by generating an initial solution
Algorithm, Simulated annealing, Tabu Search, Ant Colony and by initializing the temperature parameter which is purely an
Optimization, Particle Swarm Optimization, Artificial Immune assumption [1] which is the major drawback. This algorithm
System, Bacterial Foraging Algorithm, Fish Swarm Optimization usually stuck with local maxima, unwanted allocations are
Algorithm, Cat Swarm Optimization Algorithm, Firefly entrained and the algorithm also depends on the request
Algorithm, Cuckoo Search Algorithm, Artificial Bee Colony and availability and bin capacity. This approach works well with high
Bat Algorithm temperatures.
58
finding by a single particle, namely the individual extremum; the It minimizes flowtime, makespan which are the important
other is the optimal solution finding by whole particle swarm, that scheduling criteria in parallel and distributed computing. [28]
is, namely the global extremum. Each particle in the population
represents a possible solution of the problem to be optimized. Its 3.8 Fish swarm optimization algorithm
objective is to solve the problem by modeling and predicting The artificial fish swarm algorithm (AFSA)[11] is a population-
social behavior of insects. Instead of using random approach, it based meta-heuristic intelligent optimization algorithm inspired
serves greater number of virtual machines. It provides the best from fish swarm behaviors to solve combinatorial problems.
load balance and reduces the throughput and response time. PSO AFSA is a random parallel search algorithm belongs to the family
algorithm is easy to trap into a local optimum. The computational of swarm intelligence. This algorithm adapts the behavior of a
time of the PSO algorithm is shorter than that of other existing group of fish swarm intelligence where the group globally
metaheuristics, the precision of its final solution to the large-sized searches for the food to reach the areas with a higher
complex optimization problems is relatively poor [28]. The PSO concentration. It then combines this intelligence with one of the
algorithm is not robust to solve problems with different methods to get the optimal solution of combinatorial optimization
constraints The merits of parallel distribution, scalability, easy to problems. [13]In an AFSA system, each artificial fish (AF)
realize, strong Robustness, with high flexibility in dynamic adjusts its behavior according to its current state and its
environments, PSO solves many combinational optimization environmental state, making use of the best position encountered
problems successfully. by itself and its neighbors. It is highly flexible and more fault
tolerant as it can be used for scheduling in cloud. It is expected to
3.6 Artificial Immune System give the best results. It is insensitive to initial values, and
The AIS is also a general meta-heuristic algorithm based on possesses good performance such as fast convergence and
principles of the immune system, a [7] computational intelligence robustness. It has gained an increasing study and wide
approach, called artificial immune system (AIS), has been applications such as multi-objective optimization and clustering
developed. Artificial immune systems (AIS) constitute a family of problem.
bio inspired algorithms based on the models known from the
studies of biological immune systems and immunology . 3.9 Cat swarm optimization
Informally, biological immune systems protect the body from An intelligent heuristic scheduling algorithm based on social
dangerous substances presented in the form of pathogens. They behavior of cats, belongs to the family of swarm intelligence is cat
combine the ability to protect from both, general pathogens and swarm optimization.[15] It is based on the seeking and tracking
specific attackers (e.g. different types of viruses, bacteria and so behavior of cats. In seek mode, cats sense for the next best move
on) that cannot be be eliminated by the static (innate) part of the while sitting in a place (without movement) while in tracking
immune system . Artificial immune system algorithm includes the mode the cats chase the target by moving to the next best possible
negative selection, clonal selection, and immune networks. The position with a velocity. The cat swarm optimization adapts this
clonal selection is based on the selection theory describing the behavior of cats with the aim to solve multi objective workflow
basic response of the immune system to an antigen. Negative scheduling. The CSO maps the tasks and virtual machines by
selection is in the biological immune systems used to ensure that taking execution time, data amount and energy consumption as
newly created lymphocytes will be able to adapt to new types of input and then searches for the best task resource mapping with
threats and remain tolerant to body cells at the same time. Immune fairness and offers the best fitness value. The solution obtained
network contributes to the stable memory structure of the immune optimizes the overall energy consumption. It also provides an
system that is able to retain the information about the antigens optimal task to resource scheduling that minimize the cost of
even without their presence. It has been successfully applied to scheduling. It is an improvement over PSO by reducing the
many fields such as clustering, classification, pattern recognition, number of iterations.
computer defense, and optimization. AIS has the advantage of
finding optimal makespan values of different size problems. 3.10 Firefly algorithm
Firefly algorithm (FA) is an intelligent meta-heuristic population
3.7 Bacterial foraging algorithm based algorithm, inspired by the flashing behavior of fireflies. The
Bacterial foraging optimization algorithm is a global optimization firefly method is to dynamically create an optimal schedule based
algorithm based on the population [5]search and efficient for on swarm intelligence, investigating the travel behavior of
global search method for the distributed computing. Bacterial fireflies which go looking for the closest possible maximum
foraging algorithm is a nature inspired optimization techniques alternative. The blinking light in the fireflies is their attribute of
that mimicking the foraging behavior of E. coli bacteria satisfies. attractiveness mainly used to attract mates and to defend
It is a multi-criteria optimization problem. It gives optimized themselves from other predators. In FA, fireflies are considered
solutions to the multi-objective problem where simultaneous as simple agents that move and interact through the search space
multi-criteria are needed to address simultaneously. This method and record the best solution that they have visited. Therefore, FA
is used for locating, handling, and ingesting the food. During can be employed to generate alternative design options in order to
foraging, a bacterium can exhibit two different actions: tumbling effectively support intelligent task scheduling on cloud computing
or swimming. The tumble action modifies the orientation of the to dynamically map the received jobs to the available resources in
bacterium[16]. During swimming means the chemo taxis step, the order to finish job's execution within minimum makespan time
bacterium will move in its current direction. It is based on the and evenly distribute the load. It can be used to solve multi-
nature where the selection process tries to preserve those animals objective problems.[17],[18],[19]
that have ability to forage successfully and tries to exclude those
animals with poor forage. Since the former have the more ability 3.11 Cuckoo search algorithm
to succeed in reproduction process. It has the advantage to Cuckoo search algorithm is a meta-heuristic algorithm that models
maximize the resource utilization and min the resource usage cost. natural behavior of cuckoo species. Cuckoos are the beautiful
birds but their aggressive reproduction strategy is more interesting
59
to us. The cuckoos reproduce by the following rules where one # Algorithm Parameters
Major Features &
egg is laid at a time and dumps it in a nest which is chosen Disadvantages
randomly and in next step the nest which has the better quality Genetic Execution time, Self managing scheme
eggs will be carried further for the next generation. It is assumed Algorithm Global Search solution
that there are fixed number of host nests. This strategy can be Resource utilization,
based on Delivery of accurate results
used for scheduling in cloud where the eggs represent the solution 1 No. of VM, Better resource utilization
Darwin
and algorithm works by replacing weaker solutions with the better theory makespan Independent of no. of VM
solutions. This algorithm gives the optimal solution and Min. makespan and flowtime
[2],[4]
effectively balances the local and global search with the help of
switching parameter. The results obtained are better than particle Simulated Execution time Depends on resource
swarm optimization.[17],[20],[21] annealing Temperature availability and bin capacity
based on Less Average execution time
2 Request availability
3.12 Artificial bee colony Annealing in GA
Artificial bee colony algorithm is a population-based optimization solids Bin capacity Stuck with local maxima
meta-heuristic algorithm based on intelligent behavior of bees, Unwanted allocations
[4]
which has two common types; foraging behavior and reproduction
(mating) behavior. The ABC algorithm is based on the skillful
Tabu Search Execution time Less execution time
based on Do not stuck in local optima
foraging behavior of honey bee swarm. A typical hive may 3 notion of Explore new regions
include 5,000 to 20,000 individual bees. Honey bees assume to movement[6] Premature termination
have different functions within their colony through time. Active
foraging bees go to a food source, check neighbor sources, collect Ant Colony Randomization Reducing the number of
food and return back to the hive. Scout bees examine the area Optimization physical machines
No. of VM
surrounding the hive searching for plentiful new food sources. At 4 (ACO)[9] Global optimal solutions
any time some of the foraging bees become inactive. This strategy Robustness
can be applied in the field of task scheduling is the foraging Falling for local optima
behavior. Mapping of bees foraging behavior to the task Particle Randomization Quick Converge local optima
scheduling problem, can commonly be defined in a way that the Swarm
Convergence Fair distribution
employed bees are associated with allocating tasks on a resource 5 Optimization Minimize makespan
[24] Cost, Makespan Lacking of reliability
and share their information about food sources. It minimize the
Distribution
completion time the search through solution space and
diversifying the search. These characteristics make the artificial Artificial Makespan Optimal makespan
bee colony technique efficient to be applied in cloud task 6 Immune
scheduling domain.[21],[12] System [7]
Bacterial Makespan Reduced makespan
4. CONCLUSIONS & FUTURE WORK 7
Foraging flowtime Minimim flowtime
Traditional approaches of task scheduling suffer from Algorithm
disadvantages like overpricing and slow processing for bulk of [5] [16]
tasks. These scheduling algorithms are mostly based on cost Fish Swarm Makespan High efficiency
reduction factor, deadline factor and even on priority based 8
Optimization Less makespan
scheduling and they suffer from long waiting priority queues. The Algorithm
intelligent scheduling algorithm provides a fair solution to all of [11],[13]
these challenges and provides a highly optimized, dynamic and Cat Swarm Cost Reduced no. of iterations
more reliable scheduling scheme than the traditional. This paper 9
Optimization Resource utilization Optimal resource utilization
overviews whe intelligent scheduling algorithm with meta- Algorithm Fair distribution
[15] No. of iterations less cost
heuristic, swarm based approach. GA, SA and TS have less
execution time. ACO reduces the number of physical machines Firefly Randomization Automatic subdivision of
required while PSO, CS give fair distribution. AI, FS, BF, ABC Algorithm entire population.
parameter
works well with less makespan. CS provides optimal resource [17], [23], Efficiently deal with
10 [18], [19] Light absorption multimodality.
utilization. FF, CS, BF increases convergence speed. CS, PSO has
relatively less cost. Coefficient Increased convergence speed.
60
5. REFERENCES [15] Saurabh Bilgaiyan, Santwana Sagnika, Madhabananda
Das 2014 “Workflow Scheduling in Cloud Computing
Environment Using cat Swarm Optimization” IEEE
[1] Zhi-Hui Zhan, Xiao-Fang Liu, Yue-Jiao Gong, Jun Zhang, International Advance Computing Conference, IACC,
Henry Shu-Hung Chung, Yun Li:, 2015 “Cloud [16] Liji Jacob, V.Jeyakrishanan, P.Sengottuvelan “Resource
Computing Resource Scheduling and a Survey of Its Scheduling in Cloud using Bacterial Foraging
Evolutionary Approaches” ACM Computing Surveys, Vol. Optimization Algorithm” International Journal of
47, No. 4, Article 63, Publication date. Computer Applications (0975 – 8887)
[2] Ipsita Kar, R.N. Ramakant Parida, Himansu Das 2016. [17] Navneet Kaur, Amit Chhabra 2016 “Analytical review of
“Energy Aware Scheduling using Genetic Algorithm in three latest nature inspired algorithms for scheduling in
Cloud Data Centers” International Conference on clouds” International Conference on Electrical,
Electrical, Electronics, and Optimization Techniques Electronics, and Optimization Techniques
(ICEEOT)– (ICEEOT)
[3] Tripti Mandal, Sriyankar Acharyya, 2015 “Optimal Task [18] Adil Yousif, Abdul Hanan Abdullah, Sulaiman Mohd
Scheduling in Cloud Computing Environment: Meta Nor, Mohammed Bakri Bashir 2012 “Optimizing
Heuristic Approaches” Proceedings of International Job Scheduling for computational Grid based on
Conference on Electrical Information and Firefly Algorithm” IEEE conference on sustainable
Communication Technology utilization and development in engineering and
[4] Aleena Xavier T, Rejimoan R 2016 “Survey on Various technology (STUDENT),
Resource Allocation Strategies in Cloud” [19] R. SundarRajan, V. Vasudevan, S. Mithya 2016
International Conference on Circuit, Power and “Workflow Scheduling in cloud Computing
Computing Technologies [ICCPCT] Environment using Firefly Algorithm” International
[5] Mandeep Kaur 2016 “Elitist Multi-Objective Bacterial Conference on Electrical, Electronics, and Optimization
Foraging Evolutionary Algorithm for Multi-Criteria based Techniques (ICEEOT)
Grid Scheduling Problem” International Conference on [20] Khadija Bousselmi, Zaki Brahmi, Mohamed Mohsen
Internet of Things and Applications (IOTA) Gammoudi 2016 “Energy efficient partitioning and
[6] Pan Yi, Hui Ding and Byrav Ramamurthy 2015 “Budget- scheduling approach for Scientific Workflows in the
optimized network aware joint resource allocation in Cloud” IEEE International Conference on Services
Grids/Clouds over optical networks” Computing
[7] Pavel Kr, Jan Platoˇs, V´aclav 2012 “Independent Task [21] Saurabh Bilgaiyan, Santwana Sagnika, Madhabananda
Scheduling by Artificial Immune Systems, Differential Das 2014 “Workflow Scheduling in Cloud
Evolution, and Gnetic Algorithms” Fourth International Computing Environment Using Cat Swarm
Conference on Intelligent Networking and Collaborative Optimization” 978-1-4799-2572- 8/14
Systems. [22] Elaheh Hallaj, Seyyed Reza Kamel Tabbakh 2015 “Study
[8] Liyun Zuo, Lei Shu, Shoubin Dong, Chunsheng Zhu, and Analysis of Task Scheduling Algorithms in Clouds
TakahirHara 2015 “A Multi-Objective Optimization Based on Artificial Bee colony” Second International
Scheduling Method Based on the Ant Colony Algorithm Congress on Technology communication and Knowledge
in Cloud Computing” IEEE.Transactions Access (ICTCK 2015) November, 11-12,
2015.2508940 [23] Mariappan Kadarkarainadar Marichelvam, Thirumoorthy
[9] Xiao-Fang Liu, Zhi-Hui ZhanJeremiah D Deng, Yun Prabaharan, and Xin She Yang 2014 “Discrete Firefly
Li,Tianlong Gu,Jun Zhang “An Energy Efficient Ant Algorithm for the Multi-Objective Hybrid Flowshop
Colony System for Virtual Machine Placement in Cloud Scheduling Problems” IEEE Transactions On
Computing” 2016 IEEE. Transactions on Evolutionary Evolutionary Computation, Vol.18, NO. 2
computation [24] Mohammad Reza Bonyadi and Zbigniew Michalewicz
[10] Jasraj Meena, Malay Kumar, And Manu Vardhan 2016I “Impacts of Coefficients on movement patterns in the
“Cost Effective Genetic Algorithm for Workflow particle swarm optimization algorithm” IEEE
Scheduling in Cloud Constraint” IEEE Transactions on Evolutionary Computation.
/ACCESS.2016.2593903 [25] Shane Strasser, John Sheppard Fellow, Nathan Fortier,
[11] Dyah Pythaloka, Agung Toto Wibowo, Mahmud Dwi Rollie Goodman “Factored Evolutionary Algorithms”
Sulistiyo 2015 “Artificial Fish Swarm Algorithm for IEEE Transactions on Evolutionary Computation.
Job Shop Scheduling Problem” 3rd International [26] TARANDEEP KAUR and INDERVEER CHANA 2015
Conference on Information and Communication “Energy Efficiency techniques in Cloud Computing: A
Technology (ICoICT). Survey and Taxonomy” Computing Surveys, Vol.48, No.
[12] Gamal F. Elhady and Medhat A. Tawfeek 2015 “A 2, Article 22
Comparative Study into Swarm Intelligence Algorithms [27] Neenu George, K. Chandrasekaran, A.Binu 2015 “ An
for Dynamic Tasks Scheduling in Cloud Computing” EEE objective study on improvement of task scheduling
Seventh International Conference on intelligent mechanism using computational intelligence in
Computing and Information Systems (ICICIS'15) cloud computing” IEEE Conference on Computational
[13] GeHongwei, Sun Liang, 2016 “Intelligent Scheduling in Intelligence and Computing Research
Flexible job Shop Environments Based on Artificial Fish [28] Haitao Yuan, Jing Bi, Wei Tan, Bo Hu Li “Impacts of
Swarm Algorithm with Estimation of Distribution” IEEE coefficients on movement patterns in the particle swarm
Congress On Evolutionary Computation (CEC) optimization algorithm” IEEE Transactions on
[14] Nima Jafari Navimipour and Farnaz Sharifi Milani, 2015 Evolutionary Computation .
“Task Scheduling in the Cloud Computing Based on the
Cuckoo Search Algorithm” International Journal of
Modeling and Optimization, Vol. 5, No. 1
61