Економічно ефективний гібридний генетичний алгоритм планування робочого процесу в хмарі
Cloud computing plays a significant role in everyone’s lifestyle by snugly linking communities, information, and trades across the globe. Due to its NP-hard nature, recognizing the optimal solution for workflow scheduling in the cloud is a challenging area. We proposed a hybrid meta-heuristic cost-e...
Saved in:
| Date: | 2022 |
|---|---|
| Main Authors: | , , |
| Format: | Article |
| Language: | English |
| Published: |
The National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute"
2022
|
| Subjects: | |
| Online Access: | https://journal.iasa.kpi.ua/article/view/263013 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Journal Title: | System research and information technologies |
| Download file: | |
Institution
System research and information technologies| _version_ | 1866391924030570496 |
|---|---|
| author | Bothra, Sandeep Kumar Singhal, Sunita Goyal, Hemlata |
| author_facet | Bothra, Sandeep Kumar Singhal, Sunita Goyal, Hemlata |
| author_sort | Bothra, Sandeep Kumar |
| baseUrl_str | http://journal.iasa.kpi.ua/oai |
| collection | OJS |
| datestamp_date | 2022-12-21T22:15:21Z |
| description | Cloud computing plays a significant role in everyone’s lifestyle by snugly linking communities, information, and trades across the globe. Due to its NP-hard nature, recognizing the optimal solution for workflow scheduling in the cloud is a challenging area. We proposed a hybrid meta-heuristic cost-effective load-balanced approach to schedule workflow in a heterogeneous environment. Our model is based on a genetic algorithm integrated with predict earliest finish time (PEFT) to minimize makespan. Instead of assigning the task randomly to a virtual machine, we apply a greedy strategy that assigns the task to the lowest-loaded virtual machine. After completing the mutation operation, we verify the dependency constraint instead of each crossover operation, which yields a better outcome. The proposed model incorporates the virtual machine’s performance variance as well as acquisition delay, which concedes the minimum makespan and computing cost. One of the most astounding aspects of our cost-effective hybrid genetic algorithm (CHGA) is its capacity to anticipate by creating an optimistic cost table (OCT) while maintaining quadratic time complexity. Based on the results of our meticulous experiments on some real-world workflow benchmarks and comprehensive analysis of some recently successful scheduling algorithms, we concluded that the performance of our CHGA is melodious. CHGA is 14.58188%, 11.40224%, 11.75306%, and 9.78841% cheaper than standard Ant Colony Optimization (ACO), Particle Swarm Optimization (PSO), Cost Effective Genetic Algorithm(CEGA), and Cost-Effective Load-balanced Genetic Algorithm (CLGA), respectively. |
| doi_str_mv | 10.20535/SRIT.2308-8893.2022.3.08 |
| first_indexed | 2025-07-17T10:27:56Z |
| format | Article |
| fulltext |
Sandeep Kumar Bothra, Sunita Singhal, Hemlata Goyal, 2022
Системні дослідження та інформаційні технології, 2022, № 3 121
UDC 519-62
DOI: 10.20535/SRIT.2308-8893.2022.3.08
COST EFFECTIVE HYBRID GENETIC ALGORITHM
FOR WORKFLOW SCHEDULING IN CLOUD
SANDEEP KUMAR BOTHRA, SUNITA SINGHAL, HEMLATA GOYAL
Abstract. Cloud computing plays a significant role in everyone’s lifestyle by snugly
linking communities, information, and trades across the globe. Due to its NP-hard
nature, recognizing the optimal solution for workflow scheduling in the cloud is a
challenging area. We proposed a hybrid meta-heuristic cost-effective load-balanced
approach to schedule workflow in a heterogeneous environment. Our model is based
on a genetic algorithm integrated with predict earliest finish time (PEFT) to mini-
mize makespan. Instead of assigning the task randomly to a virtual machine, we ap-
ply a greedy strategy that assigns the task to the lowest-loaded virtual machine. Af-
ter completing the mutation operation, we verify the dependency constraint instead
of each crossover operation, which yields a better outcome. The proposed model in-
corporates the virtual machine’s performance variance as well as acquisition delay,
which concedes the minimum makespan and computing cost. One of the most as-
tounding aspects of our cost-effective hybrid genetic algorithm (CHGA) is its capac-
ity to anticipate by creating an optimistic cost table (OCT) while maintaining quad-
ratic time complexity. Based on the results of our meticulous experiments on some
real-world workflow benchmarks and comprehensive analysis of some recently suc-
cessful scheduling algorithms, we concluded that the performance of our CHGA is
melodious. CHGA is 14.58188%, 11.40224%, 11.75306%, and 9.78841% cheaper
than standard Ant Colony Optimization (ACO), Particle Swarm Optimization
(PSO), Cost Effective Genetic Algorithm(CEGA), and Cost-Effective Load-
balanced Genetic Algorithm (CLGA), respectively.
Keywords: cloud computing, cost effective, genetic algorithm, metaheuristic algo-
rithm, predict earliest finish time, Workflow scheduling.
INTRODUCTION
Cloud computing is a buzzword in the current era, which provides a very elastic
‘pay as you go’ model[1]. Dr. Raj Kumar Buyya says, “A cloud is a kind of paral-
lel and distributed system made up of a number of linked, virtualized computers
that are constantly provided and shown as one or more unified computing re-
sources in accordance with service-level agreements negotiated between the ser-
vice provider and customers” [2]. On the basis of physical location and distribu-
tion, various deployment models are available now. Task scheduling is critical for
maximizing the use of cloud resources as well as providing end users with quality
of service (QoS) [3].
Static scheduling and dynamic scheduling are two different sorts of task
scheduling issues. In the static category, all task characteristics, including the
costs of computation and communication for each activity as well as how those
activities relate to one another, are known in advance. However, the dynamic cat-
egory makes such information unavailable and makes judgments at runtime [4].
Furthermore, static scheduling refers to compile-time scheduling, and dynamic
scheduling refers to scheduling at runtime.
Sandeep Kumar Bothra, Sunita Singhal, Hemlata Goyal
ISSN 1681–6048 System Research & Information Technologies, 2022, № 3 122
Heuristic-based and guided random search-based algorithms are the two
types of static scheduling algorithms that are most frequently used.
Heuristic-based algorithms deliver approximate, frequently excellent results
because of their polynomial time complexity [5]. Similar to heuristic-based
algorithms, guided random search-based algorithms provide approximations, but
the results’ quality may be increased by including more rounds, which raises the
cost of the methods [6].
Dependent tasks are represented as a workflow, which is a set of nodes and
edges, with each node representing a job and each edge representing follow-up
dependence [7]. Workflow is scheduled and executed by the workflow
management system where tasks are scheduled and provisioned to virtual
machines [8]. Various researchers are engaged themselves to resolve the problem
of resource scheduling in cloud. Many tasks have been completed using the
heuristic approach; however it is not very excellent owing to its problem-
dependent aspect, which is that it is unable to provide a globally optimum
solution. As a result, the researcher prefers to use a meta-heuristic technique. Due
to its task-independent character, the meta-heuristic method delivers a global
optimal solution. The researcher’s ultimate objective is to maximize cloud
resource usage while lowering costs for cloud’s end users [9], [10].
The majority of quadratic time complexity list-based scheduling algorithms
just evaluate the current task when allocating a task to a processor. Although it is
a low-cost method, it does not examine what comes before the current job, which
could lead to poor decisions in some cases. Lookahead [11] is an example of an
algorithm that analyses the impact on child nodes, but it raises the time
complexity to the fourth order. As a result, we used the PEFT approach in our
proposed model “cost-effective hybrid genetic algorithm” (CHGA). One of
PEFT’s most amazing features is its ability to predict by making an OCT with
optimistic costs while preserving quadratic time complexity.
Motivation. Following a comprehensive review, we motivated to resolve a
research gap where many parameters, such as virtual machine (VM) performance
variation, booting time, and shutdown time, as well as load balancing across VMs
and minimize execution time in parallel using heuristics approaches, are not
effectively addressed.
Objective. The goal of this research is to arrange the tasks of workflow in
such a way that it reduces not only computation costs but also processing time
while maintaining load balance among virtual machines. Our objective was to
create a hybrid meta-heuristic technique for reducing processing time and expense
while maintaining load balance across virtual machines under time constraint.
During population initialization in genetic algorithm, we employed predict
earliest finish time (PEFT) approach, which is significant for decreasing the
makespan. We also took into account the time it takes for a virtual machine (VM)
to boot up and performance fluctuations, both of which have an effect on
computation time and execution cost. This represents the novelty of our model.
To keep the load balanced among the virtual machines, we employed a greedy
method [10] in our proposed model “cost-effective hybrid genetic algorithm”
(CHGA).
The remaining sections of the paper are structured as follows. The second
section goes through some background information. Sections III and IV contain
problem definitions and details of our proposed model, respectively. The per-
formance review may be found in Section V. Section V consists of two sub-
sections: result analysis and discussion. Finally, Section VI draws the paper
toward its conclusion.
Cost effective hybrid genetic algorithm for workflow scheduling in cloud
Системні дослідження та інформаційні технології, 2022, № 3 123
RELATED WORK
Comprehensive work has been done by us on various meta-heuristic algorithms in
literature [9], some of them are genetic algorithm (GA) [12], ant colony optimiza-
tion (ACO) [13], particle swarm optimization (PSO) [14], artificial bee colony
algorithm (ABC) [15] etc.
The RDPSO (Revised Discrete PSO) technique employed in[16] involves a
greedy adaptive search procedure to establish the swarm particle, followed by the
computation of local best and global best. It focuses on achieving the lowest
execution cost, but load balancing across virtual machines is not provided.
In literature [17] , researchers designed a PSO-based algorithm to minimize
execution cost as well as makespan and compared it with the Best Resource
Selection (BRS) algorithm, but they didn’t take into account dependent tasks in
scheduling approach. This deficiency is removed by [18], where ACO is applied
to the workflow. They used an approach ant strategy: front ant and back ant. Their
study took into account pre-execution time and a pheromone threshold value, but
they did not mimic a different type of scientific workflow.
Researchers in Dynamic Objective-based GA (DOGA) [19] reduced the cost
of workflow execution and reached a result that was comparable to PSO, but they
ignored the booting time factor and the load balancing approach. Authors pro-
vided a GA-based technique in the literature [20], where cost and time span are
both reduced within a user-defined deadline. This paper was not based on real
world workflow, which is accomplished by [21].
A multi-objective PSO approach with a weighted linear transform fitness
function is presented in the literature [22] and they conclude that their proposed
algorithm is better than genetic algorithms, but they consider only makespan and
resource utilization as parameters, not other parameters like execution cost, load
distribution, etc. The outcome of their experiment is not very trustworthy due to
the limited size of their workflow.
A new approach SACO Slave ACO(SACO) [23] proposed a slave-ant
concept where two techniques are used: diversification and reinforcement. These
techniques escape slave ants from long paths. Their experiment didn’t consider
heterogeneous resources or load balance concepts. Multi Objectives ACO(MO-
ACO) [24] addresses this flaw by presenting an approach for scheduling jobs in a
cloud context that considers load balancing with cost and time but ignores
dependent tasks in the cloud.
The Greedy-Ant-based ACO [25] approach uses forward and backward
dependency techniques to build transition probability. To allocate the virtual
machine, they used a greedy strategy. They compared their meta-heuristic model
with a heuristic that has a high level of scarceness in their research.
In the suggested GA [26], VMs are grouped based on their capacity to
shorten the time it takes for a procedure to complete. Before clustering the VM,
they considered cost computing to make this approach more successful. They did
not include the VM termination delay in their study, and they also did not
examine the load balancing idea.
In the literature [27], authors focused on function optimization using improved
genetic algorithms, whereas machine learning concepts are included with GA [28].
In order to reduce makespan and cost, authors presented a HEFT-ACO tech-
nique [29] that is based on the heterogeneous earliest end time (HEFT) and ACO,
but they did not integrate the idea of load balancing across virtual machines.
Sandeep Kumar Bothra, Sunita Singhal, Hemlata Goyal
ISSN 1681–6048 System Research & Information Technologies, 2022, № 3 124
In research [10], authors focused on balancing the load among virtual ma-
chines to increase performance. To achieve this, a greedy seeding strategy was
applied with the genetic algorithm, but there was no efficient heuristic approach
to reduce the makespan and cost.
Following a comprehensive review, we observed a research gap where many
parameters, such as virtual machine (VM) performance variation, booting time,
and shutdown time, as well as load balancing across VMs and minimize execution
time in parallel using heuristics, are not effectively addressed.
Our goal was to develop a hybrid meta-heuristic approach for processing
time and cost reduction in a time-constrained situation while maintaining load
balance across virtual machines. To accomplish this, we used the PEFT strategy
during population initialization, which helps to reduce the makespan. The ability
of PEFT to anticipate by building an optimistic cost table (OCT) while preserving
quadratic time complexity is one of its most amazing features. We also took into
account the time it takes for a VM to boot up and performance fluctuations, both
of which have an effect on computation time and execution cost. To keep the load
balanced among the virtual machines, we employed a greedy method in our pro-
posed model CHGA.
PROBLEM DEFINITION
Minimization of computing costs and makespan of scientific workflow with bal-
ancing the loads among virtual machines is the main motto of our proposed Cost
Effective Hybrid Genetic Algorithm (CHGA), which works under a user-defined
deadline constraint. A simple workflow is depicted in Fig. 1, and its correspond-
ing encoding is represented in Fig. 2.
In a heterogeneous cloud computing environment, variation in the per-
formance of VMs and booting time delays are two main factors that impact the
makespan of the scientific workflow. That’s why we considered both of the
above-mentioned parameters in our proposed model. Schedule is illustrated as
Fig. 1. Example of Workflow
Fig. 2. Encoding of workflow depicted in Fig. 1
Cost effective hybrid genetic algorithm for workflow scheduling in cloud
Системні дослідження та інформаційні технології, 2022, № 3 125
},,,{ TECTETMapVMS SET , Where VMSET a virtual machine pool, and Map
denotes the selection of an appropriate virtual machine to perform a task. Total
Execution Time and Total Execution Cost, respectively, are abbreviated as TET
and TEC. We generate the value 0%–24% randomly as a performance variation,
and the acquisition delay is assumed to be 1 minute for each VM. We defined the
problems to achieve our objectives. If TET violates the deadline constraint, then TEC
is not computed, otherwise it will be computed.
THE PROPOSED HYBRID GENETIC ALGORITHM
Description of the CHGA
We explained CHGA step-by-step here.
Step 1. During population initialization, the chromosome is encoded in the
same way as in the meta-heuristic technique provided by [25]. OrderOfTask,
Task, VM, and VmType are four fields that are used to encode a chromosome.
If the total population is N , then )1( N is initialized using a random tech-
nique and the remaining is using PEFT. PEFT is described in section IV B.
Step 2. During the population initialization, we employed a greedy tech-
nique [10] to balance the load among several virtual machines, as illustrated
through flowchart in this paper. This strategy assigns the new task to those VMi
which have minimum load at that time. Compute Load Li on a VMi:
1
i
j
n
j
i VMC
T
L
.
Step 3. Now compute the fitness of each candidate.
Step 3.1: Calculate the execution and transfer times for all of the individual’s tasks.
Divide the` task’s size
iTSize by the virtual machine’s processing speed
kVMSpeed to find the task’s execution time )( iVM TET
k
)(
k
i
k
VM
T
iVM Speed
Size
TET .
The size of an output data file
iTDataFile and the typical bandwidth β may
be used to compute the communication time
ijETT :
β
i
ij
T
E
DataFile
TT .
If a task is appear as root task or all parent tasks are on the same VM then
communication time is zero.
Step 3.2. Calculate Execution start time
iTST and finish time
pTT F now.
iTST is an estimated time to start the execution. It is equal to acquisition delay if
the task is appear as root node, otherwise
}}},{{{
pippi ETTkT TTFTMaxVMAvailMaxST .
Here )( kVMAvail is the time of VMk when it is ready to execute a new
task and the VM’s performance variation is denoted by PerVar.
pTT F indicates
completion time of parent’s task.
Sandeep Kumar Bothra, Sunita Singhal, Hemlata Goyal
ISSN 1681–6048 System Research & Information Technologies, 2022, № 3 126
PerVar
TET
STVMAvail
iVM
Tk
k
i 1
.
iTFT is the time indicates to finish the execution.
PerVar
TET
STFT
iVM
TT
k
ii 1
.
Step 3.3. Now compute TET and TEC as given below:
} { , If itFTTETDTET ,
ST_VMno ET_VMno
* 1
alTimeInterv
VMtype
VM
n k
n
CTEC .
Equation (1) to Equation (8) are from literature [19], [20].
Step 4. After computing the fitness of the chromosome, the tournament-
based algorithm is used to select the best two individuals for further crossover.
Step 5. A two-point crossover is used as depicted in Fig. 3.
Step 6. Apply mutation operations as illustrated in Fig. 4. Now check the
dependency constraint on it.
If a new individual follows the dependency constraint, then it is accepted,
otherwise it is discarded.
Fig. 3. Crossover Operation
Fig. 4. Mutation Operation
Cost effective hybrid genetic algorithm for workflow scheduling in cloud
Системні дослідження та інформаційні технології, 2022, № 3 127
Step 7. If the fittest solution meets our objectives under the user-defined
constraint, then stop the iteration; otherwise, continue it from step 3 after replac-
ing the least fit candidate with the better new solution.
A picture is worth a thousand words, that’s why we depict our proposed
model CHGA through a block diagram and flowchart, as shown in Fig. 5, a and
Fig. 5, b respectively.
A Glance on PEFT
The PEFT [30] consists of two stages: a task prioritization phase that identifies
priority of task and a VM selection step that determines the optimum VM for exe-
cuting the present job. Both stages are centered on OCT. By computing an OCT
and retaining quadratic time complexity, this algorithm can forecast. Earliest Fin-
Fig. 5. Block Diagram of Proposed model CHGA (a); flowchart of Proposed model CHGA (b)
a
Population initialization
using PEFT
Replace worst candidate
by Best one solution
b
Sandeep Kumar Bothra, Sunita Singhal, Hemlata Goyal
ISSN 1681–6048 System Research & Information Technologies, 2022, № 3 128
ish Time (EFT) of a node n on processor p is sum of earliest start time and com-
putation time of a node n on processor p. Illustrated in Fig. 6.
PERFORMANCE EVALUATION
Baseline Algorithm
In the current era, ACO and PCO are buzzwords. Both meta-heuristic algorithms
are inspired by the natural process of resolving NP-hard problems like optimiza-
tion. That’s why we used them as baseline algorithms as they contributed to solv-
ing the same problem addressed here. Except these we used CEGA [20] and
CLGA [10] as baseline algorithms.
Pheromone-based communication in an ant is to find the best solution. Ini-
tially, all the routes have the same probability of selection, i.e., ‘no bias’ due to
the same or no pheromone. A local update rule is applied when the ant constructs
the route, i.e., solution. Longer pathways vaporise or disintegrate more quickly
than shorter ones do. Shorter pathways therefore accumulate more pheromones
over time. Pheromone’s quantity is responsible for indirect communication,
which is known as ‘stigmergy’ [18]. When all the ants have completed their
routes, then a global update operation is performed. Now the selection of the path
is biased and the best ant is allowed to update the pheromone by the pseudo-
random-proportional rule [18]. We can understand ACO from Fig. 7 and PSO
from Fig. 8.
Particle Swarm Optimization was first introduced by Kennedy and Eberhart
[22]. In this instance, swarm stands for the population, and particle for a potential
solution. Each particle is first assigned a random coordinate. The objective func-
tion, or the distance between the particle’s present position and the food, is used
to evaluate performance. PBEST indicates the local best position of a particle,
whereas refers to the velocity constant. By updating the velocity and position of
the particle, a global optimum solution can be achieved. We keep this process go-
ing until we get our objective or reach our maximum iterations [22]. This is de-
picted in Fig. 8. As baseline algorithms we used ACO, PSO, CEGA [20] and
CLGA [10].
Stop
Fig. 6. PEFT Strategy
Cost effective hybrid genetic algorithm for workflow scheduling in cloud
Системні дослідження та інформаційні технології, 2022, № 3 129
Experimental Setup
We used four types of scientific workflows: Montage, Cybershake, LIGO, and
Epigenomics as benchmarks, where the size of the workflow is 50 nodes, 100
nodes, and 500 nodes approximately.
We have implemented the proposed model CHGA in a JAVA-based robust
environment and concluded the result after executing each type of workflow 30
times. The accuracy of the obtained result varies by about ±5. As mentioned in
Table 1, we considered 5 types of VMs as specification [31]. We assumed 20
kbps average bandwidth as proposed by Amazon Elastic Block Store (EBS) [32].
A thorough analysis of the literature [33],[34] is beneficial in deciding on various
parameters.
There are 3 levels of deadline constraints: hard, crunch, and soft, which are
considered in our experiment.
Deadline
iWETD ()min)1( .
For hard deadline range of : 2.10 .
For crunchy deadline range of : 8.22.1 .
For soft deadline range of : 4.48.2 .
Here α indicates step length, whose value is 0.4.
Fig. 7. ACO Algorithm Fig. 8. PSO Algorithm
Sandeep Kumar Bothra, Sunita Singhal, Hemlata Goyal
ISSN 1681–6048 System Research & Information Technologies, 2022, № 3 130
T a b l e 1 . Configuration of VM in our practical approach
VM Types m1.Small m1.Large m1.Xlarge c1.Medium C1.Xlarge
Processing Capacity
(GFLOPS)
4.4 17.6 35.2 22 88
ECUs (Speed) 1 4 8 5 20
Cores 1 2 4 2 8
Memory (GB) 1.7 7.5 15 1.7 7
Disk(GB) 160 850 1690 350 1690
Cost /Hr. ($) 0.04 0.16 0.32 0.2 0.8
RESULT AND ANALYSIS
Evaluation of Deadline Constraint
Our suggested CHGA is evaluated and compared using baseline algorithms in
order to fulfill our goal within a user-defined deadline, as depicted in Table 2 and
Fig. 9. The hit rate of our proposed CHGA is better than that of other baseline
algorithms, which represents its robustness. The capacity of PEFT to predict the
impact of scheduling the all children task of the current parent task reduced the
makespan of workflow and improved the hit rate of CHGA.
T a b l e 2 . Analysis of Hit Rate based on deadline
Deadline Algorithm Montage Cybershake LIGO Epigenomics
CHGA 96.3002 94.0645 93.0989 92.3004
ACO 53.9809 58.2309 52.0051 57.4506
PSO 69.0989 67.9882 68.0898 69.1216
CEGA 92.3433 88.4844 88.5034 83.4908
Hard
CLGA 95.5022 91.4788 91.4602 88.0223
CHGA 99.8956 99.8002 99.7444 99.8288
ACO 72.0989 73.0899 71.9004 74.0112
PSO 79.0767 80.3503 81.0302 78.9704
CEGA 99.5011 99.6202 99.5002 99.6055
Crunch
CLGA 99.5067 99.7601 99.5676 99.7388
CHGA 99.8876 99.7909 99.7708 99.8092
ACO 78.0998 76.0038 77.0902 78.2312
PSO 83.4534 82.0034 85.7801 86.5709
CEGA 99.6767 99.7022 99.6081 99.5003
Soft
CLGA 99.6878 99.7803 99.7003 99.7099
Cost effective hybrid genetic algorithm for workflow scheduling in cloud
Системні дослідження та інформаційні технології, 2022, № 3 131
Load-Balance Evaluation
Greedy strategy plays an important role in load balance. Finding a virtual machine
with a low load is important before we allocate a task iT to an individual. In order
a
b
c
Fig. 9. Analysis under: Hard Deadline Constraint (a); Crunch Deadline Constraint (b);
Soft Deadline Constraint (c)
Sandeep Kumar Bothra, Sunita Singhal, Hemlata Goyal
ISSN 1681–6048 System Research & Information Technologies, 2022, № 3 132
to manage the load balance, we map the task iT with iVM using the greedy tech-
nique after identifying iVM with minimum load.
The capacity of VMCi can be calculated as given by Equation, where the
number of processing elements is PEnum:
mipsnum PEPEVMC i .
All VMCi are collectively known as Virtual Machine Capacity (VMC), and
m is the total number of sVM :
m
i i1VMCVMC .
Load iL on a iVM is as Equation.
Total load TL is as Equation
m
i iL1TL .
Load capacity per unit is puLC as Equation
VMC
TL
puLC .
Threshold value iTH is as Equation
ipui LC VMCTH .
The threshold value THi is compared with the load of VMi to determine the
status of VMi, i.e., under-loaded, balanced or over-loaded. The result of our ex-
periment shows that with ACO, VM1 is overloaded by +82% and VM3 is under-
loaded by -58%. In contrast, with PSO, VM5 is overloaded by + 69% and VM4 is
under-loaded by -63%, as shown in Fig. 10. Our model CHGA exhibited better
load-balance compare to ACO, PSO, and CEGA, which denotes the robustness of
CHGA. When we used the proposed CHGA, VM4 was overloaded by +26%,
while VM3 was under-loaded by –12%. Illustrated in Fig. 10.
Fig. 10. Comparatively analysis of Load Balance
Cost effective hybrid genetic algorithm for workflow scheduling in cloud
Системні дослідження та інформаційні технології, 2022, № 3 133
Cost and Makespan Evaluation
Our holistic comparison between the baseline and our proposed CHGA is de-
picted in Fig. 11 – 14. The obtained result of our experiment indicates the robust-
ness of our proposed model CHGA. CHGA is 14.58188%, 11.40224%,
11.75306%, and 9.78841% cheaper than standard ACO, PSO, CEGA, and CLGA,
respectively. CHGA’s average makespan is 34.73619%, 31.48127%, 5.71553%,
and 9.73710% lower than standard ACO, PSO, CEGA, and CLGA, respectively.
Fig. 11. Comparatively analysis of Cost. Began
Sandeep Kumar Bothra, Sunita Singhal, Hemlata Goyal
ISSN 1681–6048 System Research & Information Technologies, 2022, № 3 134
Fig. 12. Comparatively analysis of Cost. Continued
Fig. 13. Comparatively analysis of Makespan. Began
Cost effective hybrid genetic algorithm for workflow scheduling in cloud
Системні дослідження та інформаційні технології, 2022, № 3 135
The capacity of PEFT to predict the impact of scheduling the all children
task of the current parent task reduced the makespan of workflow and improved
the execution cost in term of minimization in our proposed model. The obtained
result of our experiment indicates the robustness of our proposed model CHGA.
CHGA is 14.58188%, 11.40224%, 11.75306%, and 9.78841% cheaper than stan-
dard ACO, PSO, CEGA, and CLGA, respectively. CHGA’s average makespan is
34.73619%, 31.48127%, 5.71553%, and 9.73710% lower than standard ACO,
PSO, CEGA, and CLGA, respectively.
DISCUSSION
Because the best schedules take into account both the gain in a sequence of tasks
as well as the immediate gain in processing time, we observed that the best meta-
heuristic schedules could not be achieved if we adhered to the conventional strat-
egy of selecting processors based only on current task execution time, so we used
the PEFT strategy during population initialization, which helps to reduce the
makespan. Its capacity to predict the impact of scheduling all child tasks of the
current parent task This attribute allows one to make the perfect decision when
selecting the perfect virtual machine. We also took into account the time it takes
for a VM to boot up and performance fluctuations, both of which have an influ-
ence on computation time and execution cost. These statements are verified by the
obtained results of our experiments, which indicate the robustness of our pro-
posed model CHGA. CHGA is 14.58188%, 11.40224%, 11.75306%, and
9.78841% cheaper than standard ACO, PSO, CEGA, and CLGA, respectively.
CHGA’s average makespan is 34.73619%, 31.48127%, 5.71553%, and 9.73710%
lower than standard ACO, PSO, CEGA, and CLGA, respectively.
We also applied the greedy strategy during the initialization of the popula-
tion, which plays an important role in load balancing among VMs. When we used
the proposed CHGA, VM4 was overloaded by +26%, while VM3 was under-
loaded by -12%, which shows our model CHGA is better in load-balancing com-
pared to ACO, PSO, and CEGA.
CONCLUSIONS AND FUTURE WORK
To schedule scientific workflow, we introduced our meta-heuristic, cost-effective,
load-balanced hybrid evolutionary method. To balance the load among VMs in a
Fig. 14. Comparatively analysis of Makespan. Continued
Sandeep Kumar Bothra, Sunita Singhal, Hemlata Goyal
ISSN 1681–6048 System Research & Information Technologies, 2022, № 3 136
heterogeneous environment, an effective encoding approach with a greedy strat-
egy is used. We also employed the PEFT technique to make our algorithm more
cost-effective. Under a user-defined deadline, we considered three parameters:
makespan, computation cost, and load balance, and rigorously tested four types of
scientific workflows with varied task sizes. Our experimental results proved that
the proposed CHGA algorithm’s performance is better than the ACO, PSO,
CEGA, and CLGA in minimizing the computing cost and execution time as well
as balancing the load among virtual machines. CHGA is 17.48570%, 15.30489%,
11.75306%, and 9.78841% cheaper than standard ACO, PSO, CEGA, and CLGA,
respectively. CHGA’s average makespan is 34.73619%, 31.48127%, 5.71553%,
and 9.73710% lower than standard ACO, PSO, CEGA, and CLGA, respectively.
In the future, we will consider the dynamic nature of workflow with the latest me-
ta-heuristic algorithms like Cuckoo search, Firefly, Lion, and Jaya, etc.
CONFLICT OF INTEREST
The authors declare that they have no conflict of interest.
REFERENCES
1. S. Ibrahim, B. He, and H. Jin, “Towards pay-as-you-consume cloud computing,”
Proc. 2011 IEEE Int. Conf. Serv. Comput. SCC 2011, pp. 370–377. doi:
10.1109/SCC.2011.38.
2. R. Buyya, C.S. Yeo, S. Venugopal, J. Broberg, and I. Brandic, “Cloud computing
and emerging IT platforms: Vision, hype, and reality for delivering computing as the
5th utility,” Futur. Gener. Comput. Syst., 25(6), pp. 599–616, 2009. doi:
10.1016/j.future.2008.12.001.
3. A.S. Kulik, A.G. Chukhray, and O.V. Havrylenko, “Information technology for cre-
ating intelligent computer programs for training in algorithmic tasks. Part 1: mathe-
matical foundations,” System Research & Information Technologies, no. 4, 2021.
doi: 10.20535/SRIT.2308-8893.2021.4.02
4. S. Singhal and J. Grover, “Hybrid biogeography algorithm for reducing power con-
sumption in cloud computing,” 2017 Int. Conf. Adv. Comput. Commun. Informatics,
ICACCI 2017, pp. 121–124. doi: 10.1109/ICACCI.2017.8125827.
5. S. Singhal and J. Patel, “Load balancing scheduling algorithm for concurrent work-
flow,” Comput. Informatics, 37(2), pp. 311–326, 2018. doi: 10.4149/cai_2018_2_311.
6. G. Dalin and V. Radhamani, “IRIAL-an improved approach for VM migrations in
cloud computing,” International Journal of Advanced Technology and Engineering
Exploration, 2018 July 1, 5(44), pp. 165–171.
7. J. Yu and R. Buyya, “A taxonomy of workflow management systems for Grid comput-
ing,” J. Grid Comput., 3(3–4), pp. 171–200, 2005. doi: 10.1007/s10723-005-9010-8.
8. D. Malhotra, “An adaptive threshold policy for host overload detection in cloud data
centre,” International Journal of Advanced Technology and Engineering Explora-
tion, 2021 Oct 1, 8(83), pp. 1315–1335.
9. S.K. Bothra and S. Singhal, “Nature-inspired metaheuristic scheduling algorithms in
cloud: A systematic review,” Sci. Tech. J. Inf. Technol. Mech. Opt., 21(4),
pp. 463–472, 2021. doi: 10.17586/2226-1494-2021-21-4-463-472.
10. S.K. Bothra, S. Singhal, and H. Goyal, “Deadline-constrained cost-effective load-
balanced improved genetic algorithm for workflow scheduling,” Int. J. Inf. Technol.
Web Eng., 16(4), pp. 1–34, 2021. doi: 10.4018/IJITWE.2021100101.
Cost effective hybrid genetic algorithm for workflow scheduling in cloud
Системні дослідження та інформаційні технології, 2022, № 3 137
11. L.F. Bittencourt, R. Sakellariou, and E.R.M. Madeira, “DAG scheduling using a loo-
kahead variant of the heterogeneous earliest finish time algorithm,” Proc. 18th Eu-
romicro Conf. Parallel, Distrib. Network-Based Process, PDP 2010, pp. 27–34. doi:
10.1109/PDP.2010.56.
12. V.M. Sineglazov, K.D. Riazanovskiy, and O.I. Chumachenko, “Multicriteria condi-
tional optimization based on genetic algorithms,” System Research & Information
Technologies, no. 3, pp. 89–104, 2020. doi: 10.20535/SRIT.2308-8893.2020.3.07
13. Dorigo and C. Blum, “Ant colony optimization theory: A survey,” Theoretical Com-
puter Science, 344, pp. 243–278, 2005. doi: 10.1016/j.tcs.2005.05.020.
14. S.K. Patel and A.K. Sharma, “Improved PSO based job scheduling algorithm for re-
source management in grid computing,” International Journal of Advanced Tech-
nology and Engineering Exploration, 2019 May 1, 6(54), pp. 152–161.
15. D. Karaboga and B. Akay, “A modified Artificial Bee Colony (ABC) algorithm for
constrained optimization problems,” Appl. Soft Comput. J., 11(3), pp. 3021–3031,
2011. doi: 10.1016/j.asoc.2010.12.001.
16. Z. Wu, Z. Ni, and L. Gu, “A Revised Discrete Particle Swarm Optimization for
Cloud Workflow Scheduling,” Proceedings of the 2010 International Conference on
Computational Intelligence and Security, pp. 184–188. doi: 10.1109/CIS.2010.46.
17. S. Pandey, L. Wu, S.M. Guru, and R. Buyya, “A particle swarm optimization-based
heuristic for scheduling workflow applications in cloud computing environments,”
Proc. Int. Conf. Adv. Inf. Netw. Appl. AINA, 2010, pp. 400–407. doi:
10.1109/AINA.2010.31.
18. F. Engineering, “Scheduling Workflow in Cloud Computing Based on Ant Colony
Optimization Algorithm,” 2013 Sixth International Conference on Business Intelli-
gence and Financial Engineering (BIFE), pp. 57–61. doi: 10.1109/BIFE.2013.14.
19. Z. Chen and K. Du, “Deadline Constrained Cloud Computing Resources Scheduling
for Cost Optimization based on Dynamic Objective Genetic Algorithm,” Evolution-
ary Computation (CEC), 2015 IEEE Congress on, IEEE, 2015, pp. 708–714.
20. J. Meena, M. Kumar, and M. Vardhan, “Cost Effective Genetic Algorithm for Work-
flow Scheduling in Cloud Under Deadline Constraint,” IEEE Access, 4, pp. 5065–
5082, 2016. doi: 10.1109/ACCESS.2016.2593903.
21. M. Mollajafari and H.S. Shahhoseini, “A cost-optimized GA-based heuristic for
scheduling time-constrained workflow applications in infrastructure clouds using an
innovative feasibility-assured decoding mechanism,” J. Inf. Sci. Eng., 32(6), pp.
1541–1560, 2016. doi: 10.1688/JISE.2016.32.6.8.
22. Gupta, V. Gajera, P.K. Jana, and I.S. Member, “An Effective Multi-Objective Work-
flowScheduling in Cloud Computing: A PSO based Approach, “Ninth International
Conference on Contemporary Computing”, 2016.
23. Y. Moon, H. Yu, J.M. Gil, and J. Lim, “A slave ants based ant colony optimization
algorithm for task scheduling in cloud computing environments,” Human-centric
Comput. Inf. Sci., 2017. doi: 10.1186/s13673-017-0109-2.
24. A. You, M.A.Y. Be, and I. In, “Task scheduling based on ant colony optimization in
cloud environment,” AIP Conference Proceedings, vol. 1834, iss. 1, 040039, 2017.
doi: 10.1063/1.4981635.
25. B. Xiang, B. Zhang, and L. Zhang, “Greedy Ant: Ant Colony System-Inspired
Workflow Scheduling for Heterogeneous Computing,” IEEE Access, 2017. doi:
10.1109/ACCESS.2017.2715279.
26. S.K. Sonkar and M.U. Kharat, “Load prediction analysis based on virtual machine
execution time using optimal sequencing algorithm in cloud federated environment,”
Int. J. Inf. Technol., 11(2), pp. 265–275, 2019. doi: 10.1007/s41870-019-00282-1.
27. C. Yan, M.X. Li, and W. Liu, “Application of improved genetic algorithm in func-
tion optimization,” J. Inf. Sci. Eng., 35(6), pp. 1299–1309, 2019. doi:
10.6688/JISE.201911_35(6).0008.
Sandeep Kumar Bothra, Sunita Singhal, Hemlata Goyal
ISSN 1681–6048 System Research & Information Technologies, 2022, № 3 138
28. Al-Azzawi, “Evaluation of Genetic Algorithm Optimization in Machine Learning,”
J. Inf. Sci. Eng., 36(2), 2020.
29. A. Belgacem and K. Beghdad-Bey, “Multi-objective workflow scheduling in cloud
computing: trade-off between makespan and cost,” Cluster Computing, 25(1), pp.
579–595, 2022.
30. H. Arabnejad and J.G. Barbosa, “List Scheduling Algorithm for Heterogeneous Sys-
tems by an Optimistic Cost Table,” IEEE Transactions on Parallel and Distributed
Systems, 25(3), pp. 682–694, 2014.
31. Z. Hill and M. Humphrey, “A quantitative analysis of high performance computing
with Amazon’s EC2 infrastructure: The death of the local cluster?” Proc. IEEE/ACM
Int. Work. Grid Comput., pp. 26–33, 2009. doi: 10.1109/GRID.2009.5353067.
32. Amazon Elastic Block Store. Available: https://aws.amazon.com/ebs/. Accessed 22
July, 2020.
33. M. Naderan, “Review methods for breast cancer detection using artificial intelli-
gence and deep learning methods,” System Research & Information Technologies,
no. 1, 2021. doi: 10.20535/SRIT.2308-8893.2021.1.08
34. I.V. Beyko, J.V. Spivak, and O.V. Furtel, “Generalized solutions of optimal control
problems,” System Research & Information Technologies, no. 4, 2020, pp. 104–114.
doi: 10.20535/SRIT.2308-8893.2020.4.08.
Received 15.08.2022
INFORMATION ON THE ARTICLE
Sandeep Kumar Bothra, ORCID: 0000-0003-0555-569X, Manipal University Jaipur,
(Raj.), India, e-mail: bothrajain@gmail.com
Sunita Singhal, ORCID: 0000-0003-2462-8102, Manipal University Jaipur, (Raj.), India,
e-mail: sunita.singhal@jaipur.manipal.edu
Hemlata Goyal, ORCID: 0000-0003-1344-0921, Manipal University Jaipur, (Raj.), In-
dia,e-mail: hemlata.goyal@jaipur.manipal.edu
ЕКОНОМІЧНО ЕФЕКТИВНИЙ ГІБРИДНИЙ ГЕНЕТИЧНИЙ АЛГОРИТМ
ПЛАНУВАННЯ РОБОЧОГО ПРОЦЕСУ В ХМАРІ / Сандіп Кумар Бовра, Суніта
Сінгхал, Хемлата Гоял
Анотація. Хмарні обчислення відіграють значну роль у способі життя кожно-
го, щільно пов’язуючи спільноти, інформацію та торги по всьому світу. Розпі-
знавання оптимального рішення для планування робочих процесів у хмарі є
складною сферою через його NP-жорсткий характер. Запропоновано гібрид-
ний метаевристичний економічно ефективний збалансований за навантажен-
ням підхід до планування робочого процесу в гетерогенному середовищі. Мо-
дель ґрунтується на генетичному алгоритмі, інтегрованому з прогнозом
найбільш раннього часу фінішу (PEFT), щоб мінімізувати makepan. Замість
призначення завдання випадковим чином на віртуальній машині застосовуємо
жадібну стратегію, яка відводить завдання на віртуальну машину з найменш
завантаженим. Після завершення операції мутації перевіряємо обмеження за-
лежності замість кожної операції кросовера, що дає кращий результат. Запро-
понована модель включає в себе дисперсію продуктивності віртуальної маши-
ни, а також затримку придбання, яка поступається мінімальній вартості
makepan і computing. Одним з найбільш приголомшливих аспектів економічно
ефективного гібридного генетичного алгоритму ( CHGA ) є його здатність пе-
редбачати, створюючи оптимістичну таблицю витрат ( OCT ), зберігаючи ква-
дратичну складність часу. На основі результатів ретельних експериментів над
деякими показниками робочого процесу в реальному світі та всебічного аналі-
зу деяких нещодавно успішних алгоритмів планування отримано висновок, що
продуктивність запропонованої CHGA є мелодійною.
Ключові слова: хмарні обчислення, економічно вигідні, генетичний алгоритм,
метагевристичний алгоритм, прогнозування раннього часу оброблення, плану-
вання робочого процесу.
|
| id | journaliasakpiua-article-263013 |
| institution | System research and information technologies |
| keywords_txt_mv | keywords |
| language | English |
| last_indexed | 2025-07-17T10:27:56Z |
| publishDate | 2022 |
| publisher | The National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute" |
| record_format | ojs |
| resource_txt_mv | journaliasakpiua/cd/3f9978b203b68d457ae8bafd8ecf7dcd.pdf |
| spelling | journaliasakpiua-article-2630132022-12-21T22:15:21Z Cost effective hybrid genetic algorithm for workflow scheduling in cloud Економічно ефективний гібридний генетичний алгоритм планування робочого процесу в хмарі Bothra, Sandeep Kumar Singhal, Sunita Goyal, Hemlata хмарні обчислення економічно вигідні генетичний алгоритм метагевристичний алгоритм прогнозування раннього часу оброблення планування робочого процесу cloud computing cost effective genetic algorithm metaheuristic algorithm predict earliest finish time Workflow scheduling Cloud computing plays a significant role in everyone’s lifestyle by snugly linking communities, information, and trades across the globe. Due to its NP-hard nature, recognizing the optimal solution for workflow scheduling in the cloud is a challenging area. We proposed a hybrid meta-heuristic cost-effective load-balanced approach to schedule workflow in a heterogeneous environment. Our model is based on a genetic algorithm integrated with predict earliest finish time (PEFT) to minimize makespan. Instead of assigning the task randomly to a virtual machine, we apply a greedy strategy that assigns the task to the lowest-loaded virtual machine. After completing the mutation operation, we verify the dependency constraint instead of each crossover operation, which yields a better outcome. The proposed model incorporates the virtual machine’s performance variance as well as acquisition delay, which concedes the minimum makespan and computing cost. One of the most astounding aspects of our cost-effective hybrid genetic algorithm (CHGA) is its capacity to anticipate by creating an optimistic cost table (OCT) while maintaining quadratic time complexity. Based on the results of our meticulous experiments on some real-world workflow benchmarks and comprehensive analysis of some recently successful scheduling algorithms, we concluded that the performance of our CHGA is melodious. CHGA is 14.58188%, 11.40224%, 11.75306%, and 9.78841% cheaper than standard Ant Colony Optimization (ACO), Particle Swarm Optimization (PSO), Cost Effective Genetic Algorithm(CEGA), and Cost-Effective Load-balanced Genetic Algorithm (CLGA), respectively. Облачные вычисления играют важную роль в образе жизни каждого человека, тесно связывая сообщества, информацию и сделки по всему миру. Признание оптимального решения для планирования рабочих процессов в облаке является сложной областью из-за его NP-твердого характера. Мы предложили гибридный мета-эвристический рентабельный подход с балансировкой нагрузки для планирования рабочего процесса в гетерогенной среде. Наша модель основана на генетическом алгоритме, интегрированном с прогнозированием самого раннего времени окончания ( PEFT ), чтобы минимизировать время прокачки. Вместо случайного назначения задачи на виртуальной машине мы применяем жадную стратегию, которая выделяет задачу на самую низкую загруженную виртуальную машину. После завершения операции мутации мы проверяем ограничение зависимости вместо каждой операции кроссовера, что дает лучший результат. Предлагаемая модель включает в себя разницу производительности виртуальной машины, а также задержку приобретения, что обеспечивает минимальную стоимость изготовления и вычислительной техники. Одним из наиболее поразительных аспектов нашего экономически эффективного гибридного генетического алгоритма ( CHGA ) является его способность предвидеть путем создания оптимистичной таблицы затрат ( OCT ) при сохранении квадратичной временной сложности. Основываясь на результатах наших тщательных экспериментов по некоторым реальным контрольным показателям рабочего процесса и всестороннему анализу некоторых недавно успешных алгоритмов планирования, мы пришли к выводу, что производительность нашей CHGA мелодична. Хмарні обчислення відіграють значну роль у способі життя кожного, щільно пов’язуючи спільноти, інформацію та торги по всьому світу. Розпізнавання оптимального рішення для планування робочих процесів у хмарі є складною сферою через його NP-жорсткий характер. Запропоновано гібридний метаевристичний економічно ефективний збалансований за навантаженням підхід до планування робочого процесу в гетерогенному середовищі. Модель ґрунтується на генетичному алгоритмі, інтегрованому з прогнозом найбільш раннього часу фінішу (PEFT), щоб мінімізувати makepan. Замість призначення завдання випадковим чином на віртуальній машині застосовуємо жадібну стратегію, яка відводить завдання на віртуальну машину з найменш завантаженим. Після завершення операції мутації перевіряємо обмеження залежності замість кожної операції кросовера, що дає кращий результат. Запропонована модель включає в себе дисперсію продуктивності віртуальної машини, а також затримку придбання, яка поступається мінімальній вартості makepan і computing. Одним з найбільш приголомшливих аспектів економічно ефективного гібридного генетичного алгоритму ( CHGA ) є його здатність передбачати, створюючи оптимістичну таблицю витрат ( OCT ), зберігаючи квадратичну складність часу. На основі результатів ретельних експериментів над деякими показниками робочого процесу в реальному світі та всебічного аналізу деяких нещодавно успішних алгоритмів планування отримано висновок, що продуктивність запропонованої CHGA є мелодійною. The National Technical University of Ukraine "Igor Sikorsky Kyiv Polytechnic Institute" 2022-10-30 Article Article application/pdf https://journal.iasa.kpi.ua/article/view/263013 10.20535/SRIT.2308-8893.2022.3.08 System research and information technologies; No. 3 (2022); 121-138 Системные исследования и информационные технологии; № 3 (2022); 121-138 Системні дослідження та інформаційні технології; № 3 (2022); 121-138 2308-8893 1681-6048 en https://journal.iasa.kpi.ua/article/view/263013/265046 |
| spellingShingle | хмарні обчислення економічно вигідні генетичний алгоритм метагевристичний алгоритм прогнозування раннього часу оброблення планування робочого процесу Bothra, Sandeep Kumar Singhal, Sunita Goyal, Hemlata Економічно ефективний гібридний генетичний алгоритм планування робочого процесу в хмарі |
| title | Економічно ефективний гібридний генетичний алгоритм планування робочого процесу в хмарі |
| title_alt | Cost effective hybrid genetic algorithm for workflow scheduling in cloud |
| title_full | Економічно ефективний гібридний генетичний алгоритм планування робочого процесу в хмарі |
| title_fullStr | Економічно ефективний гібридний генетичний алгоритм планування робочого процесу в хмарі |
| title_full_unstemmed | Економічно ефективний гібридний генетичний алгоритм планування робочого процесу в хмарі |
| title_short | Економічно ефективний гібридний генетичний алгоритм планування робочого процесу в хмарі |
| title_sort | економічно ефективний гібридний генетичний алгоритм планування робочого процесу в хмарі |
| topic | хмарні обчислення економічно вигідні генетичний алгоритм метагевристичний алгоритм прогнозування раннього часу оброблення планування робочого процесу |
| topic_facet | хмарні обчислення економічно вигідні генетичний алгоритм метагевристичний алгоритм прогнозування раннього часу оброблення планування робочого процесу cloud computing cost effective genetic algorithm metaheuristic algorithm predict earliest finish time Workflow scheduling |
| url | https://journal.iasa.kpi.ua/article/view/263013 |
| work_keys_str_mv | AT bothrasandeepkumar costeffectivehybridgeneticalgorithmforworkflowschedulingincloud AT singhalsunita costeffectivehybridgeneticalgorithmforworkflowschedulingincloud AT goyalhemlata costeffectivehybridgeneticalgorithmforworkflowschedulingincloud AT bothrasandeepkumar ekonomíčnoefektivnijgíbridnijgenetičnijalgoritmplanuvannârobočogoprocesuvhmarí AT singhalsunita ekonomíčnoefektivnijgíbridnijgenetičnijalgoritmplanuvannârobočogoprocesuvhmarí AT goyalhemlata ekonomíčnoefektivnijgíbridnijgenetičnijalgoritmplanuvannârobočogoprocesuvhmarí |