Cloud Resource Management and Scheduling
Cloud Resource Management and Scheduling
Contents
Dan C. Marinescu
Functionality.
Performance.
Cost.
Dan C. Marinescu
Motivation
The strategies for resource management for IaaS, PaaS, and SaaS
are different.
Dan C. Marinescu
2.
3.
4.
5.
Dan C. Marinescu
Dan C. Marinescu
Tradeoffs
Dan C. Marinescu
The inputs the offered workload and the policies for admission
control, the capacity allocation, the load balancing, the energy
optimization, and the QoS guarantees in the cloud.
Dan C. Marinescu
external
traffic
Predictive
filter
forecast
Optimal
controller
(k )
s
u* (k)
Queuing
dynamics
(k )
The controller uses the feedback regarding the current state and the estimation
of the future disturbance due to environment to compute the optimal inputs over
a finite horizon. r and s are the weighting factors of the performance index.
Dan C. Marinescu
10
Application 1
Application 1
SLA 1
Application
controller
.
.
SLA n
Application n
Application
controller
VM
VM
Monitor
Decision
Cloud Controller
Actuator
VM
VM
Monitor
Decision
Actuator
Cloud Platform
Dan C. Marinescu
11
Dan C. Marinescu
12
Dan C. Marinescu
13
Design decisions
Dan C. Marinescu
14
Proportional thresholding
Algorithm
Compute the integral value of the high and the low threshold as
averages of the maximum and, respectively, the minimum of the
processor utilization over the process history.
Request additional VMs when the average value of the CPU utilization
over the current time slice exceeds the high threshold.
Release a VM when the average value of the CPU utilization over the
current time slice falls below the low threshold.
Conclusions
Dan C. Marinescu
15
Dan C. Marinescu
16
Communication between
autonomous managers
Performance manager
Control policy
Power manager
Control policy
Power
data
Performance
data
Blade
Blade
Workload
generator
Workload
distribution
Blade
Power
assignment
Power
Blade
Blade
17
Dan C. Marinescu
18
R
e
w
a
r
d
R0
R1
R2
R - response time
Dan C. Marinescu
19
Si2
Si3
Si1
Si6
Si4
vk
vkmax
Si5
rkmax
rk
(b)
(a)
20
Dan C. Marinescu
21
Resource bundling
Dan C. Marinescu
22
Users provide bids for desirable bundles and the price they are
willing to pay.
Dan C. Marinescu
23
u1
Proxy
x1(t)
u2
Proxy
x2(t)
u3
Proxy
x3(t)
Auctioneer
(t ) 0
uU
Proxy
xU(t)
p(t+1)
The schematics of the ASCA algorithm; to allow for a single round auction
users are represented by proxies which place the bids xu(t). The auctioneer
determines if there is an excess demand and, in that case, it raises the price of
resources for which the demand exceeds the supply and requests new bids.
Dan C. Marinescu
24
Dan C. Marinescu
25
Round-robin.
First-Come-First-Serve (FCFS).
Shortest-Job-First (SJF).
Priority algorithms.
Dan C. Marinescu
26
Dan C. Marinescu
27
Quantity
Hard real-time
strict
Hard-requirements
Soft-requirements
Best-effort
Timing
loose
loose
strict
28
R(t)
Fai(tai)=Sai(tai)+Pai
Fai(tai)=Sai(tai)+Pai
Sai(tai)=Rai(tai)
Sai(tai)=Fai-1(tai-1)
Fai-1(tai-1)
Rai(tai)
tai-1
tai
(a)
tai-1
tai
(b)
The transmission of packet i of a flow can only start after the packet is
available and the transmission of the previous packet has finished.
(a) The new packet arrives after the previous has finished.
(b) The new packet arrives before the previous one was finished.
Dan C. Marinescu
29
The root node is the processor and the leaves of this tree are the
threads of each application.
Dan C. Marinescu
30
Server
VM2
(3)
VM1
(1)
A2
(1)
A1
(3)
t1,1
(1)
t1,2
(1)
t1,3
(1)
t2
(1)
A3
(1)
vs1
(1)
vs2
(1)
vs3
(1)
The SFQ tree for scheduling when two virtual machines VM1 and VM2 run
on a powerful server
Dan C. Marinescu
31
The scheduler thread maintains its own scheduler virtual time (SVT)
defined as the minimum actual virtual time of any thread.
The threads are dispatched in the order of their effective virtual time,
policy called the Earliest Virtual Time (EVT).
32
Thread a
0
12
24
36
t
12
24
36
thread b is suspended
48
thread b is reactivated
Thread b
0
12
12
15
Virtual time
36
24
12
Real time
15
18
21
24
36
48
60
Top the virtual startup time and the virtual finish time and function of the real time t
for each activation of threads a and b.
Bottom the virtual time of the scheduler v(t) function of the real time
Dan C. Marinescu
33
390
360
300
270
210
180
120
90
30
11
14
20
18
23
29
32
27
38
36
41
45
The effective virtual time and the real time of the threads a
(solid line) and b (dotted line) with weights wa = 2 wb when
the actual virtual time is incremented in steps of 90 mcu.
Dan C. Marinescu
34
390
360
300
270
210
180
120
90
30
12
14
21
18
23
30
27
32
36
39
41
45
-60
The effective virtual time and the real time of the threads a (solid line), b (dotted line),
and the c with real-time constraints (thick solid line). Thread c wakes up periodically at
times t=9, 18, 27, 36,, is active for 3 units of time and has a time warp of 60 mcu.
Dan C. Marinescu
35
Dan C. Marinescu
36
Dan C. Marinescu
37
S0
S1
S2
S3
Sn
The timing diagram for the Optimal Partitioning Rule; the algorithm requires
worker nodes to complete execution at the same time. The head node, S0,
distributes sequentially the data to individual worker nodes.
Dan C. Marinescu
38
S0
S1
S2
S3
1
2
3
Sn
The timing diagram for the Equal Partitioning Rule; the algorithm assigns an equal
workload to individual worker nodes.
Dan C. Marinescu
39