Green Cloud Report
Green Cloud Report
INTRODUCTION
Cloud computing is used by IT Services companies for the delivery of computing requirements
as a service to a heterogeneous community of end-recipients. The vision of computing utilities
based on a service provisioning model anticipated the massive transformation of the entire
computing industry in the 21st century whereby computing services will be readily available on
demand, like other utility services available in today’s society. Similarly, users need to pay
providers only when they access the computing services. In addition, consumers no longer need
to invest heavily or encounter difficulties in building and maintaining complex IT infrastructure.
In such a model, users access services based on their requirements without regard to where the
services are hosted. This model has been referred to as utility computing, or as Cloud computing.
The latter term denotes the infrastructure as a “Cloud” from which businesses and users can
access applications as services from anywhere in the world on demand. Hence, Cloud computing
can be classified as a new paradigm for the dynamic provisioning of computing services
supported by state-of-the-art data centers that usually employ Virtual Machine (VM)
technologies for consolidation and environment isolation purposes .
Cloud computing delivers infrastructure, platform, and software (applications) as services, which
are made available to consumers as subscription-based services under the pay-as-you-go model.
In industry these services are referred to as
can access and deploy applications from anywhere in the world on demand at competitive costs
depending on their QoS (Quality of Service) requirements .
Clouds are virtualized datacenters and applications offered as services on a subscription basis.
They require high energy usage for its operation. Today, a typical datacenter with 1000 racks
need 10 Megawatt of power to operate, which results in higher operational cost. Thus, for a
datacenter, the energy cost is a significant component of its operating and up-front costs.
According to a report published by the European Union, a decrease in emission volume of 15%–
30% is required before year 2020 to keep the global temperature increase below 2 C. Thus,
energy consumption and carbon emission by Cloud infrastructures has become a key
environmental concern
Green Computing
Green computing is the eco-friendly use of computers and related resources. Such practices
include the implementation of energy-efficient central processing units, servers, peripherals as
well as reduced resource consumption and proper disposal of electronic waste. Green computing
is a study and practice of designing , manufacturing, using, and disposing of computers, servers,
and associated subsystems—such as monitors, printers, storage devices, and networking and
communications systems—efficiently and effectively with minimal or no impact on the
environment." The goals of green computing are similar to green chemistry; reduce the use of
hazardous materials, maximize energy efficiency during the product's lifetime, and promote the
recyclability or biodegradability of defunct products and factory waste. Research continues into
key areas such as making the use of computers as energy-efficient as possible, and designing
algorithms and systems for efficiency-related computer technologies.
There are several approaches to green computing namely
• Algorithmic efficiency
• Resource allocation
• Virtualization
• Power management
while availability dwindles, there is a need to shift focus from optimizing data center resource
management for pure performance to optimizing for energy efficiency while maintaining high
service level performance. According to certain reports, the total estimated energy bill for data
centers in 2010 is $11.5 billion and energy costs in a typical data center double every five years.
Data centers are not only expensive to maintain, but also unfriendly to the environment.
Data centers now drive more in carbon emissions than both Argentina and the Netherlands. High
energy costs and huge carbon footprints are incurred due to massive amounts of electricity
needed to power and cool numerous servers hosted in these data centers. Cloud service providers
need to adopt measures to ensure that their profit margin is not dramatically reduced due to high
energy costs. For instance, Google, Microsoft, and Yahoo are building large data centers in
barren desert land surrounding the Columbia River, USA to exploit cheap and reliable
hydroelectric power.
Lowering the energy usage of data centers is a challenging and complex issue because
computing applications and data are growing so quickly that increasingly larger servers and disks
are needed to process them fast enough within the required time period. Green Cloud computing
is envisioned to achieve not only efficient processing and utilization of computing infrastructure,
but also minimize energy consumption. This is essential for ensuring that the future growth of
Cloud computing is sustainable. Otherwise, Cloud computing with increasingly pervasive front-
end client devices interacting with back-end data centers will cause an enormous escalation of
energy usage.
To address this problem, data center resources need to be managed in an energy-efficient manner
to drive Green Cloud computing. In particular, Cloud resources need to be allocated not only to
satisfy QoS requirements specified by users via Service Level Agreements (SLA), but also to
reduce energy usage.
Using a neural network predictor, the green scheduling algorithms first estimates required
dynamic workload on the servers. Then unnecessary servers are turned off in order to minimize
the number of running servers, thus minimizing the energy use at the points of consumption to
provide benefits to all other levels. Also, several servers are added to help assure service-level
agreement. The bottom line is to protect the environment and to reduce the total cost of
ownership while ensuring quality of service.
VM Migration
The problem of VM allocation can be divided in two: the first part is admission of new requests
for VM provisioning and placing the VMs on hosts, whereas the second part is optimization of
current allocation of VMs.
Optimization of current allocation of VMs is carried out in two steps: at the first step we select
VMs that need to be migrated, at the second step chosen VMs are placed on hosts using MBFD
algorithm. We propose four heuristics for choosing VMs to migrate. The first heuristic, Single
Threshold (ST), is based on the idea of setting upper utilization threshold for hosts and placing
VMs while keeping the total utilization of CPU below this threshold. The aim is to preserve free
resources to prevent SLA violation due to consolidation in cases when utilization by VMs
increases. At each time frame all VMs are reallocated using MBFD algorithm with additional
condition of keeping the upper utilization threshold not violated. The new placement is achieved
by live migration of VMs.
The other three heuristics are based on the idea of setting upper and lower utilization thresholds
for hosts and keeping total utilization of CPU by all VMs between these thresholds. If the
utilization of CPU for a host goes below the lower threshold, all VMs have to be migrated from
this host and the host has to be switched off in order to eliminate the idle power consumption. If
the utilization goes over the upper threshold, some VMs have to be migrated from the host to
reduce utilization in order to prevent potential SLA violation. We propose three policies for
choosing VMs that have to be migrated from the host.
Distributed & Cloud Computing CSC 557 Sphoorthy Asuri Maringanti
• Minimization of Migrations (MM) – migrating the least number of VMs to minimize migration
overhead.
• Highest Potential Growth (HPG) – migrating VMs that have the lowest usage of CPU relatively
to the requested in order to minimize total potential increase of the utilization and SLA violation.
• Random Choice (RC) – choosing the necessary number of VMs by picking them according to a
uniformly distributed random variable.
Experimental Setup
A generic Cloud computing environment is essential to evaluate on a large-scale virtualized data
center infrastructure. It is difficult to conduct large-scale experiments on a real infrastructure,
especially when it is necessary to repeat the experiment with the same conditions. Therefore,
simulations have been chosen as a way to evaluate the proposed heuristics. The CloudSim toolkit
has been chosen as a simulation platform as it is a modern simulation framework aimed at Cloud
computing environments. In contrast to alternative simulation toolkits (e.g. SimGrid, GandSim),
it supports modeling of on-demand virtualization enabled resource and application management.
It has been extended in order to enable power-aware simulations as the core framework does not
provide this capability. Apart from the power consumption modeling and accounting, the ability
to simulate service applications with variable over time workload has been incorporated.
Few assumptions have been made to simplify the model of the system and enable simulation-
driven evaluation. The first assumption is that the overhead of VM migration is considered as
negligible. Modeling the cost of migration of VMs is another research problem and is being
currently investigated. However, it has been shown that application of live migration of VMs can
provide reasonable performance overhead. Moreover, with advancements of virtualization
technologies, the efficiency of VM migration is going to be improved. Another assumption is
that due to unknown types of applications running on VMs, it is not possible to build the exact
model of such a mixed workload. Therefore, rather than simulating particular applications, the
utilization of CPU by a VM is generated as a uniformly distributed random variable. In the
simulations we have defined that SLA violation occurs when a VM cannot get amount of MIPS
that are requested. This can happen in cases when VMs sharing the same host require higher
CPU performance that cannot be provided due to consolidation. To compare efficiency of the
algorithms we use a characteristic called SLA violation percentage, or simply SLA violation,
which is defined as a percentage of SLA violation events relatively to the total number of
measurements.
A simulation of a data center comprises 100 heterogeneous physical nodes. Each node is
modeled to have one CPU core with performance equivalent to 1000, 2000 or 3000 Million
Instructions Per Second (MIPS), 8 Gb of RAM and 1 TB of storage. According to this model, a
host consumes from 175 W with 0% CPU utilization and up to 250 W with 100% CPU
utilization. Each VM requires one CPU core with 250, 500, 750 or 1000 MIPS, 128 MB of RAM
and 1 GB of storage. The users submit requests for provisioning of 290 heterogeneous VMs that
fills the full capacity of the simulated data center. Each VM runs a web-application or any kind
of application with variable workload, which is modeled to create the utilization of CPU
according to a uniformly distributed random variable. The application runs for 150,000 MIPS
that equals to 10 minutes of execution on 250 MIPS CPU with 100% utilization. Initially, VMs
Distributed & Cloud Computing CSC 557 Sphoorthy Asuri Maringanti
are allocated according to the requested characteristics assuming 100% utilization. Each
experiment has been run 10 times and the presented results are built upon the mean values.
Results
For the benchmark experimental results a Non Power Aware (NPA) policy has been used. It does
not apply any power aware optimizations and means that all hosts run at 100% CPU utilization
and use maximum power. The second policy applies DVFS, but does not perform any adaptation
of allocation of VMs in run-time. The NPA policy leads to the total energy consumption of 9.15
KWh, whereas DVFS allows decreasing this value to 4.4 KWh for simulation setup.
Conclusion
Cloud computing is emerging as a significant shift as today's organizations which are
facing extreme data overload and skyrocketing energy costs. Green Cloud architecture, which
can help consolidate workload and achieve significant energy saving for cloud computing
environment, at the same time, guarantees the real-time performance for many performance-
sensitive applications.
In the future, there are still a number of research activities that we plan to carry out, which could
improve the performance of Green Cloud and bring solid value to users to achieve their business
goals and their social responsibility in Green IT Applying green technologies is highly essential
for the sustainable development of cloud computing. Of the various green methodologies
enquired, the DVFS technology is a highly hardware oriented approach and hence less flexible.
Green scheduling algorithms based on neural predictors can lead to a 70% power savings. These
policies also enable us to cut down data Centre energy costs, thus leading to a strong,
competitive cloud computing industry. End users will also benefit from the decreased energy
bills. As a conclusion, Green Cloud effectively saves energy by dynamically adapting to
workload leveraging live VM migrations, at the same time meeting system SLAs.
Distributed & Cloud Computing CSC 557 Sphoorthy Asuri Maringanti
References
1. “Energy efficient management of data center resources for cloud computing: A vision,
architectural elements and Open Challenges” Raj Kumar Buyya,Anton Beloglazov,Jemal
Abawajy Proc. of 9th IEEE International Symposium on Cluster Computing and the Grid
(CCGrid 2009), Rio De Janeiro, Brazil, May 2009.
2. “Performance Evaluation of a green Scheduling algorithm for energy savings in cloud
computing” Trough Vinh Troung Duy, Yukinori Sato, Yashushi Inoguchi IEEE Explore, March
2010
3. www.wikipedia.com/greencomputing.
4. www.ibm.com/developerworks/websphere/zones/hipods