Scalability and Elasticity in Cloud Computing
Scalability and Elasticity in Cloud Computing
In cloud computing, the term cloud scalability refers to the capacity to improve or reduce IT
resources, depending on the requirement changing demand. In other words, we can say that
scalability is employed to satisfy the static growth in the workload.
What is Elasticity?
Elasticity refers to the capability of a cloud to automatically boost or shorten the
infrastructural resources, depending on the requirement so that the workload can be handled
efficiently. This elasticity also helps in reducing the infrastructural expenditure.
Cloud elasticity is a system’s ability to increase (or decrease) its varying capacity-related
needs such as storage, networking, and computing based on specific criteria (think: total load
on the system).
Simply put, elasticity adapts to both the increase and decrease in workload by provisioning
and de-provisioning resources in an autonomous capacity.
Scalability Elasticity
Refers to a software system’s ability to scale up or Refers to the hardware layer, also known
scale out while processing a higher workload on as cloud infrastructure, to increase or
the current or additional hardware resources decrease physical resources without
without interrupting services or impacting physical service interruption
performance
Strengthens the hardware with additional nodes Adjusts the resources to accommodate
and increases the performance of a single dynamic scaling needs – the ability of
computing resource or a group of computer your resources to scale according to
resources specified criteria
The existing resources may increase to meet the The available resources correspond to the
future demands current demands, essential for cloud
environments where you pay-per-use, not
for resources you don’t currently need
Empowers companies to meet the demand for Empowers companies to meet unexpected
services with long-term, strategic needs changes and short-term, tactical needs
1 It is used just to fulfil the sudden requirement in It is used to fulfil the static boost in the
the workload for a short period. workload.
4 It is a short term event that is used to deal with It is a long term event that is used to deal
an unplanned or sudden growth in demand. with an expected growth in demand.
3. Diagonal scaling
Diagonal scaling involves horizontal and vertical scaling. It’s more flexible and cost-
effective as it helps add or remove resources as per existing workload requirements. Adding
and upgrading resources according to the varying system load and demand provides better
throughput and optimizes resources for even better performance.