Planes, Raft, and Pods: A Tour of Distributed Systems Within Kubernetes

Planes, Raft, and Pods
A Tour of Distributed Systems Within Kubernetes

“Open-source platform for automating
deployment, scaling, and operations of
application containers around clusters of
hosts, providing container-centric
infrastructure”
- Kubernetes Documentation

Flexible platform for running
containerized apps!

● Autoscaling
● Rolling Deploys
● Secret Management
● Load Balancing
● Auto-Recovery from Failures

How does Kubernetes
leverage distributed
systems?

etcd is designed for “large scale distributed
systems… that never tolerate split brain
behavior and are willing to sacrifice
availability” to achieve it
- etcd Documentation

Simple interface hides
complex problems

Let’s look at a Not Raft system

Consensus requires
coordination

Raft = consensus algorithm
for managing replicated log

Elected leader is put in
charge of managing the log

Three States!
● Leader
● Follower
● Candidate

Leader sends heartbeat
messages

What happens if a follower
doesn’t get a heartbeat?
Election time!

In the game of Raft leadership elections, you win or you lose.

1. Write goes to leader
2. Leader appends command to log
3. Tells other servers via RPC to append it
to their logs (followers will say no if
they’re behind)
4. Once majority append, leader commits
5. Leader tells nodes in subsequent
messages of the last committed entry
6. Nodes commit

Solves problems in our bad system

Consistency and partition-tolerance
are achieved through requiring a
majority of nodes to act

Further Raft Reading
● The Raft Paper
● The Secret Lives of Data
(Raft Visualization)

Controller = loop that watches cluster
state and makes changes to ensure
we keep the desired state

Replica Set Controller makes sure
there’s a given number of pods running
at any time

Deployment controller manages the
whole deployment process of your app

Scheduler watches for unscheduled pods and
assigns them to a given node

The Scheduling Algorithm
1. Filter out nodes that aren’t desired or
not a great fit
2. Rank the remaining nodes
3. Pick the top ranked node

Step 1: Filter Against Predicates
● HostName
● MatchNodeSelector
● PodFitsHostPort
● PodFitsResources
● CheckNodeMemoryPressure
● CheckNodeDiskPressure

Ranking applies a series of
weighted priority functions
that return a score from 0 to
10 (least to most desirable)

Functions are run against
each node, added up, and the
node with the highest score
is the winner!

Some Ranking Functions
● LeastRequestedPriority
● BalancedResourceAllocation
● SelectorSpreadPriority

What happens when we
submit a deployment to
Kubernetes?

apiVersion: apps/v1beta1
kind: Deployment
metadata:
name: hello-world-deployment
spec:
replicas: 3
template:
metadata:
labels:
app: hello-world
spec:
containers:
- name: hello-world
image: tutum/hello-world
ports:
- containerPort: 80

How do we submit our
deployment? Kubectl!

What We Expect
1. We create deployment
2. Deployment creates a replica set
3. Replica set creates three pods
4. Our scheduler schedules those three
pods
5. Kubelet will run scheduled pods

involvedObject:
apiVersion: extensions
kind: Deployment
name: hello-world-deployment
resourceVersion: "1097"
uid: c8806e58-5371-11e7-9d84-0800278f5909
kind: Event
lastTimestamp: 2017-06-17T15:29:41Z
message: Scaled up replica set hello-world-deployment-
3877114392 to 3
reason: ScalingReplicaSet
source:
component: deployment-controller
type: Normal

involvedObject:
kind: ReplicaSet
name: hello-world-deployment-3877114392
namespace: default
uid: c8811ca5-5371-11e7-9d84-0800278f5909
kind: Event
message: 'Created pod: hello-world-deployment-
3877114392-jk3ps'
reason: SuccessfulCreate
source:
component: replicaset-controller
type: Normal

involvedObject:
kind: ReplicaSet
namespace: default
uid: c8811ca5-5371-11e7-9d84-0800278f5909
kind: Event
3877114392-nt62j'
source:
type: Normal

involvedObject:
apiVersion: v1
kind: Pod
name: hello-world-deployment-3877114392-nt62j
namespace: default
uid: c8833786-5371-11e7-9d84-0800278f5909
kind: Event
message: Successfully assigned hello-world-deployment-
3877114392-nt62j to minikube
reason: Scheduled
source:
component: default-scheduler
type: Normal

involvedObject:
kind: ReplicaSet
namespace: default
uid: c8811ca5-5371-11e7-9d84-0800278f5909
kind: Event
3877114392-c71lp'
source:
type: Normal

involvedObject:
apiVersion: v1
kind: Pod
name: hello-world-deployment-3877114392-jk3ps
namespace: default
uid: c88336ab-5371-11e7-9d84-0800278f5909
kind: Event
3877114392-jk3ps to minikube
reason: Scheduled
source:
type: Normal

involvedObject:
apiVersion: v1
kind: Pod
name: hello-world-deployment-3877114392-c71lp
namespace: default
uid: c8833ea2-5371-11e7-9d84-0800278f5909
kind: Event
3877114392-c71lp to minikube
reason: Scheduled
source:
type: Normal

involvedObject:
apiVersion: v1
fieldPath: spec.containers{hello-world}
kind: Pod
namespace: default
uid: c8833ea2-5371-11e7-9d84-0800278f5909
kind: Event
message: pulling image "tutum/hello-world"
reason: Pulling
source:
component: kubelet
host: minikube
type: Normal

involvedObject:
apiVersion: v1
kind: Pod
namespace: default
uid: c88336ab-5371-11e7-9d84-0800278f5909
kind: Event
reason: Pulling
source:
component: kubelet
host: minikube
type: Normal

involvedObject:
apiVersion: v1
kind: Pod
namespace: default
uid: c8833786-5371-11e7-9d84-0800278f5909
kind: Event
reason: Pulling
source:
component: kubelet
host: minikube
type: Normal

involvedObject:
apiVersion: v1
kind: Pod
namespace: default
uid: c8833ea2-5371-11e7-9d84-0800278f5909
kind: Event
message: Successfully pulled image "tutum/hello-world"
reason: Pulled
source:
component: kubelet
host: minikube
type: Normal

involvedObject:
apiVersion: v1
kind: Pod
namespace: default
uid: c8833ea2-5371-11e7-9d84-0800278f5909
kind: Event
message: Created container with id
c71c2605bcb7ab52e0c4fc7e08545664c628dd8eb5ceea2
0eff5ccff4afb865d
reason: Created
source:
component: kubelet
host: minikube

involvedObject:
apiVersion: v1
kind: Pod
namespace: default
uid: c8833ea2-5371-11e7-9d84-0800278f5909
kind: Event
message: Started container with id
c71c2605bcb7ab52e0c4fc7e08545664c628dd8eb5ceea2
0eff5ccff4afb865d
reason: Started
source:
component: kubelet
host: minikube

involvedObject:
apiVersion: v1
kind: Pod
namespace: default
uid: c88336ab-5371-11e7-9d84-0800278f5909
kind: Event
reason: Pulled
source:
component: kubelet
host: minikube

involvedObject:
apiVersion: v1
kind: Pod
namespace: default
uid: c88336ab-5371-11e7-9d84-0800278f5909
kind: Event
26cc7eff24538a09647a8a595d606c1988ca802a74d1930
fdcb801aafc624075
reason: Created
source:
component: kubelet
host: minikube

involvedObject:
apiVersion: v1
kind: Pod
namespace: default
uid: c88336ab-5371-11e7-9d84-0800278f5909
kind: Event
26cc7eff24538a09647a8a595d606c1988ca802a74d1930
fdcb801aafc624075
reason: Started
source:
component: kubelet
host: minikube

involvedObject:
apiVersion: v1
kind: Pod
namespace: default
uid: c8833786-5371-11e7-9d84-0800278f5909
kind: Event
reason: Pulled
source:
component: kubelet
host: minikube

involvedObject:
apiVersion: v1
kind: Pod
namespace: default
uid: c8833786-5371-11e7-9d84-0800278f5909
kind: Event
ef8a53303a35c539d7a16f2d633d7f7f6f70d42c6dc7f629
da771a49a95a0c25
reason: Created
source:
component: kubelet
host: minikube

involvedObject:
apiVersion: v1
kind: Pod
namespace: default
uid: c8833786-5371-11e7-9d84-0800278f5909
kind: Event
ef8a53303a35c539d7a16f2d633d7f7f6f70d42c6dc7f629
da771a49a95a0c25
reason: Started
source:
component: kubelet
host: minikube

kind: Service
apiVersion: v1
metadata:
name: hello-world-service
spec:
selector:
app: hello-world
ports:
- protocol: TCP
port: 80
type: NodePort

Things We’ve Done
● Look at Kubernetes components
● Shown how it handles distributed
state
● Dove into how we reconcile state and
schedule pods
● Trace a deployment through the
system

Planes, Raft, and Pods: A Tour of Distributed Systems Within Kubernetes

More Related Content

What's hot (20)

Similar to Planes, Raft, and Pods: A Tour of Distributed Systems Within Kubernetes (20)

Recently uploaded (20)

Planes, Raft, and Pods: A Tour of Distributed Systems Within Kubernetes

Editor's Notes