2017-08-29 00:19:25 +03:00
# Bootstrapping the Kubernetes Control Plane
In this lab you will bootstrap the Kubernetes control plane across three compute instances and configure it for high availability. You will also create an external load balancer that exposes the Kubernetes API Servers to remote clients. The following components will be installed on each node: Kubernetes API Server, Scheduler, and Controller Manager.
## Prerequisites
The commands in this lab must be run on each controller instance: `controller-0` , `controller-1` , and `controller-2` . Login to each controller instance using the `gcloud` command. Example:
2018-07-25 14:18:38 +03:00
< details open >
< summary > GCP< / summary >
2017-08-29 00:19:25 +03:00
```
gcloud compute ssh controller-0
```
2018-07-25 14:18:38 +03:00
< / details >
< details >
< summary > AWS< / summary >
```
VPC_ID="$(aws ec2 describe-vpcs \
--filters Name=tag-key,Values=kubernetes.io/cluster/kubernetes-the-hard-way \
--profile kubernetes-the-hard-way \
--query 'Vpcs[0].VpcId' \
--output text)"
get_ip() {
aws ec2 describe-instances \
--filters \
Name=vpc-id,Values="$VPC_ID" \
Name=tag:Name,Values="$1" \
--profile kubernetes-the-hard-way \
--query 'Reservations[0].Instances[0].PublicIpAddress' \
--output text
}
```
```
ssh -i ~/.ssh/kubernetes-the-hard-way "ubuntu@$(get_ip controller-0)"
```
< / details >
2018-05-12 19:54:18 +03:00
### Running commands in parallel with tmux
[tmux ](https://github.com/tmux/tmux/wiki ) can be used to run commands on multiple compute instances at the same time. See the [Running commands in parallel with tmux ](01-prerequisites.md#running-commands-in-parallel-with-tmux ) section in the Prerequisites lab.
2017-08-29 00:19:25 +03:00
## Provision the Kubernetes Control Plane
2018-05-12 19:54:18 +03:00
Create the Kubernetes configuration directory:
```
sudo mkdir -p /etc/kubernetes/config
```
2017-08-29 00:19:25 +03:00
### Download and Install the Kubernetes Controller Binaries
Download the official Kubernetes release binaries:
```
wget -q --show-progress --https-only --timestamping \
2018-05-12 19:54:18 +03:00
"https://storage.googleapis.com/kubernetes-release/release/v1.10.2/bin/linux/amd64/kube-apiserver" \
"https://storage.googleapis.com/kubernetes-release/release/v1.10.2/bin/linux/amd64/kube-controller-manager" \
"https://storage.googleapis.com/kubernetes-release/release/v1.10.2/bin/linux/amd64/kube-scheduler" \
"https://storage.googleapis.com/kubernetes-release/release/v1.10.2/bin/linux/amd64/kubectl"
2017-08-29 00:19:25 +03:00
```
Install the Kubernetes binaries:
```
2018-05-12 19:54:18 +03:00
{
chmod +x kube-apiserver kube-controller-manager kube-scheduler kubectl
sudo mv kube-apiserver kube-controller-manager kube-scheduler kubectl /usr/local/bin/
}
2017-08-29 00:19:25 +03:00
```
### Configure the Kubernetes API Server
```
2018-05-12 19:54:18 +03:00
{
sudo mkdir -p /var/lib/kubernetes/
2017-08-29 00:19:25 +03:00
2018-05-12 19:54:18 +03:00
sudo mv ca.pem ca-key.pem kubernetes-key.pem kubernetes.pem \
service-account-key.pem service-account.pem \
encryption-config.yaml /var/lib/kubernetes/
}
2017-08-29 00:19:25 +03:00
```
2018-01-04 17:27:37 +03:00
The instance internal IP address will be used to advertise the API Server to members of the cluster. Retrieve the internal IP address for the current compute instance:
2017-08-29 00:19:25 +03:00
2018-07-25 14:18:38 +03:00
< details open >
< summary > GCP< / summary >
2017-08-29 00:19:25 +03:00
```
INTERNAL_IP=$(curl -s -H "Metadata-Flavor: Google" \
http://metadata.google.internal/computeMetadata/v1/instance/network-interfaces/0/ip)
```
2018-07-25 14:18:38 +03:00
< / details >
< details >
< summary > AWS< / summary >
```
INTERNAL_IP="$(curl -s http://169.254.169.254/latest/meta-data/local-ipv4)"
```
< / details >
< p > < / p >
2017-08-29 00:19:25 +03:00
Create the `kube-apiserver.service` systemd unit file:
```
2018-05-12 19:54:18 +03:00
cat < < EOF | sudo tee / etc / systemd / system / kube-apiserver . service
2017-08-29 00:19:25 +03:00
[Unit]
Description=Kubernetes API Server
2017-12-18 18:07:54 +03:00
Documentation=https://github.com/kubernetes/kubernetes
2017-08-29 00:19:25 +03:00
[Service]
ExecStart=/usr/local/bin/kube-apiserver \\
--advertise-address=${INTERNAL_IP} \\
--allow-privileged=true \\
--apiserver-count=3 \\
--audit-log-maxage=30 \\
--audit-log-maxbackup=3 \\
--audit-log-maxsize=100 \\
--audit-log-path=/var/log/audit.log \\
--authorization-mode=Node,RBAC \\
--bind-address=0.0.0.0 \\
--client-ca-file=/var/lib/kubernetes/ca.pem \\
2018-05-12 19:54:18 +03:00
--enable-admission-plugins=Initializers,NamespaceLifecycle,NodeRestriction,LimitRanger,ServiceAccount,DefaultStorageClass,ResourceQuota \\
2017-08-29 00:19:25 +03:00
--enable-swagger-ui=true \\
--etcd-cafile=/var/lib/kubernetes/ca.pem \\
--etcd-certfile=/var/lib/kubernetes/kubernetes.pem \\
--etcd-keyfile=/var/lib/kubernetes/kubernetes-key.pem \\
--etcd-servers=https://10.240.0.10:2379,https://10.240.0.11:2379,https://10.240.0.12:2379 \\
--event-ttl=1h \\
--experimental-encryption-provider-config=/var/lib/kubernetes/encryption-config.yaml \\
--kubelet-certificate-authority=/var/lib/kubernetes/ca.pem \\
--kubelet-client-certificate=/var/lib/kubernetes/kubernetes.pem \\
--kubelet-client-key=/var/lib/kubernetes/kubernetes-key.pem \\
--kubelet-https=true \\
2017-10-02 06:37:09 +03:00
--runtime-config=api/all \\
2018-05-12 19:54:18 +03:00
--service-account-key-file=/var/lib/kubernetes/service-account.pem \\
2017-08-29 00:19:25 +03:00
--service-cluster-ip-range=10.32.0.0/24 \\
--service-node-port-range=30000-32767 \\
--tls-cert-file=/var/lib/kubernetes/kubernetes.pem \\
--tls-private-key-file=/var/lib/kubernetes/kubernetes-key.pem \\
--v=2
Restart=on-failure
RestartSec=5
[Install]
WantedBy=multi-user.target
EOF
```
### Configure the Kubernetes Controller Manager
2018-05-12 19:54:18 +03:00
Move the `kube-controller-manager` kubeconfig into place:
```
sudo mv kube-controller-manager.kubeconfig /var/lib/kubernetes/
```
2017-08-29 00:19:25 +03:00
Create the `kube-controller-manager.service` systemd unit file:
2018-07-25 14:18:38 +03:00
< details open >
< summary > GCP< / summary >
2017-08-29 00:19:25 +03:00
```
2018-05-12 19:54:18 +03:00
cat < < EOF | sudo tee / etc / systemd / system / kube-controller-manager . service
2017-08-29 00:19:25 +03:00
[Unit]
Description=Kubernetes Controller Manager
2017-12-18 18:07:54 +03:00
Documentation=https://github.com/kubernetes/kubernetes
2017-08-29 00:19:25 +03:00
[Service]
ExecStart=/usr/local/bin/kube-controller-manager \\
--address=0.0.0.0 \\
--cluster-cidr=10.200.0.0/16 \\
--cluster-name=kubernetes \\
--cluster-signing-cert-file=/var/lib/kubernetes/ca.pem \\
--cluster-signing-key-file=/var/lib/kubernetes/ca-key.pem \\
2018-05-12 19:54:18 +03:00
--kubeconfig=/var/lib/kubernetes/kube-controller-manager.kubeconfig \\
2017-08-29 00:19:25 +03:00
--leader-elect=true \\
--root-ca-file=/var/lib/kubernetes/ca.pem \\
2018-05-12 19:54:18 +03:00
--service-account-private-key-file=/var/lib/kubernetes/service-account-key.pem \\
2017-09-15 17:48:41 +03:00
--service-cluster-ip-range=10.32.0.0/24 \\
2018-05-12 19:54:18 +03:00
--use-service-account-credentials=true \\
2017-08-29 00:19:25 +03:00
--v=2
Restart=on-failure
RestartSec=5
[Install]
WantedBy=multi-user.target
EOF
```
2018-07-25 14:18:38 +03:00
< / details >
< details >
< summary > AWS< / summary >
```
cat < < EOF | sudo tee / etc / systemd / system / kube-controller-manager . service
[Unit]
Description=Kubernetes Controller Manager
Documentation=https://github.com/kubernetes/kubernetes
[Service]
ExecStart=/usr/local/bin/kube-controller-manager \\
--address=0.0.0.0 \\
--cluster-cidr=10.200.0.0/16 \\
--cluster-name=kubernetes-the-hard-way \\
--cluster-signing-cert-file=/var/lib/kubernetes/ca.pem \\
--cluster-signing-key-file=/var/lib/kubernetes/ca-key.pem \\
--kubeconfig=/var/lib/kubernetes/kube-controller-manager.kubeconfig \\
--leader-elect=true \\
--root-ca-file=/var/lib/kubernetes/ca.pem \\
--service-account-private-key-file=/var/lib/kubernetes/service-account-key.pem \\
--service-cluster-ip-range=10.32.0.0/24 \\
--use-service-account-credentials=true \\
--v=2
Restart=on-failure
RestartSec=5
[Install]
WantedBy=multi-user.target
EOF
```
< / details >
2017-08-29 00:19:25 +03:00
### Configure the Kubernetes Scheduler
2018-05-12 19:54:18 +03:00
Move the `kube-scheduler` kubeconfig into place:
```
sudo mv kube-scheduler.kubeconfig /var/lib/kubernetes/
```
Create the `kube-scheduler.yaml` configuration file:
```
cat < < EOF | sudo tee / etc / kubernetes / config / kube-scheduler . yaml
apiVersion: componentconfig/v1alpha1
kind: KubeSchedulerConfiguration
clientConnection:
kubeconfig: "/var/lib/kubernetes/kube-scheduler.kubeconfig"
leaderElection:
leaderElect: true
EOF
```
2017-08-29 00:19:25 +03:00
Create the `kube-scheduler.service` systemd unit file:
```
2018-05-12 19:54:18 +03:00
cat < < EOF | sudo tee / etc / systemd / system / kube-scheduler . service
2017-08-29 00:19:25 +03:00
[Unit]
Description=Kubernetes Scheduler
2017-12-18 18:07:54 +03:00
Documentation=https://github.com/kubernetes/kubernetes
2017-08-29 00:19:25 +03:00
[Service]
ExecStart=/usr/local/bin/kube-scheduler \\
2018-05-12 19:54:18 +03:00
--config=/etc/kubernetes/config/kube-scheduler.yaml \\
2017-08-29 00:19:25 +03:00
--v=2
Restart=on-failure
RestartSec=5
[Install]
WantedBy=multi-user.target
EOF
```
### Start the Controller Services
```
2018-05-12 19:54:18 +03:00
{
sudo systemctl daemon-reload
sudo systemctl enable kube-apiserver kube-controller-manager kube-scheduler
sudo systemctl start kube-apiserver kube-controller-manager kube-scheduler
}
2017-08-29 00:19:25 +03:00
```
2018-05-12 19:54:18 +03:00
> Allow up to 10 seconds for the Kubernetes API Server to fully initialize.
### Enable HTTP Health Checks
2018-07-25 14:18:38 +03:00
< details open >
< summary > GCP< / summary >
2018-05-12 19:54:18 +03:00
A [Google Network Load Balancer ](https://cloud.google.com/compute/docs/load-balancing/network ) will be used to distribute traffic across the three API servers and allow each API server to terminate TLS connections and validate client certificates. The network load balancer only supports HTTP health checks which means the HTTPS endpoint exposed by the API server cannot be used. As a workaround the nginx webserver can be used to proxy HTTP health checks. In this section nginx will be installed and configured to accept HTTP health checks on port `80` and proxy the connections to the API server on `https://127.0.0.1:6443/healthz` .
> The `/healthz` API server endpoint does not require authentication by default.
Install a basic web server to handle HTTP health checks:
2017-08-29 00:19:25 +03:00
```
2018-05-12 19:54:18 +03:00
sudo apt-get install -y nginx
2017-08-29 00:19:25 +03:00
```
```
2018-05-12 19:54:18 +03:00
cat > kubernetes.default.svc.cluster.local < < EOF
server {
listen 80;
server_name kubernetes.default.svc.cluster.local;
location /healthz {
proxy_pass https://127.0.0.1:6443/healthz;
proxy_ssl_trusted_certificate /var/lib/kubernetes/ca.pem;
}
}
EOF
2017-08-29 00:19:25 +03:00
```
```
2018-05-12 19:54:18 +03:00
{
sudo mv kubernetes.default.svc.cluster.local \
/etc/nginx/sites-available/kubernetes.default.svc.cluster.local
sudo ln -s /etc/nginx/sites-available/kubernetes.default.svc.cluster.local /etc/nginx/sites-enabled/
}
2017-08-29 00:19:25 +03:00
```
2018-05-12 19:54:18 +03:00
```
sudo systemctl restart nginx
```
```
sudo systemctl enable nginx
```
2017-08-29 00:19:25 +03:00
2018-07-25 14:18:38 +03:00
< / details >
2017-08-29 00:19:25 +03:00
### Verification
```
2018-05-12 19:54:18 +03:00
kubectl get componentstatuses --kubeconfig admin.kubeconfig
2017-08-29 00:19:25 +03:00
```
```
NAME STATUS MESSAGE ERROR
2017-09-04 00:18:03 +03:00
controller-manager Healthy ok
scheduler Healthy ok
etcd-2 Healthy {"health": "true"}
etcd-0 Healthy {"health": "true"}
2017-08-29 00:19:25 +03:00
etcd-1 Healthy {"health": "true"}
```
2018-05-12 19:54:18 +03:00
Test the nginx HTTP health check proxy:
2018-07-25 14:18:38 +03:00
< details open >
< summary > GCP< / summary >
2018-05-12 19:54:18 +03:00
```
curl -H "Host: kubernetes.default.svc.cluster.local" -i http://127.0.0.1/healthz
```
2018-07-25 14:18:38 +03:00
> output
2018-05-12 19:54:18 +03:00
```
HTTP/1.1 200 OK
Server: nginx/1.14.0 (Ubuntu)
Date: Mon, 14 May 2018 13:45:39 GMT
Content-Type: text/plain; charset=utf-8
Content-Length: 2
Connection: keep-alive
ok
```
2018-07-25 14:18:38 +03:00
< / details >
< details >
< summary > AWS< / summary >
```
curl -i \
--cacert /var/lib/kubernetes/ca.pem \
-H "Host: kubernetes.default.svc.cluster.local" \
https://127.0.0.1:6443/healthz
```
> output
```
HTTP/2 200
content-type: text/plain; charset=utf-8
content-length: 2
date: Tue, 31 Jul 2018 15:47:02 GMT
ok
```
< / details >
< p > < / p >
2017-08-29 00:19:25 +03:00
> Remember to run the above commands on each controller node: `controller-0`, `controller-1`, and `controller-2`.
2017-10-02 06:37:09 +03:00
## RBAC for Kubelet Authorization
In this section you will configure RBAC permissions to allow the Kubernetes API Server to access the Kubelet API on each worker node. Access to the Kubelet API is required for retrieving metrics, logs, and executing commands in pods.
> This tutorial sets the Kubelet `--authorization-mode` flag to `Webhook`. Webhook mode uses the [SubjectAccessReview](https://kubernetes.io/docs/admin/authorization/#checking-api-access) API to determine authorization.
2018-07-25 14:18:38 +03:00
< details open >
< summary > GCP< / summary >
2017-10-02 06:37:09 +03:00
```
gcloud compute ssh controller-0
```
2018-07-25 14:18:38 +03:00
< / details >
< details >
< summary > AWS< / summary >
```
ssh -i ~/.ssh/kubernetes-the-hard-way "ubuntu@$(get_ip controller-0)"
```
< / details >
< p > < / p >
2017-10-02 06:37:09 +03:00
Create the `system:kube-apiserver-to-kubelet` [ClusterRole ](https://kubernetes.io/docs/admin/authorization/rbac/#role-and-clusterrole ) with permissions to access the Kubelet API and perform most common tasks associated with managing pods:
```
2018-05-12 19:54:18 +03:00
cat < < EOF | kubectl apply --kubeconfig admin . kubeconfig -f -
2017-10-02 06:37:09 +03:00
apiVersion: rbac.authorization.k8s.io/v1beta1
kind: ClusterRole
metadata:
annotations:
rbac.authorization.kubernetes.io/autoupdate: "true"
labels:
kubernetes.io/bootstrapping: rbac-defaults
name: system:kube-apiserver-to-kubelet
rules:
- apiGroups:
- ""
resources:
- nodes/proxy
- nodes/stats
- nodes/log
- nodes/spec
- nodes/metrics
verbs:
- "*"
EOF
```
The Kubernetes API Server authenticates to the Kubelet as the `kubernetes` user using the client certificate as defined by the `--kubelet-client-certificate` flag.
Bind the `system:kube-apiserver-to-kubelet` ClusterRole to the `kubernetes` user:
```
2018-05-12 19:54:18 +03:00
cat < < EOF | kubectl apply --kubeconfig admin . kubeconfig -f -
2017-10-02 06:37:09 +03:00
apiVersion: rbac.authorization.k8s.io/v1beta1
kind: ClusterRoleBinding
metadata:
name: system:kube-apiserver
namespace: ""
roleRef:
apiGroup: rbac.authorization.k8s.io
kind: ClusterRole
name: system:kube-apiserver-to-kubelet
subjects:
- apiGroup: rbac.authorization.k8s.io
kind: User
name: kubernetes
EOF
```
2017-08-29 00:19:25 +03:00
## The Kubernetes Frontend Load Balancer
In this section you will provision an external load balancer to front the Kubernetes API Servers. The `kubernetes-the-hard-way` static IP address will be attached to the resulting load balancer.
> The compute instances created in this tutorial will not have permission to complete this section. Run the following commands from the same machine used to create the compute instances.
2018-05-12 19:54:18 +03:00
### Provision a Network Load Balancer
2017-08-29 00:19:25 +03:00
2018-05-12 19:54:18 +03:00
Create the external load balancer network resources:
2017-08-29 00:19:25 +03:00
2018-07-25 14:18:38 +03:00
< details open >
< summary > GCP< / summary >
2017-08-29 00:19:25 +03:00
```
2018-05-12 19:54:18 +03:00
{
KUBERNETES_PUBLIC_ADDRESS=$(gcloud compute addresses describe kubernetes-the-hard-way \
--region $(gcloud config get-value compute/region) \
--format 'value(address)')
gcloud compute http-health-checks create kubernetes \
--description "Kubernetes Health Check" \
--host "kubernetes.default.svc.cluster.local" \
--request-path "/healthz"
gcloud compute firewall-rules create kubernetes-the-hard-way-allow-health-check \
--network kubernetes-the-hard-way \
--source-ranges 209.85.152.0/22,209.85.204.0/22,35.191.0.0/16 \
--allow tcp
gcloud compute target-pools create kubernetes-target-pool \
--http-health-check kubernetes
gcloud compute target-pools add-instances kubernetes-target-pool \
--instances controller-0,controller-1,controller-2
gcloud compute forwarding-rules create kubernetes-forwarding-rule \
--address ${KUBERNETES_PUBLIC_ADDRESS} \
--ports 6443 \
--region $(gcloud config get-value compute/region) \
--target-pool kubernetes-target-pool
}
2017-08-29 00:19:25 +03:00
```
2018-07-25 14:18:38 +03:00
< / details >
< details >
< summary > AWS< / summary >
```
get_instance_id() {
aws ec2 describe-instances \
--filters \
Name=vpc-id,Values="$VPC_ID" \
Name=tag:Name,Values="$1" \
--profile kubernetes-the-hard-way \
--query 'Reservations[0].Instances[0].InstanceId' \
--output text
}
aws elb register-instances-with-load-balancer \
--load-balancer-name kubernetes-the-hard-way \
--instances \
"$(get_instance_id controller-0)" \
"$(get_instance_id controller-1)" \
"$(get_instance_id controller-2)" \
--profile kubernetes-the-hard-way
```
< / details >
2017-08-29 00:19:25 +03:00
### Verification
Retrieve the `kubernetes-the-hard-way` static IP address:
2018-07-25 14:18:38 +03:00
< details open >
< summary > GCP< / summary >
2017-08-29 00:19:25 +03:00
```
2017-10-02 06:37:09 +03:00
KUBERNETES_PUBLIC_ADDRESS=$(gcloud compute addresses describe kubernetes-the-hard-way \
2017-08-29 00:19:25 +03:00
--region $(gcloud config get-value compute/region) \
--format 'value(address)')
```
2018-07-25 14:18:38 +03:00
< / details >
< details >
< summary > AWS< / summary >
```
KUBERNETES_PUBLIC_ADDRESS="$(aws elb describe-load-balancers \
--load-balancer-name kubernetes-the-hard-way \
--profile kubernetes-the-hard-way \
--query 'LoadBalancerDescriptions[0].DNSName' \
--output text)"
```
< / details >
< p > < / p >
2017-08-29 00:19:25 +03:00
Make a HTTP request for the Kubernetes version info:
```
2017-10-02 06:37:09 +03:00
curl --cacert ca.pem https://${KUBERNETES_PUBLIC_ADDRESS}:6443/version
2017-08-29 00:19:25 +03:00
```
> output
```
{
"major": "1",
2018-05-12 19:54:18 +03:00
"minor": "10",
"gitVersion": "v1.10.2",
"gitCommit": "81753b10df112992bf51bbc2c2f85208aad78335",
2017-08-29 00:19:25 +03:00
"gitTreeState": "clean",
2018-05-12 19:54:18 +03:00
"buildDate": "2018-04-27T09:10:24Z",
"goVersion": "go1.9.3",
2017-08-29 00:19:25 +03:00
"compiler": "gc",
"platform": "linux/amd64"
}
```
Next: [Bootstrapping the Kubernetes Worker Nodes ](09-bootstrapping-kubernetes-workers.md )