Kubernetes EFK Stack for AWS

The EFK stack for kubernetes clusters.

Installation

To deploy these configs to an existing cluster, run kubectl apply -f ./configs --recursive.

After the elasticsearch cluster is finally up (ETA 2 minutes or so), delete all of the existing indexes so that the mapping template applies to the new index.

ssh into one of the pods and run:

curl -X DELETE localhost:9200/_all

Kibana

Note that if you're creating a kibana instance, it will need to bundle all of its resources. This can take up to 7 minutes based on how many cpu requests we've allotted the pod. So go grab a coffee or something and come back once kubectl -n kube-system logs $pod -f prints something besides Optimzing and caching bundles...

Elasticsearch runs as root to have permissions on the /data volume.

Everything is running in the kube-system namespace.

Notes

When we reference "elasticsearch nodes", we technically mean "elasticsearch pods" in the kubernetes dialect, but distributed software like elasticsearch has standardized on calling each instance a node as well. So we're going to call both things nodes and rely on context to differentiate.

How it Works

Kubernetes nodes put all docker logs into /var/lib/docker/containers. We run a fluentd-logging container on each node, which has /var/lib/docker/containers mounted and watches the files for changes. When it detects a change, it ships it off to our elasticsearch-logging pod.

Finally, we can query those elasticsearch-logging logs with kibana-logging.

Per-Node `fluentd` containers.

We use a DaemonSet to guarantee that each node receives a fluentd pod.

This fluentd is configured to parse the logs it encounters using various rules defined in td-agent.conf.

Storage

We provide dynamic persistent storage to the elasticsearch nodes by creating them as a PetSet (soon to be renamed to StatefulSet) with a persistentVolumeClaimTemplate field. This field defines a template for a PVC, which dynamically creates EBS volumes based on a StorageClass. In this case we allocate 100Gb gp2 volumes per-elasticsearch-node and attach them to the appropriate kubernetes nodes.

Networking

The only special situation with networking is the use of two services for elasticsearch, elasticsearch-logging and elasticsearch-transport. The (headless) elasticsearch-transport service is used by the elasticsearch-cloud-kubernetes plugin to discover peers. The elasticsearch-logging service is cluster-public and allows clients like Kibana (and you!) to connect to the elasticsearch API.

Elasticsearch Pod

The elasticsearch pod is the most complicated of the definitions. It contains two containers: one elasticsearch instance, and a curator sidecar that uses cron to rotate indices once they're older than DAYS days old (default 7 days).

Elasticsearch is configured to use /data as {path.data}, which is where the EBS volume from above is mounted. The wrapper startup script does two things:

It chowns the /data directory. I'm not sure why this is necessary, since elasticsearch is running as root and should be able to access anything, but it is indeed necessary.
It PUTs the index template to the ES API. ES no longer support reading templates from disk, so we've resorted to this hack to add the template to the indices.

One thing to watch out for is that the template won't take effect on the current indices, so we'll have to delete them on the first (and only the first) deployment (to be more specific, only when a new, fresh EBS volume is used).
To do so, exec into one of the elasticsearch-logging pods and run curl -X DELETE localhost:9200/_all to delete all of the existing indices.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
configs		configs
elasticsearch-curator		elasticsearch-curator
elasticsearch-logging		elasticsearch-logging
fluentd-logging		fluentd-logging
kibana-logging		kibana-logging
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Kubernetes EFK Stack for AWS

Installation

Kibana

Notes

How it Works

Per-Node `fluentd` containers.

Storage

Networking

Elasticsearch Pod

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Kubernetes EFK Stack for AWS

Installation

Kibana

Notes

How it Works

Per-Node fluentd containers.

Storage

Networking

Elasticsearch Pod

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Per-Node `fluentd` containers.

Packages