Field Guide

Complete Guide

etcd is a distributed key-value store that provides strongly consistent reads and writes across a cluster. It is the default backing store for Kubernetes, where it holds the entire cluster state — every Pod, Service, ConfigMap, and Secret — behind the API server.

Internally, etcd uses the Raft consensus algorithm to replicate data across an odd-numbered cluster (typically 3 or 5 members) and tolerate the loss of a minority of nodes. Data is stored in a multi-version concurrency control (MVCC) B+ tree backed by bbolt, which lets clients watch keys for changes and receive ordered notifications — the mechanism Kubernetes controllers rely on to react to state. Clients talk gRPC, and every write is an fsync-to-disk per committed Raft entry.

Beyond Kubernetes, etcd is also used by Cilium, M3, OpenStack, CoreDNS, and Rook. Operating it well is mostly about disk I/O: etcd is extremely sensitive to fsync latency, and a slow disk will produce the “apply took too long” messages that eventually turn into leader elections and API server timeouts. The usual production advice is dedicated SSDs, isolation from noisy neighbours, and backups via etcdctl snapshot save. It graduated from the CNCF in 2020.

CNCF Project

Cloud Native Computing Foundation

Accepted: 2018-12-11

Incubating: 2018-12-11

Graduated: 2020-11-24

Dev Stats

Community

Join the conversation

Twitter/X Slack Mailing List

etcd learning paths

Advanced 7h 27m

Klustered: Advanced Cluster Triage

Master the craft of rescuing broken clusters. This path walks through real war‑room incidents so you can debug the control plane, container runtime, and host OS when kubectl alone is useless.

Recent etcd news

May 20, 2026

etcd cuts first 3.7 beta and removes the v2 API

etcd v3.7.0-beta.0 deletes client/v2, v2discovery, and the v2 request path, adds Unix socket endpoints and FastLeaseKeepAlive, and reports up to 2x faster lease, user, and role operations.