01.The Distributed Consensus Challenge
In high-availability cloud infrastructure, databases cannot run on a single machine. To prevent data loss and support millions of concurrent users, we replicate state across multiple physical server nodes.
However, distributing state introduces the **Split-Brain Problem**. If a network partition occurs (e.g. cutting off Server 1 and Server 2 from Server 3), how do nodes independently agree on which writes are committed, in what exact sequence order, and who holds the authoritative lease?
The Replicated State Machine Invariant:
Consensus algorithms ensure that a cluster of machines can coordinate as a single cohesive unit, surviving node crashes. The system mandates that **if any single node commits a transaction log entry at index $I$, no other node in the cluster can ever commit a different entry at index $I$**.