Dynamo - Highly Available Key-Value Store

source

Dynamo is a highly available, decentralized key-value storage system developed by Amazon to address the extreme reliability and scalability requirements of its e-commerce platform. It represents a fundamental shift in database design philosophy: prioritizing availability and performance over strong consistency.

Core Design Philosophy

Failures as Default

At scale, treating failures as exceptional events is the real exception—component failures become the normal operating mode.

At massive scale, component failures are not exceptional—they are the normal mode of operation. Amazon’s services required an “always-on” experience where customer updates (like adding items to a shopping cart) are never rejected due to server or network failures.

Traditional relational databases proved inadequate for several reasons:

Excess functionality: Complex query features were unnecessary for simple key-value lookups
Limited availability: Replication technologies prioritized strong consistency over availability
Scaling difficulties: Partitioning and load balancing remained challenging

Dynamo was built specifically for services needing simple get(key) and put(key, value) operations on binary objects, trading Eventual Consistency for extreme availability.

When to Use Dynamo

Dynamo is ideal for:

Simple access patterns: Primary-key lookups without complex queries or joins
High availability requirements: Services that must remain writeable during failures
Stateful services: Shopping carts, session management, user preferences
Predictable performance: Systems requiring tight SLA guarantees (99.9th percentile)

Dynamo is not suitable for:

Workloads requiring Strong Consistency or ACID transactions
Multi-item operations or relational queries
Use cases where data conflicts are unacceptable

Core Principles

Always Writeable

Complexity Migration Pattern

Pushing conflict resolution to reads turns availability from a coordination problem into an application design pattern.

The system never rejects writes. Even during network partitions or node failures, customer updates are captured. Complexity of Conflict Resolution Strategies is pushed to the read path, ensuring writes never fail.

Application-Assisted Conflict Resolution

When version conflicts arise, the application can perform semantic reconciliation based on business logic. A shopping cart service, for example, merges conflicting cart versions by taking the union of items rather than picking a single version.

Decentralized and Symmetric

Every node has identical responsibilities—no master nodes or single points of failure. This design philosophy leverages peer-to-peer techniques, making the system simpler to provision and more resilient to failures.

Tunable Trade-offs

Applications control the balance between availability, consistency, performance, and cost through three parameters:

N: Number of replicas
R: Read quorum size
W: Write quorum size

Common configuration: (N=3, R=2, W=2) provides quorum-like guarantees where R + W > N.

Production Insights

Dynamo has powered Amazon’s infrastructure for years, handling tens of millions of requests daily during peak loads without downtime. Key operational learnings include:

Performance optimization: Amazon measures SLAs at the 99.9th percentile (not averages) to ensure good experiences for all customers. Optimizations like client-driven coordination and write buffering reduced 99.9th percentile latencies by over 30ms.

Theory vs Reality Gap

Dynamo’s architecture assumes frequent conflicts, yet 99.94% of operations see a single version—the system optimizes for theoretical worst cases that rarely materialize in practice.

Rare divergence: In practice, version conflicts are uncommon. A 24-hour study of the shopping cart service showed 99.94% of requests encountered exactly one version. Most conflicts stemmed from concurrent writes (often automated bots), not system failures.

Evolution: The partitioning strategy evolved from random token placement to equal-sized partitions (Q/S tokens per node), dramatically improving load balancing, bootstrapping speed, and operational simplicity.

Architectural Overview

Dynamo synthesizes several Distributed Systems techniques:

Consistent Hashing with Virtual Nodes: Enables incremental scalability and uniform load distribution
Replication Strategies in Distributed Systems: N-way replication across preference lists
Vector Clocks: Track causality and detect version conflicts
Quorum Systems: Configurable R/W parameters balance consistency and availability
Sloppy Quorums with Hinted Handoff: Maintain availability during temporary failures
Anti-Entropy Protocols using Merkle Trees: Synchronize replicas in the background
Gossip Protocols: Decentralized membership and failure detection

The power comes not from any single innovation, but from the thoughtful synthesis of proven techniques into a cohesive system optimized for Amazon’s specific requirements.

Key Takeaway

Dynamo demonstrates that an eventually-consistent storage system is a viable and powerful building block for highly available applications in demanding production environments. By embracing eventual consistency and pushing conflict resolution to applications, it achieves the “always writeable” property critical for customer-facing services at scale.

See Dynamo Architecture for detailed exploration of how these techniques interconnect.

Gradual Notes

Recent Writing

Revisited

Space is Not Barrenness

Study the Canon

Recent Notes

Caching Context

Reentrant Code

Zeigarnik Effect

Migrating from Resque to Sidekiq

Dynamo - Highly Available Key-Value Store

Core Design Philosophy

When to Use Dynamo

Core Principles

Always Writeable

Application-Assisted Conflict Resolution

Decentralized and Symmetric

Tunable Trade-offs

Production Insights

Architectural Overview

Key Takeaway

Recent Writing

Revisited

Space is Not Barrenness

Study the Canon

Recent Notes

Caching Context

Reentrant Code

Zeigarnik Effect

Migrating from Resque to Sidekiq

Kubernetes Batch Jobs

Graph View

Table of Contents

Backlinks