What Is a Retry Storm?


Diagram showing a retry storm as a feedback loop of increasing load and tail latency.

Retry storm: when retries multiply load and turn partial failures into outages. Learn how they happen, how to detect them, and how to prevent them.

What Is a Thundering Herd?


Diagram showing a thundering herd as a synchronized wave of clients stampedes a shared bottleneck.

Thundering herd: when many clients do the same work at once and overload a dependency. Understand why it happens, what it looks like, and how to reduce risk.

What Is Backpressure?


Diagram showing backpressure as a signal from a downstream component to an upstream component to slow down.

Backpressure: a system’s way of saying “slow down” before overload turns into timeouts and retries. Understand why it matters and what signals it uses.

What Is Load Shedding?


Diagram showing load shedding as a way to keep the system in a controlled state under stress.

Load shedding rejects work during overload so systems stay usable. Learn why it matters, what it looks like, and how it prevents retry storms.

Fundamentals of Distributed Systems


Master the core concepts of distributed systems that power modern applications. Learn about consistency, fault tolerance, scalability patterns, and architectural principles that separate toy projects from production-ready systems.