The Design and Organization of Data Centers/Redundancy

Everything will eventually fail. Redundancy allows you to minimize the damage of a system failure.

Failure characteristics

edit

Planned vs. unplanned

Total vs. partial

Frequency of failure

Length of partial failure or outage

Types of redundancy

edit

Structure

edit

Ladder

Mesh

N+1

Implementation

edit

Active/active

Active/passive

Human Factors

edit

Notification

Unattended problem resolution

Documentation and problem clarity

Allowance for no-impact maintenance