The Design and Organization of Data Centers/Redundancy
Everything will eventually fail. Redundancy allows you to minimize the damage of a system failure.
Failure characteristics
editPlanned vs. unplanned
Total vs. partial
Frequency of failure
Length of partial failure or outage
Types of redundancy
editStructure
editLadder
Mesh
N+1
Implementation
editActive/active
Active/passive
Human Factors
editNotification
Unattended problem resolution
Documentation and problem clarity
Allowance for no-impact maintenance