Cost
To assess reliabilities past the 3rd or 4th decimal place can require an enormous amount of testing
- Is it necessary to do so?
Should all failures be considered equally bad?
- Showstoppers vs. “wrong color”
Oracles of “correctness” aren’t always easy
Monitoring phone switches is relatively easy; monitoring shrinkwrap isn’t