Fault tolerance
Stories about lineage, recomputation, checkpointing, and speculation.
Planned stories:
- Lineage: Spark’s “memory” so it can recompute
- Checkpointing: when lineage is too long
- Speculation: when a straggler gets a backup copy
Stories will be added here gradually.