Kubernetes

Kubernetes Troubleshooting Labs

Diagnose k8s outages in live environments.

Practice Kubernetes incident response with real cluster failures and system-level debugging.

  • k8s control-plane issues
  • Network and DNS failures
  • Service recovery tasks
  • Live terminal access

K8s courses

  • Guided walkthroughs
  • No incident pressure
  • Theory-heavy
  • Minimal debugging

Deadnodes labs

  • Real cluster failures
  • Live troubleshooting
  • Incident-style tasks
  • Root-cause focus

Real k8s incidents

Work on broken clusters with failing nodes, control-plane issues, and misconfigured services.

  • k3s instability
  • CrashLoopBackOff
  • DNS outages

Built for SRE and DevOps

Scenarios mirror what teams see in production: noisy alerts, broken dependencies, and tough tradeoffs.

  • Incident response
  • Triage speed
  • Service recovery

Measure skill growth

Track progress over time with run history and scores.

  • Run history
  • Scoring
  • Replayable scenarios