devops-exercises/topics/chaos_engineering
abregman e6a80309e0 Add Terraform and Linux questions and exercises
Also updated the script that counts questions to actually update
the number in README.md
2022-10-30 22:08:05 +02:00
..
README.md Add Terraform and Linux questions and exercises 2022-10-30 22:08:05 +02:00

Chaos Engineering

Chaos Engineering Questions

Basics

What is Chaos Engineering?

Wikipedia: "Chaos Engineering is the discipline of experimenting on a system in order to build confidence in the system's capability to withstand turbulent conditions in production."

TechTarget: "Chaos engineering is the process of testing a distributed computing system to ensure that it can withstand unexpected disruptions."

What's a typical Chaos Engineering workflow?

According to Gremlin there are three steps:

  1. Planning an experiment where you design and choose a scenario in which your system should fail to operate properly
  2. You execute the smallest possible experiment to test your theory
  3. If nothing goes wrong, you scale your experiment and make the blast radius bigger. If your system breaks, you better understand why and start dealing with it

The process then repeats itself either with same scenario or a new one.