Continuous Monitoring #1 How are we doing?
Continuous monitoring is not often a real thing. Let's start with How are we doing question:
- Fine.
- How?
- Dunno…
To avoid these awkward answers, just present some data.
- How?
- This is how…
Just for starters, let's focus on tracking the following KPIs:
Establishing what's above, can lead you to get a more & better view, as it's always a matter of working culture. So, continuous monitoring step one.
Lead Time
The lead time KPI metric is mainly on the team's velocity. Average delivery time on user stories, features, bugs resolution, and tasks is clear information on how we are doing:
For example:
Questions set, try to find the answer.
Discuss during Sprint Retro what went wrong, try to identify the cause, and during the next Sprint Planning try to set more granular (thus more achievable in 2 weeks) User Stories. And measure it again in the next period, and adapt accordingly to get it below 10 days.?
Of course, you can measure lead time on every work items you want: Epics, Features, User Stories, Tasks and Bugs.
Sounds simple? It may, but as everything is connected, remember to check other KPIs.
Deployment Frequency
领英推荐
Deployment frequency is a tricky part - it depends on the team's repository structure and the maturity of the deployment process: to what extent delivery process is automated? And what quality mechanics are in place? CI/CD is a good start.
Change Failure Rate & Defect Detection Rate
The other two metrics can reveal a true delivery speed and its quality:
When teams start tracking Change Failure Rate and Defect Detection/Escape Rates in their KPIs, it can lead to a change in their Software Delivery Life Cycle (SDLC) by:
High-performing teams have change failure rates in the 0-15 percent range. How does it look in our case?
Measurement month by month will give feedback to the team about their % rate, and will enable questions:
?The same goes for Change Detection Rate (have to be higher than 0) or Change Escape Rate (in this case should be lower than 100%). This KPIs is simple:
Ideal view: you can catch every abnormal system behavior, and your production is errorless
Summary
The same practices that enable shorter lead times — test automation, trunk-based development, and working in small batches — correlate with a reduction in change failure rates.?
?All these practices make defects much easier to identify and remediate.
By tracking the defect detection rate, teams can identify areas where they need to improve their testing processes.?This can lead to more effective testing and fewer defects in production.
By tracking the change failure rate, teams can identify areas where they need to improve their deployment processes.?This can lead to more stable deployments and fewer incidents.
In the end - being more aware, and more efficient means a lot more time for other activities or at least real free time after work ;)