page icon

SRE Maturity Assessment

Background of the initiative

Since it is physically difficult to embed SRE in all products, we were looking for a way to promote SRE across the board. Also, because there was no data or indicators to get a bird's-eye view of the whole, the organization was unable to allocate resources efficiently and was often behind in risk management. To solve these problems, we developed the SRE maturity assessment.

What is the SRE Maturity Assessment?

This was created based on the capability maturity model integration in order to obtain an overview of the entire business division and digitize it.
In addition, we have created a list of necessary items based on the fault lines of service reliability, and made it as simple as possible to make it easier to evaluate.
図1. 成熟度概要
Figure 1. Maturity Overview

What can an SRE maturity assessment do for you?

By utilizing the SRE maturity assessment, SRE promotion (including enablement) can be carried out across the board.
Also, by knowing where you are currently, it becomes easier to make improvement plans and get closer to the ideal state for the product.

SRE Maturity Assessment Process

The SRE maturity assessment is carried out in four main steps:
  1. preparation
  1. Assessment and Planning
  1. Improvement implementation
  1. Looking back

1. Preparation

When conducting an SRE maturity assessment, we will explain the concept of SRE maturity assessment, the application flow, and the Level 3 guidelines.
  • Lv.3 Guidelines
    • Questions on the best practices for each item
    • Ideal state of each product = Lv.3
    • The ideal state for each product is different, so it is not necessary to satisfy all of them.

2. Assessment and Planning

By referring to Level 3 of each item, we will align the current maturity level of each item with the ideal state. (※ Level 3 of each item will be shared around June 2023)
Once you have aligned your understanding, the final step is to create an improvement plan. First, create a quarterly improvement plan, and then organize the action items and owners. In addition, if monitoring, incident response, and postmortem are at Level 1, we recommend that you prioritize creating an improvement plan.
図3. SRE成熟度評価シート
Figure 3. SRE Maturity Assessment Sheet
図4. SRE成熟度改善計画書
Figure 4. SRE Maturity Improvement Plan

3.Improvement implementation

We will improve the maturity level of each item while utilizing knowledge from other services. We also provide templates that can be used immediately for postmortems and incident response.
図5. ナレッジデータベース
Figure 5. Knowledge Database

4. Review

After implementing improvements, review the improvement plan quarterly or semiannually. First, review quarterly, but if the operational load is high, it is better to do it semiannually.

What the SRE Maturity Assessment Gained

  • It has become possible to get an overview of the entire business division as data.
  • It becomes easier to determine which products and improvements should be prioritized
  • I was able to learn about internal practices that I had not been aware of