Faster Safely > Resources > Accelerate - Science of DevOps

Accelerate - Science of DevOps

Summary of Findings

Included below is a summary of key findings from the research and science behind effective software delivery and DevOps. Links to the original research and book are available in the references.

Four Key Metrics of Software Delivery
Software Delivery Performance Levels
Organizational Performance
Burnout and Deployment Pain
Lean Management
Lean Product Management
Cultural Capabilities
- Westrum Cultural Typologies:
eNPS and Job Satisfaction
Leadership
Deployment pain
- Key signs of deployment pain
- Reducing deployment pain
Software Delivery Performance Levels: Updated 2019
References

Four Key Metrics of Software Delivery

The four characteristics that were found to be the most effective measures of software delivery performance are

Lead Time For Changes: The amount of time it takes to go from code committed to code successfully running in production.
Change Failure Rate: How often deployment failures occur in production that require immediate remedy, especially rollbacks.
Deployment Frequency: How often changes are deployed into production.
Mean Time To Restore (MTTR): Average amount of time it takes to restore a service after a disruption.

Software Delivery Performance Levels

Software delivery performance is categorized into the following levels:

Aspect of Software Delivery Performance	High Performer	Medium Performer	Low Performer
Deployment frequency For the primary application or service you work on, how often does your organization deploy code to production or release it to end users?	On-demand (multiple deploys per day)	Between once per day and once per week	Between once per month and once every six months
Lead time for changes For the primary application or service you work on, what is your lead time for changes (i.e., how long does it take to go from code committed to code successfully running in production)?	Less than one hour	Between one week and one month	Between one week and one month
Time to restore service For the primary application or service you work on, how long does it generally take to restore service when a service incident or a defect that impacts users occurs (e.g., unplanned outage or service impairment)?	Less than one hour	Less than one day	Between one day and one week
Change failure rate For the primary application or service you work on, what percentage of changes to production or released to users result in degraded service (e.g., lead to service impairment or service outage) and subsequently require remediation (e.g., require a hotfix, rollback, fix forward, patch)?	0-15%	0-15%	31-45%

Organizational Performance

High performers are twice as likely to exceed organizational performance goals as low performers: profitability, productivity, market share, number of customers. High performers are twice as likely to exceed noncommercial performance goals as low performers: quantity of products/ services, operating efficiency, customer satisfaction, quality of products/services, achieving organizational/mission goals. In a follow-up survey to the initial 2014 data collection effort, we gathered stock ticker data and performed additional analysis on responses from just over 1,000 respondents across 355 companies who volunteered the organization they worked for. For those who worked for publicly traded companies, we found the following (this analysis was not replicated in later years because our dataset was not large enough):
High performers had 50% higher market capitalization growth over three years compared to low performers.

Burnout and Deployment Pain

Deployment pain is negatively correlated with software delivery performance and Westrum organizational culture. The five factors most highly correlated with burnout are Westrum organizational culture (negative), leaders (negative), organizational investment (negative), organizational performance (negative), and deployment pain (positive).

Lean Management

Our research shows that a lightweight change approval process based on peer review (pair programming or intrateam code review) produces superior IT performance than using external change approval boards (CABs)
Change advisory boards are negatively correlated with software delivery performance. Approval only for high-risk changes was not correlated with software delivery performance. Teams that reported no approval process or used peer review achieved higher software delivery performance. A lightweight change approval process predicts software delivery performance.

Lean Product Management

The ability to take an experimental approach to product development is highly correlated with the technical practices that contribute to continuous delivery. Lean product development capabilities predict Westrum organizational culture, software delivery performance, organizational performance, and less burnout.

Lean methodology extends beyond manufacturing, DevOps etc… it also complements product development.

Cultural Capabilities

Westrum organizational culture predicts software delivery performance, organizational performance, and job satisfaction. Westrum organizational culture is negatively correlated with deployment pain. The more painful code deployments are, the poorer the culture.
Westrum’s generative culture (as outlined below). This measure of organizational culture is based on a typology developed by Ron Westrum, a sociologist who studied safety-critical complex systems in the domains of aviation and healthcare. Our research has found that this measure of culture is predictive of IT performance, organizational performance, and decreasing burnout. Hallmarks of this measure include good information flow, high cooperation and trust, bridging between teams, and conscious inquiry

Westrum Cultural Typologies:

Pathological Power-oriented	Bureaucratic Rule-oriented	Generative Performance-oriented
Low co-operation	Modest co-operation	High co-operation
Messengers shot	Messengers neglected	Messengers trained
Responsibilities shirked	Narrow responsibilities	Risks are shared
Bridging discouraged	Bridging tolerated	Bridging encouraged
Failure leads to scapegoating	Failure leads to justice	Failure leads to inquiry
Novelty crushed	Novelty leads to problems	Novelty implemented

Westrum originally was interested in improving safety outcomes in medical environments by adopting generative cultures and avoiding pathological ones. From my experience in working in healthcare I noticed how impactful it was as well and there is research that supports this. Little did he realize it would also become a huge driver in DevOps and continuous delivery. Of the human factors needed to adopt continuous delivery, its one of the most significant.

Here was his original research paper on organizational culture https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1765804/pdf/v013p0ii22.pdf

eNPS and Job Satisfaction

High performers have better employee loyalty, as measured by employee Net Promoter Score (eNPS). Employees in high-performing organizations were 2.2 times more likely to recommend their organization as a great place to work. eNPS was significantly correlated with:
The extent to which the organization collects customer feedback and uses it to inform the design of products and features The ability of teams to visualize and understand the flow of products or features through development all the way to the customer The extent to which employees identify with their organizations values and goals, and the effort they are willing to put in to make the organization successful
Employees in high-performing teams are 2.2 times more likely to recommend their organization as a great place to work. Employees in high-performing teams are 1.8 times more likely to recommend their team as a great place to work. Job satisfaction predicts organizational performance.

Leadership

We observed significant differences in leadership characteristics among high-, medium-, and low-performing teams.
High-performing teams reported having leaders with the strongest behaviors across all dimensions: vision, inspirational communication, intellectual stimulation, supportive leadership, and personal recognition. Low-performing teams reported the lowest levels of all five leadership characteristics. These differences were all at statistically significant levels.
- Characteristics of transformational leadership are highly correlated with software delivery performance. Transformational leadership is highly correlated with employee Net Promoter Score (eNPS). Teams with the top 10% of reported transformational leadership characteristics were equally or even less likely to be high performers, compared to the entire population of teams represented in survey results. Leadership is predictive of Lean product development capabilities (working in small batches, team experimentation, gathering and implementing customer feedback) and technical practices (test automation, deployment automation, trunk-based development, shift left on security, loosely coupled architecture, empowered teams, continuous integration).

Deployment pain

The fear and anxiety that engineers and technical staff feel when they push code into production can tell us a lot about a team’s software delivery performance. We call this deployment pain, and it is important to measure because it highlights the friction and disconnect that exist between the activities used to develop and test software and the work done to maintain and keep software operational. This is where development meets IT operations, and it is where there is the greatest potential for differences: in environment, in process and methodology, in mindset, and even in the words teams use to describe the work they do.

Key signs of deployment pain

Deployments that happen outside business hours
Deployments that require downtime and are hard to roll back
Software built without deployability in mind
Deployments that require manual changes must be made to production environment
Deployments that require handing off changes to multiple teams, particularly silo’d teams

Reducing deployment pain

Our research shows that improving key technical capabilities reduces deployment pain, teams that:

implement comprehensive test and deployment automation
use continuous integration, including trunk-based development
shift left on security(automate); effectively manage test data
use loosely coupled architectures
can work independently
use version control of everything required to reproduce production environments.

decrease their deployment pain.

Software Delivery Performance Levels: Updated 2019

In the 2019 edition of the “State of DevOps” report a new “Elite” category was added to distinguish the extreme high performers that emerged while grouping the data. The benefits applied to “high performers” from up above still apply here to the “Elite” performers.

Aspect of Software Delivery Performance	Elite Performer	High Performer	Medium Performer	Low Performer
Deployment frequency For the primary application or service you work on, how often does your organization deploy code to production or release it to end users?	On-demand (multiple deploys per day)	Between once per day and once per week	Between once per week and once per month	Between once per month and once every six months
Lead time for changes For the primary application or service you work on, what is your lead time for changes (i.e., how long does it take to go from code committed to code successfully running in production)?	Less than one day	Between one day and one week	Between one week and one month	Between one month and six months
Time to restore service For the primary application or service you work on, how long does it generally take to restore service when a service incident or a defect that impacts users occurs (e.g., unplanned outage or service impairment)?	Less than one hour	Less than one day	Less than one day	Between one week and one month
Change failure rate For the primary application or service you work on, what percentage of changes to production or released to users result in degraded service (e.g., lead to service impairment or service outage) and subsequently require remediation (e.g., require a hotfix, rollback, fix forward, patch)?	0-15%	0-15%	0-15%	46-60%

References

jump to top of page

Accelerate: The Science of Lean Software and DevOps

State of DevOps 2019