Agile · Letter D

DevOps

The cultural and engineering practice of merging software development and operations into a single, continuous flow — automating build, test, deploy, monitoring, and feedback to ship software faster and more reliably.

By Dr. Hassan Eliwa, PhD · Founder of PMMilestone.org and PMMilestone.com · Updated 2026-06-26

Definition

DevOps is the set of cultural philosophies, engineering practices, and tooling that close the historical gap between software development teams and the operations teams who run their software in production. The aim is a single continuous loop: code, build, test, deploy, operate, monitor, learn, and feed the learning back into the next change. DevOps is not a job title or a tool — it is an operating model.

Why It Matters

The DORA "State of DevOps" research, now in its eleventh year, consistently finds that elite-performing teams deploy multiple times a day with change-failure rates below 15% and mean-time-to-restore under one hour. Low-performing teams deploy monthly or less, with change-failure rates above 45% and recovery measured in days. The gap is not talent; it is practice. The same engineers, given a different operating model, move between the two clusters.

Core Practices

  • Continuous integration. Every commit triggers a build and the full automated test suite.
  • Continuous delivery / deployment. Every green build is releasable, and elite teams release automatically.
  • Infrastructure as code. Environments described in version-controlled code (Terraform, Pulumi, CloudFormation), never clicked together in a console.
  • Observability. Logs, metrics, traces, and structured alerts that let on-call engineers diagnose in minutes, not hours.
  • Shared on-call. Developers carry the pager for the code they wrote. Nothing changes behaviour like a 3am alert about your own commit.
  • Blameless post-incident reviews. Failures are system properties, not individual mistakes.

Real-World IT Example

A European retail bank I supported moved its mobile-banking team from monthly releases to continuous deployment over 14 months. Lead time from commit to production fell from 21 days to 47 minutes. Change-failure rate dropped from 38% to 9%. The transformation was 70% cultural — breaking down the dev/ops wall, introducing shared on-call, blameless post-incident reviews — and 30% tooling (modernised CI, infrastructure as code, feature flags). Customer-reported defects per release fell by roughly 80%.

Real-World Construction Analogue

Construction is starting to learn DevOps lessons under the label "digital construction operations". On a major airport expansion, the digital twin team applied DevOps practices — infrastructure as code for the cloud environment, automated tests on the BIM federation pipeline, and continuous integration of model updates — and reduced the time from a model change being authored to being visible in the integrated coordination platform from 36 hours to 28 minutes. The operating principle is identical: short feedback loops, automation, shared responsibility.

Common Mistakes

  • Hiring a "DevOps team" and changing nothing else. You have just renamed your old operations team. The cultural wall is unchanged.
  • Tool-first transformation. Buying Jenkins, Terraform, and Datadog does not produce DevOps any more than buying a violin produces a violinist.
  • Pursuing deployment frequency without quality gates. Faster broken releases are worse than slower working ones.
  • Skipping observability. You cannot recover quickly from what you cannot see.
  • Punishing failure. The fastest way to kill DevOps is to fire someone for an incident. Engineers will stop deploying.
  • Ignoring security. Modern practice — sometimes called DevSecOps — bakes security into the pipeline from day one.

Expert Tips

  • Measure the four DORA metrics weekly: lead time for change, deployment frequency, change-failure rate, mean-time-to-restore. They tell the truth.
  • Invest in test reliability before deployment frequency. Flaky tests destroy the trust the pipeline runs on.
  • Make rollback faster than roll-forward. Confident teams ship more often.
  • Adopt feature flags. They decouple deploy from release and turn rollback into a config change.
  • Treat incidents as a learning asset, not a punishment. The best post-incident review I have seen ran 90 minutes, generated nine action items, and changed how the team structured its services.

Practical Lessons Learned

  • The hardest part is rarely the pipeline; it is the relationship between developers, operations, and security. Tools follow culture, not the other way around.
  • The teams that fail most often start by trying to deploy weekly with no test discipline. They generate enough chaos to discredit the whole effort. Start by stabilising the build and the test suite.
  • Senior leadership endorsement matters. Without an executive sponsor who tolerates short-term disruption for long-term gain, DevOps stalls.

Key Takeaways

  • DevOps is an operating model — culture, practices, and tooling combined — not a team or a product.
  • The DORA metrics (lead time, deployment frequency, change-failure rate, MTTR) are the cleanest measure of DevOps maturity.
  • Elite performers achieve high speed and high reliability simultaneously; the trade-off is a myth.
  • Tooling without cultural change produces no improvement; cultural change without tooling stalls quickly. Both are required.
  • Blameless learning, shared on-call, and observability are the unsung heroes of every successful transformation.

Related Encyclopedia Entries

Related Research Articles, Case Studies & Tools

Frequently Asked Questions

  • Is DevOps the same as SRE?
    Closely related, not identical. SRE (Site Reliability Engineering) is Google's specific implementation of DevOps principles with formalised error budgets and reliability targets.
  • Do we need a DevOps engineer?
    The role exists but the goal is to make DevOps the operating model for the whole engineering organisation, not a siloed team.
  • Can DevOps work in regulated industries?
    Yes. The most regulated banks and healthcare systems now run continuous delivery, with regulators broadly supportive when audit trails are automated.
  • What are the DORA metrics?
    Lead time for change, deployment frequency, change-failure rate, and mean-time-to-restore. The four-metric framework comes from the DORA research programme now run by Google Cloud.
  • Do we have to deploy multiple times per day?
    No. The metric that matters is the ability to deploy safely on demand. Many businesses deliberately bundle releases for product reasons.
  • How does DevOps affect security?
    Modern practice — DevSecOps — embeds security scanning, dependency analysis, and policy-as-code into the pipeline. Security becomes faster, not slower.
  • How long does a DevOps transformation take?
    12–24 months for a meaningful step-change in a 100+ engineer organisation. Smaller teams can move faster.
  • Which calculators on PMMilestone.org apply to DevOps?
    For DevOps, the most relevant tools on the flagship platform are the EVM, SPI and CPI calculators on PMMilestone.org. They reproduce the formulas referenced in this entry against your own project data.
  • What is a common misconception about DevOps?
    That the topic is well-defined across all references. In practice, definitions vary between PMBOK, PRINCE2, AACE and ISO 21500 — this entry uses the definition most aligned with field practice on capital projects, and flags where the standards diverge.
  • Which related encyclopedia entries should I read alongside DevOps?
    Read Earned Value Management, Critical Path Method and the DCMA 14-point assessment next. The full A–Z is available in the PMMilestone Encyclopedia, and quick one-line definitions live in the PM Glossary on the flagship platform.
  • How does Dr. Hassan Eliwa's research treat DevOps?
    Dr. Hassan Eliwa's research focuses on owner-side project controls, schedule integrity and forensic delay analysis on capital construction and power programmes. DevOps is treated through that lens — what a planning or controls engineer is expected to do with it on a live project, not its textbook definition alone. See the full research library at PMMilestone Research Articles.
  • How is DevOps defined on PMMilestone Research & Insights?
    The cultural and engineering practice of merging software development and operations into a single, continuous flow — automating build, test, deploy, monitoring, and feedback to ship software faster and more reliably. For the full treatment, see the definition, principles, applications and related entries above — every encyclopedia entry follows the same research-grade structure.

People also ask

Follow-up questions practitioners search for next — each one points to the calculator, template or reference entry that answers it.

Related Entries

Further reading on PMMilestone.org

Curated companion resources hosted on the flagship platform, PMMilestone.org.

Related Encyclopedia Entries
Career Guides
Tools on PMMilestone.org
Buy me a coffee