Menu
Operating Cadence

The Fragile → Stable → Productive Arc

How engineering teams actually mature (and why 'hire more seniors' isn't the answer)

8 min read·essay

Every engineering team I've walked into falls somewhere on a three-stage arc. Fragile. Stable. Productive. The names are simple. The journey between them is not.

Most founders and engineering leaders intuitively know which stage they're in. They just don't have a framework for what to do about it. So they default to the same playbook every time: "We need to hire more senior engineers." And then they're surprised when the new senior engineers inherit the same structural problems and produce the same results, just at a higher salary.

The problem is almost never the people. The problem is the operating system the people are running on.

Here's how the arc works, what each stage actually looks like, and how to move through it in about six months. I've done this three times now. It's not magic. It's sequence.


The Three-Stage Arc

A horizontal progression with three distinct stages. FRAGILE on the left (red/orange tint), STABLE in the middle (yellow/neutral), PRODUCTIVE on the right (green/teal). An arrow showing direction of travel. Most teams try to jump from Fragile to Productive. It doesn't work. You have to pass through Stable first.


Stage 1: Fragile

You know you're here when:

Everything feels urgent. Priorities change weekly, sometimes daily. Nobody can explain what the team is working on this quarter without pulling up Jira and squinting. Bugs in production are discovered by customers, not by monitoring. Deployments are events that require courage. The retro produces the same three observations it produced last month. People are busy, visibly stressed, and weirdly unclear on whether any of it matters.

The CEO thinks the team is slow. The engineering lead thinks the team is underwater. They're both right.

Here's the thing about Fragile that most people miss: the engineers are often doing good work individually. The code isn't necessarily terrible. The people aren't incompetent. The system around them is what's broken. There's no planning rhythm that survives first contact with the week. There's no shared understanding of what "done" means in business terms. There's no observability, so problems are invisible until they're emergencies. And there's no capacity model, so every request from every stakeholder gets the same answer: "we'll try to fit it in."

"We'll try to fit it in" is the unofficial motto of every Fragile team I've ever met.

What breaks teams out of Fragile:

Not hiring. Not a rewrite. Not switching to the hot new framework. What breaks a team out of Fragile is installing a basic operating cadence. And I do mean basic.

A quarterly plan that fits on one page. Not a 40-page roadmap document. One page. What are we doing this quarter, why does it matter to the business, and how will we know it worked. If the team can't recite the top three priorities from memory, the plan is too complicated.

A weekly rhythm that actually happens. Monday: what are we doing this week and what's blocked. Friday: what did we ship and what did we learn. It sounds infantile. It is the single most effective intervention I've ever seen in Fragile teams. The act of saying "here's what we'll do" and then checking "here's what we did" creates accountability without bureaucracy.

Monitoring before it's an emergency. Stand up basic observability. You don't need a fancy APM suite on day one. You need to know when something breaks before your customers do. New Relic, Datadog, even well-configured CloudWatch. The tool matters less than the habit.

One clear owner per initiative. Not a committee. Not "the team." One person who can say "this is on track" or "this is off track" without convening a meeting. Ownership isn't about blame. It's about someone knowing the state of things at all times.

That's it. Four things. Quarterly plan, weekly rhythm, basic monitoring, clear ownership. I've moved teams from Fragile to Stable in 8-12 weeks with nothing more than these four interventions and the discipline to actually stick with them.


The Fragile Escape Kit

Four items in a clean layout: (1) One-page quarterly plan, (2) Monday/Friday weekly rhythm, (3) Basic observability, (4) One owner per initiative. Four interventions. 8-12 weeks. No new hires required.


Stage 2: Stable

You know you're here when:

Things ship when they're supposed to. Not always, but mostly. The team can explain what they're working on and why. Production incidents still happen, but they're detected by monitoring instead of by angry customer emails. Deployments are routine, not ceremonies. The CEO has stopped asking "what is engineering doing?" every week. The retros actually produce different observations from month to month.

Stable feels good. It feels like the problem is solved. This is where most teams get stuck.

The trap of Stable is that it looks like success. People are calmer. Delivery is predictable. Stakeholders are satisfied. So the urgency to improve disappears. "We're shipping consistently" becomes the ceiling instead of the floor. And the team settles into a rhythm of reliable output that's completely disconnected from business outcomes.

Stable teams ship features. Productive teams ship outcomes. The difference is whether engineering knows (and cares about) what happened after the code went live.

What moves a team from Stable to Productive:

This is where things get interesting, and where AI starts to play a real role.

Connect delivery metrics to business metrics. Stop measuring velocity and start measuring impact. Not "we shipped the integration" but "the integration unblocked three enterprise deals." Not "we improved onboarding" but "day-7 activation went from 45% to 62%." This requires the shared revenue instrumentation I wrote about in Why Your Engineering Team Isn't a Revenue Engine. It's Fix #1 for a reason.

Install RICE-weighted planning with revenue impact. Stable teams plan based on what's in the backlog. Productive teams plan based on what moves the business. Use RICE scoring (Reach, Impact, Confidence, Effort) and weight Impact toward revenue outcomes. The integration that unlocks a $50K enterprise deal should outrank the UI improvement that makes an internal admin page slightly nicer.

Give the team visibility into the pipeline. When your engineers know that three enterprise deals are blocked on SSO support, the SSO ticket stops being "that boring compliance thing" and becomes "the thing that unlocks $150K in pipeline." Context transforms motivation. I've watched engineers voluntarily reprioritize their sprint when they understood the revenue context. Nobody had to tell them to. They just needed to see the map.

Use AI to eliminate toil, not to generate features. This is the Stable-to-Productive AI play. Automated test generation for the parts of the codebase nobody wants to touch. AI-assisted incident triage that gives on-call context instantly instead of requiring 30 minutes of log diving. Intelligent monitoring that learns what "normal" looks like for your system and only alerts when something is genuinely wrong. CI/CD pipeline optimization that cuts build times so your team spends less of their day watching progress bars. None of this is glamorous. All of it gives engineers back hours per week that they can spend on the work that actually matters.

Create feedback loops that close. The most underrated practice in engineering: following up. You shipped the feature. Did anyone use it? Did it move the metric? Did the customer renew? If you don't close the loop, your team is flying blind. Build a monthly "impact review" where you look at what shipped 30-60 days ago and measure what actually happened. The first time your team sees real revenue data connected to their work, something clicks. Permanently.


Stable vs. Productive

Two-column comparison. Left: Stable — Ships on time, Predictable velocity, Features delivered, Measures output. Right: Productive — Ships what matters, Revenue-connected planning, Outcomes delivered, Measures impact. A bridge between them: Connect engineering delivery to business outcomes.


The timeline

Here's the honest timeline based on doing this three times:

Fragile to Stable: 8-12 weeks. This is the fastest phase because the interventions are structural and immediate. Install the cadence, enforce the rhythm, add observability, clarify ownership. The team feels the difference within the first two weeks. Stability takes about two months to become the default rather than the exception.

Stable to Productive: 3-6 months. This is slower because it requires cultural and systemic change, not just process installation. The team needs to internalize that "what happened after we shipped" matters as much as "did we ship." Revenue instrumentation needs to be built. Planning frameworks need to be retooled. Feedback loops need to run through at least two or three cycles before the team trusts them.

Total arc: about 6 months. Give or take. Company size, team distribution, and existing technical debt all affect the timeline. But I've consistently seen teams move through the full arc in two quarters when leadership commits to the sequence and doesn't skip Stable.

That last part is critical. Every team I've seen try to jump straight from Fragile to Productive has failed. You can't install revenue-connected planning on a team that doesn't know what it's delivering this week. Sequence matters. Discipline matters. The exciting stuff comes second.


The test

Ask your engineering team two questions:

"What are we delivering this quarter and why does the business care?" If the answer is clear and consistent across the team, you're at least Stable.

"What was the revenue impact of the last three things we shipped?" If the answer is clear and backed by data, you're Productive.

If both answers are blank stares, you know where to start. And now you know the sequence.

Rakesh Kamath

Rakesh Kamath is a scaling systems operator who helps SaaS companies install the engineering, operational, and financial infrastructure that makes growth durable.

More about working together