status: complete audience: both chapter: 05 last_updated: 2026-04 contributors: [alexwill87, claude-cockpit] lang: en
5.14 -- Measuring ROI
Context
"The agent saves me time." OK, how much? If you don't measure, you don't know. And if you don't know, you can't justify the cost, optimize usage, or decide if it's worth continuing.
Measuring the ROI of an AI agent is concrete. Not philosophy.
The three axes of ROI
1. Time saved
The most tangible. Compare time with and without the agent.
How to measure
For 2 weeks, track:
# ROI Journal — Week of [date]
| Task | Without agent (estimated) | With agent (actual) | Saved |
|---|---|---|---|
| Daily health check | 10 min | 0 min (automated) | 10 min |
| Cockpit deployment | 15 min | 5 min | 10 min |
| Write commit message | 3 min | 30 sec | 2.5 min |
| Debug Docker error | 30 min | 10 min | 20 min |
| Generate API docs | 2h | 30 min | 1h30 |
| Rotate secrets | 20 min | 5 min | 15 min |
**Total week: ~3h30 saved**
Financial calculation
Time saved/week: 3.5 hours
Hourly rate: [your rate] EUR/h
Monthly value of time saved: 3.5h * 4 weeks * rate = X EUR
Agent cost/month: Y EUR (API + infra)
ROI = (X - Y) / Y * 100
Concrete example:
Time saved: 14h/month
Rate: 80 EUR/h
Value: 1120 EUR/month
Agent cost: ~50 EUR/month (API ~30 EUR + VPS ~20 EUR)
ROI: (1120 - 50) / 50 = 2140%
Even with conservative estimates, ROI is generally massive. The problem isn't ROI, it's measuring it to know where to optimize.
2. Improved decisions
Harder to quantify, but real. The agent helps you make better decisions because it:
- Analyzes data you wouldn't have looked at.
- Detects patterns you wouldn't have seen.
- Proposes alternatives you wouldn't have considered.
How to measure
Track decisions where the agent added value:
# Assisted decisions — [month]
1. The agent detected that the SSL certificate would expire in 15 days.
Without agent: I probably would have forgotten. Potential impact: site down.
2. The agent suggested condensing the system prompt (480 -> 150 words).
Result: token cost reduced by 40%.
3. The agent identified a pattern of doc tasks piling up.
Result: I blocked 2h/Thursday for docs. Doc tasks no longer drag on.
No need to quantify each decision. Simply noting them shows the value.
3. Errors prevented
Each error prevented has a hidden cost: correction time, user impact, stress.
How to measure
Track situations where the agent prevented an error:
# Errors prevented — [month]
1. I asked for a push --force on main. The agent refused (boundary prompt).
Cost avoided: potentially hours of recovery + lost commits.
2. The health check detected a disk at 85% before it caused an outage.
Cost avoided: unplanned downtime.
3. The agent detected an inconsistency in the config before deployment.
Cost avoided: deploying a buggy version.
Monthly dashboard
Each month, compile a summary:
# Agent ROI — [Month YYYY]
## Time
- Hours saved: XX h
- Value: XX EUR
- Top 3 automated tasks:
1. [task] — [time saved]
2. [task] — [time saved]
3. [task] — [time saved]
## Cost
- API: XX EUR
- Infra: XX EUR
- Total: XX EUR
## ROI
- Net: XX EUR
- Percentage: XX%
## Decisions
- X notable assisted decisions
## Errors prevented
- X errors prevented
## Actions
- [What we can improve next month]
When ROI is negative
It happens. Signs:
- You spend more time fixing the agent than doing the work yourself.
- API cost exceeds the value of time saved.
- The agent introduces errors you have to debug.
In this case, the problem usually isn't the agent. It's:
- The system prompt: misconfigured, so the agent doesn't do what you want.
- The workflow: not adapted, so unnecessary friction.
- Wrong usage: you're using the agent for tasks where it doesn't add value.
Before giving up, review these three points. If ROI stays negative after optimization, reduce usage to tasks where the agent is clearly useful.
Common mistakes
Never measure. "It's faster, I can feel it." Feelings aren't enough. Measure at least once to have a baseline.
Measure time only. Time saved is most visible but not the only factor. Improved decisions and prevented errors count too.
Compare with zero. "Without the agent, I'd do the same thing manually." Not always. Some tasks (pattern analysis, automated monitoring) simply wouldn't be done without the agent.
Forget the learning curve. In the first month, ROI is negative because you're learning to configure. That's normal. Measure starting month 2.
Steps
- Track your usage for 2 weeks (ROI journal).
- Calculate time saved and its value.
- Calculate monthly cost (API + infra).
- Note assisted decisions and prevented errors.
- Compile the monthly dashboard.
- Identify optimizations for next month.
Checklist
- [ ] An ROI journal exists with at least 2 weeks of data.
- [ ] Time saved is quantified in hours and euros.
- [ ] Monthly cost is known (API + infra).
- [ ] ROI is positive (or reasons for negative are identified).
- [ ] A monthly dashboard is compiled.
Proposer une modification sur GitHub