
The Spec Says What. Output Requirements Say “Good.”
A specification defines the work to be performed. It does not define what a successful result looks like — and that gap is where human and AI deliverables quietly fail. Output Requirements close it: they define the characteristics of a good deliverable and the evidence needed to prove it created value.
There is a particular kind of disappointment every leader has felt. You assign work. It comes back having technically done what you asked — and it is still wrong. Too long, or too shallow, in the wrong format, missing the evidence you needed, optimized for something you didn’t care about. The work matched the instruction. It did not match what “good” actually meant. That gap — between a task completed and a deliverable that creates value — is one of the oldest problems in management. It is about to become one of the most important, because the mixed human-AI workforce makes it both more frequent and more expensive.
The Definition-of-Done Gap.
A specification answers “what work should be performed?” It is necessary and it is not sufficient, because two deliverables can both satisfy the same specification and differ enormously in value. One is a three-page executive brief a board can act on; the other is a thirty-page technical dump that buries the decision. Same task. Same spec. Wildly different outcomes. The difference is everything the specification didn’t say: how detailed, in what tone, in what format, optimized for what, proven by what evidence, measured against what standard.
For human workers, we have always papered over this gap with tacit knowledge. A seasoned analyst “just knows” the CFO wants one page, knows which format the board expects, knows what counts as sufficient evidence. That tacit knowledge is invisible, unevenly distributed, and impossible to scale — but it mostly works, because humans absorb organizational norms over time.
A Model Optimizes Exactly What You Specify.
An AI agent has no tacit knowledge of your organization. It does not “just know” that your executives want brevity, that your industry requires citations, or that a deliverable without a measurable success criterion is worthless to you. It optimizes precisely what you make explicit — and it silently drops everything you leave implicit.
This is the hidden multiplier. A human who receives an underspecified task fills the gap with judgment and organizational instinct, often correctly. An AI fills the same gap with statistical plausibility — a confident, well-formatted output that satisfies the literal task while missing the unstated standard entirely. The more capable the agent, the more convincingly it produces the wrong kind of right answer. Output Requirements are how you convert the tacit definition of “good” into something a digital worker can actually act on — and, not incidentally, something a human worker no longer has to guess at.
What It Looks Like in Practice.
The same specification, with and without output requirements, produces two different working relationships with the contributor.
- Length unknown — could be 2 pages or 40
- Tone unknown — board-ready or raw analysis?
- Format unknown — deck, doc, spreadsheet?
- Evidence optional — claims may be unsupported
- “Done” undefined — done when it says so
- Response style: executive summary
- Tone: consultative, board-appropriate
- Evidence: each driver cited to a data source
- Success criterion: stakeholder-ready
- Risk threshold: Green before escalation
The second contributor — human or AI — can succeed on the first attempt. The first will iterate, guess, and frustrate. Output Requirements are not bureaucracy. They are the difference between delegation that works and delegation that loops.
The Twelve Components, in Four Clusters.
A complete Output Requirements framework has twelve components. Handed over as a flat list they overwhelm; understood in four clusters they become a usable discipline. Each cluster answers a different question about the deliverable.
Cluster One — Shape: what the deliverable is
How the output presents itself
- Response Style — the level of detail and complexity: executive summary, concise, detailed, comprehensive, or technical deep dive. Matches the deliverable to stakeholder need.
- Tone — the communication register: professional, formal, educational, consultative, executive, or conversational. Aligns voice to audience.
- Format — the physical structure: Markdown, HTML, Word, slide outline, JSON, YAML, tables, bullets. Improves usability and interoperability.
Cluster Two — Direction: what to optimize and how to handle it
How trade-offs and artifacts are managed
- Focus Areas — the priorities for optimization: task completion, learning transfer, accuracy, code quality, maintainability, governance, accessibility, cost-effectiveness, strategic alignment. Guides the trade-offs a contributor must make under pressure.
- File Management — artifact handling: naming conventions, save location, version control, archiving, output directory. Ensures consistent, findable document management.
Cluster Three — Proof: how success is demonstrated
How the deliverable proves it is good
- Success Criteria — measurable completion standards: functional requirements met, acceptance testing passed, stakeholder approval, performance thresholds achieved. Establishes a clear definition of done.
- Assumptions — conditions believed true: data availability, stakeholder participation, system compatibility. Surfaces hidden risk before it bites.
- Evidence Requirements — the supporting documentation: data sources, citations, test results, validation reports, expert reviews. Converts assertion into accountability.
Cluster Four — Value: how it connects to the organization
How the deliverable links to measurable value
- Evaluation Measures — assessment methods: quality metrics, KPIs, Kirkpatrick Levels, Phillips ROI, customer satisfaction, performance indicators. Measures effectiveness, not just completion.
- Continuous Improvement — feedback and refinement: lessons learned, root-cause analysis, process improvements, future recommendations. Turns one deliverable into organizational learning.
- Strategic Alignment — the connection to purpose, traced through the full value chain. Ensures the output contributes measurable value rather than merely completing a task.
Risk Thresholds — the Stoplight That Governs Escalation.
The twelfth component deserves its own treatment, because it is the governance mechanism that makes the rest operational. A risk threshold tells a contributor — human or AI — when a deliverable is ready to ship, when it needs revision, and when it must be escalated rather than released. Expressed as a stoplight, it becomes a shared, unambiguous decision rule.
Ready for implementation. The deliverable meets its success criteria and can proceed.
Minor revisions recommended. Functional, but improvement is needed before it represents the organization’s standard.
Significant deficiencies requiring corrective action. Not for release; escalate rather than ship.
The power of the stoplight in a mixed workforce is that it gives an AI agent and a human reviewer the same escalation language. An agent instructed to deliver only Green-rated work, and to escalate anything Red, behaves like a contributor who knows the difference between “done” and “good enough to send up the chain.” That is governance encoded into the deliverable itself.
The Strategic Alignment Chain.
Strategic Alignment is the component that prevents the whole framework from collapsing into busywork. It traces every deliverable back to organizational purpose along a single chain:
From Vision to Continuous Performance Improvement
- Vision → Mission → Strategy → Initiative → Capability → Action → Output → Outcome → Value → ROI → Evaluation → Continuous Performance Improvement
An output requirement that cannot be located somewhere on that chain is a signal worth heeding — it suggests the deliverable may complete a task without contributing measurable value. The chain is the test: if you cannot trace the output to an outcome and the outcome to value, you are generating activity, not results.
Specifications define the work. Output Requirements define success. Evaluation verifies value. Continuous improvement strengthens future performance.
Why This Is the Missing Middle.
Read across the framework and a structure appears that mirrors how well-run organizations have always operated — and that the mixed workforce now makes explicit and mandatory.
Specification
Defines the work to be performed. The job description of the contributor.
Output Requirements
Define what a successful deliverable looks like and the evidence that proves value.
Evaluation
Verifies the value was actually created, and feeds continuous improvement.
Most organizations have specifications, in some form, and most are beginning to build evaluation, under budget pressure. The middle term is the one almost everyone skips — and it is the one that determines whether the work that satisfies the spec actually satisfies the organization. Output Requirements are the missing middle between defining work and verifying value.
What to Do First.
Add output requirements to one recurring deliverable
Pick a report or work product your team produces regularly and define its output requirements once — style, tone, format, success criteria, evidence, risk threshold. Reuse it every cycle. The first definition is the only expensive one.
Give every agent a risk threshold
No AI agent should deliver work without a stoplight rule telling it what to ship, what to revise, and what to escalate. This single instruction converts a confident output machine into a governed contributor.
Make the alignment chain a release gate
Before accepting a deliverable, trace it along the Vision-to-Value chain. If it cannot be located there, you have found activity disguised as results — catch it before it consumes more capacity.
The objective was never to generate outputs. It was to generate value.
A specification gets the work done. Output requirements get it done well. Evaluation proves it mattered. Continuous improvement makes the next one better.
In a workforce that is part human and part digital, the organizations that define success precisely will be the ones whose outputs reliably become outcomes.
