Specs as Source Code | Agentic Engineering

Throwing away prompts after generating code is like checking in compiled binaries while discarding source.

This mental model, articulated by Sean Grove, reframes how we think about specifications, research documents, and plans in agentic systems.

The Core Shift

In traditional programming:

Source code is the truth
Documentation is secondary
Code is machine-readable and executable

In agentic programming:

Specs are the truth
Generated code is secondary (can be regenerated)
Specs are machine-readable and executable (by agents)

Traditional:                    Agentic:

┌──────────────┐               ┌──────────────┐
│ Source Code  │               │ Specification│
└──────┬───────┘               └──────┬───────┘
       │ compile                      │ agent reads
       ▼                              ▼
┌──────────────┐               ┌──────────────┐
│   Binary     │               │ Generated    │
│ (throwaway)  │               │ Code         │
└──────────────┘               │ (throwaway)  │
                               └──────────────┘

When you discard the prompt that generated working code, you've lost the source. You're left maintaining compiled output.

Your Mental Model

Specs are machine-readable, testable, enforceable contracts. Not wishful thinking in a Google Doc—they're the primary programming surface. Agents read specs, not vibes.

In agentic systems, 80-90% of programming is structured communication. The specification IS the program. The code is just one artifact the specification produces.

This changes what you version control, what you review, and what you test.

What This Looks Like in Practice

Specs Become First-Class Artifacts

# Traditional project structure
src/
  main.py          # This is what you maintain
  utils.py
docs/
  notes.md         # Throwaway reference
  plan.txt         # Deleted after implementation

# Agentic project structure
specs/
  architecture.md  # Source of truth - version controlled
  requirements.md  # Tested against implementation
  plan.md          # Executable by plan-build-review
src/
  main.py          # Can be regenerated from specs
  utils.py         # Generated code, not hand-maintained

You Version Control Prompts Like Code

Research documents, planning artifacts, and agent instructions are checked into version control because they're the source code:

git diff specs/authentication-flow.md
 
- Agent should validate JWT tokens
+ Agent should validate JWT tokens and check revocation list
+ See security-requirements.md section 3.2 for revocation protocol

This diff is more important than the code diff it produces. The spec change is the actual change. The code change is a compilation artifact.

You Review and Test Specs

Code review becomes spec review:

# PR: Add user authentication
 
Changes to specs/:
  + authentication-flow.md
  + security-requirements.md
  + error-handling.md
 
Generated implementation in src/:
  + auth.py (generated from specs)
  + tests.py (generated from specs)

The reviewer focuses on whether the spec is correct, complete, and testable. If the spec is right, the code can always be regenerated.

You write tests that validate the spec was followed:

def test_auth_follows_spec():
    """Verify implementation matches authentication-flow.md section 2."""
    spec = load_spec("authentication-flow.md")
    assert implementation.validates_jwt == spec.requires_jwt_validation
    assert implementation.checks_revocation == spec.requires_revocation_check

The Implications

Research Documents ARE Source Code

That document where you researched authentication approaches? That's not a throwaway—it's the source of truth for why the system works the way it does.

When you need to change authentication later, you don't dig through code trying to reverse-engineer the reasoning. You read the research document, update it with new findings, and regenerate.

Plans ARE Source Code

The plan you wrote before implementing a feature isn't scaffolding to discard. It's executable source code for the plan-build-review pattern.

# feature-plan.md
 
## Approach
Use Redis for session storage with 24-hour TTL.
 
## Dependencies
- Redis client library
- Session serialization logic
 
## Implementation Steps
1. Add Redis connection pool
2. Implement session CRUD operations
3. Add TTL configuration

An agent can execute this plan directly. It's not pseudo-code—it's the program.

Documentation IS Executable

When documentation is machine-readable and structured correctly, agents can use it directly:

# api-spec.md
 
## Endpoint: POST /users
**Auth required**: Yes
**Rate limit**: 100/hour
**Parameters**:
- email: string, required, must be valid email
- name: string, required, 1-100 chars

This isn't just human documentation. An agent building a client can read this spec and generate correct implementation. An agent testing the API can verify the implementation matches the spec.

The spec is the source. Everything else derives from it.

When to Apply This Model

Good Fit

Multi-agent systems: When multiple agents need to coordinate, specs become the shared interface. Agents read the same specs humans do.

Long-lived projects: When you'll maintain code for months/years, specs as source code means future agents can understand the system by reading specs, not archaeologically excavating code.

Generated code: When agents generate implementation, the spec is what you maintain. The generated code is disposable.

Complex domains: When the "why" is as important as the "what," specs capture reasoning that code can't express.

Poor Fit

One-off scripts: For throwaway automation, the mental overhead of treating specs as source code isn't worth it. Just write the script.

Exploratory prototypes: When you're still figuring out what to build, heavyweight specs slow you down. Prototype first, spec later.

Stable, finished systems: If the code is done and won't change, maintaining parallel specs is overhead without benefit.

Common Pitfalls

Spec Drift

Problem: Specs and implementation diverge. The spec says one thing, the code does another.

Solution: Test that implementation matches specs. Make spec updates part of your change workflow. If code changes without spec update, the PR is incomplete.

Over-Specification

Problem: Specs become so detailed they're harder to maintain than code.

Solution: Specs should capture intent and constraints, not line-by-line implementation. Leave room for agent judgment.

Vague Specs

Problem: Specs are too high-level to be executable. Agents can't generate correct code from them.

Solution: Think "testable." If you can't test whether the spec was followed, it's too vague. Add concrete examples and constraints.

Connections

To Context as Code: This mental model extends beyond specs to all context artifacts—knowledge bases, expertise files, and system prompts should be treated as source code
To Plan-Build-Review: The plan IS the source code—it's not throwaway scaffolding
To Self-Improving Experts: Expertise files are specs for agent behavior
To Knowledge Evolution: Knowledge bases are specs for how to think about domains
To Prompt Structuring: Structured prompts are executable specifications