Practices | Agentic Engineering

Theory gets you started. Practice is where the learning happens.

Practice Areas

Practice	Focus
Debugging Agents	Finding and fixing what went wrong
Evaluation	Measuring agent performance systematically
Cost and Latency	Managing the economics and speed of agent systems
Production Concerns	Running agents reliably at scale
Workflow Coordination	Structured metadata for agent coordination
Knowledge Evolution	Guidelines for updating knowledge base entries

Questions to Answer Here

What's the most important skill for someone building agents that isn't obvious at first?
What practices do you wish you'd adopted earlier?
How does the craft of agentic engineering differ from traditional software engineering?
What does "experience" give you in this field? What can't be shortcut?

Your Development Workflow

How do you typically work when building or improving an agent?

High-Leverage Review

Review research and plans, not code line-by-line.

The most expensive mistakes in agentic systems happen at the conceptual level, not the implementation level. Multi-agent production systems demonstrate this pattern: agents operating from wrong assumptions execute flawed strategies flawlessly, propagating conceptual errors through thousands of generated lines. One misunderstood research conclusion causes more damage than any single code defect.

Traditional code review focuses on implementation details—checking syntax, catching edge cases, spotting potential bugs. That made sense when humans wrote every line. With agents generating code, the leverage point shifts dramatically upstream.

Where Human Expertise Has Maximum Impact

Early conceptual validation, not late implementation review:

Research quality: Did the agent understand the domain correctly? Are its assumptions sound?
Plan coherence: Does the proposed approach make sense? Will it solve the actual problem?
Mental model alignment: Is the agent thinking about this problem the right way?
Scope appropriateness: Is it solving too much? Too little?

Spending an hour reviewing an agent's research findings catches misunderstandings before they propagate through thousands of lines of generated code. Spending that hour on line-by-line code review inspects the symptoms of conceptual errors without preventing them.

The Anti-Pattern

Skimming the plan, then carefully reviewing every generated line of code. This inverts the leverage:

Accepting flawed thinking at the cheapest intervention point (the plan)
Catching its consequences at the most expensive intervention point (the implementation)
The agent has already invested its context window in the wrong direction
Wasting expertise on problems that static analysis or tests would catch

The pattern: Invest your review time where agents are weakest—conceptual reasoning, domain understanding, holistic design. Trust them for what they're strong at—consistent implementation of a well-defined plan.

Practical Implementation

Review the research phase thoroughly: Read what the agent learned. Check its sources. Verify its conclusions.
Review the plan critically: Does this approach make sense? What could go wrong? What's it not considering?
Spot-check the implementation: Sample key sections to verify the plan is being followed. Don't read every line.
Review by testing: Run it. Does it behave as the plan specified? If not, was the plan wrong or the implementation?

Mental alignment matters more than individual line correctness. Correct thinking makes implementation debuggable. Flawed thinking means perfect code just automates misconceptions faster. This observation holds across production deployments: conceptual validation prevents more failures than implementation review catches.