The 8 Levels of Agentic Engineering: A Developer Maturity Model

Bassim Eledath published a practitioner-level framework on March 10, 2026 outlining eight discrete levels of AI-assisted software engineering, arguing that the gap between AI coding capability and team productivity closes not gradually but in distinct jumps. The framework runs from Level 1 tab-complete tools like GitHub Copilot through Level 2 AI-focused IDEs like Cursor, then into Level 3 context engineering — the discipline of crafting high information-density prompts and rules files like CLAUDE.md — and Level 4 compounding engineering, a plan-delegate-assess-codify loop popularized by Kieran Klaassen that makes each future agent session better than the last. The upper levels introduce MCP and skills integration at Level 5, automated test and linter feedback loops as "harness engineering" at Level 6, autonomous background agents running via what Eledath calls the "Ralph Loop" at Level 7, and full multi-agent orchestration at Level 8, where an orchestrator coordinates parallel background workers.

A central organizing insight in the piece is what Eledath calls the "multiplayer effect": a developer operating at Level 7, raising overnight PRs via background agents, is throttled if a Level 2 colleague controls merge approvals. Team-level adoption matters as much as individual technical progression, and Eledath cites Anthropic's own engineering team shipping a product called Cowork in 10 days as the benchmark for what organizations operating at the top levels can achieve. Companies like Block and Vercel are referenced as practitioners of advanced agentic workflows, with Block having built an internal skills marketplace of over 100 shared agent capabilities.

Hacker News commentary surfaced several substantive critiques. Commenter vidimitrov flagged a meaningful risk in the Level 4 codify step: rules files like CLAUDE.md capture what decisions were made but not why, creating conditions for "confident mistakes" when background agents at Level 7 follow stale rules without understanding the original tradeoffs. That commenter argued git commit history is a more natural home for decision rationale, and that structuring commit history for agent retrieval is "genuinely underexplored territory." A separate skeptical thread questioned the strategic logic of delegating software development to agents at all, asking why any company that builds such a capability wouldn't simply pivot to selling it as a product given the larger addressable market.

Vidimitrov's critique — that rules files capture decisions but not reasoning, priming autonomous agents for confident mistakes — is the most important open problem the framework raises. For engineering leaders, that's the frame: advancing at scale requires pulling the whole team up, not just empowering the most advanced practitioners. Building institutional memory that agents can reason from, not just retrieve, is the work no maturity model has fully answered.