AI agents now write more code, faster, than any team can meaningfully review.
The new code looks right when it’s right and looks right when it’s
wrong; at this volume, review can’t tell which is which. In
New Relic’s June 2026 State of AI Coding survey, 94% of technology leaders rated AI-generated code higher quality than
human-written code at review; 78% reported more production incidents once it
shipped. The industry calls that spread the verification gap
(Sonar’s January 2026 State of Code: 96% of developers don’t fully trust AI-generated code, yet only 48%
always verify it before committing), and the work accumulating inside it is
what Werner Vogels calls
verification debt.
The bottleneck in software was never generation. It’s verification: knowing
what a system does, proving what changed, and deciding whether to believe a
quality claim. We built machinery for that.