Discussion about this post

User's avatar
Max Andreacchi's avatar

I appreciate the breadth you cover on this topic. A lot of research shows agentic workflow developers introducing LLM judges in a security capacity, and I think your allusion to “not letting the perfect be the enemy of the good” is the best lens to view this through. Defense-in-depth remains very much alive and well, especially in an era where risk isn’t mitigated with a deterministic patch solution. Great post!

richardstevenhack's avatar

Also human-in-the-loop does not scale to handling armies of agents.

"Agents are building the tools that other agents use"

And it's a known fact that AIs produce insecure code frequently.

Double-whammy.

No posts

Ready for more?