Discussion about this post

User's avatar
richardstevenhack's avatar

Cloudflare tested Mythos against 50 internal code repositories and concluded the jump from prior frontier models is “not just a refinement of what came before."

I have long suspected that Anthropic explicitly designed Mythos to have that capability. While other LLMs of the current generation are nearly as good, Mythos in my opinion was designed to be a cybersecurity model - precisely to enable Anthropic to penetrate the US government market after the whole Pentagon fiasco.

I can't PROVE that, of course, without access to Anthropic internal documents.

As for prompt injection, I thought that was already conceded to be an unsolvable problem.

It gets worse if you follow AI cybersecurity expert Disesdi Shoshana Cox here on Substack, who says basically most of "AI security" is impossible and all of "AI red teaming" is fake.

No posts

Ready for more?