Microsoft’s multi-agent AI system tops Anthropic’s Mythos on cybersecurity benchmark

GeekWire — May 14, 00:16 AM

https://cdn.geekwire.com/wp-content/uploads/2026/05/cyber.png" width="968" />
Microsoft's new vulnerability-scanning system, codenamed MDASH, scored 88.45% on the CyberGym benchmark, surpassing single-model systems from Anthropic and OpenAI by using more than 100 specialized AI agents across multiple models. https://www.geekwire.com/2026/microsofts-multi-agent-ai-system-tops-anthropics-mythos-on-cybersecurity-benchmark/">Read More

Read full article at GeekWire →