Anthropic Claude Mythos Preview: Project Glasswing AI Security

Anthropic's Claude Mythos Preview is officially stepping into the cybersecurity arena to autonomously hunt down critical system vulnerabilities before malicious actors can exploit them. Through a newly announced initiative called Project Glasswing, the AI giant is partnering with industry heavyweights to deploy this frontier model exclusively for defensive security.

The initiative brings together a massive coalition of tech leaders, including Nvidia, Google, Amazon Web Services, Apple, and Microsoft. Access to the model is strictly limited to these launch partners to prevent adversaries from weaponizing its capabilities. According to Newton Cheng, the cyber lead for Anthropic’s frontier red team in an interview with The Verge, the goal is to give cyber defenders a crucial head start against potential threats.

Autonomous Threat Detection and Exploits

While not explicitly trained solely for cybersecurity, the model leverages strong agentic coding and reasoning skills to analyze complex software infrastructure. Anthropic reports that the AI has already flagged thousands of high-severity vulnerabilities across every major operating system and web browser. Strikingly, the model identified these flaws and developed related exploits entirely autonomously, requiring zero human steering.

Cheng declined to share specific details of the model’s cybersecurity successes beyond the company’s publicly-released examples. However, the sheer scale of its autonomous detection capabilities marks a significant leap in how enterprise networks can be secured without constant manual oversight.

Financial Backing and Government Briefings

To accelerate adoption, Anthropic is heavily subsidizing the initial rollout of the program. The company is committing up to $100 million in usage credits for its partners, alongside $4 million in direct donations to the Linux Foundation and the Apache Software Foundation. Other key partners in the Glasswing initiative include JPMorgan Chase, Broadcom, Cisco, CrowdStrike, and Palo Alto Networks, joining roughly 40 other infrastructure organizations.

The official reveal follows a data leak last month that prematurely exposed the model's existence. Dianne Penn, head of product management at Anthropic, clarified to The Verge that the leak was due to human error and entirely unrelated to software vulnerabilities. Furthermore, Anthropic has briefed senior US government officials regarding the model's offensive and defensive cyber capabilities, maintaining ongoing discussions amid recent political clashes.

The Strategic Shift Toward AI Security Agents

Anthropic’s decision to restrict Claude Mythos Preview to defensive partners highlights a growing industry awareness of the dual-use nature of advanced AI. By subsidizing the initial $100 million in usage credits, the company is effectively building a massive, real-world testing ground for autonomous security agents.

If the model proves indispensable for giants like Microsoft and CrowdStrike, this subsidized program will likely transition into a highly lucrative enterprise tier. This move not only positions Anthropic as a critical infrastructure protector but also sets a new standard for how frontier models are deployed in high-stakes environments.