5d ago

Firefox team fixes 423 vulnerabilities using Anthropic's Claude Mythos Preview

0

Mozilla's Firefox team fixed 423 security vulnerabilities in April 2026 using Anthropic's Claude Mythos Preview. The single-month total exceeded fixes across the prior 15 months, where monthly totals ranged from 17 to 76. The AI tool detected and patched complex issues like sandbox escapes. Anthropic Head of Developer Relations Alex Albert detailed the effort.

Original post

With the help of Claude Mythos Preview, the Firefox team fixed more security bugs in April than in the past 15 months combined.

12:20 PM · May 7, 2026 View on X
Reposted by

@emollick I mean this is not at all evidence that some post train of Mythos was tooo dangerous for the public

Ethan Mollick@emollick

So Mythos was, indeed, not marketing hype. Remember this is a general purpose model that just happens to be good at finding exploits because good models are good at lots of things. Expect similar from OpenAI & Google. And from open models in 8 months. https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

10:44 PM · May 7, 2026 · 439.1K Views
2:05 AM · May 8, 2026 · 21.6K Views

I agree that fundamentally, AI favors defense in cyber. We know that theoretically, bug free software (at least in narrow sense of conforming with spec) can exist, and as we find and fix more bugs, we can approach this asymptote.

Logan Graham@logangraham

I'm optimistic this eventually favors defense over offense. We wanted to start this transition cautiously. I've honestly been inspired by what orgs have been able to do with Mythos. More to come!

12:07 AM · May 8, 2026 · 42.2K Views
2:00 AM · May 8, 2026 · 22.6K Views

There is another reason why AI favors the defender. A lot of small but important systems (e.g. hospitals) are insecure because they are limited in the security staff they can hire. Now you can have a team of a 100 top security engineers watching over your system.

Boaz Barak@boazbaraktcs

I agree that fundamentally, AI favors defense in cyber. We know that theoretically, bug free software (at least in narrow sense of conforming with spec) can exist, and as we find and fix more bugs, we can approach this asymptote.

2:00 AM · May 8, 2026 · 22.6K Views
2:05 AM · May 8, 2026 · 2.5K Views

@tszzl I guess I didn't interpret that as the claim? I think Anthropic has its own ideas of what constitutes a threat, for better or worse - a model finding a large number of exploits. This seems to suggest that Mythos can do this (I do not think it is the only model that can do this)

roon@tszzl

@emollick I mean this is not at all evidence that some post train of Mythos was tooo dangerous for the public

2:05 AM · May 8, 2026 · 21.6K Views
2:36 AM · May 8, 2026 · 16K Views

And also https://www.paloaltonetworks.com/blog/2026/05/frontier-ai-defense/

Ethan Mollick@emollick

So Mythos was, indeed, not marketing hype. Remember this is a general purpose model that just happens to be good at finding exploits because good models are good at lots of things. Expect similar from OpenAI & Google. And from open models in 8 months. https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

10:44 PM · May 7, 2026 · 439.1K Views
9:10 PM · May 8, 2026 · 11.4K Views

@DKThomp my view, somewhere in middle

Gary Marcus@GaryMarcus

My view on Mythos was and is somewhere in between. It’s real, it’s a wakeup call, and it’s not quite what some of the media coverage suggested. • Mythos is not going to allow an 8 year old to accidentally take down a power grid. (Someone writing for the NYT thought that was a real possibility). • The new Mozilla data certainly show that Mythos is better then its predecessors at detecting bugs. • But the UK AI institute’s study showed (and I still think this is true) that well-secured systems are not immediately at risk. • Consistent with all this the Mozilla report notes “Note that a number of these bugs are sandbox escapes, which would need to be combined with other exploits to achieve a full-chain Firefox compromise” I stand by my middle view; it’s not marketing hype, but it’s not quite as potent as some people thought. One other thing to note is that whatever advances there are not necessarily general to many or all domains; we will have to wait and see on that.

12:54 AM · May 8, 2026 · 43K Views
4:19 AM · May 8, 2026 · 2.1K Views

This is confused, but popular.

Popular because it tells a bunch of people what they want to hear.

Confused for a couple reasons: first, Mythos probably isn’t a pure LLM. (Claude Code isn’t, and it probably uses some similar techniques). [Also critics such as myself never called LLMs a “scam”; rather we said that LLMs need to be supplemented with other techniques, and wouldn’t be enough on their own.]

And on @EpochAIResearch’s important ECI benchmark it’s NOT hugely better than other models.

It’s better at bug finding, but doesn’t mean it’s solved hallucinations, boneheaded reasoning errors etc.

prinz@deredleritt3r

Old enough to remember when the prevailing view on AI was that LLMs are a scam, actually, and the bubble is about to pop (6 months ago)

8:22 PM · May 7, 2026 · 296.3K Views
1:27 AM · May 9, 2026 · 17K Views

My view on Mythos was and is somewhere in between. It’s real, it’s a wakeup call, and it’s not quite what some of the media coverage suggested.

• Mythos is not going to allow an 8 year old to accidentally take down a power grid. (Someone writing for the NYT thought that was a real possibility).

• The new Mozilla data certainly show that Mythos is better then its predecessors at detecting bugs.

• But the UK AI institute’s study showed (and I still think this is true) that well-secured systems are not immediately at risk.

• Consistent with all this the Mozilla report notes “Note that a number of these bugs are sandbox escapes, which would need to be combined with other exploits to achieve a full-chain Firefox compromise”

I stand by my middle view; it’s not marketing hype, but it’s not quite as potent as some people thought.

One other thing to note is that whatever advances there are not necessarily general to many or all domains; we will have to wait and see on that.

Ethan Mollick@emollick

So Mythos was, indeed, not marketing hype. Remember this is a general purpose model that just happens to be good at finding exploits because good models are good at lots of things. Expect similar from OpenAI & Google. And from open models in 8 months. https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

10:44 PM · May 7, 2026 · 439.1K Views
12:54 AM · May 8, 2026 · 43K Views

@emollick my somewhat intermediate take; also re your second paragraph I don’t think we actually know how much bug-detecting-specific harnessing there is. so i am not sure your second sentence will stand the test of time. some data i saw briefly suggest otherwise, but we will have to see.

Gary Marcus@GaryMarcus

My view on Mythos was and is somewhere in between. It’s real, it’s a wakeup call, and it’s not quite what some of the media coverage suggested. • Mythos is not going to allow an 8 year old to accidentally take down a power grid. (Someone writing for the NYT thought that was a real possibility). • The new Mozilla data certainly show that Mythos is better then its predecessors at detecting bugs. • But the UK AI institute’s study showed (and I still think this is true) that well-secured systems are not immediately at risk. • Consistent with all this the Mozilla report notes “Note that a number of these bugs are sandbox escapes, which would need to be combined with other exploits to achieve a full-chain Firefox compromise” I stand by my middle view; it’s not marketing hype, but it’s not quite as potent as some people thought. One other thing to note is that whatever advances there are not necessarily general to many or all domains; we will have to wait and see on that.

12:54 AM · May 8, 2026 · 43K Views
12:55 AM · May 8, 2026 · 6.4K Views

Post here, including example vulnerabilities and tips for other orgs on how to build in-house harnesses that let you make the most of new models: https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

Helen Toner@hlntnr

One of the things that made the Mythos release hard to interpret is that Anthropic held back details on most vulns they found, to give defenders time to patch. 1 month later, info from orgs with access to Mythos is starting to trickle out, e.g. this post from Mozilla today:

8:03 PM · May 7, 2026 · 224K Views
8:03 PM · May 7, 2026 · 17.2K Views

Now imagine they had used this knowledge for nefarious purposes.

Kudos to Anthropic, but I really think these high-stakes situations should not rely on "company defies their incentives to do the right thing"

Helen Toner@hlntnr

One of the things that made the Mythos release hard to interpret is that Anthropic held back details on most vulns they found, to give defenders time to patch. 1 month later, info from orgs with access to Mythos is starting to trickle out, e.g. this post from Mozilla today:

8:03 PM · May 7, 2026 · 224K Views
4:38 PM · May 8, 2026 · 2.5K Views

#ICMYI today's AI is the worst AI we'll ever have.

A Kick the Tires period - as @deanwball and I call for in @lawfare - would ensure others could take such steps and expedite the process of seeing the latest and greatest AI tools be deployed.

Alex Albert@alexalbert__

With the help of Claude Mythos Preview, the Firefox team fixed more security bugs in April than in the past 15 months combined.

7:20 PM · May 7, 2026 · 1.2M Views
7:51 PM · May 7, 2026 · 3.8K Views

Gotta do the right thing even when getting shit on for literally no reason

Ethan Mollick@emollick

So Mythos was, indeed, not marketing hype. Remember this is a general purpose model that just happens to be good at finding exploits because good models are good at lots of things. Expect similar from OpenAI & Google. And from open models in 8 months. https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

10:44 PM · May 7, 2026 · 439.1K Views
1:56 AM · May 8, 2026 · 13.8K Views