Firefox team fixes 423 vulnerabilities using Anthropic's Claude Mythos Preview

@emollick I mean this is not at all evidence that some post train of Mythos was tooo dangerous for the public

So Mythos was, indeed, not marketing hype. Remember this is a general purpose model that just happens to be good at finding exploits because good models are good at lots of things. Expect similar from OpenAI & Google. And from open models in 8 months. https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

10:44 PM · May 7, 2026 · 439.1K Views

2:05 AM · May 8, 2026 · 21.6K Views

REPLYDH #114Dan Hendrycks@HENDRYCKS

@logangraham Eventually could be a long time, especially in critical infrastructure, much of which is running on Windows 7.

Logan Graham@logangraham

I'm optimistic this eventually favors defense over offense. We wanted to start this transition cautiously. I've honestly been inspired by what orgs have been able to do with Mythos. More to come!

12:07 AM · May 8, 2026 · 42.2K Views

12:50 AM · May 8, 2026 · 1.6K Views

QUOTE POSTBB #118Boaz Barak@BOAZBARAKTCS

Credit where credit is due. This is genuinely impressive.

Alex Albert@alexalbert__

With the help of Claude Mythos Preview, the Firefox team fixed more security bugs in April than in the past 15 months combined.

7:20 PM · May 7, 2026 · 1.2M Views

10:17 PM · May 7, 2026 · 29.4K Views

QUOTE POSTBB #118Boaz Barak@BOAZBARAKTCS

I agree that fundamentally, AI favors defense in cyber. We know that theoretically, bug free software (at least in narrow sense of conforming with spec) can exist, and as we find and fix more bugs, we can approach this asymptote.

Logan Graham@logangraham

I'm optimistic this eventually favors defense over offense. We wanted to start this transition cautiously. I've honestly been inspired by what orgs have been able to do with Mythos. More to come!

12:07 AM · May 8, 2026 · 42.2K Views

2:00 AM · May 8, 2026 · 22.6K Views

REPLYBB #118Boaz Barak@BOAZBARAKTCS

There is another reason why AI favors the defender. A lot of small but important systems (e.g. hospitals) are insecure because they are limited in the security staff they can hire. Now you can have a team of a 100 top security engineers watching over your system.

Boaz Barak@boazbaraktcs

I agree that fundamentally, AI favors defense in cyber. We know that theoretically, bug free software (at least in narrow sense of conforming with spec) can exist, and as we find and fix more bugs, we can approach this asymptote.

2:00 AM · May 8, 2026 · 22.6K Views

2:05 AM · May 8, 2026 · 2.5K Views

ORIGINAL POSTEM #127Ethan Mollick@EMOLLICK

So Mythos was, indeed, not marketing hype.

Remember this is a general purpose model that just happens to be good at finding exploits because good models are good at lots of things. Expect similar from OpenAI & Google. And from open models in 8 months. https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

10:44 PM · May 7, 2026 · 439.1K Views

REPLYEM #127Ethan Mollick@EMOLLICK

@tszzl I guess I didn't interpret that as the claim? I think Anthropic has its own ideas of what constitutes a threat, for better or worse - a model finding a large number of exploits. This seems to suggest that Mythos can do this (I do not think it is the only model that can do this)

roon@tszzl

@emollick I mean this is not at all evidence that some post train of Mythos was tooo dangerous for the public

2:05 AM · May 8, 2026 · 21.6K Views

2:36 AM · May 8, 2026 · 16K Views

REPLYEM #127Ethan Mollick@EMOLLICK

And also https://www.paloaltonetworks.com/blog/2026/05/frontier-ai-defense/

Ethan Mollick@emollick

So Mythos was, indeed, not marketing hype. Remember this is a general purpose model that just happens to be good at finding exploits because good models are good at lots of things. Expect similar from OpenAI & Google. And from open models in 8 months. https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

10:44 PM · May 7, 2026 · 439.1K Views

9:10 PM · May 8, 2026 · 11.4K Views

QUOTE POSTGM #133Gary Marcus@GARYMARCUS

@DKThomp my view, somewhere in middle

Gary Marcus@GaryMarcus

My view on Mythos was and is somewhere in between. It’s real, it’s a wakeup call, and it’s not quite what some of the media coverage suggested. • Mythos is not going to allow an 8 year old to accidentally take down a power grid. (Someone writing for the NYT thought that was a real possibility). • The new Mozilla data certainly show that Mythos is better then its predecessors at detecting bugs. • But the UK AI institute’s study showed (and I still think this is true) that well-secured systems are not immediately at risk. • Consistent with all this the Mozilla report notes “Note that a number of these bugs are sandbox escapes, which would need to be combined with other exploits to achieve a full-chain Firefox compromise” I stand by my middle view; it’s not marketing hype, but it’s not quite as potent as some people thought. One other thing to note is that whatever advances there are not necessarily general to many or all domains; we will have to wait and see on that.

12:54 AM · May 8, 2026 · 43K Views

4:19 AM · May 8, 2026 · 2.1K Views

QUOTE POSTGM #133Gary Marcus@GARYMARCUS

This is confused, but popular.

Popular because it tells a bunch of people what they want to hear.

Confused for a couple reasons: first, Mythos probably isn’t a pure LLM. (Claude Code isn’t, and it probably uses some similar techniques). [Also critics such as myself never called LLMs a “scam”; rather we said that LLMs need to be supplemented with other techniques, and wouldn’t be enough on their own.]

And on @EpochAIResearch’s important ECI benchmark it’s NOT hugely better than other models.

It’s better at bug finding, but doesn’t mean it’s solved hallucinations, boneheaded reasoning errors etc.

prinz@deredleritt3r

Old enough to remember when the prevailing view on AI was that LLMs are a scam, actually, and the bubble is about to pop (6 months ago)

8:22 PM · May 7, 2026 · 296.3K Views

1:27 AM · May 9, 2026 · 17K Views

QUOTE POSTGM #133Gary Marcus@GARYMARCUS

My view on Mythos was and is somewhere in between. It’s real, it’s a wakeup call, and it’s not quite what some of the media coverage suggested.

• Mythos is not going to allow an 8 year old to accidentally take down a power grid. (Someone writing for the NYT thought that was a real possibility).

• The new Mozilla data certainly show that Mythos is better then its predecessors at detecting bugs.

• But the UK AI institute’s study showed (and I still think this is true) that well-secured systems are not immediately at risk.

• Consistent with all this the Mozilla report notes “Note that a number of these bugs are sandbox escapes, which would need to be combined with other exploits to achieve a full-chain Firefox compromise”

I stand by my middle view; it’s not marketing hype, but it’s not quite as potent as some people thought.

One other thing to note is that whatever advances there are not necessarily general to many or all domains; we will have to wait and see on that.

Ethan Mollick@emollick

So Mythos was, indeed, not marketing hype. Remember this is a general purpose model that just happens to be good at finding exploits because good models are good at lots of things. Expect similar from OpenAI & Google. And from open models in 8 months. https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

10:44 PM · May 7, 2026 · 439.1K Views

12:54 AM · May 8, 2026 · 43K Views

QUOTE POSTGM #133Gary Marcus@GARYMARCUS

@emollick my somewhat intermediate take; also re your second paragraph I don’t think we actually know how much bug-detecting-specific harnessing there is. so i am not sure your second sentence will stand the test of time. some data i saw briefly suggest otherwise, but we will have to see.

Gary Marcus@GaryMarcus

My view on Mythos was and is somewhere in between. It’s real, it’s a wakeup call, and it’s not quite what some of the media coverage suggested. • Mythos is not going to allow an 8 year old to accidentally take down a power grid. (Someone writing for the NYT thought that was a real possibility). • The new Mozilla data certainly show that Mythos is better then its predecessors at detecting bugs. • But the UK AI institute’s study showed (and I still think this is true) that well-secured systems are not immediately at risk. • Consistent with all this the Mozilla report notes “Note that a number of these bugs are sandbox escapes, which would need to be combined with other exploits to achieve a full-chain Firefox compromise” I stand by my middle view; it’s not marketing hype, but it’s not quite as potent as some people thought. One other thing to note is that whatever advances there are not necessarily general to many or all domains; we will have to wait and see on that.

12:54 AM · May 8, 2026 · 43K Views

12:55 AM · May 8, 2026 · 6.4K Views

REPLYDW #206Dean W. Ball@DEANWBALL

@GaryMarcus @binarybits This strikes me as a reasonable take.

12:57 AM · May 8, 2026 · 468 Views

REPLYWD #242will depue@WILLDEPUE

@alexalbert__ wowzers

Alex Albert@alexalbert__

With the help of Claude Mythos Preview, the Firefox team fixed more security bugs in April than in the past 15 months combined.

7:20 PM · May 7, 2026 · 1.2M Views

8:19 PM · May 7, 2026 · 1.6K Views

REPLYHT #276Helen Toner@HLNTNR

Post here, including example vulnerabilities and tips for other orgs on how to build in-house harnesses that let you make the most of new models: https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

Helen Toner@hlntnr

One of the things that made the Mythos release hard to interpret is that Anthropic held back details on most vulns they found, to give defenders time to patch. 1 month later, info from orgs with access to Mythos is starting to trickle out, e.g. this post from Mozilla today:

8:03 PM · May 7, 2026 · 224K Views

8:03 PM · May 7, 2026 · 17.2K Views

ORIGINAL POSTHT #276Helen Toner@HLNTNR

One of the things that made the Mythos release hard to interpret is that Anthropic held back details on most vulns they found, to give defenders time to patch.

1 month later, info from orgs with access to Mythos is starting to trickle out, e.g. this post from Mozilla today:

8:03 PM · May 7, 2026 · 224K Views

REPLYHT #276Helen Toner@HLNTNR

Super interesting, on building their own harness:

Helen Toner@hlntnr

Post here, including example vulnerabilities and tips for other orgs on how to build in-house harnesses that let you make the most of new models: https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

8:03 PM · May 7, 2026 · 17.2K Views

8:04 PM · May 7, 2026 · 17.3K Views

QUOTE POSTNS #287Noah Smith 🐇🇺🇸🇺🇦🇹🇼@NOAHPINION

Maybe powerful AI will favor the cyber defense.

Vulnerabilities are finite in number, so if you can fix them all, maybe software just becomes much more secure.

Alex Albert@alexalbert__

With the help of Claude Mythos Preview, the Firefox team fixed more security bugs in April than in the past 15 months combined.

7:20 PM · May 7, 2026 · 1.2M Views

8:34 PM · May 7, 2026 · 19.3K Views

QUOTE POST⿻A #366⿻ Andrew Trask@IAMTRASK

Future historians might say that cybersecurity was unsolvable *until* tools like Mythos came around and plugged all the holes.

Alex Albert@alexalbert__

With the help of Claude Mythos Preview, the Firefox team fixed more security bugs in April than in the past 15 months combined.

7:20 PM · May 7, 2026 · 1.2M Views

9:06 PM · May 7, 2026 · 324 Views

REPLYKA #506kache@YACINEMTB

@emollick Gpt 5.5 is already capable of this

Ethan Mollick@emollick

So Mythos was, indeed, not marketing hype. Remember this is a general purpose model that just happens to be good at finding exploits because good models are good at lots of things. Expect similar from OpenAI & Google. And from open models in 8 months. https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

10:44 PM · May 7, 2026 · 439.1K Views

11:52 PM · May 7, 2026 · 8.6K Views

ORIGINAL POSTAA #558Alex Albert@ALEXALBERT__

With the help of Claude Mythos Preview, the Firefox team fixed more security bugs in April than in the past 15 months combined.

7:20 PM · May 7, 2026 · 1.2M Views

REPLYAA #558Alex Albert@ALEXALBERT__

Pulled from this great blog post: https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

Alex Albert@alexalbert__

With the help of Claude Mythos Preview, the Firefox team fixed more security bugs in April than in the past 15 months combined.

7:20 PM · May 7, 2026 · 1.2M Views

7:20 PM · May 7, 2026 · 50.9K Views

QUOTE POSTLG #791Logan Graham@LOGANGRAHAM

I'm optimistic this eventually favors defense over offense. We wanted to start this transition cautiously.

I've honestly been inspired by what orgs have been able to do with Mythos. More to come!

Alex Albert@alexalbert__

With the help of Claude Mythos Preview, the Firefox team fixed more security bugs in April than in the past 15 months combined.

7:20 PM · May 7, 2026 · 1.2M Views

12:07 AM · May 8, 2026 · 42.2K Views

REPLYLG #791Logan Graham@LOGANGRAHAM

@hendrycks Yeah. That's probably the scenario we think about the most. The entire question is then how to smooth the transition as much as possible.

(which I think could require some unprecedented innovations in security)

Dan Hendrycks@hendrycks

@logangraham Eventually could be a long time, especially in critical infrastructure, much of which is running on Windows 7.

12:50 AM · May 8, 2026 · 1.6K Views

12:56 AM · May 8, 2026 · 885 Views

ORIGINAL POSTDT #915Derek Thompson@DKTHOMP

Skepticism of corporate marketing and AI boosterism is always warranted, but I think the folks who accused Anthropic of overrating Mythos should check out this post by Mozilla developers indicating that the Firefox team fixed more security bugs in April using Mythos than in the past 15 months combined. https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

8:23 PM · May 7, 2026 · 106.7K Views

QUOTE POSTMH #936Marius Hobbhahn@MARIUSHOBBHAHN

Now imagine they had used this knowledge for nefarious purposes.

Kudos to Anthropic, but I really think these high-stakes situations should not rely on "company defies their incentives to do the right thing"

Helen Toner@hlntnr

One of the things that made the Mythos release hard to interpret is that Anthropic held back details on most vulns they found, to give defenders time to patch. 1 month later, info from orgs with access to Mythos is starting to trickle out, e.g. this post from Mozilla today:

8:03 PM · May 7, 2026 · 224K Views

4:38 PM · May 8, 2026 · 2.5K Views

QUOTE POSTMH #936Marius Hobbhahn@MARIUSHOBBHAHN

Important clarification:

6:31 PM · May 8, 2026 · 347 Views

QUOTE POSTPW #959Peter Wildeford🇺🇸🚀@PETERWILDEFORD

Wow, Mythos is really cooking over at Firefox

Helen Toner@hlntnr

One of the things that made the Mythos release hard to interpret is that Anthropic held back details on most vulns they found, to give defenders time to patch. 1 month later, info from orgs with access to Mythos is starting to trickle out, e.g. this post from Mozilla today:

8:03 PM · May 7, 2026 · 224K Views

1:16 AM · May 8, 2026 · 3.2K Views

ORIGINAL POSTTB #1304Timothy B. Lee@BINARYBITS

Not a good day for team "Claude Mythos is just marketing hype."

8:45 PM · May 7, 2026 · 45.2K Views

QUOTE POSTPR prinz@DEREDLERITT3R

Old enough to remember when the prevailing view on AI was that LLMs are a scam, actually, and the bubble is about to pop (6 months ago)

Alex Albert@alexalbert__

With the help of Claude Mythos Preview, the Firefox team fixed more security bugs in April than in the past 15 months combined.

7:20 PM · May 7, 2026 · 1.2M Views

8:22 PM · May 7, 2026 · 296.3K Views

QUOTE POSTKF Kevin Frazier@KEVINTFRAZIER

#ICMYI today's AI is the worst AI we'll ever have.

A Kick the Tires period - as @deanwball and I call for in @lawfare - would ensure others could take such steps and expedite the process of seeing the latest and greatest AI tools be deployed.

Alex Albert@alexalbert__

With the help of Claude Mythos Preview, the Firefox team fixed more security bugs in April than in the past 15 months combined.

7:20 PM · May 7, 2026 · 1.2M Views

7:51 PM · May 7, 2026 · 3.8K Views

QUOTE POSTLE levent@__ALPOGE__

Gotta do the right thing even when getting shit on for literally no reason

Ethan Mollick@emollick

So Mythos was, indeed, not marketing hype. Remember this is a general purpose model that just happens to be good at finding exploits because good models are good at lots of things. Expect similar from OpenAI & Google. And from open models in 8 months. https://hacks.mozilla.org/2026/05/behind-the-scenes-hardening-firefox/

10:44 PM · May 7, 2026 · 439.1K Views

1:56 AM · May 8, 2026 · 13.8K Views

Cluster engagement

Sentiment

Cluster engagement