Claude Fable Blocked - 11 Quiet Details on What’s Next

Overview

This episode breaks down the sudden shutdown of Anthropic's Fable 5 after a US government export restriction made it effectively impossible to offer the model to foreign nationals. The host argues that the move was far bigger than a routine safety response and may signal a new, messy phase in AI regulation, where jailbreaks, geopolitics, and company messaging collide.

Key Takeaways

The core claim is that the government did not just ask Anthropic to patch a problem. According to the reporting cited here, officials chose export controls because they were the fastest way to force action, knowing Anthropic would likely have to block the model for everyone rather than try to screen every user and employee by nationality.

A big part of the episode is the split between two explanations. The less cynical read is that officials, under pressure to show they were taking frontier-model risks seriously, reacted quickly after being shown a jailbreak by a trusted outside party. The more cynical read is that the decision was already made, and the jailbreak served as a pretext for singling out Anthropic while leaving rivals untouched.

The host leans hard on a contradiction: Anthropic says the capability exposed by the jailbreak was already common across other models, and in some benchmarked areas like prompt injection resistance, Mythos and Fable were said to perform far better than GPT or Gemini. If that is right, then the question becomes why Anthropic drew the export ban while others did not.

Another thread is that the reported jailbreak itself may not have been the nightmare scenario the public might assume. One cited account says the model was prompted to help patch security vulnerabilities, which a cybersecurity firm reportedly viewed as normal defensive use rather than a major threat. At the same time, the host concedes there are separate reports of more harmful jailbreaks, including one involving explosives, which makes the official focus look selective.

The episode also points out a problem no one has solved: jailbreak resistance is not absolute. Anthropic's defense, as presented here, is that narrow jailbreaks are different from universal ones, and that "perfect" resistance is not available from any major lab. That matters because if one exploitable edge case becomes enough to trigger export controls, the standard could freeze releases across the whole industry.

Practical Steps

For people building with frontier models, the immediate lesson is to plan for policy shock, not just technical failure.

Avoid depending on a single model provider for production use. Keep backups tested and ready.
Separate tasks by risk. Do not route sensitive biosecurity or cybersecurity workflows through one model without fallback rules and human review.
Track policy signals, not just model performance. White House statements, export rules, and company system cards now matter as much as benchmarks.
Read safety claims closely. If a lab markets itself around extreme caution, that language can later be used against it by regulators.
If you run a company, map out what happens if access is restricted by geography, identity verification, or employee nationality. The host suggests those questions are no longer hypothetical.

For regular users, the practical move is simpler: expect access instability, export controls, and tighter identity checks to become part of using top-tier AI systems.

Notable Quotes

"The quickest way to shut Fable down was to ban its export."
"We suspect that perfect jailbreak resistance is not currently possible for any model provider." - Anthropic statement
"Who at the White House evaluated this and thought it was a threat? It's a complete overreaction." - independent cybersecurity firm, as quoted by the host

If this new standard was applied across the industry, we believe it would essentially halt all new model deployments for all frontier model providers. — From the episode

Full Transcript

Source: openai 13m runtime

Something extremely strange happened in the last 36 hours, and if it holds, it will have massive ramifications for how we all access and use AI. Claude Fable 5, the latest model from Anthropic and one of the best LLMs in the world, if not the best, has been blocked for everyone. That's on the orders of the US government, who wanted it blocked for any foreign national inside or outside the United States, including foreign nationals that were Anthropic employees. Anthropic reacted by blocking it for everyone. Now, you had probably gathered that Fable 5 had been disabled by looking at the headlines or seeing it being blocked when you tried to use it. I wouldn't make a video just telling you that. So what I've compiled are 11 things you might not have noticed, contextualizing facts that might help explain what happened and what's next for AI. Let's start with the first fact that you may or may not know, courtesy of an exclusive in The Information. You might have heard that it was a call from Amazon CEO Andy Jassy, as well as other tech leaders, that informed the US government that there were such jailbreaks that could be done on Mythos 5 or Fable 5. Fable 5 is or was the public version of the model with added safeguards. Your first thought might be that this is a big tech rival trying to hobble Anthropic. But did you know that Amazon is one of Anthropic's biggest investors and vendors? After discussions with Amazon CEO Jassy and other tech leaders, US national cyber director Sean Cankross called a meeting with senior White House officials, who decided that export restriction was the most straightforward way to take action against Anthropic. Translated, the quickest way to shut Fable down was to ban its export. They knew that the only way for Anthropic to enforce this was to shut the model down for everyone. Next comes two follow on details in this article, one that will make you more cynical about the government's motives and one perhaps less cynical. Let's start with the less cynical one. We're going to go back to that US national cyber director, Sean Cankross. In May, Politico reported that he was already under immense pressure to do more to handle the release of these frontier models with the cybersecurity issues that they brought with them. Given his relative lack of experience in cybersecurity, people were questioning his expertise to head this effort. For example, JPMorgan CEO Jamie Dimon had relayed to Treasury Secretary Scott Besant his concerns about the speed of the government's response to major potential risks to critical infrastructure posed by these new forms of AI like Mythos. Without huge experience himself in this area, he might have deferred to outside experts or CEOs and thought, well, if they're worried, I better do more. I better act faster. This version of events is backed up by David Sachs, AI star for the US government, who said a highly credible trusted partner of both Anthropic and the US government, who was testing Fable, came forward with a jailbreak of guardrails. The admin asked Dario to fix the jailbreak or de-deploy the model. Dario refused. Now, though, for the more cynical read and set of contextualizing facts. As you'd expect, Anthropic CEO Dario Amadei and administration officials spoke. And during those conversations, Anthropic officials laid out how the security vulnerabilities found through the alleged jailbreak were relatively simple and could be achieved with other models. This is what Anthropic put on their statement viewed by millions. This jailbreak unearthed capability that, they say, is widely available from other models, dobbing in this time OpenAI's GPT 5.5. Here's the key line, though. But the government told Anthropic that it had already decided to implement the export control. Why, over a seemingly narrow and commonplace jailbreak, had the government already decided to implement extreme export controls on Frontier AI at Anthropic? They knew it would affect millions, tens of millions of users. Why have they gone further in publicly saying that they're unlikely to extend those export restrictions to other AI companies? Let's look at the hard facts as found on page 233 of the Mythos system card. I've done more detailed videos on both Mythos and Fable 5, so do check them out. In one of the few areas where we can directly compare Frontier models on what's called prompt injection, where models are tricked into disregarding their system guidelines into doing unintended actions, Mythos and the Fable series are by far the most robust against this, as compared to, for example, the GPT series or the Gemini series. Not by a small margin, by the way, 5 or 10 times more robust. Moreover, according to the Wall Street Journal, the jailbreak involved getting Mythos or Fable to help patch security vulnerabilities. This is according to an independent cybersecurity firm. They saw the US government report that Anthropic had received, and this independent firm said, Who at the White House evaluated this and thought it was a threat? It's a complete overreaction. This is exactly the kind of prompting that defenders would do. As always, I'm trying to give you the full story. There are other independent reports of Fable 5 being jailbroken to do less positive things. Famously, Pliny the Liberator was able to get it to give instructions on making explosives. But it's weird that that evidence wasn't counted. Instead, it was evidence of Mythos being helpful for security. Then there are concerns of a hatchet job. In the briefings to Politico, the US government said, Oh, we attempted to reach Dario Amadei, but he was unavailable because he was attending a wellness retreat. That obviously makes him look pretty silly. Not only did Anthropic reject that claim and say, This is absolutely false, but one journalist, Ashley Vance, was actually reporting at Anthropic at the time and said, Dario is not at a wellness retreat. The feds seem to be scrambling to try and make an example of Anthropic again. It actually really matters whether the more cynical or less cynical version of events is the correct one. Because if this is just an honest reaction to jailbreaks, the issue can be resolved quite quickly. I and any other, quote, foreigner can regain access to Mythos or Fable quite quickly, hopefully. If it's the more cynical version and this stays in place, then you don't need me to say this will have huge consequences. Will everyone's ID have to be checked before they use a model? Will Anthropic have to fire all their foreign-born workers like Andre Carpathie or even Chris Ola, a co-founder? What about Ilya Sutskever or Demis Hassabis or Geoffrey Hinton? This is why I think whether or not it was a misunderstanding, I can only see them retracting this in the coming few days, if only for concerns about the stock market. We're not done, though, with contextualizing facts. In June, in response to another matter, the White House stressed this. We are not conducting oversight of all new models, as that level of government overreach would have a chilling effect on free speech and innovation. Anthropic agrees. That seems a 180 from their current actions. If this new standard was applied across the industry, we believe it would essentially halt all new model deployments for all frontier model providers. In case you didn't know, jailbreaking is an unsolved issue. No one, to my knowledge, has been able to produce a functional model of any size that is resistant to all jailbreaks. That's why Amadei, somewhat understandably, said this must be a misunderstanding. He assumed it was a misunderstanding. He explained that narrow jailbreaks were different from a universal jailbreak, wherein you could bypass the defenses and get the model to do anything. That's very different from getting it to do one specific narrow thing. Later, he asked for more time. But then he apparently passed the tipping point and said he'll make no commitments to pull Mythos 5 or Fable 5. That's when Treasury Secretary Scott Besant told Amadei directly he was making a bad decision. Hours later, Fable was banned. David Sachs said Anthropic had prioritized the continued offering of the consumer model over safety. He was surprised that Anthropic hadn't wanted to cooperate and fix the jailbreak issue. Well, perhaps because they knew that there would always be another jailbreak if they fixed one of them. Again, Anthropic say, we suspect that perfect jailbreak resistance is not currently possible for any model provider. Of course, as they point out, they added in so many safeguards with Fable that people, you could say, including me, complained about the over-restriction of the model. Ask it anything about biology and it would just shut down and give you Opus 4.8. Now, I will say this to keep the story as balanced as I can. Earlier this month, a senior official Anthropic, Jason Clinton, had said this. We are seven to 10 months before open weight models have the capability of the Mythos-class models. When that happens, we will have the roughest, most difficult time in our careers that we have had as cybersecurity folks. The duration of that difficult time, I would guess 18 months. That's why some of you watching will be saying, well, Anthropic kind of set this up for themselves. They said that Mythos had all of these dangers when it came to cybersecurity and they admitted that Mythos and any model could be jailbroken. Put the two together and the government weren't entirely wrong to act. That's the point that David Sachs is making when he said that this is not on brand for Anthropic as a safety company. It's difficult to fathom how they could claim a jailbreak allowing operability of a cyberweapon could be defined as not serious. That's the one twist in the tale that you could say Anthropic had coming. They said, as I've reported many times, three or four years ago that the only reason they wanted to make