US government orders Anthropic to suspend Fable 5 and Mythos 5 access

A quiet Friday with no marquee model launch, but plenty of plumbing for builders: OpenAI loosens Codex rate limits, GitHub tightens Copilot CLI's delegation logic, and a GPT-5-class realtime voice model quietly surfaces over WebRTC. On the business side, Washington pulled the plug on Anthropic's Fable 5 and Mythos 5, and Mistral is reportedly raising again at roughly double its last mark.

US government orders Anthropic to suspend Fable 5 and Mythos 5 access

Anthropic issued a statement responding to a US government directive to suspend access to its Fable 5 and Mythos 5 models. The company published its position publicly rather than quietly complying, framing it as a government-mandated suspension rather than a product decision. Details on scope, duration, and affected customers are thin in the statement itself.

Why it matters: If you have anything in production on these models, treat this as an availability risk today and check your fallbacks — a regulator can now apparently switch a frontier model off.

OpenAI lets Codex users bank and manually trigger rate-limit resets

OpenAI changed how usage caps work for its Codex coding agent: instead of resets expiring on a fixed schedule, users can now save them and cash one in manually when they hit a cap mid-session. Go, Plus, Pro, and Business plans each get one free reset to start, with Plus and Pro able to unlock more via referrals. The Decoder frames it as an opening shot in a coding-agent price war.

Why it matters: A small but real quality-of-life change if you live in Codex — no more losing a session to a fixed-window reset clock.

GitHub made Copilot CLI more selective about delegating to sub-agents

GitHub detailed orchestration changes to Copilot CLI aimed at reducing unnecessary hand-offs between agents, claiming better progress with fewer delegations and no new user-facing settings. The writeup focuses on when the CLI should keep working in-context versus spinning up a delegate.

Why it matters: Over-eager delegation is a common failure mode in agentic CLIs; tuning it without adding knobs is the right direction if it holds up in practice.

OpenAI's GPT-Realtime-2 shows up in the WebRTC API with document context

Simon Willison revisited his OpenAI realtime-audio playground to test GPT-Realtime-2, which OpenAI bills as its first voice model with GPT-5-class reasoning and a Sep 30, 2024 knowledge cutoff. The updated tool lets you select the better model and paste in a large chunk of document text as context for a voice session. Notably the model still hasn't appeared in the ChatGPT iPhone app.

Why it matters: A reasoning-grade realtime voice model you can feed long context to over WebRTC is a concrete building block for voice agents — and it's reachable via API before it lands in OpenAI's own consumer app.

Anthropic's first Public Record survey: most Americans fear AI, daily users far less so

Anthropic published results from its first Public Record, a survey of nearly 52,000 Americans. 64% fear job losses and 56% worry about losing the ability to think for themselves, but daily AI users report much lower concern. Most respondents still reject AI in their own workplace, even for tasks they believe it could handle.

Why it matters: The usage-versus-fear gap is the interesting signal: adoption tracks familiarity, and broad workplace resistance is a real deployment constraint regardless of capability.

Allen AI ships olmo-eval, an evaluation workbench for the model dev loop

Allen AI published olmo-eval, described as an evaluation workbench designed to fit into the model development loop rather than as a one-off benchmark run. The Hugging Face post positions it around the OLMo development workflow.

Why it matters: Eval tooling tied to the training/dev loop is more useful than leaderboard chasing; worth a look if you're building or fine-tuning open models.

Mistral reportedly raising €3B at a ~€20B valuation

TechCrunch reports Mistral is rumored to be raising €3B at roughly a €20B (~$23.15B) valuation, nearly double its €11.7B Series C mark. The round is unconfirmed.

Why it matters: Europe's main open-weight challenger doubling its valuation signals continued capital flowing to non-US labs — relevant if you want a credible alternative model supplier.

Browse previous days →