Claude Fable 5 Lands
Anthropic's Claude Fable 5 dominates the day, with Simon Willison already rebuilding tooling with it and Karpathy waxing about Jevon's paradox. Google ships Gemma 4 12B and a Gemini 3.5 Live Translate update, while OpenAI runs a Codex customer-story PR cycle and Cohere quietly drops its first dev model.
Anthropic ships Claude Fable 5 and Mythos 5
Anthropic released two new frontier models: Claude Mythos 5 and Claude Fable 5, with Anthropic claiming Fable matches Mythos performance but with stricter guardrails against misuse. Simon Willison spent ~5.5 hours stress-testing Fable 5, calling it slow, expensive, and hard to stump on real tasks. Interconnects frames the dual release as another move in frontier-AI safety and power politics.
Why it matters: A new frontier model from Anthropic is an immediate API decision point — and the Fable/Mythos split signals that safety-gated variants are becoming a productized tier rather than an afterthought.
- Claude Fable 5 and Claude Mythos 5 (Anthropic)
- Initial impressions of Claude Fable 5 (Simon Willison)
- Claude Fable 5 and new AI safety fables (Interconnects)
Fable 5 in practice: llm 0.32a3 written almost entirely by the new model
Willison shipped llm 0.32a3, noting it was almost entirely authored by Claude Fable 5. He also documented reverse-engineering Wes McKinney's AgentsView to add custom pricing for Fable 5, which wasn't yet in the pricing database. Karpathy, reflecting on Fable 5, argued that cheap on-tap software triggers Jevon's paradox — demand for bespoke tooling grows rather than shrinks.
Why it matters: This is the concrete other half of the launch: a frontier model used as the primary author of real OSS, plus the unglamorous reality that pricing and tooling lag a same-day release.
- llm 0.32a3 (Simon Willison)
- Setting a custom price for a model in AgentsView (Simon Willison)
- Quoting Andrej Karpathy (Simon Willison)
Google launches Gemma 4 12B, an encoder-free multimodal model
Google DeepMind released Gemma 4 12B, described as a unified, encoder-free multimodal model. The encoder-free design folds vision directly into the model rather than relying on a separate vision tower.
Why it matters: An open-weights 12B multimodal model with a simplified architecture is an attractive target for local and fine-tuned deployments where you want one model handling text and images.
- Introducing Gemma 4 12B: a unified, encoder-free multimodal model (Google DeepMind)
FrontierCode: a benchmark for code quality over slop
Latent Space introduced FrontierCode, a new benchmark aimed at measuring code quality rather than just pass rates — explicitly targeting the 'slop' problem in AI-generated code.
Why it matters: As coding agents flood repos with technically-passing-but-bad code, a benchmark that grades quality rather than mere correctness is the kind of signal developers actually need.
- [AINews] FrontierCode: Benchmarking for Code Quality over Slop (Latent Space (swyx))
Cohere debuts North Mini Code, its first developer-focused model
Cohere Labs introduced North Mini Code, billed as Cohere's first model aimed specifically at developers and coding tasks. The release is available via Hugging Face.
Why it matters: Cohere staking out the small coding-model niche adds another option against Codex/Gemma/Qwen-class competitors for embedded dev tooling.
Gemini 3.5 Live Translate brings near real-time voice translation
Google DeepMind launched Gemini 3.5 Live Translate, offering near real-time natural speech translation across Google AI Studio, Google Translate, and Google Meet.
Why it matters: Low-latency speech-to-speech translation exposed through AI Studio gives developers a building block for live multilingual voice apps without stitching together separate ASR/MT/TTS pipelines.
- Fluid, natural voice translation with Gemini 3.5 Live Translate (Google DeepMind)
OpenAI runs the Codex customer-story tour with GPT-5.5
OpenAI published two case studies on Codex powered by GPT-5.5: Nextdoor using it to investigate hard-to-reproduce bugs and build cross-platform, and Notion using it to one-shot specs and ship AI Voice Input for the web. Separately, OpenAI laid out an 'industrial policy for the Intelligence Age' on opportunity and institution-building.
Why it matters: These are marketing pieces, but the concrete workflows — flaky-bug triage, spec one-shotting, small-team leverage — are a useful read on what Codex plus GPT-5.5 is actually being used for in production.
Also worth a look
- From one-off prompts to workflows: How to use custom agents in GitHub Copilot CLI (GitHub Blog)
- Defend against frontier cyber models: Cloudflare's architecture as customer zero (Cloudflare Blog)
- How an Agent Built a 3D Paris Gallery by Chaining Two Hugging Face Spaces (Hugging Face)
- Migrating Your GitHub CI to Hugging Face Jobs (Hugging Face)
- Build an agentic incident triage assistant with Amazon Quick and New Relic (AWS Machine Learning)
- Hands-free first notice of loss: Strands Agents and Amazon Bedrock AgentCore Browser Tool (AWS Machine Learning)
- Scale Robot Reinforcement Learning with NVIDIA Isaac Lab on Amazon SageMaker AI (AWS Machine Learning)
- Powering the future of robotics in Europe (Google DeepMind)
- Learning to lead in a hybrid human-AI enterprise (MIT Technology Review)
- Five things you need to know about AI (MIT Technology Review)