OpenAI frontier models and Codex go GA on AWS
A heavy day on the business side: OpenAI's frontier models land on AWS and Anthropic quietly files to go public. On the tooling front, JetBrains ships an open coding MoE, and the discourse turns to where extra model intelligence actually pays off.
OpenAI frontier models and Codex go GA on AWS
OpenAI says its frontier models and the Codex coding agent are now generally available on AWS, letting enterprises consume them through existing AWS environments, IAM controls, and procurement. It positions OpenAI as multi-cloud rather than Azure-only and gives AWS-native shops a path from evaluation to production without leaving their account.
Why it matters: If you build on AWS, you can now wire in OpenAI models and Codex without standing up a separate vendor relationship — fewer billing and compliance hoops to clear.
Anthropic confidentially files a draft S-1 with the SEC
Anthropic disclosed that it has confidentially submitted a draft S-1 registration statement to the SEC, the standard first step toward a US IPO. The filing is confidential, so terms, financials, and timing are not public.
Why it matters: An Anthropic IPO would force unprecedented disclosure of frontier-lab economics — useful signal for anyone betting their stack on Claude's long-term pricing and viability.
JetBrains releases Mellum2, a 12B MoE coding model
JetBrains introduced Mellum2, a 12B-parameter mixture-of-experts model aimed at code, published on Hugging Face. It is the successor to the company's earlier Mellum coding model and is positioned as an open release for developer tooling.
Why it matters: An openly available MoE tuned for code from the IDE vendor itself is a concrete option for self-hosted completion and agentic coding, away from closed APIs.
Latent Space digs into xAI's Grok Imagine and the case for video agents
Latent Space interviews Ethan He, who led xAI's Grok Imagine, on building the video model in roughly three months and the distinction between video generation and world models. The episode argues video agents are the next frontier and that Grok Imagine is underrated.
Why it matters: A rare engineering-level look at how a frontier video model gets shipped fast, and a framing of where video-native agents go next.
- Why Video Agent models are next — Ethan He, xAI Grok Imagine (Latent Space (swyx))
NVIDIA pushes local agents onto RTX PCs and DGX Spark
NVIDIA's Computex-timed post highlights a wave of on-device personal agents, citing open source projects like OpenClaw and Hermes, that run locally to drive applications, generate content, and automate multi-step tasks. The pitch centers on RTX PCs and the DGX Spark desktop as the hardware to run them.
Why it matters: Local-first agents sidestep API costs and data egress, but watch the hardware floor — this is as much a GPU upsell as a software story.
Interconnects: open and closed models are on different exponentials
Nathan Lambert argues that open and closed models are improving along distinct curves, and that marginally higher intelligence creates value in some workloads while barely mattering in others. The piece is a framework for deciding when to pay for the frontier versus run open weights.
Why it matters: A useful lens for build-vs-buy decisions: it pushes you to map model capability to where your product actually feels the difference.
- Open and closed models are on different exponentials (Interconnects)
Also worth a look
- Building the infrastructure for the Intelligence Age in Michigan (OpenAI)
- How LinkedIn Uses PyTorch to Solve Extreme-Scale Optimization Problems (PyTorch)
- Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic (Hugging Face)
- Import AI 459: AI oversight is difficult; scaling laws for protein folding models; pricing AI extinction risk (Import AI (Jack Clark))
- How we used Gemini to build Google I/O 2026 (Google AI Blog)
- OpenAI's views on AI policy and political advocacy (OpenAI)