OpenAI frontier models and Codex go GA on AWS

A heavy day on the business side: OpenAI's frontier models land on AWS and Anthropic quietly files to go public. On the tooling front, JetBrains ships an open coding MoE, and the discourse turns to where extra model intelligence actually pays off.

OpenAI frontier models and Codex go GA on AWS

OpenAI says its frontier models and the Codex coding agent are now generally available on AWS, letting enterprises consume them through existing AWS environments, IAM controls, and procurement. It positions OpenAI as multi-cloud rather than Azure-only and gives AWS-native shops a path from evaluation to production without leaving their account.

Why it matters: If you build on AWS, you can now wire in OpenAI models and Codex without standing up a separate vendor relationship — fewer billing and compliance hoops to clear.

Anthropic confidentially files a draft S-1 with the SEC

Anthropic disclosed that it has confidentially submitted a draft S-1 registration statement to the SEC, the standard first step toward a US IPO. The filing is confidential, so terms, financials, and timing are not public.

Why it matters: An Anthropic IPO would force unprecedented disclosure of frontier-lab economics — useful signal for anyone betting their stack on Claude's long-term pricing and viability.

JetBrains releases Mellum2, a 12B MoE coding model

JetBrains introduced Mellum2, a 12B-parameter mixture-of-experts model aimed at code, published on Hugging Face. It is the successor to the company's earlier Mellum coding model and is positioned as an open release for developer tooling.

Why it matters: An openly available MoE tuned for code from the IDE vendor itself is a concrete option for self-hosted completion and agentic coding, away from closed APIs.

Latent Space digs into xAI's Grok Imagine and the case for video agents

Latent Space interviews Ethan He, who led xAI's Grok Imagine, on building the video model in roughly three months and the distinction between video generation and world models. The episode argues video agents are the next frontier and that Grok Imagine is underrated.

Why it matters: A rare engineering-level look at how a frontier video model gets shipped fast, and a framing of where video-native agents go next.

NVIDIA pushes local agents onto RTX PCs and DGX Spark

NVIDIA's Computex-timed post highlights a wave of on-device personal agents, citing open source projects like OpenClaw and Hermes, that run locally to drive applications, generate content, and automate multi-step tasks. The pitch centers on RTX PCs and the DGX Spark desktop as the hardware to run them.

Why it matters: Local-first agents sidestep API costs and data egress, but watch the hardware floor — this is as much a GPU upsell as a software story.

Interconnects: open and closed models are on different exponentials

Nathan Lambert argues that open and closed models are improving along distinct curves, and that marginally higher intelligence creates value in some workloads while barely mattering in others. The piece is a framework for deciding when to pay for the frontier versus run open weights.

Why it matters: A useful lens for build-vs-buy decisions: it pushes you to map model capability to where your product actually feels the difference.

Browse previous days →