MiniMax shipped M3 on June 1: frontier coding claims, 1M-token context, native multimodality, and pricing that undercuts Opus 4.7 by 10-40x. It's already on Ollama Cloud and OpenRouter, so you can point Claude Code at it today.
Read more →
A community file distilling Karpathy's coding-agent observations hit 60K stars on four principles. I opened my own CLAUDE.md to compare. I'd independently written two of them. The two I hadn't are the ones that matter most.
Read more →
Frontier models top out at 68% compliance with 500 instructions. Every rule you add makes every other rule less likely to be followed. The research explains why.
Read more →
Playwright MCP's 26 tools are killing your context window. Vercel's agent-browser shows a better way: fewer tools, smarter snapshots, 93% less overhead.
Read more →
ENABLE_EXPERIMENTAL_MCP_CLI eliminates MCP tool schema overhead entirely. Undocumented, untested in the wild, but it works. Here's what I found.
Read more →
HumanLayer's 12-factor agents codifies what works in production AI: own your context, keep agents small, stay out of the dumb zone.
Read more →
Claude's new model ships with defer_loading for tools. The MCP isolation patterns I built are now (mostly) obsolete.
Read more →
From Chrome DevTools experiment to universal MCP wrapper: progressive discovery works with any server, Skills integration, and smart deduplication
Read more →
How to use slash commands to keep specialized MCP tools isolated from your main conversation context.
Read more →