Emergent Minds

18-DEC-25 [6 MIN]

Three Ways to Build Deep Research with Claude

From 20 lines of shell to production apps. Anthropic renamed Claude Code SDK to Agent SDK because deep research is now a first-class use case.

08-JAN-26 [6 MIN]

claude-code

Claude Code Hooks: Guardrails That Actually Work

Real footgun stories and the deterministic hooks that would've prevented them. From $30k API key leaks to nuked home directories.

MiniMax M3: Frontier Coding at a Tenth of the Price

01-JUN-26 [5 MIN]

llm

MiniMax M3: Frontier Coding at a Tenth of the Price

MiniMax shipped M3 on June 1: frontier coding claims, 1M-token context, native multimodality, and pricing that undercuts Opus 4.7 by 10-40x. It's already on Ollama Cloud and OpenRouter, so you can point Claude Code at it today.

07-JUN-26 [4 MIN]

ai-coding

Eighteen Cents on the Dollar

One analysis of 2,444 companies claims only 18 cents of every AI-token dollar reaches the product. The rest goes to fixing the AI's bugs, reworking its context misses, and review friction. Treat the source with caution, but the shape of the problem is real.

06-JUN-26 [3 MIN]

productivity

The Control Group Quit

METR tried to rerun its developer-productivity study and couldn't, because developers refused to work without AI even for a few research tasks. The experiment that could tell us whether AI helps now has no control group. We opted out of finding out.

05-JUN-26 [4 MIN]

productivity

Measure Token Usage, Get Token Usage

Amazon built an internal leaderboard ranking engineers by AI usage, called Kirorank. Employees gamed it by running agents on pointless tasks to climb the board, and it got killed. It's Goodhart's law with a token meter attached.

The Copilot Meter Turned On, Right on Schedule

04-JUN-26 [4 MIN]

product

The Copilot Meter Turned On, Right on Schedule

On June 1, every GitHub Copilot plan moved to usage-based AI Credits, code review started burning Actions minutes, and Copilot Max appeared. The trilogy called the date. Here are the receipts, and what metered-by-default actually changes.

03-JUN-26 [5 MIN]

cli

Your Vendor CLI Has an Expiry Date

Google is retiring Gemini CLI on June 18 and pushing individual users to the new Antigravity CLI. If you built a workflow on a vendor's free CLI, you just learned what that dependency costs: a migration on their calendar, not yours.

Featured

02-JUN-26 [5 MIN]

claude-code

It Was Always an IPO

Anthropic filed a confidential S-1 on June 1 at a $965B valuation, eclipsing OpenAI. Read backwards from the filing, the last two years stop looking like a safety lab's awkward compromises and start looking like a pre-IPO playbook executed on schedule.

31-MAY-26 [5 MIN]

llm

Cheap Is a Hardware Strategy

Google led I/O 2026 with a cheap, fast Gemini Flash instead of a frontier behemoth, and everyone read it as conceding the top of the market. Wrong read. Cheap isn't a model strategy, it's a silicon strategy. Google owns every layer from the TPU to the search box, which is why it can give intelligence away while its rivals rent the compute to compete with it, some of them for $40 billion.

30-MAY-26 [6 MIN]

career

The Last Slow Thing

Everything in software got a fast mode this year except understanding what to build. The proof is in the labs' own org charts: the companies selling the models that supposedly end software engineering are paying $600k for engineers to go sit in customers' offices. The bottleneck moved all the way up to the conversation.

Opus 4.8: The Honest Model Is the Expensive Model

29-MAY-26 [7 MIN]

llm

Opus 4.8: The Honest Model Is the Expensive Model

Opus 4.8's headline feature isn't a benchmark. It's that the model is 4x less likely to let a flaw in its own code pass unflagged. Self-correction, flagged uncertainty, and effort dials all cost tokens. Anthropic shipped a model that pays for confidence by the token, weeks before it starts billing automation by the token.

29-MAY-26 [5 MIN]

security

Security Review Moved Into the Loop

Anthropic's new security-guidance plugin is built entirely on hooks. It fires on every edit, turn, and commit, hands the diff to a second Claude with fresh context, and fixes findings in the same session. It catches vulnerabilities before they reach the PR. It also doesn't block a single one, and that's the honest part.

28-MAY-26 [4 MIN]

dev-tools

Microsoft Is Letting GitHub Die

Ghostty left after 18 years. Zig, cURL, and Godot are reducing reliance. 48 major outages in a year, no CEO since August, reporting into the Core AI team. Centralising the world's code was a bet on stewardship. The steward stopped showing up.

27-MAY-26 [4 MIN]

claude-code

The Viral CLAUDE.md and Mine

A community file distilling Karpathy's coding-agent observations hit 60K stars on four principles. I opened my own CLAUDE.md to compare. I'd independently written two of them. The two I hadn't are the ones that matter most.

Showing 15 of 180 posts

View All Posts →

Three Ways to Build Deep Research with Claude

Claude Code Hooks: Guardrails That Actually Work

MiniMax M3: Frontier Coding at a Tenth of the Price

It Was Always an IPO

Too Dangerous to Release, $20 a Month

Don't Take Their Legos Away

Recent Posts

Eighteen Cents on the Dollar

The Control Group Quit

Measure Token Usage, Get Token Usage

The Copilot Meter Turned On, Right on Schedule

Your Vendor CLI Has an Expiry Date

It Was Always an IPO

Cheap Is a Hardware Strategy

The Last Slow Thing

Opus 4.8: The Honest Model Is the Expensive Model

Security Review Moved Into the Loop

Microsoft Is Letting GitHub Die

The Viral CLAUDE.md and Mine