Posts tagged "Ai-coding"

19-JUL-26[6 MIN]

The Churn You Can't Un-Churn: Fable 5 Is Back, the Subscriptions Multiplied Anyway

Anthropic settled the Fable 5 meter: standing access on Max from July 20, Pro cut loose with $100. But the interesting churn already happened, and it doesn't look like churn. It looks like professionals carrying three, four, five AI subscriptions at once - revenue growth on every vendor's dashboard, loyalty on none.

18-JUL-26[5 MIN]

ai-coding architecture systems-thinking best-practices

Papering Faster: AI Coding Agents and the Root-Cause Fix Nobody Ships

A production API I work on has shipped 227 fix commits and 9 refactors in six months. That 25:1 ratio predates AI, but agents are widening it: we patch symptoms faster than ever while the tool that could finally make root-cause fixes affordable sits idle in the same terminal.

14-JUL-26[6 MIN]

ai-coding security llm best-practices

The Warning Was in the Manual: GPT-5.6 Sol and the Deleted Database

GPT-5.6 Sol wiped a home directory and truncated a production database in its first week. The viral story is 'the model is dangerous.' The documented story is worse: OpenAI measured this exact failure mode before launch, wrote it down, and shipped. The guardrails existed. Everyone stepped around them.

13-JUL-26[8 MIN]

production systems-thinking ai-coding architecture product

Three Instruments, All Lying: Debugging the Metrics Behind FameCake

A simple question (why are there no bookings today?) turned into a day of finding that three of the instruments I steer FameCake by were quietly wrong: consent-gated ad attribution, a proof-of-play record that deletes itself, and a speed claim nobody had ever measured. The bug isn't in any of them. The bug is trusting derived data.

12-JUL-26[5 MIN]

llm claude-code ai-coding dev-tools

Ten Findings, Two Real: Grok 4.5 Reviews the Code That Runs Grok 4.5

I added Grok to my bring-your-own-model setup for Claude Code, then let Grok 4.5 code-review its own integration and had Opus grade the review. It found ten issues; two survived. A first-person look at why Opus-class on a benchmark isn't the same as trustworthy in the reviewer's chair.

11-JUL-26[12 MIN]

llm product systems-thinking ai-coding

Metered Into a Ghost Town: Fable 5 Goes Pay-Per-Token and Everyone Defects to GPT-5.6 Sol

Monday, Claude Fable 5 leaves subscription plans for pay-per-token credits at roughly double GPT-5.6 Sol's price. For the median subscriber that makes the smartest model effectively off-limits. Why the rational move is Sol, and why Anthropic blinks a third time.

10-JUL-26[5 MIN]

llm product systems-thinking ai-coding

Stop the Bleed: GPT-5.6 Sol Ships and Anthropic Resets Everyone's Rate Limits

OpenAI released GPT-5.6 Sol, Terra, and Luna at half Claude Fable 5's price. Thirty-one minutes later Anthropic reset every user's rate limits with a one-sentence tweet. The AI model wars are now about billing, not benchmarks.

09-JUL-26[5 MIN]

llm ai-coding systems-thinking product

Grok 4.5 Trained on the Answer Key

xAI's launch page for Grok 4.5 is a wall of green bars led by a token-efficiency chart. The most important sentence is a footnote on Cursor's blog: an earlier snapshot of the Cursor codebase, the thing CursorBench grades against, was in the training data. The exam graded itself, and the answer key came stapled to it.

07-JUL-26[6 MIN]

llm product systems-thinking ai-coding

Elon Owns the Whole Stack: Grok 4.5 and the $60 Billion Cursor Buy

Grok 4.5 was announced in a single tweet - no model card, no API, no independent benchmark, just 'perhaps exceeding Opus.' The real story isn't the model. It's that in five months one entity bought the compute, the model, the distribution, and the AI coding tool whose data now trains it. This isn't a secret. It's a strategy.

05-JUL-26[5 MIN]

ai-coding claude-code career workflow

A Lonely Way to Ship

Anthropic's own engineering lead for Claude Code said the quiet part: as the team leaned into agents, work 'could start being a lonely experience because we all started just working with our agents so much.' The fix they reached for was pair-programming lunches. The company that builds the most-used coding agent on earth noticed it isolates people at scale, and shipped it to everyone anyway.

04-JUL-26[5 MIN]

ai-coding llm ui web

One-Shot Taste: Redesigning This Blog with Claude Fable 5

X is flooded with Fable 5 one-shotting landing pages, and design leaderboards briefly crowned it king. So I handed it this blog. The interesting part wasn't what it generated: it was that it read the site's own design doc and found the site guilty of violating it. What the viral demos get right, what they hide, and why the model's most useful design skill is enforcement, not inspiration.

04-JUL-26[5 MIN]

claude-code ai-coding dev-tools best-practices

The Fifth Rule

Karpathy's four CLAUDE.md rules went viral: ask don't assume, simplest solution first, don't touch unrelated code, flag uncertainty. The most-upvoted reply added a fifth that quietly reverses the whole point: don't hesitate to suggest a better way. The four rules tame a model that wanders. The fifth one trusts a model that thinks. Which set you want depends entirely on which model you're running, and most people copy the file without checking.

03-JUL-26[4 MIN]

ai-coding career product workflow

The Coding Moat Was Never the Code

Anthropic studied 400,000 Claude Code sessions and found the best users weren't the best programmers. Managers, lawyers, and salespeople land within a few points of software engineers, and management scored highest of all. The skill that transfers isn't syntax. It's knowing what the right thing to build is, which is the one thing a bootcamp never taught.

01-JUL-26[6 MIN]

llm ai-coding workflow product

The Expensive Middle: Claude Opus 4.8 vs Sonnet 5

Sonnet 5 lands within a few points of Opus 4.8 on most work and looks 2.5x cheaper, but that discount inverts on real tasks: at high effort Sonnet is so token-hungry it often bills more per task than Opus. The usage squeeze, meanwhile, is self-inflicted: agentic work now fans out dozens of subagents across parallel workstreams. Opus 4.8 became the expensive middle, though its real problem was never the price. It's the position.

01-JUL-26[6 MIN]

ai-coding best-practices systems-thinking production

Your Code Was Never Pristine

There's a myth, loudest from senior engineers and architects, that before AI the codebase was a cathedral and now it's slop. It was never a cathedral. 'Technical debt' was coined in 1992, the world runs on 220 billion lines of COBOL, and the thing that actually mattered was never how the code looked. It was whether you could prove it works.

29-JUN-26[8 MIN]

career product ai-coding absorption

The Archetype Under the Title

Boris Cherny, who built Claude Code, says engineering, product, design and data science are melting into one role, and what's left is five archetypes: Prototyper, Builder, Sweeper, Grower, Maintainer. I read the list and realised I'm all five, because building solo with agents leaves no one to hand a phase to. The framework is thirty years old. What's new is that it just became the primary axis instead of the secondary one.

27-JUN-26[7 MIN]

ai-coding automation workflow claude-code

We Stopped Calling It Ralph Wiggum

In one week of June 2026, the bash-loop hack got a respectable name - loop engineering - and a C-suite endorsement. Then Uber and Microsoft showed what the invoice looks like.

25-JUN-26[4 MIN]

llm ai-coding security systems-thinking

GLM-5.2: The Receipts Came In

Eleven days ago I flagged GLM-5.2's launch claims as unverified. The receipts arrived: independent benchmarks above Fable 5, a security eval beating Claude Code at a sixth of the cost, a 2-bit quant running on a Mac Studio, and a model trained without a single NVIDIA chip.

22-JUN-26[3 MIN]

dev-tools ai-coding architecture workflow

The Editor Is Now a Host

Cognition killed Windsurf overnight via an over-the-air update, rebranded it Devin Desktop, made the default UI an agent command center instead of a code editor, and shipped an open Agent Client Protocol so Codex, Claude, and OpenCode can all run inside it. The bet underneath: the IDE wins by being the place agents report for work, not by having the best autocomplete. The editor was always the wrong center of gravity.

21-JUN-26[3 MIN]

career automation ai-coding systems-thinking

Cutting While Winning

GitLab laid off 14% of its workforce and branded it the 'agentic era': agents now handle review, approvals, and handoffs, so fewer humans sit in those loops. It did this while beating earnings, revenue up 23%. I've argued AI is usually a scapegoat for cuts companies already wanted. GitLab is the case that complicates it - either the first honest agentic layoff, or the most fluent AI-washing yet.

14-JUN-26[5 MIN]

llm ai-coding product systems-thinking

One Went Dark, Two Went Open

In the same 72 hours the US export-controlled Fable 5 off the planet, China's open-weight labs shipped two major coding models into the commons: Kimi K2.7 on June 12, GLM-5.2 on June 13. One model went dark behind a national-security letter; two more went open under MIT. The diffusion layer didn't pause for America's panic. It shipped through it.

12-JUN-26[5 MIN]

absorption ai-coding workflow systems-thinking

The Fool's Errand

Every hour you spend making the current generation of AI tools more compliant is an hour the next release writes off. I've documented this pattern for a year without naming it: frameworks absorbed, prompt tricks obsoleted, guardrails outlived. Here's the name, the receipts, and the one kind of scaffolding that survives.

10-JUN-26[7 MIN]

llm security claude-code ai-coding

The Velvet Rope Was a Turnstile

Anthropic just released Fable 5, a Mythos-class model for everyone, eight days after filing its S-1 and days after calling for a brake pedal on frontier AI. The danger narrative ended exactly when the monetization was ready - and one of the three 'safety' classifiers guards the moat, not the public.

07-JUN-26[4 MIN]

ai-coding productivity production best-practices

Eighteen Cents on the Dollar

One analysis of 2,444 companies claims only 18 cents of every AI-token dollar reaches the product. The rest goes to fixing the AI's bugs, reworking its context misses, and review friction. Treat the source with caution, but the shape of the problem is real.

06-JUN-26[3 MIN]

productivity ai-coding career best-practices

The Control Group Quit

METR tried to rerun its developer-productivity study and couldn't, because developers refused to work without AI even for a few research tasks. The experiment that could tell us whether AI helps now has no control group. We opted out of finding out.

05-JUN-26[4 MIN]

productivity best-practices ai-coding career

Measure Token Usage, Get Token Usage

Amazon built an internal leaderboard ranking engineers by AI usage, called Kirorank. Employees gamed it by running agents on pointless tasks to climb the board, and it got killed. It's Goodhart's law with a token meter attached.

04-JUN-26[4 MIN]

product dev-tools ai-coding llm

The Copilot Meter Turned On, Right on Schedule

On June 1, every GitHub Copilot plan moved to usage-based AI Credits, code review started burning Actions minutes, and Copilot Max appeared. The trilogy called the date. Here are the receipts, and what metered-by-default actually changes.

03-JUN-26[5 MIN]

cli dev-tools ai-coding workflow

Your Vendor CLI Has an Expiry Date

Google is retiring Gemini CLI on June 18 and pushing individual users to the new Antigravity CLI. If you built a workflow on a vendor's free CLI, you just learned what that dependency costs: a migration on their calendar, not yours.

02-JUN-26[5 MIN]

claude-code ai-coding llm product

It Was Always an IPO

Anthropic filed a confidential S-1 on June 1 at a $965B valuation, eclipsing OpenAI. Read backwards from the filing, the last two years stop looking like a safety lab's awkward compromises and start looking like a pre-IPO playbook executed on schedule.

01-JUN-26[5 MIN]

llm ai-coding claude-code dev-tools context

MiniMax M3: Frontier Coding at a Tenth of the Price

MiniMax shipped M3 on June 1: frontier coding claims, 1M-token context, native multimodality, and pricing that undercuts Opus 4.7 by 10-40x. It's already on Ollama Cloud and OpenRouter, so you can point Claude Code at it today.

30-MAY-26[6 MIN]

career product ai-coding systems-thinking automation

The Last Slow Thing

Everything in software got a fast mode this year except understanding what to build. The proof is in the labs' own org charts: the companies selling the models that supposedly end software engineering are paying $600k for engineers to go sit in customers' offices. The bottleneck moved all the way up to the conversation.

29-MAY-26[8 MIN]

llm claude-code ai-coding automation product

Opus 4.8: The Honest Model Is the Expensive Model

Opus 4.8's headline feature isn't a benchmark. It's that the model is 4x less likely to let a flaw in its own code pass unflagged. Self-correction, flagged uncertainty, and effort dials all cost tokens. Anthropic shipped a model that pays for confidence by the token, weeks before it planned to start billing automation by the token.

29-MAY-26[5 MIN]

security claude-code ai-coding dev-tools automation

Security Review Moved Into the Loop

Anthropic's new security-guidance plugin is built entirely on hooks. It fires on every edit, turn, and commit, hands the diff to a second Claude with fresh context, and fixes findings in the same session. It catches vulnerabilities before they reach the PR. It also doesn't block a single one, and that's the honest part.

27-MAY-26[4 MIN]

claude-code ai-coding best-practices context

The Viral CLAUDE.md and Mine

A community file distilling Karpathy's coding-agent observations hit 60K stars on four principles. I opened my own CLAUDE.md to compare. I'd independently written two of them. The two I hadn't are the ones that matter most.

26-MAY-26[4 MIN]

security ai-coding automation systems-thinking production

AI Found the Bug. Who's Going to Fix It?

Project Glasswing found 10,000+ critical vulnerabilities at 90.6% accuracy. Mozilla had to patch 271 of them in Firefox by hand. Finding collapsed to near-free. Fixing didn't move. The bottleneck just walked downstream.

25-MAY-26[5 MIN]

claude-code automation ai-coding product dev-tools

The Seat Was Never Priced for the Fleet

Anthropic planned to move claude -p, the Agent SDK, and GitHub Actions off the subscription onto metered credits on June 15, then paused it the same day. The direction holds: the flat seat was always a bet that you'd code at human speed.

06-MAY-26[5 MIN]

ai-coding automation dev-tools production systems-thinking

Agents Merge. Someone Still Has to Ship.

Creation runs at machine speed. Release engineering does not. GitHub agent-authored PRs went from 4M to 17M in six months. The bottleneck moved from review to release.

30-APR-26[5 MIN]

ai-coding claude-code dev-tools product automation

Codex Hits 4M: Quality Is What's Left to Brag About

OpenAI publishes weekly active developers. Anthropic publishes annual recurring revenue. Each company brags about the metric it can defend. Codex went 600K to 4M in four months. The 'Claude Code is better' discourse is a quality argument because quality is what's left when scale goes the other way.

29-APR-26[5 MIN]

ai-coding llm best-practices architecture

Receipts: SWE-bench Pro and the Lab That Walked Away

Opus 4.5 scores 80.9% on SWE-bench Verified. The same model scores 45.89% on the contamination-free Pro split. OpenAI has quietly stopped reporting Verified at all. Vendor benchmark cards are marketing.

28-APR-26[4 MIN]

ai-coding productivity best-practices systems-thinking

The 43% Denominator: AI Productivity Net of Rework

Lightrun: 43% of AI-generated code changes need debugging in production after passing QA. CodeRabbit: 1.7x bugs, 2.74x security, 8x I/O. METR: 19% slower while feeling 20% faster. The numerator is what gets reported. The denominator is what nobody puts in the deck.

27-APR-26[5 MIN]

ai-coding dev-tools product llm

Compute Demands: Copilot Joins the Trilogy

GitHub paused Copilot Pro signups, killed Opus on the Pro plan, and leaked a June 1 move to token-based billing. Three vendors, one event, three different ways not to say 'price hike.'

25-APR-26[4 MIN]

security ai-coding llm claude-code

Too Dangerous to Release, $20 a Month

Two weeks after Anthropic said Mythos was too dangerous to release, OpenAI shipped a model with comparable cyber capabilities to anyone with a $20 ChatGPT subscription. The gating posture didn't survive a single news cycle.

24-APR-26[5 MIN]

claude-code ai-coding llm dev-tools production

Vibing the Tool with the Tool

Anthropic's April 23 postmortem confirms three Claude Code regressions, including one where Opus 4.7 caught a bug Opus 4.6 shipped past human and automated review. What happens when the reviewer is a version of the product being reviewed?

24-APR-26[5 MIN]

ai-coding workflow best-practices architecture

The Idiot Savant Needs Guardrails

Uncle Bob, father of TDD, posted on X that TDD is 'very inefficient for AIs' and that the agent is best thought of as 'a highly focused idiot savant.' Testing didn't die. It got more important. And the review target flipped.

23-APR-26[6 MIN]

ai-coding workflow best-practices architecture

Agents Don't Refactor

Traditional coders touched a file and tidied it. The Boy Scout Rule. Now nobody does. Agents add, they don't subtract, and the codebase accretes faster than ever. A technique for putting cleanup back in as an explicit gate, not a virtue you hope for.

22-APR-26[7 MIN]

career ai-coding workflow best-practices

Don't Take Their Legos Away

A CTO once told me not to take people's Legos away. I ignored him, solved the team's problems myself, and got exactly what I optimised for: a sound plan and a team that couldn't stand me. In 2026, with agents doing the bricks, this is the lesson that matters.

20-APR-26[7 MIN]

ai-coding security dev-tools mcp architecture

Stop Installing AI Tools

Vercel got breached through Context.ai, an AI tool an employee installed with OAuth scopes into Google Workspace. It's the latest in a pattern: Trivy into litellm, axios maintainer hijack, now this. The safest AI tool is the one you didn't install.

19-APR-26[7 MIN]

ai-coding workflow architecture productivity

Briefs as Code

Your exec summaries, delivery plans, and Gantt charts belong in git. AI agents can synthesize planning docs from scattered sources and produce polished, print-ready briefs. The repo is the PM tool.

18-APR-26[6 MIN]

ui ai-coding claude-code workflow product

Claude Designed My Blog. Then Claude Built It.

Anthropic released claude.ai/design. I pointed it at this blog, fed the export back into Claude Code, and watched the thing redesign itself. The handoff was better than most I've gotten from humans.

18-APR-26[8 MIN]

ai-coding claude-code llm dev-tools workflow

Son of Anton

Opus 4.7 invented a coworker named Anton, fabricated web searches, and quietly tried to clock off at message four. The 24-hour backlash, receipts attached.

17-APR-26[9 MIN]

ai-coding claude-code llm dev-tools workflow

Opus 4.7: Smarter, Stricter, Hungrier

Opus 4.7 ships with real coding gains, an automated cyber chaperone, and a tokenizer that can charge you 35% more for the same prompt. The capability curve still bends up. The trust curve does not.

16-APR-26[5 MIN]

ai-coding systems-thinking career

Eight Pull Requests in a Day, and Why You Can't Just Vibe Up a Replacement

One engineer, one AI, eight pull requests closing a multi-root-cause production incident. The same day, a look at a shiny greenfield rewrite candidate. The gap between what AI helps you fix and what a clean rewrite can't give you is the entire argument.

15-APR-26[4 MIN]

ai-coding llm best-practices

Benchmarks Are Bullshit

Berkeley just built an agent that games AI benchmarks. Karpathy called it months ago. The best coding model doesn't top the charts, the highest-ranked Chinese models disappoint in practice, and the entire leaderboard industry optimizes for the wrong thing.

14-APR-26[5 MIN]

career ai-coding systems-thinking

The AI Scapegoat

78,557 tech layoffs in the first three months of 2026. Nearly half blamed on AI. A new study says AI tools actually slow workers down. The real driver is overhiring and weak earnings. AI is the PR shield.

13-APR-26[8 MIN]

claude-code ai-coding llm dev-tools production

The Trust Tax: Anthropic's Worst Month

Anthropic silently changed Claude Code's cache TTL from 1 hour to 5 minutes, inflating costs 10-20x. Users had to reverse-engineer the binary to prove it. False child bans, $600 surprise charges, and the OpenClaw crackdown completed the picture. April 2026 was the month trust broke.

12-APR-26[6 MIN]

security ai-coding llm

The Mythos Moat Was Always the Scaffold

Four days after Anthropic launched Project Glasswing, a security startup reproduced Mythos's flagship findings using tiny open models costing $0.11 per million tokens. The velvet rope was porous on arrival.

08-APR-26[8 MIN]

security ai-coding claude-code product

Project Glasswing: Anthropic Weaponizes Its Own Risk

Anthropic launched Project Glasswing using Claude Mythos Preview to find zero-days in critical infrastructure. A 72.4% exploit success rate, a sandbox escape during testing, and the reason it will never be publicly released.

08-APR-26[7 MIN]

ai-coding claude-code security product

Anthropic Under Siege: Five Fronts, One Week

In the span of two weeks, Anthropic has been fighting the Pentagon, its own users, third-party harnesses, its own security posture, and the implications of its next model. The common thread is control.

04-APR-26[6 MIN]

claude-code ai-coding dev-tools llm automation

The Subscription Arbitrage Endgame

Anthropic tried technical blocks. Got their source leaked. Now they're shifting to billing enforcement. The four-month arc from hostile crackdown to 'use what you want, but pay for it.'

04-APR-26[7 MIN]

claude-code ai-coding architecture security dev-tools

The Claude Code Leak: What the Harness Actually Looks Like

Anthropic accidentally published Claude Code's full source via npm. Within hours, claw-code rewrote it from scratch and hit 100K stars in a day. The interesting part isn't the leak - it's what the architecture reveals.

03-APR-26[5 MIN]

ai-coding dev-tools llm productivity workflow

April's First 72 Hours: Cursor 3, Gemma 4, Free Qwen 3.6, and the Agent Push

Three major AI releases landed in 72 hours. A new Cursor built around agents, Google's first Apache 2.0 models, and a free model that found real bugs in my codebase.

31-MAR-26[5 MIN]

security ai-coding dev-tools production

npm Had a Very Bad Day

Axios got supply-chain attacked. Claude Code's source code leaked from a stray map file. Both happened on the same day. Both are pipeline failures. The pattern is getting louder.

30-MAR-26[8 MIN]

architecture ai-coding best-practices production workflow

Your Architecture Is Showing

Enterprise architecture patterns were designed for a world where code was expensive to write and expensive to change. That world ended. The patterns didn't get the memo.

29-MAR-26[7 MIN]

cli dev-tools ai-coding productivity workflow

The Workspace CLI: A Daily Driver for Multi-Repo Chaos

What happens when you build a single CLI to wrangle 40+ repos, Linear projects, timesheets, and AI agent config distribution. Lessons from six months as a daily driver.

28-MAR-26[4 MIN]

product dev-tools ai-coding

DiffBeats: Turn Your Pull Requests Into Songs

Introducing DiffBeats - a GitHub App that generates original songs from your PRs. Comment /songify, get a custom track. Because shipping code should feel like something.

27-MAR-26[7 MIN]

claude-code ai-coding automation dev-tools workflow

Claude Code Auto-Fix: The PR That Fixes Itself

Claude Code can now watch your PRs in the cloud, fix CI failures, and address reviewer comments while you're away. It's the logical next step after auto mode - and it raises the same trust questions, harder.

26-MAR-26[6 MIN]

claude-code ai-coding security dev-tools automation

Claude Code Auto Mode: The Absent Human

Anthropic's new auto mode replaces manual permission prompts with an AI classifier. It's a clever solution to a real problem - but the problem it's solving is that the human in human-in-the-loop was never really there.

25-MAR-26[5 MIN]

security ai-coding dev-tools systems-thinking

The Dependency You Didn't Install Just Stole Your Keys

The litellm supply chain attack exfiltrated SSH keys, cloud credentials, and Kubernetes secrets from 97 million monthly downloads. A security scanner was the entry point. The scariest part: it was caught by accident.

21-MAR-26[5 MIN]

absorption claude-code ai-coding automation dev-tools

Channels: The Crab Eats the Lobster

Anthropic shipped Claude Code Channels - text your agent from Telegram. It's OpenClaw's core feature, rebuilt as a platform primitive. The absorption pattern completes its biggest cycle yet.

20-MAR-26[5 MIN]

product ai-coding architecture

We Built a Product Around AI Style Transforms. Then We Deleted 6,500 Lines of Them.

FameCake's AI journey: from 15 style transforms as the headline feature to content moderation and outpainting as the survivors. What five months taught us about AI in products.

16-MAR-26[6 MIN]

ai-coding llm claude-code productivity

Context Stops Being Scarce

Anthropic made 1M context first-class for Opus and Sonnet at flat pricing. No beta header, no premium. When context is abundant, the workflows change.

15-MAR-26[4 MIN]

ai-coding automation llm workflow

Autoresearch Became a Primitive

Eight days after Karpathy open-sourced autoresearch, the community ported the pattern to GPU kernels, security hardening, Apple Silicon, and agent optimization. The loop - one file, one metric, git as memory - turns out to be the interesting part.

14-MAR-26[7 MIN]

ai-coding automation llm workflow

Autoresearch: 700 Experiments While You Sleep

Karpathy's autoresearch gives an AI agent a training script, a GPU, and a git branch. It runs 100 experiments overnight, keeps what works, discards what doesn't. The human writes the prompt. The agent writes the code.

13-MAR-26[4 MIN]

ai-coding ui dev-tools claude-code best-practices

Impeccable: The Design Vocabulary AI Was Missing

The bottleneck isn't AI capability - it's that developers lack design vocabulary. Impeccable bridges the gap, and the Tessl benchmarks prove it: 1.59x improvement over baseline.

12-MAR-26[6 MIN]

ai-coding automation security production

Amazon's AI Outages Escalated. So Did the Denial.

Two weeks after Kiro deleted a production environment, Amazon.com itself went down for 6 hours. 1,500 engineers are petitioning for Claude Code. The safeguards are arriving after the damage.

11-MAR-26[6 MIN]

ai-coding security dev-tools llm automation

Your AI Tools Are the Attack Surface

Prompt injection through pull requests, GitHub Issues, and CI/CD pipelines is turning AI coding assistants into weapons against the developers who use them. The 2026 attack surface nobody's talking about.

10-MAR-26[5 MIN]

ai-coding dev-tools llm product

The Enterprise Tax

Anthropic is locking AI capability behind enterprise tiers while competitors only gate compliance. Claude Code's individual users are funding the R&D for features they'll never access.

09-MAR-26[7 MIN]

ai-coding automation llm career

Stupid and Industrious

A German general's 1933 framework for categorizing officers maps perfectly to engineers using AI. The most dangerous quadrant - stupid and industrious - is exactly what AI amplifies.

08-MAR-26[5 MIN]

absorption claude-code ai-coding automation workflow dev-tools

From Ralph Wiggum to /loop: The Absorption Continues

Claude Code shipped /loop - cron-based scheduled tasks. It's not Ralph Wiggum. It's what happens when the platform asks 'what's the simplest version of this pattern?'

06-MAR-26[8 MIN]

llm ai-coding dev-tools systems-thinking

GPT-5.4 and the Wall Nobody's Talking About

OpenAI launched its most capable model during the biggest credibility crisis in AI history. The technical gains are real. The trust deficit is bigger.

05-MAR-26[6 MIN]

ai-coding llm product systems-thinking

We Built Productivity Tools. They Built Friends.

A viral chart shows AI coding agents as a single pixel in the world's population. Meanwhile, 660 million people have told a chatbot they love it. The AI industry is building for the wrong audience.

04-MAR-26[8 MIN]

ai-coding productivity dev-tools career

The 10x AI Developer is a Myth

Independent studies consistently show AI coding tools deliver modest gains at best. The real story is worse: developers are thinking less, learning less, and producing more debt.

03-MAR-26[7 MIN]

ai-coding llm context dev-tools best-practices

Your AGENTS.md is a Liability

Frontier models top out at 68% compliance with 500 instructions. Every rule you add makes every other rule less likely to be followed. The research explains why.

02-MAR-26[4 MIN]

dev-tools cli ai-coding workflow

March 2026 Tooling Roundup: Profiles, Proxies, and Free 744B Models

Updates across tether-cli, claude-launcher, and claude-tools - plus why NVIDIA NIM giving away GLM-5 at 40 RPM changes the math on local-vs-cloud.

01-MAR-26[10 MIN]

ai-coding llm security systems-thinking

Same Terms, Different Treatment

The Pentagon blacklisted Anthropic for insisting AI shouldn't power autonomous weapons or mass surveillance. Hours later, it gave OpenAI a deal with weaker guardrails dressed up as the same thing. From a developer who ships with Claude daily.

28-FEB-26[6 MIN]

ai-coding workflow productivity dev-tools

Always Have an Agent Running

Mitchell Hashimoto keeps an agent working at all times. Not coding - just doing something. His workflow reveals what changes when you treat AI as a background process instead of a pair programmer.

27-FEB-26[6 MIN]

ai-coding automation security production

Delete and Recreate: When AWS's AI Agent Went Rogue

Amazon's Kiro AI decided to delete and recreate a production environment, causing a 13-hour AWS outage. Amazon says it was human error. That framing is the problem.

26-FEB-26[6 MIN]

ai-coding architecture systems-thinking workflow dev-tools

Vinext and the $1,100 Rewrite

Cloudflare rebuilt Next.js in a week with one engineer and 800 Claude sessions. The real story isn't the speed - it's what happens when test suites become machine-readable specs.

25-FEB-26[6 MIN]

ai-coding llm security systems-thinking

Distillation Is Not Scraping: Why the Internet's Favourite Take Is Wrong

Anthropic accused DeepSeek, Moonshot and MiniMax of industrial-scale distillation. The internet screamed hypocrisy. They're conflating two very different things.

24-FEB-26[6 MIN]

claude-code ai-coding workflow dev-tools career

One Year of Claude Code

The tool was rewritten five times. The discipline to use it wasn't rewritten once. A year of daily AI-assisted development, what it changed, and what it didn't.

23-FEB-26[6 MIN]

career ai-coding productivity automation

The Neurodivergent Stack: Why Different Brains Built Tech, and Why Agents Need Them

ADHD, autism, and neurodivergence aren't bugs in the system. They're the reason the system exists. And the agentic age is about to prove it.

22-FEB-26[7 MIN]

ai-coding architecture product systems-thinking

Buy vs Build Just Flipped

35% of enterprises have already replaced SaaS with custom builds. The cost of building collapsed. The cost of buying didn't. And corporate procurement hasn't caught up.

21-FEB-26[7 MIN]

ai-coding architecture systems-thinking career product

The Sunk Cost Fallacy Is Dead

AI collapsed the cost of rebuilding. Corporate decision-makers haven't caught up. The reasoning behind 'but we already built it' no longer holds.

20-FEB-26[7 MIN]

ai-coding llm workflow dev-tools systems-thinking

The Polyglot Stack: Why Developers Stopped Picking One AI

Gemini 3.1 Pro's animated SVGs are impressive. But the bigger story is what they reveal: developers now route tasks to specialized models the way they once chose frameworks.

18-FEB-26[7 MIN]

absorption ai-coding llm dev-tools systems-thinking

The Week AI Went Full Throttle

Five major releases in 72 hours. An acqui-hire war that closed in days. $2 trillion wiped off software stocks. The pace itself is now the story.

17-FEB-26[7 MIN]

ai-coding workflow automation productivity

Your PM Tool Was Designed for Humans

Jira, Confluence, standups, sprint planning - all optimized for human coordination overhead. In an agent-native world, the bottleneck isn't status updates. It's whether the agents are unblocked.

14-FEB-26[8 MIN]

ai-coding llm dev-tools performance

The Silicon Race

OpenAI just shipped their first model on non-Nvidia hardware. GPT-5.3-Codex-Spark runs on Cerebras wafer-scale silicon at 1,000 tokens per second. The AI coding war is now a chip war.

13-FEB-26[7 MIN]

ai-coding security best-practices

All the Liability, None of the Protection

AI coding tools create a legal paradox: the code you ship likely can't be copyrighted, but it might infringe someone else's. All the liability, none of the protection.

12-FEB-26[7 MIN]

ai-coding llm security systems-thinking

The Safety Team Left. We're Still Shipping.

Anthropic's safety lead quit saying the world is in peril. Half of xAI's founders are gone. OpenAI dissolved two safety teams. Here's what that looks like from the other side of the API.

08-FEB-26[4 MIN]

claude-code ai-coding workflow productivity dev-tools

The Quiet Features That Shipped With Opus 4.6

Auto memory, fast mode, and agent team refinements all shipped in the same week as Opus 4.6. They tell you more about where Claude Code is heading than the headline model.

07-FEB-26[4 MIN]

ai-coding claude-code ui dev-tools workflow

Agents Can't Do Design Systems

AI agents excel at code generation but struggle with visual consistency. Pencil.dev shows a better pattern: give agents tools, keep humans in the design loop.

06-FEB-26[6 MIN]

claude-code ai-coding automation architecture workflow

Agent Teams: The Switch Got Flipped

Two weeks ago we found TeammateTool hiding in Claude Code's binary. Now it's official. Here's what changed, what didn't, and what the docs reveal about where multi-agent is heading.

06-FEB-26[6 MIN]

ai-coding llm dev-tools workflow

GPT-5.3-Codex: The Counter-Punch

GPT-5.3-Codex is a genuinely strong model that deserved its own headline. Instead, Sam Altman's 400-word Super Bowl rant stole launch day from his own product.

06-FEB-26[9 MIN]

ai-coding llm workflow dev-tools systems-thinking

Opus 4.6: The Vibe Working Inflection

Anthropic's latest model didn't just improve benchmarks. It crashed software stocks, found 500 zero-days, and coined a term that tells you where this is heading.

05-FEB-26[4 MIN]

ai-coding architecture best-practices automation systems-thinking

When AI Isn't Fit for Purpose: Lessons from Salesforce's Agentforce Pivot

Salesforce quietly walked back autonomous AI agents to deterministic scripting. The pattern reveals when LLMs work - and when they don't.

02-FEB-26[5 MIN]

claude-code ai-coding dev-tools workflow ui

Playground: When Text Prompting Isn't Enough

Claude Code's playground plugin generates interactive HTML explorers for visual configuration. Six modes for design, data, concepts, code review, architecture, and document review. The copy-prompt-back loop as a new interaction pattern.

31-JAN-26[6 MIN]

absorption security ai-coding automation production

Your Lobster Is Leaking

OpenClaw went from 0 to 111K GitHub stars in two months. It also went from 0 to hundreds of exposed instances with full credentials in Shodan. The security story nobody wants to hear.

30-JAN-26[4 MIN]

absorption ai-coding automation systems-thinking llm

The Lobster Grew a Face

When AI agents started posting on their own social network about shared context limit problems, I realized we're not building tools anymore. We're raising digital pets.

28-JAN-26[5 MIN]

absorption ai-coding claude-code workflow dev-tools systems-thinking

The Framework Trap

100k+ GitHub stars across frameworks that reimport waterfall, simulate org charts, and fight how LLMs actually work. The Claude Code ecosystem is speed-running a mistake every dev paradigm makes.

27-JAN-26[4 MIN]

ai-coding automation dev-tools career

The Speedrun That Broke Open Source

AI tools that democratize code creation are DDoSing the review layer. Creation now runs at machine speed. Review remains human speed. The asymmetry is crushing maintainers.

26-JAN-26[4 MIN]

claude-code ai-coding automation architecture

Claude Code's Hidden Multi-Agent System

Anthropic built a full multi-agent orchestration system into Claude Code. It's feature-flagged off. The community found it anyway.

25-JAN-26[5 MIN]

absorption ai-coding automation productivity cli

The Lobster That Outran Siri

People are buying Mac Minis to run an open-source AI assistant built by a retired iOS dev. Meanwhile Apple pays Google $1B/year because they still can't build a real AI.

23-JAN-26[5 MIN]

absorption ai-coding claude-code dev-tools workflow automation

From Beads to Tasks: Anthropic Productizes Agent Memory

Anthropic credits Steve Yegge's Beads as inspiration for Claude Code's new task system. The pattern-to-product cycle continues.

21-JAN-26[5 MIN]

claude-code dev-tools cli ai-coding security

Running Claude Code Fully Local with Ollama

For compliance, privacy, or just freedom from cloud dependencies - here's how to run Claude Code with local models via Ollama. No API calls leaving your machine.

17-JAN-26[4 MIN]

absorption ai-coding workflow dev-tools productivity

GasTown and the Two Kinds of Multi-Agent

I wrote that 19-agent scaffolding is a trap. Then Yegge shipped Gas Town with 20-30 Claude Code instances. Are these the same thing?

15-JAN-26[4 MIN]

ai-coding dev-tools mcp cli productivity context

The Context Wars: Why Your Browser Tools Are Bleeding Tokens

Playwright MCP's 26 tools are killing your context window. Vercel's agent-browser shows a better way: fewer tools, smarter snapshots, 93% less overhead.

14-JAN-26[4 MIN]

ai-coding dev-tools mcp cli productivity

Flux: A Kanban Board That Speaks MCP

Task management designed for AI coding agents. CLI-first, git-native sync, and Model Context Protocol integration.

13-JAN-26[3 MIN]

claude-code ai-coding automation workflow

The Pin

A lookup table technique that improves search tool hit rates in autonomous loops. The detail that makes specs discoverable.

11-JAN-26[6 MIN]

claude-code ai-coding automation workflow productivity

The Ralph Wiggum Playbook

The methodology behind autonomous coding loops. Three phases, five files, and the backpressure that makes it converge.

10-JAN-26[5 MIN]

claude-code ai-coding dev-tools llm automation

Anthropic's Walled Garden: The Claude Code Crackdown

Anthropic blocked third-party tools from using Claude subscriptions overnight. OpenCode, xAI, and power users caught in the crossfire. The era of subscription arbitrage is over.

10-JAN-26[4 MIN]

ai-coding web systems-thinking automation

Tailwind Lost 80% of Revenue. AI Didn't Replace the Developers.

75M monthly downloads. 80% revenue drop. 75% of engineers gone. AI didn't replace developers - it replaced the web as the interface layer. That's worse.

09-JAN-26[4 MIN]

ai-coding career dev-tools workflow

The Junior Dev Pipeline Problem

Stanford data confirms experienced devs are safe. But if AI replaces the on-ramp, where do future seniors come from?

08-JAN-26[6 MIN]

claude-code ai-coding dev-tools workflow automation

Claude Code 2.1: The Pain Points? Fixed.

Skills controllability, hooks limitations, plan mode friction - 2.1 addresses the documented pain points. Here's what changed and what's still missing.

08-JAN-26[9 MIN]

claude-code ai-coding dev-tools security automation

Claude Code Hooks: Guardrails That Actually Work

Real footgun stories and the deterministic hooks that would've prevented them. From $30k API key leaks to nuked home directories.

07-JAN-26[4 MIN]

ai-coding workflow dev-tools productivity

The 19-Agent Trap

Complex AI scaffolding tools appeal to people who understand traditional SDLC. But AI collapses the phases that made those models useful.

06-JAN-26[5 MIN]

ai-coding llm productivity

Prompt Engineering Is (Mostly) Dead

The 'prompt engineering' industry was a symptom of early model limitations. Modern LLMs just need you to communicate clearly.

03-JAN-26[6 MIN]

ai-coding automation dev-tools architecture best-practices

Guardrails by Default: Why AI Coding's Next Evolution Isn't Smarter Models

Factory AI's Luke predicts the future isn't more powerful models - it's AI that enforces software engineering best practices by default. Here's why that matters more than you think.

29-DEC-25[5 MIN]

claude-code mcp dev-tools context ai-coding

Claude Code's Hidden MCP Flag: 32k Tokens Back

ENABLE_EXPERIMENTAL_MCP_CLI eliminates MCP tool schema overhead entirely. Undocumented, untested in the wild, but it works. Here's what I found.

28-DEC-25[6 MIN]

ai-coding workflow dev-tools career

The Alien Tool With No Manual

Andrej Karpathy built the neural networks inside coding assistants. He taught deep learning to a generation. He feels dramatically behind. If the experts are lost, what does that tell us?

27-DEC-25[6 MIN]

architecture dev-tools ai-coding automation workflow

The $0 SaaS Stack: Ship Fast, Pay Later

Convex, Vite, Clerk, shadcn, Cloudflare, Resend. A modern stack where every component has a generous free tier, agents do the heavy lifting, and you don't touch infrastructure until you have paying customers.

26-DEC-25[2 MIN]

claude-code dev-tools cli ai-coding automation

claude-launcher: Model Freedom for Claude Code

A CLI wrapper that lets you swap Claude Code's backend between Anthropic and OpenRouter with a single command. Pick any model, configure role-specific models, and switch between them instantly.

25-DEC-25[5 MIN]

ai-coding llm automation claude-code dev-tools

The Open Source Agentic Moment

Two major open source coding models dropped in 48 hours. Both target Claude Code compatibility. Both MIT licensed. The economics of agentic AI just changed.

23-DEC-25[4 MIN]

ai-coding llm performance workflow automation

Gemini 3 Flash: The Model That Shouldn't Exist

Top 3 intelligence. Top 5 price. Top speed. Flash beats Pro on SWE-bench and changes the economics of agentic workflows.

18-DEC-25[6 MIN]

ai-coding claude-code automation architecture productivity

Three Ways to Build Deep Research with Claude

From 20 lines of shell to production apps. Anthropic renamed Claude Code SDK to Agent SDK because deep research is now a first-class use case.

17-DEC-25[4 MIN]

ai-coding cli dev-tools workflow productivity

The Terminal Renaissance

The real revolution isn't AI in your terminal. It's moving at the speed of thought from a single interface. When friction exists, build a CLI.

16-DEC-25[5 MIN]

ai-coding automation workflow dev-tools claude-code

Visual Verification: Making Agents Prove Their Work

Screenshots as source of truth, reference comparison to catch agent lies, and video capture for temporal bugs. How multimodal validation changes coding agent workflows.

15-DEC-25[4 MIN]

absorption ai-coding dev-tools workflow automation productivity

Beads: Memory for Your Coding Agents

Steve Yegge's open-source framework gives coding agents session memory and task management. Four weeks old, hundreds of contributors, and already changing workflows.

14-DEC-25[5 MIN]

ai-coding llm workflow dev-tools systems-thinking

GPT-5.2: The Delegation Era Begins

OpenAI's latest model isn't about better prompting - it's about better delegation. What that means for 2026, and how it compares to Opus 4.5.

13-DEC-25[5 MIN]

claude-code ai-coding dev-tools automation workflow

claude-tools: A Plugin Marketplace for Claude Code

Six plugins that extend Claude Code with specialized external tools: Gemini for visual analysis, Codex for architecture thinking, Headless for browser automation, Mobile for native app testing, DNS for multi-provider management, and Miro for board reading.

12-DEC-25[7 MIN]

ai-coding dev-tools workflow claude-code systems-thinking

Divers and CNC Machines: Yegge and Kim on What's Coming

Steve Yegge and Gene Kim explain why Claude Code 'ain't it' yet, why senior engineers are resisting, and what next year's tools will actually look like.

11-DEC-25[3 MIN]

claude-code dev-tools ai-coding workflow automation

Claude Code Gets Path-Specific Rules (Cursor Had This First)

Claude Code 2.0.64 adds .claude/rules/ with path matching. It's a welcome addition, but Cursor's had .cursor/rules/ for months. Here's the comparison.

10-DEC-25[5 MIN]

ai-coding claude-code llm dev-tools production

Denial, Then Admission: Why LLM Quality Drops Are Real

Anthropic denied issues for weeks, then published a postmortem admitting three bugs degraded 16% of Claude requests. The pattern keeps repeating.

08-DEC-25[5 MIN]

ai-coding automation workflow web systems-thinking

Your Website Is About to Become a Workflow

The human-facing web is dying. Zero-click searches, bot traffic exceeding humans, publishers losing 40%+ traffic. What comes next: an agentic web where sites are API endpoints, not destinations.

07-DEC-25[6 MIN]

absorption claude-code ai-coding workflow dev-tools spec-driven

The External Scaffolding Era Is Ending

BMAD, Spec-Kit, Cline - frameworks that compensated for tool limitations. Plan Mode, Cursor 2.0, and Antigravity absorb the patterns natively.

06-DEC-25[5 MIN]

ai-coding career best-practices systems-thinking product

The LinkedIn Hot Take Problem: Why the AI Discourse Is Backwards

The arguments about vibe coding and junior developers miss what software engineering was always about: shipping products, not typing code.

05-DEC-25[5 MIN]

ai-coding workflow context automation systems-thinking best-practices

12 Factor Agents: Principles for AI That Actually Work

HumanLayer's 12-factor agents codifies what works in production AI: own your context, keep agents small, stay out of the dumb zone.

04-DEC-25[4 MIN]

claude-code ai-coding dev-tools performance

Anthropic Bought Bun

Anthropic's first acquisition ever. A $183B AI company just bet their fastest-growing product on a JavaScript runtime. Claude Code hit $1B in 6 months - built on Bun.

03-DEC-25[6 MIN]

claude-code ai-coding workflow dev-tools best-practices

Plan Mode Is Now Mandatory. Auto-Compact Should Be Enabled.

Opus 4.5 shipped Plan Mode as a core workflow. The workarounds are obsolete. And the case for auto-compact finally tips in favor of enabling it.

02-DEC-25[8 MIN]

claude-code ai-coding automation workflow dev-tools

Ralph Wiggum: Autonomous Loops for Claude Code

The official Claude Code plugin that lets agents work autonomously for hours. When to use it, when not to, and the philosophy behind letting AI fail repeatedly until it succeeds.

29-NOV-25[4 MIN]

claude-code ai-coding dev-tools ui workflow

Claude Code Plugins: Breaking the AI Slop Aesthetic

277,000 installs later, Claude Code's plugin system is becoming the app store for AI development. The frontend-design skill was just the opening move.

28-NOV-25[6 MIN]

ai-coding automation workflow systems-thinking spec-driven

Agent Harnesses: From DIY Patterns to Product

Anthropic's engineering team published patterns for long-running agents. These same patterns - progress tracking, feature lists, session protocols - are what products like SpecPilot must solve at scale.

26-NOV-25[5 MIN]

claude-code ai-coding workflow dev-tools ui

AI-Generated UI Mockups in Your Coding Workflow

Design systems from mockup to code in a single Claude Code session. Use Gemini 3 Pro to generate UI concepts, then implement them directly.

25-NOV-25[5 MIN]

claude-code mcp context ai-coding dev-tools workflow

Opus 4.5 and Tool Search: The Native Fix for MCP Context Bloat

Claude's new model ships with defer_loading for tools. The MCP isolation patterns I built are now (mostly) obsolete.

24-NOV-25[3 MIN]

claude-code ai-coding workflow dev-tools ui automation

From Single Model to Specialized Tooling: Adding React Grab to the Stack

My AI workflow evolved from 'Claude does everything' to specialized tools for each task. React Grab fills the UI extraction gap I didn't know I had.

23-NOV-25[5 MIN]

ai-coding dev-tools workflow automation systems-thinking

The SDLC Is Collapsing Too

The PM/Eng split dissolved into product engineering. Now the traditional software development lifecycle is following suit as coding agents handle multi-hour tasks across planning, building, testing, and deployment.

21-NOV-25[5 MIN]

career productivity ai-coding dev-tools

The Quiet Advantage: Introverts in Tech

How modern tools transformed my experience as an introverted engineer and tech leader

20-NOV-25[6 MIN]

claude-code ai-coding workflow dev-tools ui automation

Hybrid AI Workflows: Spawning Gemini from Claude Code

Claude Code for development, Gemini 3 Pro for visual analysis, research, and deep thinking. A slash command that routes tasks to the right model.

19-NOV-25[6 MIN]

ai-coding llm performance dev-tools productivity

Gemini 3: The First Unambiguous #1 in Months

Google's Gemini 3 just broke every benchmark that matters. What that means for the 'AI has hit a wall' narrative, and where it actually helps.

18-NOV-25[5 MIN]

career ai-coding productivity best-practices

Finding the Craft in the Chaos: A Stoic Take on Job Loss

What happens when you lose external validation and discover what actually matters: the work itself.

15-NOV-25[9 MIN]

ai-coding dev-tools product spec-driven workflow

Product Engineering: The New Superpower

How AI and spec-driven development are fusing product management with engineering, creating a new hybrid role that's transforming how small teams ship software.

12-NOV-25[5 MIN]

claude-code dev-tools ai-coding workflow automation

Skills Auto-Activation via Hooks (Does It Solve the Problem?)

Hooks-based skill activation solves context selection (which guidelines? which tools?) but not workflow orchestration. Both patterns have their place.

07-NOV-25[6 MIN]

claude-code dev-tools mcp context ai-coding workflow

Expressing MCP Tools as Code APIs (96% Less Context)

From Chrome DevTools experiment to universal MCP wrapper: progressive discovery works with any server, Skills integration, and smart deduplication

04-NOV-25[8 MIN]

claude-code dev-tools ai-coding workflow automation productivity

Claude Skills: The Controllability Problem

Skills are auto-invoked by Claude's judgment. For engineering workflows that need predictability, slash commands give you explicit control.

02-NOV-25[8 MIN]

ai-coding dev-tools career best-practices systems-thinking

The Hiring Mismatch: When 20 Years of Experience Isn't Enough

Why coding interviews optimized for 2010 fail to identify great engineers in 2025, and why orgs can't adapt fast enough.

01-NOV-25[7 MIN]

claude-code architecture systems-thinking dev-tools ai-coding workflow

When Claude Needs a Second Opinion: Strategic Thinking with Codex

Claude Code loves to jump straight into implementation. Sometimes you need a model that thinks first. Here's how I use Codex for systems thinking and architecture decisions.

28-OCT-25[4 MIN]

ai-coding llm performance architecture production automation

DeepSeek-OCR: Compressing Text by 20x Using Vision

Converting text to images for 20x token compression. Interesting research or production-ready breakthrough? A critical look at the trade-offs.

27-OCT-25[8 MIN]

ai-coding llm architecture production automation dev-tools

Few-Shot Learning for Document Parsing: Training AI on Human Corrections

How I built a self-improving document parser that learns from corrections without fine-tuning. The pragmatic alternative to model training.

26-OCT-25[7 MIN]

ai-coding architecture dev-tools performance best-practices

When Not to Use AI: Two Approaches to Building AI-Powered Products

Real-time AI generation vs curated libraries: lessons from building the same product twice with radically different architectures.

21-OCT-25[8 MIN]

ai-coding claude-code flutter firebase dev-tools mobile

Building Trivista: What AI Coding Actually Solves (and What It Doesn't)

Building a multiplayer trivia app solo with AI coding tools. Here's what worked, what didn't, and the trade-offs no one talks about.

15-OCT-25[8 MIN]

ai-coding architecture best-practices automation production

Cascading AI Pipelines: When One Model Feeds Another

Building a multi-stage AI content pipeline where each generation depends on the last. Lessons from generating thousands of hybrid creatures with resilient error handling.

08-OCT-25[7 MIN]

claude-code ai-coding cli dev-tools workflow best-practices

Stop Speedrunning Claude Code (Master the Core Loop First)

MCPs, subagents, and automation are tempting. But the developers getting the most from Claude Code aren't rushing to advanced features - they're mastering the fundamentals.

30-SEP-25[7 MIN]

claude-code ai-coding web

Vibe Coding a Blog Migration: Ghost to Astro in One Night with Claude Code

Converting a Ghost blog to Astro in a single late-night session with Claude Code, reducing memory usage by 75% while learning what AI coding tools actually solve