Posts tagged "Architecture"

18-JUL-26[5 MIN]

Papering Faster: AI Coding Agents and the Root-Cause Fix Nobody Ships

A production API I work on has shipped 227 fix commits and 9 refactors in six months. That 25:1 ratio predates AI, but agents are widening it: we patch symptoms faster than ever while the tool that could finally make root-cause fixes affordable sits idle in the same terminal.

13-JUL-26[8 MIN]

production systems-thinking ai-coding architecture product

Three Instruments, All Lying: Debugging the Metrics Behind FameCake

A simple question (why are there no bookings today?) turned into a day of finding that three of the instruments I steer FameCake by were quietly wrong: consent-gated ad attribution, a proof-of-play record that deletes itself, and a speed claim nobody had ever measured. The bug isn't in any of them. The bug is trusting derived data.

11-JUL-26[5 MIN]

architecture systems-thinking product best-practices

The Like-for-Like Trap: How Vendor Lock-In Survives a Platform Migration

Replacing a locked-in legacy platform doesn't free you if the migration replicates the lock-in. The middleman play, lump-sum fees, and why 'replicate what we have' hands the vendor their leverage back.

02-JUL-26[5 MIN]

llm architecture production systems-thinking

Two Models or Nothing: LLM Consensus for Dirty Data

One LLM on the long tail is a coin flip. How I designed a product-enrichment pipeline around consensus voting, abstention, and content-hashed freshness gates.

29-JUN-26[6 MIN]

llm architecture context performance

Lazy Context: Why Retrieval Beats Compression

Headroom went from zero to 40k GitHub stars by attacking agentic token bloat. The durable idea isn't the tool - it's treating context compression as a retrieval problem.

28-JUN-26[6 MIN]

architecture product systems-thinking security

You Can't Bot a Billboard, But You Can Game a Like

Rewards built on likes are unverifiable and gameable. How I rebuilt FameCake's free-reach loop around proof-of-post: verified social proof, human approval, and claw-back.

23-JUN-26[8 MIN]

llm architecture automation systems-thinking

A Multi-Agent System Sold as a Model: Sakana's Fugu

Sakana AI's Fugu collapses a multi-agent orchestration system into one OpenAI-compatible endpoint. The idea is genuinely interesting. The benchmark and export-control claims need a second look.

22-JUN-26[3 MIN]

dev-tools ai-coding architecture workflow

The Editor Is Now a Host

Cognition killed Windsurf overnight via an over-the-air update, rebranded it Devin Desktop, made the default UI an agent command center instead of a code editor, and shipped an open Agent Client Protocol so Codex, Claude, and OpenCode can all run inside it. The bet underneath: the IDE wins by being the place agents report for work, not by having the best autocomplete. The editor was always the wrong center of gravity.

31-MAY-26[5 MIN]

llm systems-thinking product architecture performance

Cheap Is a Hardware Strategy

Google led I/O 2026 with a cheap, fast Gemini Flash instead of a frontier behemoth, and everyone read it as conceding the top of the market. Wrong read. Cheap isn't a model strategy, it's a silicon strategy. Google owns every layer from the TPU to the search box, which is why it can give intelligence away while its rivals rent the compute to compete with it, some of them for $40 billion.

29-APR-26[5 MIN]

ai-coding llm best-practices architecture

Receipts: SWE-bench Pro and the Lab That Walked Away

Opus 4.5 scores 80.9% on SWE-bench Verified. The same model scores 45.89% on the contamination-free Pro split. OpenAI has quietly stopped reporting Verified at all. Vendor benchmark cards are marketing.

26-APR-26[7 MIN]

mcp security architecture claude-code production

MCP By Design: The Protocol That Won't Be Patched

Anthropic markets MCP as the universal AI tooling standard, but a 200,000-server RCE class is 'expected behavior.' You can't be both.

24-APR-26[5 MIN]

ai-coding workflow best-practices architecture

The Idiot Savant Needs Guardrails

Uncle Bob, father of TDD, posted on X that TDD is 'very inefficient for AIs' and that the agent is best thought of as 'a highly focused idiot savant.' Testing didn't die. It got more important. And the review target flipped.

23-APR-26[6 MIN]

ai-coding workflow best-practices architecture

Agents Don't Refactor

Traditional coders touched a file and tidied it. The Boy Scout Rule. Now nobody does. Agents add, they don't subtract, and the codebase accretes faster than ever. A technique for putting cleanup back in as an explicit gate, not a virtue you hope for.

20-APR-26[7 MIN]

ai-coding security dev-tools mcp architecture

Stop Installing AI Tools

Vercel got breached through Context.ai, an AI tool an employee installed with OAuth scopes into Google Workspace. It's the latest in a pattern: Trivy into litellm, axios maintainer hijack, now this. The safest AI tool is the one you didn't install.

19-APR-26[7 MIN]

ai-coding workflow architecture productivity

Briefs as Code

Your exec summaries, delivery plans, and Gantt charts belong in git. AI agents can synthesize planning docs from scattered sources and produce polished, print-ready briefs. The repo is the PM tool.

04-APR-26[7 MIN]

claude-code ai-coding architecture security dev-tools

The Claude Code Leak: What the Harness Actually Looks Like

Anthropic accidentally published Claude Code's full source via npm. Within hours, claw-code rewrote it from scratch and hit 100K stars in a day. The interesting part isn't the leak - it's what the architecture reveals.

30-MAR-26[8 MIN]

architecture ai-coding best-practices production workflow

Your Architecture Is Showing

Enterprise architecture patterns were designed for a world where code was expensive to write and expensive to change. That world ended. The patterns didn't get the memo.

20-MAR-26[5 MIN]

product ai-coding architecture

We Built a Product Around AI Style Transforms. Then We Deleted 6,500 Lines of Them.

FameCake's AI journey: from 15 style transforms as the headline feature to content moderation and outpainting as the survivors. What five months taught us about AI in products.

26-FEB-26[6 MIN]

ai-coding architecture systems-thinking workflow dev-tools

Vinext and the $1,100 Rewrite

Cloudflare rebuilt Next.js in a week with one engineer and 800 Claude sessions. The real story isn't the speed - it's what happens when test suites become machine-readable specs.

22-FEB-26[7 MIN]

ai-coding architecture product systems-thinking

Buy vs Build Just Flipped

35% of enterprises have already replaced SaaS with custom builds. The cost of building collapsed. The cost of buying didn't. And corporate procurement hasn't caught up.

21-FEB-26[7 MIN]

ai-coding architecture systems-thinking career product

The Sunk Cost Fallacy Is Dead

AI collapsed the cost of rebuilding. Corporate decision-makers haven't caught up. The reasoning behind 'but we already built it' no longer holds.

11-FEB-26[6 MIN]

ui architecture best-practices systems-thinking

Stop Designing in Pixels

Tokens are nouns. Patterns are verbs. The missing layer is grammar: a shared vocabulary that spans Figma, web, and native without breaking when someone ships a 'small' change.

06-FEB-26[6 MIN]

claude-code ai-coding automation architecture workflow

Agent Teams: The Switch Got Flipped

Two weeks ago we found TeammateTool hiding in Claude Code's binary. Now it's official. Here's what changed, what didn't, and what the docs reveal about where multi-agent is heading.

05-FEB-26[4 MIN]

ai-coding architecture best-practices automation systems-thinking

When AI Isn't Fit for Purpose: Lessons from Salesforce's Agentforce Pivot

Salesforce quietly walked back autonomous AI agents to deterministic scripting. The pattern reveals when LLMs work - and when they don't.

26-JAN-26[4 MIN]

claude-code ai-coding automation architecture

Claude Code's Hidden Multi-Agent System

Anthropic built a full multi-agent orchestration system into Claude Code. It's feature-flagged off. The community found it anyway.

03-JAN-26[6 MIN]

ai-coding automation dev-tools architecture best-practices

Guardrails by Default: Why AI Coding's Next Evolution Isn't Smarter Models

Factory AI's Luke predicts the future isn't more powerful models - it's AI that enforces software engineering best practices by default. Here's why that matters more than you think.

27-DEC-25[6 MIN]

architecture dev-tools ai-coding automation workflow

The $0 SaaS Stack: Ship Fast, Pay Later

Convex, Vite, Clerk, shadcn, Cloudflare, Resend. A modern stack where every component has a generous free tier, agents do the heavy lifting, and you don't touch infrastructure until you have paying customers.

18-DEC-25[6 MIN]

ai-coding claude-code automation architecture productivity

Three Ways to Build Deep Research with Claude

From 20 lines of shell to production apps. Anthropic renamed Claude Code SDK to Agent SDK because deep research is now a first-class use case.

01-NOV-25[7 MIN]

claude-code architecture systems-thinking dev-tools ai-coding workflow

When Claude Needs a Second Opinion: Strategic Thinking with Codex

Claude Code loves to jump straight into implementation. Sometimes you need a model that thinks first. Here's how I use Codex for systems thinking and architecture decisions.

28-OCT-25[4 MIN]

ai-coding llm performance architecture production automation

DeepSeek-OCR: Compressing Text by 20x Using Vision

Converting text to images for 20x token compression. Interesting research or production-ready breakthrough? A critical look at the trade-offs.

27-OCT-25[8 MIN]

ai-coding llm architecture production automation dev-tools

Few-Shot Learning for Document Parsing: Training AI on Human Corrections

How I built a self-improving document parser that learns from corrections without fine-tuning. The pragmatic alternative to model training.

26-OCT-25[7 MIN]

ai-coding architecture dev-tools performance best-practices

When Not to Use AI: Two Approaches to Building AI-Powered Products

Real-time AI generation vs curated libraries: lessons from building the same product twice with radically different architectures.

24-OCT-25[6 MIN]

architecture security systems-thinking dotnet

How We Fixed Authorization by Moving Security from Code to Architecture

When expensive SSO was just a symptom of deeper architectural problems, we redesigned our multi-tenant system from first principles and cut costs significantly in the process.

15-OCT-25[8 MIN]

ai-coding architecture best-practices automation production

Cascading AI Pipelines: When One Model Feeds Another

Building a multi-stage AI content pipeline where each generation depends on the last. Lessons from generating thousands of hybrid creatures with resilient error handling.

28-SEP-25[5 MIN]

firebase security performance architecture

Building Real-Time Multiplayer with Firebase: What Works and What Doesn't

Real lessons from shipping multiplayer games with Firebase: what works for small groups, where it breaks down, and the scalability limits you need to know upfront.

08-MAY-19[8 MIN]

flutter dart architecture mobile