Posts tagged "Best-practices"

29-APR-26 [5 MIN]

Receipts: SWE-bench Pro and the Lab That Walked Away

Opus 4.5 scores 80.9% on SWE-bench Verified. The same model scores 45.89% on the contamination-free Pro split. OpenAI has quietly stopped reporting Verified at all. Vendor benchmark cards are marketing.

28-APR-26 [4 MIN]

ai-coding productivity best-practices systems-thinking

The 43% Denominator: AI Productivity Net of Rework

Lightrun: 43% of AI-generated code changes need debugging in production after passing QA. CodeRabbit: 1.7x bugs, 2.74x security, 8x I/O. METR: 19% slower while feeling 20% faster. The numerator is what gets reported. The denominator is what nobody puts in the deck.

24-APR-26 [5 MIN]

ai-coding workflow best-practices architecture

The Idiot Savant Needs Guardrails

Uncle Bob, father of TDD, posted on X that TDD is 'very inefficient for AIs' and that the agent is best thought of as 'a highly focused idiot savant.' Testing didn't die. It got more important. And the review target flipped.

23-APR-26 [6 MIN]

ai-coding workflow best-practices architecture

Agents Don't Refactor

Traditional coders touched a file and tidied it. The Boy Scout Rule. Now nobody does. Agents add, they don't subtract, and the codebase accretes faster than ever. A technique for putting cleanup back in as an explicit gate, not a virtue you hope for.

22-APR-26 [7 MIN]

career ai-coding workflow best-practices

Don't Take Their Legos Away

A CTO once told me not to take people's Legos away. I ignored him, solved the team's problems myself, and got exactly what I optimised for: a sound plan and a team that couldn't stand me. In 2026, with agents doing the bricks, this is the lesson that matters.

15-APR-26 [4 MIN]

ai-coding llm best-practices

Benchmarks Are Bullshit

Berkeley just built an agent that games AI benchmarks. Karpathy called it months ago. The best coding model doesn't top the charts, the highest-ranked Chinese models disappoint in practice, and the entire leaderboard industry optimizes for the wrong thing.

30-MAR-26 [8 MIN]

architecture ai-coding best-practices production workflow

Your Architecture Is Showing

Enterprise architecture patterns were designed for a world where code was expensive to write and expensive to change. That world ended. The patterns didn't get the memo.

13-MAR-26 [4 MIN]

ai-coding ui dev-tools claude-code best-practices

Impeccable: The Design Vocabulary AI Was Missing

The bottleneck isn't AI capability - it's that developers lack design vocabulary. Impeccable bridges the gap, and the Tessl benchmarks prove it: 1.59x improvement over baseline.

03-MAR-26 [7 MIN]

ai-coding llm context dev-tools best-practices

Your AGENTS.md is a Liability

Frontier models top out at 68% compliance with 500 instructions. Every rule you add makes every other rule less likely to be followed. The research explains why.

13-FEB-26 [7 MIN]

ai-coding security best-practices

All the Liability, None of the Protection

AI coding tools create a legal paradox: the code you ship likely can't be copyrighted, but it might infringe someone else's. All the liability, none of the protection.

11-FEB-26 [6 MIN]

ui architecture best-practices systems-thinking

Stop Designing in Pixels

Tokens are nouns. Patterns are verbs. The missing layer is grammar: a shared vocabulary that spans Figma, web, and native without breaking when someone ships a 'small' change.

05-FEB-26 [4 MIN]

ai-coding architecture best-practices automation systems-thinking

When AI Isn't Fit for Purpose: Lessons from Salesforce's Agentforce Pivot

Salesforce quietly walked back autonomous AI agents to deterministic scripting. The pattern reveals when LLMs work - and when they don't.

01-FEB-26 [6 MIN]

claude-code workflow productivity best-practices dev-tools

10 Tips from Inside the Claude Code Team

Boris Cherny followed up his personal workflow with tips from across the team. Same tool, different people, different approaches. The patterns worth stealing.

05-JAN-26 [5 MIN]

claude-code workflow productivity best-practices dev-tools

How the Creator of Claude Code Uses Claude Code

Boris Cherny shared his workflow for the tool he built. The setup is surprisingly vanilla. The philosophy is worth studying.

03-JAN-26 [6 MIN]

ai-coding automation dev-tools architecture best-practices

Guardrails by Default: Why AI Coding's Next Evolution Isn't Smarter Models

Factory AI's Luke predicts the future isn't more powerful models - it's AI that enforces software engineering best practices by default. Here's why that matters more than you think.

06-DEC-25 [5 MIN]

ai-coding career best-practices systems-thinking product

The LinkedIn Hot Take Problem: Why the AI Discourse Is Backwards

The arguments about vibe coding and junior developers miss what software engineering was always about: shipping products, not typing code.

05-DEC-25 [5 MIN]

ai-coding workflow context automation systems-thinking best-practices

12 Factor Agents: Principles for AI That Actually Work

HumanLayer's 12-factor agents codifies what works in production AI: own your context, keep agents small, stay out of the dumb zone.

03-DEC-25 [6 MIN]

claude-code ai-coding workflow dev-tools best-practices

Plan Mode Is Now Mandatory. Auto-Compact Should Be Enabled.

Opus 4.5 shipped Plan Mode as a core workflow. The workarounds are obsolete. And the case for auto-compact finally tips in favor of enabling it.

27-NOV-25 [8 MIN]

career product best-practices

The Scaling Trap: When Startups Eat Their Own

At around 30 employees, growing companies either mature or become toxic. Here's the playbook for organizational dysfunction - and why your engineering leaders keep leaving.

18-NOV-25 [5 MIN]

career ai-coding productivity best-practices

Finding the Craft in the Chaos: A Stoic Take on Job Loss

What happens when you lose external validation and discover what actually matters: the work itself.

16-NOV-25 [5 MIN]

dotnet dev-tools best-practices performance

C# 14 and .NET 10: Less Boilerplate, More Clarity

The latest C# release continues its quiet war on ceremony with field-backed properties, extension members, and smarter spans. Here's what matters and what doesn't.

14-NOV-25 [6 MIN]

dotnet dev-tools cli best-practices

C# Scripts: The Python Killer You Already Know

.NET 10's shebang support and file-based apps turn C# into a scripting language. No more context-switching to Python for quick scripts.

02-NOV-25 [8 MIN]

ai-coding dev-tools career best-practices systems-thinking

The Hiring Mismatch: When 20 Years of Experience Isn't Enough

Why coding interviews optimized for 2010 fail to identify great engineers in 2025, and why orgs can't adapt fast enough.

26-OCT-25 [7 MIN]

ai-coding architecture dev-tools performance best-practices

When Not to Use AI: Two Approaches to Building AI-Powered Products

Real-time AI generation vs curated libraries: lessons from building the same product twice with radically different architectures.

15-OCT-25 [8 MIN]

ai-coding architecture best-practices automation production

Cascading AI Pipelines: When One Model Feeds Another

Building a multi-stage AI content pipeline where each generation depends on the last. Lessons from generating thousands of hybrid creatures with resilient error handling.

08-OCT-25 [7 MIN]

claude-code ai-coding cli dev-tools workflow best-practices

Stop Speedrunning Claude Code (Master the Core Loop First)

MCPs, subagents, and automation are tempting. But the developers getting the most from Claude Code aren't rushing to advanced features - they're mastering the fundamentals.