Ai-coding Tag

Ai-coding

Posts related to ai-coding

128 posts

← Back to all posts

Benchmarks Are Bullshit

Berkeley just built an agent that games AI benchmarks. Karpathy called it months ago. The best coding model doesn't top the charts, the highest-ranked Chinese models disappoint in practice, and the entire leaderboard industry optimizes for the wrong thing.

Read more →