The Field Lab

I give AI real engineering problems. Then I check its work

AI is very good at producing work that looks finished. I check whether it survives contact with reality. From an engineer with 15 years in automotive, robotics, and embedded systems.

Latest: AI FIELD TEST №17 →Follow on LinkedIn

I give AI real engineering problems. Then I check its work

I give AI real engineering problems. Then I check its work

Why AI Can't Design Duck Hunt

Six Quality Gates for AI-Assisted Engineering

I Set Out to Prove AI Can't Build Systems Software

Why Do We Still Assume Machines Must Read Before They Understand?

If Cars Can Be 'Software-Defined Vehicles', Why Aren't Phones 'Software-Defined Phones'?

What if Google Test Didn't Exist? Chapter 1: The Hidden Economic Value

Open-Source Qualification in Automotive: A Critical Industry Need

Software Defined Vehicles? Nokia Lost to Software, Not iPhones

Why Open Source Is a Trillion-Dollar Engine and How We Can Keep It Running?

Why AI Can't Design Duck Hunt

Six Quality Gates for AI-Assisted Engineering

I Set Out to Prove AI Can't Build Systems Software

Why Do We Still Assume Machines Must Read Before They Understand?

If Cars Can Be 'Software-Defined Vehicles', Why Aren't Phones 'Software-Defined Phones'?

What if Google Test Didn't Exist? Chapter 1: The Hidden Economic Value

Open-Source Qualification in Automotive: A Critical Industry Need

Software Defined Vehicles? Nokia Lost to Software, Not iPhones

Why Open Source Is a Trillion-Dollar Engine and How We Can Keep It Running?

I give AI real engineering problems. Then I check its work

I give AI real engineering problems. Then I check its work

The '94% Fewer Tokens' Screenshot Is Wrong. The README It Came From Is Honest

Gemma 4 Won 73% of My AI Debates. It Still Wouldn't Say 'I Don't Know'

My AI Committed 'Impossible' to Git. Seven Hours Later: 8/8

98% Accuracy, Worse Than Guessing

The Most Expensive Lie My Agent Tells Isn't in the Code

Claude Moved Fast and I Followed the Wrong Direction

Same Prompt, Five Runs: 7, 4, 5, 3, 9

I Pointed an AI Swarm at an npm Package Used by Millions

I Let Opus 4.8 Hold Its Own Plan. It Scaled, and I Lost the Wheel

The $1.3M Monkey: What OpenAI's Codex Experiment Actually Exposes

Why AI Can't Design Duck Hunt

My Coding Agent Compromised Three Machines in Under a Minute

Nine AI-Generated Bugs in One Repo's CI. None in the Production Code

Three Models Surrendered Identically, Then Walked It All Back

Did Claude Code Stage Its Own Leak? I Changed My Mind Twice

Six Quality Gates for AI-Assisted Engineering

The AI Debugged for 70 Minutes. The Real Fix Took 3

An AI Coding Tool Spawned 4,300 Zombie Processes and Crashed My Machine

I Set Out to Prove AI Can't Build Systems Software

Why Do We Still Assume Machines Must Read Before They Understand?

If Cars Can Be 'Software-Defined Vehicles', Why Aren't Phones 'Software-Defined Phones'?

What if Google Test Didn't Exist? Chapter 1: The Hidden Economic Value

Open-Source Qualification in Automotive: A Critical Industry Need

Software Defined Vehicles? Nokia Lost to Software, Not iPhones

Why Open Source Is a Trillion-Dollar Engine and How We Can Keep It Running?

The enterprise fortress that locked consumers out

Safety-first company. Cost-first execution

65 people whose job was to say no

Talent does not stay where politics wins

Invented digital photography in 1975. Filed for bankruptcy in 2012

Stack ranking turned employees into each other's enemies

40% global market share. Three competing OS teams. Zero smartphones

$1.75 billion to solve a problem that did not exist

Democratize blood testing. The product did not work

The culture that built the rocket was the culture that crashed it

Clean diesel was the strategy. Cheating was the execution

Customer-first values. 3.5 million fake accounts

Elevate the world's consciousness. It was a real estate sublease

Hired a product visionary. Gave her a media company's org

The '94% Fewer Tokens' Screenshot Is Wrong. The README It Came From Is Honest

Gemma 4 Won 73% of My AI Debates. It Still Wouldn't Say 'I Don't Know'

My AI Committed 'Impossible' to Git. Seven Hours Later: 8/8

98% Accuracy, Worse Than Guessing

The Most Expensive Lie My Agent Tells Isn't in the Code

Claude Moved Fast and I Followed the Wrong Direction

Same Prompt, Five Runs: 7, 4, 5, 3, 9

I Pointed an AI Swarm at an npm Package Used by Millions

I Let Opus 4.8 Hold Its Own Plan. It Scaled, and I Lost the Wheel

The $1.3M Monkey: What OpenAI's Codex Experiment Actually Exposes

Why AI Can't Design Duck Hunt

My Coding Agent Compromised Three Machines in Under a Minute

Nine AI-Generated Bugs in One Repo's CI. None in the Production Code

Three Models Surrendered Identically, Then Walked It All Back

Did Claude Code Stage Its Own Leak? I Changed My Mind Twice

Six Quality Gates for AI-Assisted Engineering

The AI Debugged for 70 Minutes. The Real Fix Took 3

An AI Coding Tool Spawned 4,300 Zombie Processes and Crashed My Machine

I Set Out to Prove AI Can't Build Systems Software

Why Do We Still Assume Machines Must Read Before They Understand?

If Cars Can Be 'Software-Defined Vehicles', Why Aren't Phones 'Software-Defined Phones'?

What if Google Test Didn't Exist? Chapter 1: The Hidden Economic Value

Open-Source Qualification in Automotive: A Critical Industry Need

Software Defined Vehicles? Nokia Lost to Software, Not iPhones

Why Open Source Is a Trillion-Dollar Engine and How We Can Keep It Running?

The enterprise fortress that locked consumers out

Safety-first company. Cost-first execution

65 people whose job was to say no

Talent does not stay where politics wins

Invented digital photography in 1975. Filed for bankruptcy in 2012

Stack ranking turned employees into each other's enemies

40% global market share. Three competing OS teams. Zero smartphones

$1.75 billion to solve a problem that did not exist

Democratize blood testing. The product did not work

The culture that built the rocket was the culture that crashed it

Clean diesel was the strategy. Cheating was the execution

Customer-first values. 3.5 million fake accounts

Elevate the world's consciousness. It was a real estate sublease

Hired a product visionary. Gave her a media company's org