The for agent-built UI.

Your AI agent ships UI that's technically correct and experientially wrong. You can see it the moment the diff lands. You don't always have time to fix it.

Get API key →

The gap you keep hitting

Your agent reached for tabs again — even though the job is sequential.

It built another dashboard when the user just needed to do the thing.

The shadcn defaults are showing. Again.

04The cost of the default

Three patterns. Every project. Every model.

Not a benchmark suite — a list of moves your agent makes when it doesn't know any better.

“It built another dashboard.”

The job was do the thing, not watch the thing. Your agent didn't know the difference.

“Tabs again.”

High-stakes, sequential workflows split across status tabs. The default move when the agent doesn't know the flow.

“It looks fine. It's wrong.”

The UI ignores who's using it, when, and why. The diff passed review. The user can't get through it.

05What you get

Whoever you are, you've already redone this work.

Builders / FDEs

You've redone agent UI three times this week. Stop.

Ship the right thing on the first build — same agent, same model, no new tool to learn.

Designers

You watched your design system get flattened into shadcn defaults.

Encode your taste so the agent has to pass through it before the diff lands.

PMs

You shouldn't be the QA layer for agent slop.

Every build measured against the actual job, before it reaches you.

Forward Deployed

You ship custom UI in customer repos every week.

Drop-in in any repo, any agent, any stack. One pointer file. No infra.

06How it shows up in your loop

The agent calls it. You see the result.

01 — Drop in

Paste 6 lines into CLAUDE.md.

Or AGENTS.md. The agent reads it on every run. No SDK, no MCP server to host, no infra.

02 — Build normally

Your agent calls /ux or /visual mid-build.

On its own. It already knows what to send: the changed file, the user task, the page in context.

03 — Apply the fixes

Patches come back, prioritized.

The agent applies them before it opens the PR. The brain stays with us. One pointer file lives in your repo.

CLAUDE.md

## Design judgment

Before finalizing UI changes, call https://api.thedesignagent.ai/ux
with the changed component path and the user task it serves.
Apply returned patches before opening the PR.
Read .thedesignagent in repo root for project conventions.
Use /visual for craft review when shipping new screens.

07Before / After

What changes when the judgment layer is in the loop.

Same task, same agent, same model. The only difference: TheDesignAgent in the build loop.

OPERATIONSThe agent was asked to: build an order fulfillment interface for warehouse operators.

Default agent outputWith TheDesignAgent

localhost:3000

Orders

New (24)

In Progress (8)

Shipped (47)

Completed (132)

Order	Customer	Items	Service	Created
#A-19842	Acme Corp	12 items	Standard	2h ago	›
#A-19841	Northwind	3 items	Express	2h ago	›
#A-19839	Globex	27 items	Standard	3h ago	›
#A-19836	Initech	6 items	Standard	4h ago	›
#A-19834	Soylent	1 item	Express	5h ago	›
#A-19828	Umbrella	9 items	Standard	6h ago	›

localhost:3000

ORDER #A-19842 · ACME CORP · 12 ITEMS

Fulfilling order, end-to-end

operator: maria.k

✓

Claim

0:14

Pick

1:42

Pack

—

Ship

—

PICK LIST · 5 of 12 items

✓

Widget, blue, large

WGT-100 · bin A-12-04

2/2

✓

Widget, blue, small

WGT-100B · bin A-12-08

1/1

Gizmo, brushed steel

GZM-014 · bin C-04-01

1/3

Padded mailer, small

PKG-S · bin P-01-02

0/1

NEXT BIN

C-04-01

Aisle C · Bay 4 · Shelf 1

SCAN

Awaiting scan…

What the agent did wrong

“You’ve seen this exact tab pattern in fifty agent builds.”

Tabs by status — the default reach when data has categories. The operator's job is sequential, not categorical.

What TheDesignAgent saw

The job isn't “monitor orders by category.”

It's “fulfill one order, end-to-end.” The agent picked a layout that fights the workflow.

What shipped instead

Sequential workflow — Claim → Pick → Pack → Ship.

One operator, one order, end-to-end. Status comes from the rail, not from where you click.

09It gets sharper every call

Each call teaches it more about your product, your users, your taste.

The score trajectory is the receipt. No knobs to tune. Just keep building; the layer remembers.

The first time you saw your agent get a screen right, you wondered if it was a fluke. By the tenth call, it isn't.

FIT_SCORE · same project1.8 → 4.6

fit_score over callssame project · same agent

10Coming soon

AI Native Mode

UI built for the agent, not just by it.

A toggle. Same call, same agent. The output is a different kind of UI — chat-first, command-driven, agent-led. Built around how AI actually works inside the surface, not adapted to it.

Most agent-built UI today is conventional UI that an agent assembled. Forms, tables, tabs. AI Native Mode flips the assumption: the agent isn't the builder, it's the partner. The screens it generates are shaped around running an agent, not being a form.

Tied into Trailhead — our growing library of AI-native patterns: command surfaces, agent inboxes, generative previews, instruction-and-confirm flows. The mode picks from Trailhead automatically based on the JTBD.

Join the waitlist →Targeting Q3 2026

From the field notes

What we're learning while building this.

Concepts

The component library was the right answer to a problem agents no longer have

Component libraries exist because building UI was expensive and inconsistency was hard to prevent. AI agents changed the cost equation. The case for building unencumbered — and what you need instead.

5 min read →Anti-patterns

The dashboard default: why agents build viewers when the job is deciding

Four KPI cards, a table, a date filter. The dashboard is the agent's lowest-effort answer to 'build me a tool for this data.' It's also wrong most of the time.

4 min read →

All field notes →

Your agent will not become a designer.It can stop shipping like one who quit.

One price. Two ways to use it.

Flat per-call. Either lens. Patches included. No seats, no tiers, no negotiation.

Pay-as-you-go— v0.1

$0.88/ call

One call = one evaluation, either lens. Both lenses included in your account. The project memory grows with every call you make.

Both lenses — /ux and /visual
Drop-in on any repo, any agent, any stack
Project memory grows with each call
Pointer file in your repo. Brain stays with us.

For humans

Sign up & manage usage

Get an API key. View calls and fit-score history in your dashboard. Pay through Stripe like any other SaaS. Standard for teams.