JTBD-calibrated UX and visual evaluation for agent-built UI. Your agent calls /discover before the build, /ux and /visual after. Findings cite the job — not a generic rubric. 5 free credits at signup. No card.
Not a benchmark — the moves your agent reaches for when it doesn't know the job. You've fixed all three this week.
— when the user just needed to do the thing.
The job was do the thing, not watch the thing. Your agent didn't know the difference.
— even though the job is sequential.
High-stakes, sequential workflows split across status tabs. The default move when the agent doesn't know the flow.
— and the diff passed review.
The UI ignores who's using it, when, and why. It looks fine. It's wrong. The user can't get through it.
Identical prompt. Identical model. The only difference is the brief before the build and the scoring after. Drag the slider to see what the judgment layer caught.
/discover before the build, /ux after, /visual when the screenshot matters. Same agent. Same model. New behavior.
— a JTBD brief, before the agent draws a pixel —
— findings cite the job + the file:line —
— headless screenshot, scored vs. your tokens —
Four roles. Same gap. One install closes it — without changing your agent, your stack, or your taste.
— for the builder who's redone agent UI three times this week —
— custom UI in customer code, every week —
— stop watching your system flatten into shadcn defaults —
— every build scored before it reaches you —
AI-generated UI looks AI-generated.
That's a bug — not a vibe.
Component libraries exist because building UI was expensive and inconsistency was hard to prevent. AI agents changed the cost equation. The case for building unencumbered — and what you need instead.
5 min read →◢ Anti-patternsFour KPI cards, a table, a date filter. The dashboard is the agent's lowest-effort answer to 'build me a tool for this data.' It's also wrong most of the time.
4 min read →Your agent will not become a designer.It can ship like one is in the loop.
Every plan drops credits in monthly and resets at the end of the cycle. Bonus credits roll over and never expire — even after cancellation. We don't charge for errors, and you pay half on degraded calls.
Five credits drop in at signup — enough for 1 /discover + 1 /ux + 1 /visual with one credit to spare. Use them in any repo, any agent. When they run out, pick a channel below.
Building an agent that buys its own usage? agent-pay link ↗