Start promptmaxxing.
PromptGolf benchmarks the spec writers, not the models. Write a spec, watch Agnes AI build it live, and let the hidden tests decide whether you actually know your domain.
Looks done in the browser. Seven hidden checks fail because the prompt never named the product rules.
Agnes 2.0 Flash generates a real app from your prompt, exactly what you asked for and nothing you did not.
The build runs through an isolated sandbox path and comes back as an interactive app preview.
Hidden Playwright checks tear the live app apart. Vague specs ship bugs; precise specs survive.