Conjure Evals
An eval harness for production agents.
The pattern we run on every agent we ship. Golden sets, regression suites, traceable replays. Drop it next to your prompt, get coverage from day one.
Coming soon to public repos
Client work is custom. Our products are the patterns we kept reaching for — evals, agent scaffolds, design-system primitives — released publicly because we'd rather sharpen them in the open than rebuild them on every engagement.
Every studio has a graveyard of internal tools. Ours we release — because the only way a tool gets good is in the hands of someone else's project.
Each of these started as the same problem on a third engagement. By the time we'd built it three times, we cared enough to make it durable, document it honestly, and ship it. We use these on our own work too — they're not abandonware.
An eval harness for production agents.
The pattern we run on every agent we ship. Golden sets, regression suites, traceable replays. Drop it next to your prompt, get coverage from day one.
Coming soon to public repos
A craft-first React component primitives library.
The components we build on top of for every product surface. Editorial-grade typography, motion-aware primitives, and the tokens that make a marketing site feel hand-made.
Coming soon to public repos
Conjuring's scaffolding for new 0→1 AI products.
TypeScript + TanStack + edge inference + observability, wired together the way we'd start a project at the studio. Opinionated, small, easy to delete.
Coming soon
The public products are the patterns. The client engagements are where we apply them — and where the next public release usually starts.