AI EngineeringEvals2 min read
The five layers of an LLM evaluation harness that survives a model migration
Most AI systems we replace have one eval — a notebook someone ran once. Here is the structure we use on every project to make the eval an artefact that survives upgrades to the underlying model, prompts, retrieval and product surface.
Read post →SaaSProduction3 min read
Inside TopPrix: a structured ingestion pipeline for weekly retailer flyers
Most deal aggregators are held together with cron jobs and goodwill. The retailer side is the real product. Here is the structured ingestion pipeline that lets TopPrix retailers update their own listings on a Sunday night without ever opening Slack.
Read post →MobileProcess3 min read
Inside Roulocal: one design system across iOS, Android and web
How we shipped iOS, Android and a web app off one design tokens package, one product team and one Friday demo cadence — and why that decision saved the project six months.
Read post →