Posts
All the articles I've posted.
-
Multimodal AI in the Field: Voice, Image, Form, Action
A closed-loop field inspection system that turns voice, a photo, and a half-filled form into a structured action, built for places where the network drops for hours.
-
RAG Over Enterprise Records: The Boring Parts That Matter
Enterprise RAG is trustworthy because of the unglamorous parts: per-user permissions enforced at retrieval, freshness, lineage, and handling records that change. Retrieval is an access-control problem wearing a search costume.
-
Sovereign AI: Running GPUs On-Prem When the Cloud Isn't an Option
For regulated workloads where the data legally cannot leave a building, on-prem GPU inference is back. The build-vs-rent math, the constraints nobody prices in, and the software that makes a fixed fleet feel like a platform.
-
Small Fine-Tuned Models Are Beating Frontier on My Workloads
On narrow, high-volume tasks a fine-tuned small model matches frontier quality at a fraction of the cost and latency. Here is the pipeline, the eval bar, and the maintenance bill nobody quotes you.