Tag: architecture
All the articles with the tag "architecture".
-
The LLM Observability Stack I Wish I'd Built Sooner
What to instrument for LLM and agent apps before the first incident: full request and tool-call tracing, token and cost per request, latency breakdown, eval scores in production, and turning real failures into eval cases.
-
Multimodal AI in the Field: Voice, Image, Form, Action
A closed-loop field inspection system that turns voice, a photo, and a half-filled form into a structured action, built for places where the network drops for hours.
-
Sovereign AI: Running GPUs On-Prem When the Cloud Isn't an Option
For regulated workloads where the data legally cannot leave a building, on-prem GPU inference is back. The build-vs-rent math, the constraints nobody prices in, and the software that makes a fixed fleet feel like a platform.
-
Building an MCP Server Fabric for Financial Operations
Instead of one large agent wired to every financial system, a fabric of small MCP servers, each wrapping one system with tightly scoped tools and an approval gate on anything that writes.