Tag: genai

All the articles with the tag "genai".

Agentic Workflows Need Guardrails, Not Vibes

18 Mar, 2025

How to put real constraints around an agent that touches money or production: bounded tools, approval gates on irreversible actions, dry-run modes, spend limits, and a tool-call audit trail you can actually read.
Agents Are Coming. Most Demos Are Lying.

15 Oct, 2024

A skeptical look at agent reliability in late 2024, where the impressive demos quietly fall apart in production, and the narrow places agents already pull their weight.
Getting JSON Out of LLMs Without Crying

20 Aug, 2024

Function calling and JSON mode get you syntactically valid JSON. They do nothing about a model that fills the right shape with confident nonsense. The validation-and-repair layer you still have to write.
Evals Are the New Unit Tests (And You're Not Writing Them)

13 Feb, 2024

Shipping an LLM feature with no evals is shipping with no tests, and almost everyone is doing it. A small, hand-written harness you run on every change, plus the honest limits of grading with another model.

Agentic Workflows Need Guardrails, Not Vibes