Tag: genai
All the articles with the tag "genai".
-
Agentic Workflows Need Guardrails, Not Vibes
How to put real constraints around an agent that touches money or production: bounded tools, approval gates on irreversible actions, dry-run modes, spend limits, and a tool-call audit trail you can actually read.
-
Agents Are Coming. Most Demos Are Lying.
A skeptical look at agent reliability in late 2024, where the impressive demos quietly fall apart in production, and the narrow places agents already pull their weight.
-
Getting JSON Out of LLMs Without Crying
Function calling and JSON mode get you syntactically valid JSON. They do nothing about a model that fills the right shape with confident nonsense. The validation-and-repair layer you still have to write.
-
Evals Are the New Unit Tests (And You're Not Writing Them)
Shipping an LLM feature with no evals is shipping with no tests, and almost everyone is doing it. A small, hand-written harness you run on every change, plus the honest limits of grading with another model.