Writing
Short posts on LLM systems, evals, and AgentOps.
- AgentOps — reliability, evals, tracing — Tool scopes, permissions, audit trails, regression gates, and tracing for agentic workflows in production.
- On building measurable LLM systems — Why reliability, cost, and latency matter in production LLM systems — and how to align architecture and evals with product goals.