BLOG

Writing about agent systems

Deep dives on building reliable tool calling: just-in-time auth, sandboxes, observability, and product decisions that make agents feel trustworthy.

EngineeringFeb 12, 202611 min read
Featured
Agent tools that scale: a practical checklist

A field guide to shipping dependable tool calling in production, from scoping and observability to rollback paths and user trust.

Read post →
SecurityFeb 02, 2026
10 min read
Just-in-time auth: reducing friction without losing control

Why deferred authorization can make agent experiences feel smoother while still preserving clear approvals, short-lived access, and policy control.

Read post →
ProductJan 22, 2026
11 min read
Designing tool interfaces your agent can’t misuse

Tool schemas shape behavior more than prompt wording does. Here is how to design interfaces that are narrow, legible, and resilient under real usage.

Read post →
EngineeringJan 10, 2026
10 min read
Observability for agents: what to log (and what not to)

A practical logging model for tracing agent behavior end to end without turning your observability stack into a liability.

Read post →
SecurityDec 18, 2025
11 min read
Sandboxed execution: why it matters for “run code” tools

If your agent can execute code, isolation is not optional. Here is what a practical sandbox needs before it is ready for real users.

Read post →
ProductDec 05, 2025
10 min read
Prompting vs product: where reliability really comes from

Prompts matter, but durable reliability comes from contracts, workflow design, and product choices users can actually feel.

Read post →
SecurityNov 22, 2025
10 min read
Building user trust in agent actions

Trust in agent products comes from previews, receipts, and reversible workflows—not from asking users to accept mysterious automation on faith.

Read post →
EngineeringNov 05, 2025
11 min read
How to evaluate an agent: metrics that actually predict success

A practical framework for measuring agent quality with metrics that correlate to user outcomes, workflow reliability, and operational cost.

Read post →
EngineeringOct 18, 2025
10 min read
From prototype to production: hardening your first agent workflow

A practical path from a promising demo to a dependable workflow, with better validation, timeouts, state handling, and user-visible recovery.

Read post →
PlatformSep 29, 2025
10 min read
Connecting 1,000+ apps: the hidden costs

At integration scale, the hard part is not OAuth. It is lifecycle management, schema drift, support load, and the long tail of provider behavior.

Read post →
SecuritySep 12, 2025
10 min read
Scoping permissions: least privilege for agent actions

A pragmatic guide to least privilege when an agent acts on a user’s behalf across tools, workflows, and shared business systems.

Read post →
PlatformAug 25, 2025
10 min read
Building a tool registry: taxonomy, ownership, and change control

A tool registry is not just a catalog. It is the governance layer that keeps a growing tool surface understandable, owned, and safe to evolve.

Read post →