The harness layer for agent products

Build your product, not the harness

Model-agnostic, fully loaded. Runtime, multi-agent orchestration, memory, tooling, policy, sandboxing, verification, and observability -- all delivered for you, so you can deliver for your customers.

Get started Read the docs

TypeScriptconst run = await harness.runs.create({
  input: { goal: "Process support tickets" },
  workspace_id: "ws_support",
});

for await (const event of harness.runs.streamEvents(run.id)) {
  console.log(`[${event.type}] ${event.data}`);
}

Built for every vertical

One harness, any agent product

The same production infrastructure powers agents across personal, team, enterprise, and small business use cases.

Personal AI assistants

Calendar, email, task, and communication management agents that handle daily workflows for individual users.

Chat & collaboration agents

Team-facing agents that integrate with project tools and communication platforms to coordinate group work.

Enterprise AI agents

HRIS, executive assistance, IT ops, and customer support agents that operate at enterprise scale with compliance controls.

Small business automation

Scheduling, invoicing, customer communication, and inventory management agents that automate routine operations.

Eight pillars

Everything your agent needs to ship

A complete platform, not a point solution. Every pillar is production-grade from day one.

Runtime

Multi-provider LLM execution. OpenAI and Anthropic at launch — not locked to one vendor.

Orchestration

Durable multi-step execution with planner-worker-judge coordination. Survives crashes and deploys.

Tooling

Extensible tool registry with structured invocation, sandboxed execution, and audit logging.

Memory

Persistent first-party memory with search, versioning, and session-scoped context that outlives any single run.

Policy

Behavioral constraints, cost ceilings, and safety guardrails enforced at the infrastructure level.

Sandboxing

Isolated execution environments with network controls, resource limits, and credential injection.

Verification

Outcome correctness checking with declarative manifests, gate evaluation, and evidence packages.

Observability

Structured event logs, real-time SSE streaming, usage metering, and cost attribution per workspace and user.

Production-grade

Built for trust at scale

Multi-tenant isolation

Every customer is isolated by a server-validated authorization boundary. Your data never leaks to other tenants.

Durable execution

Runs survive process restarts, deploys, and transient failures. Every run reaches a terminal state, guaranteed.

Structured observability

Append-only event logs, real-time SSE streaming, and cursor-based replay. Full audit trail for every agent action.

Usage attribution

Token consumption tracked per workspace and subject. Know exactly what your agents cost, broken down by team or user.

Stripe for agent execution

Ship your agent product, not your infrastructure

Provider-agnostic. Eight-pillar completeness. Purpose-built for non-coding agents. Get from sign-up to your first run in under 15 minutes.

Create your account Read the quickstart API reference