MULTI-AGENT ORCHESTRATION FOR SELF-HOSTED AI

Make specialized agents
actually work together.

ORCA is the orchestration mesh that coordinates LLM agents across long-running operational workflows. Persistent context, structured handoffs, full observability, running on your own infrastructure.

SELF-HOSTED. VENDOR-NEUTRAL. OBSERVABLE.
THE PROBLEM

Single agents hit a ceiling. Multi-agent setups break in production.

01

Context evaporates between agents

Spawn a sub-agent and the parent loses sight of what happened. Findings disappear. Decisions get re-litigated. Tasks duplicate.

02

No structured handoff contracts

Agents return whatever prose they feel like. The orchestrator has to re-parse and re-interpret. Brittle, ad-hoc, unreproducible.

03

Cloud lock-in or DIY duct tape

Existing orchestration platforms force cloud dependencies and proprietary state stores. Self-hosters end up with shell scripts and prayer.

HOW IT WORKS

A mesh, not a chain.

ORCA treats agents as specialized workers with declared scope, explicit handoff contracts, and shared persistent context. The orchestrator routes work, never re-parses.

AGENT ROSTER

Specialists, not generalists

Each agent declares scope, capabilities, and an explicit return contract. The orchestrator dispatches based on declared fit, not prompt-engineering luck.

agent: woocommerce-ops
scope: order-flow,inventory,refunds
returns: { status, payload, evidence }
context: persistent
PERSISTENT CONTEXT MESH

State survives handoffs

Shared context layer persists across agent boundaries. Findings from one agent become inputs to the next without re-discovery cost.

context.write(agent_a, finding)
context.read(agent_b, scope)
→ replayable, observable, durable
SELF-HOSTED BY DESIGN

Your infra, your data

Runs on a Tailnet, your KVMs, your hardware. No proprietary state store. Bring your own LLM endpoints (OpenAI, Anthropic, local Ollama, anything OpenAI-compatible).

OBSERVABLE WORKFLOWS

Every decision logged

Full trace of which agent ran when, with what inputs, what context it pulled, what it returned. Replayable. Auditable. Debuggable.

USE CASES

Built for operators running real workloads.

Multi-store commerce ops

Coordinate inventory, order, support, and marketing agents across multiple WooCommerce / Shopify stores from a single mesh.

Self-hosted infra ops

Agents that audit, patch, and report across your Tailnet hosts. No cloud agent platform required.

Long-running research

Multi-agent investigations with persistent ledgers, verifier passes, and durable on-disk artifacts.

WHY WE BUILT THIS

From the operator side of the table.

ORCA started as internal tooling for running a multi-store ecommerce operation and a self-hosted infra stack. Existing agent platforms were either too cloud-dependent, too generic, or too brittle for production ops work.

We rebuilt it from first principles: specialists with declared scope, persistent context that survives handoffs, full observability, and zero proprietary lock-in. Currently in private beta with a small group of self-hosted operators.

FOUNDER
Jake Bingham
Solo operator. Building ORCA on a self-hosted Tailnet stack across three production workloads.
PRIVATE BETA

Request access

We are onboarding a small number of self-hosted operators each month. Tell us what you would orchestrate.

We read every request. Typical response within 3 business days.