ZCP — Zerops Control Plane

MCP server that gives an LLM full control over a Zerops project. Runs as a zcp@1 service inside the project it manages.

Integration model

User ←→ Claude Code (terminal in code-server) ←→ ZCP (MCP over STDIO) ←→ Zerops API
                                                                        ←→ sibling services (SSH/SSHFS over VXLAN)

The user opens code-server on the zcp service subdomain. Claude Code is preconfigured with ZCP as its MCP server. The user describes what they want, the LLM figures out what to do, calls ZCP tools to make it happen.

ZCP authenticates once at startup (env var or zcli token), discovers which project it's in, and exposes everything as MCP tools. The LLM sees a system prompt with the environment concept, current service classification, and available workflows — the LLM decides what to do.

What the LLM can do

Through ZCP tools, the LLM can:

Bootstrap a full stack — from "I need a Node.js app with PostgreSQL" to running services with health checks, in one conversation
Deploy code — writes files via SSHFS mount, triggers build pipeline via SSH push
Debug — read logs, check events, verify service health
Scale — adjust CPU, RAM, disk, container count
Configure — manage env vars, subdomains, shared storage connections
Monitor — discover services, check statuses

Architecture

cmd/zcp/main.go → internal/server  → MCP tools  → internal/ops      → internal/platform → Zerops API
                                                 → internal/workflow  (orchestration + routing)
                                                 → internal/knowledge (text search)
                                                 → internal/auth      (token resolution)

Package	Responsibility
`cmd/zcp`	Entrypoint, STDIO server
`internal/server`	MCP server setup, tool registration, system prompt
`internal/tools`	MCP tool handlers (15 tools)
`internal/ops`	Business logic — deploy, verify, import, scale
`internal/workflow`	Bootstrap/deploy/recipe conductors, personalized guidance, checkers, session state, router
`internal/platform`	Zerops API client, types, error codes
`internal/auth`	Token resolution (env var / zcli), project discovery
`internal/knowledge`	Text search, embedded docs + recipes, session-aware briefings
`internal/schema`	Live Zerops YAML schema fetching, caching, enum extraction, LLM formatting
`internal/content`	Embedded workflow guides (bootstrap.md, deploy.md, recipe.md, cicd.md)

Flow routing

Every conversation starts with ZCP injecting a system prompt built from four layers:

Base instructions — workflow-first rules (always start a session before writing config)
Workflow hint — active sessions from registry (resume prompts)
Environment concept — container vs local: where code lives, how mounts work, deploy = rebuild
Project summary + Router — factual state (services, statuses, available workflows)

Router

The router is a pure function that returns factual data — no recommendations, no intent matching. The LLM decides what to do:

Route(RouterInput) → []FlowOffering{Workflow, Priority, Hint}

Service classification	Primary	Secondary
Empty project (no runtime services)	bootstrap (p1)	—
All managed (all runtimes have ZCP state)	strategy-based deploy (p1)	bootstrap (p2)
Unmanaged runtimes exist (services without ZCP state)	strategy-based or debug (p1-2)	bootstrap (p2)

Strategy-based routing reads ServiceMeta.DeployStrategy persisted from prior bootstraps. Utility offerings (recipe, scale) are always appended at priority 4-5. Scale is a direct tool — no workflow needed. Stale metas (hostnames deleted from API) are filtered out automatically.

Workflow types

Immediate (stateless)

cicd — return guidance markdown, no session tracking.

Orchestrated (session-tracked)

bootstrap, deploy, and recipe — create a session with state persistence, checker-based validation, and iteration support.

Bootstrap workflow

Bootstrap is the core flow. It takes a user request ("deploy a Go API with Postgres") and guides the LLM through 5 sequential steps with hard checks and an iteration loop.

The 5 steps

┌──────────┐   ┌───────────┐   ┌──────────┐   ┌────────┐   ┌───────┐
│ DISCOVER │──▶│ PROVISION │──▶│ GENERATE │──▶│ DEPLOY │──▶│ CLOSE │
│  (fixed) │   │  (fixed)  │   │(creative)│   │(branch)│   │(fixed)│
└──────────┘   └───────────┘   └──────────┘   └────────┘   └───────┘
                                (skippable)    (skippable)   (skip.)

Step	What happens	Hard check
discover	Classify services (via `managedByZCP`/`isInfrastructure` fields), plan services, validate types against live catalog, submit plan	—
provision	Generate import.yml, create services via API, mount dev filesystems via SSHFS, discover env vars from managed services	All services exist with expected status; managed deps have env vars
generate	Write zerops.yml + app code to mounted dev filesystem using real env vars from provision	zerops.yml valid, hostname match, env var refs valid
deploy	Deploy dev and stage services, enable subdomains, verify health, iteration loop (fix → redeploy)	All runtimes RUNNING; subdomain access enabled; health checks pass
close	Administrative closure — writes ServiceMeta files, presents strategy selection	—

generate, deploy, and close are skippable — but only for managed-only projects (no runtime services). Strategy selection happens after close via action="strategy".

Step categories

fixed — deterministic, always the same sequence of tool calls
creative — LLM generates code; requires judgment and knowledge
branching — per-service iteration with retry loops

Plan and service model

The discover step produces a plan that drives all subsequent steps:

ServicePlan
  └─ Targets[]
       ├─ Runtime
       │    ├─ DevHostname      "appdev"
       │    ├─ Type             "nodejs@22"
       │    ├─ BootstrapMode    "standard" | "dev" | "simple" (empty → standard)
       │    └─ StageHostname()  → "appstage" (auto-derived for standard mode)
       └─ Dependencies[]
            ├─ Hostname       "db"
            ├─ Type           "postgresql@16"
            ├─ Mode           "NON_HA" (auto-defaulted)
            └─ Resolution     "CREATE" | "EXISTS" | "SHARED"

Standard mode (default): every runtime gets a dev+stage pair. Dev uses deployFiles: [.] for fast iteration. Stage gets real build output.

Dev mode: single dev service, no stage. For prototyping and quick iterations.

Simple mode: single service with real start command + healthCheck. Auto-starts after deploy.

Hard checks

Before a step can complete, the engine runs a StepChecker — a function that queries the Zerops API to verify the step's postconditions:

LLM calls: zerops_workflow action="complete" step="provision" attestation="..."
  │
  ├─ Engine runs checkProvision()
  │    ├─ dev runtime RUNNING?
  │    ├─ stage runtime NEW or READY_TO_DEPLOY?
  │    ├─ dependencies RUNNING?
  │    └─ managed deps have env vars?
  │
  ├─ All pass → step completes, advance to next
  └─ Any fail → return CheckResult (not error), LLM can fix and retry

This prevents the LLM from advancing past a broken step. The check result is returned in the response so the LLM knows exactly what failed.

Iteration loop

When deploy fails, the LLM iterates:

deploy → FAIL → fix code on mount → redeploy → re-verify
                                     (max 10 attempts, configurable via ZCP_MAX_ITERATIONS)

Each iteration resets generate+deploy steps and increments the counter. Escalating diagnostic guidance is delivered on each retry.

Guidance delivery

Bootstrap and deploy use different guidance models:

Bootstrap = creative workflow — injects full knowledge (runtime briefings, schema, env vars) because the agent is creating configuration from scratch.
Deploy = operational workflow — injects compact personalized guidance (15-55 lines) with knowledge pointers. Agent pulls knowledge on demand via zerops_knowledge.
On-demand knowledge = session-aware. zerops_knowledge auto-detects the active workflow mode and filters runtime guides (Dev/Prod patterns) and recipes (mode-adapted headers) accordingly. Agent can override with explicit mode parameter.

Deploy guidance is assembled from DeployState + ServiceMeta — the agent sees their actual hostnames, mode-specific workflow steps, and strategy commands. Not generic templates.

See docs/spec-guidance-philosophy.md for the full guidance delivery specification.

Recipe workflow

Recipe is a 6-step workflow that creates deployable recipe repositories — reference implementations with 6 environment tiers (AI Agent, Remote CDE, Local, Stage, Small Production, HA Production). Supports 4 recipe types: runtime hello world (go-hello-world), frontend static (react-hello-world), frontend SSR (nextjs-hello-world), and backend framework (laravel-minimal, laravel-showcase).

The 6 steps

┌──────────┐   ┌───────────┐   ┌──────────┐   ┌────────┐   ┌──────────┐   ┌───────┐
│ RESEARCH │──▶│ PROVISION │──▶│ GENERATE │──▶│ DEPLOY │──▶│ FINALIZE │──▶│ CLOSE │
│  (fixed) │   │  (fixed)  │   │(creative)│   │(branch)│   │(creative)│   │(skip.)│
└──────────┘   └───────────┘   └──────────┘   └────────┘   └──────────┘   └───────┘

Step	What happens	Hard check
research	Fill framework research fields, submit `RecipePlan` with type/slug/targets validated against live catalog	Plan validation (slug format, types, required fields, showcase extras)
provision	Create dev/stage service pairs via import.yaml, mount dev filesystem, discover env vars	— (self-attested)
generate	Write app code + zerops.yml + README with extract fragments	Fragment markers present, YAML code block, comment ratio ≥ 30%, Gotchas section, no placeholders
deploy	Deploy dev, start server, verify, generate stage entry, cross-deploy stage, verify	— (self-attested, uses iteration escalation)
finalize	Generate 13 recipe repo files (6 import.yaml + 6 env README + main README)	All files exist, valid YAML, project naming, priority/HA/scaling per env tier, comment quality
close	Write `RecipeMeta`, present publish commands	— (administrative, skippable)

Only close is skippable. Iteration resets generate + deploy + finalize while preserving research + provision.

Recipe plan model

The research step produces a RecipePlan that drives all subsequent steps:

RecipePlan
  ├─ Framework     "laravel"
  ├─ Tier          "minimal" | "showcase"
  ├─ Slug          "laravel-minimal"
  ├─ RuntimeType   "php-nginx@8.4"
  ├─ Decisions     {WebServer, BuildBase, OS, DevTooling}
  ├─ Research      {ServiceType, PackageManager, HTTPPort, BuildCommands, ...}
  └─ Targets[]     {Hostname, Type, Role, Environments[]}

Headless creation (eval)

zcp eval create --framework laravel --tier minimal           # Single recipe
zcp eval create-suite --frameworks laravel,nestjs --tier minimal  # Batch

Spawns Claude CLI headlessly against the recipe workflow. Results in .zcp/eval/results/.

Publish flow

recipe complete → zcp sync push recipes {slug} → merge PR → zcp sync cache-clear {slug} → zcp sync pull recipes {slug}

Recipe metadata persists at {stateDir}/recipes/{slug}.json.

Post-bootstrap: ServiceMeta persistence

Bootstrap writes per-service metadata at two points:

When	What
After provision	Partial meta (hostname, mode, stage pairing — no `BootstrappedAt`)
After close step	Complete meta (`BootstrappedAt` set — marks bootstrap as finished)

Strategy is set separately via action="strategy" after bootstrap (never auto-assigned).

Stored at {stateDir}/services/{hostname}.json. These metas persist across conversations — the develop workflow reads them on start for mode, strategy, and preflight validation.

Deploy mechanics

ZCP sits on the same VXLAN network as all project services. It deploys via SSH:

SSHFS mount gives filesystem access to the target container
LLM writes code + zerops.yml directly to the mount path
zerops_deploy SSHes into the target, initializes git, runs zcli push
Zerops build pipeline picks it up from there

Dev services get source-deployed (deployFiles: [.]). Stage services get proper build output. Dev uses startWithoutCode: true so the container is already running before the first deploy.

Knowledge system

Platform knowledge comes from two sources: embedded docs (compiled into the binary) and live schemas (fetched from the Zerops API at runtime).

Briefings — stack-specific rules (e.g., "Node.js must bind 0.0.0.0, use these env var patterns for PostgreSQL wiring")
Recipes — complete framework configs (Laravel, Next.js, Django, etc.) with zerops.yml + import.yml
Live schemas — zerops.yml and import.yaml JSON schemas fetched from the public API, cached 24h. Provides authoritative enum values (119 service types, 79 build bases, 97 run bases, modes, policies) and field descriptions. Injected into workflow responses per-step — bootstrap gets import.yaml at provision, zerops.yml at generate; recipe gets step-appropriate schemas. Also used for validation in import and recipe plan submission.
Text search — search across all embedded docs by title + content matching

This prevents the LLM from guessing Zerops-specific syntax. It reads the rules and live schemas, then generates config.

Knowledge sync

Recipe and guide files are gitignored — they're pulled from external sources before build. Edits are pushed back as GitHub PRs.

# Pull (external → ZCP, before build)
zcp sync pull recipes                       # All recipes from API
zcp sync pull guides                        # All guides from zeropsio/docs (GitHub API)

# Edit locally, then push (ZCP → GitHub PRs)
zcp sync push recipes bun-hello-world       # Creates PR on app repo
zcp sync push guides                        # Creates PR on zeropsio/docs

# After PR is merged, refresh API cache and re-pull
zcp sync cache-clear bun-hello-world        # Invalidate Strapi cache
zcp sync pull recipes bun-hello-world       # Pull merged changes

Push decomposes the monolithic recipe .md into fragments (knowledge-base, integration-guide, zerops.yaml) and injects them into the correct marker regions in the app repo README. No local clones needed — everything goes through gh CLI and the GitHub API.

Config: .sync.yaml. Strapi token for cache-clear: .env (see .env.example).

Session persistence

All workflow state persists locally at .zcp/state/:

File	Purpose
`sessions/{id}.json`	Session state: bootstrap/deploy/recipe steps, plan, env vars, iteration
`services/{hostname}.json`	Per-service metadata (mode, strategy, stage pairing)
`recipes/{slug}.json`	Recipe metadata (slug, framework, tier, runtimeType)
`registry.json`	Active session tracking with PID-based ownership

Sessions survive process restarts. The MCP system prompt shows the active session state so the LLM can resume where it left off. Dead sessions (stale PID) can be taken over via zerops_workflow action="resume".

Development

go test ./... -count=1 -short    # All tests, fast
go test ./... -count=1 -race     # All tests with race detection
go build -o bin/zcp ./cmd/zcp    # Build
make lint-fast                   # Lint (~3s)

E2E tests need a real Zerops project: go test ./e2e/ -tags e2e (requires ZCP_API_KEY or zcli login).

Release

make release        # Minor bump (v2.62.0 → v2.63.0)
make release-patch  # Patch bump (v2.62.0 → v2.62.1)

Both run tests before tagging. If tests fail, the release is aborted. Requires a clean worktree (no uncommitted changes to tracked files; untracked files are ignored).

Name		Name	Last commit message	Last commit date
Latest commit History 553 Commits
.claude		.claude
.githooks		.githooks
.github/workflows		.github/workflows
cmd/zcp		cmd/zcp
docs		docs
e2e		e2e
eval		eval
integration		integration
internal		internal
tools		tools
.env.example		.env.example
.gitignore		.gitignore
.golangci.yaml		.golangci.yaml
.sync.yaml		.sync.yaml
CLAUDE.md		CLAUDE.md
KNOWLEDGE_DELIVERY_MAP.md		KNOWLEDGE_DELIVERY_MAP.md
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
install.sh		install.sh
integration.test		integration.test

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ZCP — Zerops Control Plane

Integration model

What the LLM can do

Architecture

Flow routing

Router

Workflow types

Immediate (stateless)

Orchestrated (session-tracked)

Bootstrap workflow

The 5 steps

Step categories

Plan and service model

Hard checks

Iteration loop

Guidance delivery

Recipe workflow

The 6 steps

Recipe plan model

Headless creation (eval)

Publish flow

Post-bootstrap: ServiceMeta persistence

Deploy mechanics

Knowledge system

Knowledge sync

Session persistence

Development

Release

About

Uh oh!

Releases 257

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ZCP — Zerops Control Plane

Integration model

What the LLM can do

Architecture

Flow routing

Router

Workflow types

Immediate (stateless)

Orchestrated (session-tracked)

Bootstrap workflow

The 5 steps

Step categories

Plan and service model

Hard checks

Iteration loop

Guidance delivery

Recipe workflow

The 6 steps

Recipe plan model

Headless creation (eval)

Publish flow

Post-bootstrap: ServiceMeta persistence

Deploy mechanics

Knowledge system

Knowledge sync

Session persistence

Development

Release

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 257

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages