sandy — an isolated sibling for your coding agents

When you're giving AI agents real autonomy to write code, run tests, and modify systems, the environment needs OS-enforced boundaries, not permission prompts. Sandy is the tool we built to make that work.

Install it, run it. That's it.

curl -fsSL https://raw.githubusercontent.com/rappdw/sandy/main/install.sh | bash
cd /path/to/your/project
sandy

Sandy runs Claude Code, Gemini CLI, OpenAI Codex CLI, OpenCode (provider-agnostic), or any combination of them side-by-side in a Docker container with agent permission checks disabled — so the agent works without interruption while your system stays protected:

Filesystem: Read/write limited to the mounted working directory only
Network: Public internet only — all LAN/private networks blocked
Resources: Capped CPU and memory (auto-detected from host)
Security: Non-root user, read-only root filesystem, no privilege escalation
Protected files: Shell configs, git hooks, and Claude commands mounted read-only
Per-project sandboxes: Isolated ~/.claude, credentials, and package storage per project
Dev environments: Python, Node.js, Go, Rust, and C/C++ with persistent package installs
Terminal notifications: OSC passthrough enabled — works with cmux, iTerm2, and other notification-aware terminals

No ANTHROPIC_API_KEY required if using a Claude paid account (Pro/Max) — credentials are seeded from the host on first run.

Why Sandy — Virtual Environments for Claude Code

Claude Code stores plugins, memory, hooks, credentials, and session history in a single global ~/.claude/ directory — shared across every project on your machine. This means a plugin installed for one project is active in all of them. Credentials are shared. Memory bleeds between contexts.

Sandy fixes this with per-project sandboxes — the same idea as Python virtual environments, but for your entire Claude Code environment:

~/.sandy/sandboxes/
├── webapp-a1b2c3d4/         # project A — one subdir per agent
│   ├── claude/              # mounted at ~/.claude in the container
│   │   ├── plugins/         # plugins installed here stay here
│   │   ├── memory/          # auto-memory is project-scoped
│   │   └── settings.json    # settings don't leak across projects
│   ├── gemini/              # mounted at ~/.gemini (only if SANDY_AGENT uses gemini)
│   ├── codex/               # mounted at ~/.codex  (only if SANDY_AGENT uses codex)
│   └── opencode/            # mounted at ~/.config/opencode + ~/.local/share/opencode
│       │                    # (only if SANDY_AGENT uses opencode)
│       ├── config/
│       └── share/
└── ml-pipeline-e5f6g7h8/    # project B is completely independent
    └── claude/
        └── ...

Each project sandbox also gets isolated package storage — pip, npm, go, cargo, and uv installs persist across sessions but never leak between projects. Credentials are read fresh from the host each launch and mounted ephemerally — never persisted to the sandbox.

This means you can run multiple sandy sessions across different projects simultaneously, each with its own plugins, memory, context, and installed tools — just like activating different Python venvs.

Isolation layers

Sandy wraps the agent in seven layers of OS-enforced isolation. For the assumed adversary and the honest residual risks, see docs/security/THREAT_MODEL.md.

Network egress. The agent runs on a Docker --internal network with no route off it except a TCP-only proxy sidecar. Permissive (default) blocks LAN/host/cloud-metadata and allows the public internet; strict allows only an allowlist (model providers, GitHub, package registries) plus SANDY_ALLOW_HOSTS. Non-TCP traffic (UDP/QUIC/ICMP/IPv6) is dropped by the topology itself. See How Network Isolation Works.
Filesystem. Read-only root filesystem; ephemeral tmpfs for /tmp and the home directory (lost on exit); only the working directory is bind-mounted from the host. ~25 sensitive paths inside it (shell configs, .git/config, .git/hooks/, .github/workflows/, .vscode/, .devcontainer/, …) are mounted read-only.
Credentials. Each project gets its own credential sandbox. The session's .credentials.json is mounted ephemerally and never persisted; codex/gemini OAuth files are mounted read-only. One project's credentials never leak to another.
Process & privilege. Non-root user, --cap-drop ALL, --security-opt no-new-privileges, Docker's default seccomp + AppArmor profiles, and no Docker socket — so the agent can't escalate or reach the daemon.
Resources. Capped CPU, memory, PID count, and tmpfs sizes (auto-detected from the host).
Config trust-tier. A committed .sandy/config is parsed as KEY=VALUE, never sourced (no shell execution), and can only set non-privileged keys; isolation toggles and credential variables require an explicit per-workspace approval prompt.
Per-instance. Per-session Docker networks, a per-workspace mutex (one sandy per workspace), and per-project sandboxes keep concurrent sessions and different projects fully independent.

Prerequisites

Sandy works with any Docker-compatible runtime:

Not sure whether your machine is ready? Run the doctor script — it checks the host for everything sandy needs (Docker daemon reachable, git, curl, gh CLI for default token auth, Claude credentials, $HOME/.local/bin on PATH) and prints copy-pasteable install commands for anything missing. It never installs or modifies anything itself.

curl -fsSL https://raw.githubusercontent.com/rappdw/sandy/main/doctor.sh | bash

Installation

curl -fsSL https://raw.githubusercontent.com/rappdw/sandy/main/install.sh | bash

Or install locally from a clone:

LOCAL_INSTALL=./sandy ./install.sh

Usage

cd /path/to/your/project
sandy                                              # interactive session
sandy -p "Review the code in src/ for security issues"  # one-shot prompt
sandy --remote                                     # remote-control server mode

Configuration

Per-project config (`.sandy/config`)

Sandy loads config from two levels, with project overriding user:

User-level: ~/.sandy/config and ~/.sandy/.secrets — apply to all projects on this machine
Per-project: .sandy/config and .sandy/.secrets — override user-level for this project

# ~/.sandy/config — user-level defaults for all projects
CLAUDE_CODE_OAUTH_TOKEN=sk-ant-...       # long-lived token (better in ~/.sandy/.secrets)

# .sandy/config — per-project overrides
SANDY_SSH=agent                          # use SSH agent instead of gh token
SANDY_MODEL=claude-sonnet-4-5-20250929   # override default model

Only allowlisted KEY=VALUE lines are parsed (not sourced as a shell script). Use .secrets files for credentials — they should not be committed. See the environment variables table below for supported keys.

Environment variables

Variable	Default	Description
`SANDY_AGENT`	`claude`	AI agent(s) to run. Single: `claude`, `gemini`, `codex`, `opencode`. Multi (comma-separated, 2–4 panes in tmux): e.g. `claude,gemini` or `claude,gemini,codex,opencode`. Alias: `all` = `claude,gemini,codex,opencode`
`SANDY_MODEL`	`claude-opus-4-8`	Claude model to use (applies whenever `claude` is in `SANDY_AGENT`)
`GEMINI_API_KEY`	(unset)	Google API key for Gemini CLI. Put in `.sandy/.secrets`
`GEMINI_MODEL`	(unset)	Gemini model override
`SANDY_GEMINI_AUTH`	`auto`	Force Gemini auth path: `auto`, `api_key`, `oauth`, or `adc`
`SANDY_GEMINI_EXTENSIONS`	(unset)	Comma-separated Gemini extension URLs/paths to install on first launch
`OPENAI_API_KEY`	(unset)	OpenAI API key for Codex CLI. Put in `.sandy/.secrets` — sandy materializes it as an ephemeral read-only `auth.json` for codex (codex 0.139+ no longer reads the env var for auth)
`CODEX_MODEL`	(unset)	Codex model override
`SANDY_CODEX_AUTH`	`auto`	Force Codex auth path: `auto`, `api_key`, or `oauth`
`OPENCODE_MODEL`	(unset)	OpenCode model override (`provider/model` format, e.g. `anthropic/claude-sonnet-4`)
`SANDY_OPENCODE_AUTH`	`auto`	Force OpenCode auth path: `auto`, `api_key`, or `oauth`
`SANDY_LOCAL_LLM_HOST`	(unset)	`host:port` to allow through LAN isolation, typically for a local LLM (e.g. `127.0.0.1:11434` for Ollama). Inserts a single iptables ACCEPT rule and (Linux) maps `host.docker.internal` to the bridge gateway
`GOOGLE_CLOUD_PROJECT`	(unset)	GCP project ID (Vertex AI)
`GOOGLE_CLOUD_LOCATION`	(unset)	GCP region (Vertex AI)
`GOOGLE_GENAI_USE_VERTEXAI`	(unset)	Set `true` to route Gemini through Vertex AI
`SANDY_CHANNEL_TARGET_PANE`	`0`	tmux pane target for Telegram relay in multi-agent mode. `0` = first agent in `SANDY_AGENT`, `1` = second, `2` = third
`SANDY_SSH`	`token`	Git auth method: `token` (gh CLI + HTTPS) or `agent` (SSH agent forwarding)
`SANDY_SKIP_PERMISSIONS`	`true`	Set to `false` to keep Claude Code's permission system active
`SANDY_HOME`	`~/.sandy`	Sandy config/build/sandbox directory
`SANDY_CPUS`	auto-detected	CPU limit for the container
`SANDY_MEM`	auto-detected	Memory limit for the container
`SANDY_ALLOW_LAN_HOSTS`	(unset)	Comma-separated IPs/CIDRs to allow through LAN isolation (e.g. `192.168.1.50,10.0.0.0/24`)
`SANDY_EGRESS_PROXY`	`0`	Cross-platform egress isolation. `0`=off, `1`=permissive (block LAN/host, allow internet), `2`=strict (allowlist only). The only network isolation that works on macOS — see "How Network Isolation Works"
`SANDY_ALLOW_HOSTS`	(unset)	Comma-separated extra egress-proxy allowlist entries (`host`, `*.suffix`, or `host:port`), appended to the built-in default set. Privileged tier
`SANDY_ALLOW_NO_ISOLATION`	(unset)	Set to `1` to allow launch without iptables rules (Linux)
`CLAUDE_CODE_OAUTH_TOKEN`	(unset)	Long-lived OAuth token from `claude setup-token`. Put in `.sandy/.secrets`. Recommended for headless servers
`ANTHROPIC_API_KEY`	(unset)	API key — not needed with Claude Pro/Max (OAuth)
`CLAUDE_CODE_MAX_OUTPUT_TOKENS`	`128000`	Max output tokens per response (Claude Code default is 32K)
`SANDY_SKILL_PACKS`	(unset)	Comma-separated skill packs to install (e.g. `gstack`). Built as a cached Docker layer
`SANDY_GPU`	(disabled)	GPU passthrough: `all` for all GPUs, or device IDs like `0` or `0,1`. Requires NVIDIA Container Toolkit
`SANDY_SCREENSHOT_DIR`	(unset)	Host directory of screenshots to mount into the container (read-only at `/home/claude/screenshots`). When set, sandy generates a `/ss` slash command for Claude/Gemini and a screenshot skill for Codex — type `/ss huh` to have the agent describe your latest screenshot, `/ss 3 explain` for the last three, etc. See "Screenshot skill" below
`SANDY_EXTRA_ENV`	(unset)	Comma-separated env-var names to forward into the container (e.g. `HA_TOKEN,LINEAR_API_KEY`). Values come from env (wins) or any of the four config files (workspace overrides host). Lets you wire up tokens for user-installed MCP servers without patching sandy. Privileged tier; workspace usage requires approval
`SANDY_CHANNELS`	(unset)	Channel plugins to enable (e.g. `plugin:telegram@claude-plugins-official`)
`TELEGRAM_BOT_TOKEN`	(unset)	Telegram bot token (from BotFather). Put in `.sandy/.secrets`, not `.sandy/config`
`TELEGRAM_ALLOWED_SENDERS`	(unset)	Comma-separated Telegram user IDs for allowlist (e.g. `123456,789012`)
`DISCORD_BOT_TOKEN`	(unset)	Discord bot token. Put in `.sandy/.secrets`, not `.sandy/config`
`DISCORD_ALLOWED_SENDERS`	(unset)	Comma-separated Discord user IDs for allowlist
`CLAUDE_CODE_EXPERIMENTAL_AGENT_TEAMS`	(unset)	Set to `1` to enable experimental agent teams

Flags

Flag	Description
`--new`	Start a fresh session (default: resume last)
`--resume`	Open session picker (forwarded to claude)
`--remote`	Start in remote-control server mode (connect from browser/phone)
`--rebuild`	Force rebuild of the Docker image
`--build-only`	Build images and exit (for CI)
`--upgrade`	Update sandy to the latest version from GitHub
`--agent <list>`	Agent(s) to launch — overrides `SANDY_AGENT` and `.sandy/config` (e.g. `--agent claude,gemini`)
`-p "prompt"`	One-shot prompt (no interactive session)

All other arguments are forwarded to claude.

Headless / remote servers

Recommended: Use a long-lived token (valid 1 year) to avoid OAuth expiry entirely:

On a machine with a browser, run: claude setup-token
Copy the token and add to ~/.sandy/.secrets on the headless server (applies to all projects):
```
CLAUDE_CODE_OAUTH_TOKEN=your_token_here
```
Run sandy — no browser needed, no /login needed

Fallback (without a long-lived token): Sandy skips the browser-based OAuth flow on Linux and directs you to use /login inside the session.

The /login URL is long and Claude Code wraps it with indentation, which breaks copy-paste. To work around this on macOS:

Select the wrapped URL text in your terminal and copy it (Cmd+C)
Clean and open it with: pbpaste | tr -d ' \n\t' | xargs open

To automate this as a global keyboard shortcut (e.g., Ctrl+Cmd+U):

Open Automator > File > New > Quick Action
Set "Workflow receives" to no input in any application
Add a Run Shell Script action with: pbpaste | tr -d ' \n\t' | xargs open
Save as "Open Cleaned URL"
Assign a shortcut in System Settings > Keyboard > Keyboard Shortcuts > Services

Running Gemini CLI (`SANDY_AGENT=gemini`)

Sandy supports four Gemini auth paths, probed automatically unless SANDY_GEMINI_AUTH pins a specific one:

Path	How to set up	When to use
API key	`GEMINI_API_KEY=...` in `.sandy/.secrets`	Simplest; works on headless servers
OAuth	Run `gemini auth` on the host once — sandy copies `~/.gemini/tokens.json` into the container ephemerally on each launch	Free-tier Gemini with browser login
ADC	`gcloud auth application-default login` on the host	Google Cloud / Vertex AI workflows
Vertex AI	ADC + `GOOGLE_GENAI_USE_VERTEXAI=true`, `GOOGLE_CLOUD_PROJECT=...`, `GOOGLE_CLOUD_LOCATION=...`	Enterprise / Vertex billing

gemini auth must be run on the host because the container is headless and cannot open a browser. --remote is not supported in any multi-agent combo — only SANDY_AGENT=claude (single) works with --remote.

Running Codex CLI (`SANDY_AGENT=codex`)

Sandy supports two Codex auth paths, probed automatically unless SANDY_CODEX_AUTH pins a specific one:

Path	How to set up	When to use
API key	`OPENAI_API_KEY=sk-...` in `.sandy/.secrets` — sandy writes it into an ephemeral `auth.json` (what `codex login --with-api-key` would create) and mounts it read-only; codex 0.139+ ignores the bare env var for auth	Simplest; works on headless servers
OAuth (ChatGPT)	Run `codex login` on the host once — sandy copies `~/.codex/auth.json` into the container as a read-only mount on each launch	ChatGPT Plus/Team/Enterprise accounts

Because the OAuth mount is read-only, in-session token refresh will fail — if your token expires, run codex login inside the sandy session (or back on the host for next launch). This is intentional: a writable mount would leak refreshed tokens back to the host and open a stale-token race on session exit.

Sandy forces sandbox_mode = "danger-full-access" in the container's ~/.codex/config.toml and passes --sandbox danger-full-access on the CLI (belt-and-suspenders). Codex's Landlock sandbox does not nest cleanly inside Docker — sandy provides the outer isolation. On first launch sandy also seeds a full [notice] block in config.toml to suppress all first-run prompts and appends a trusted-project entry for your workspace.

Headless mode (-p / --print / --prompt "...") translates to codex exec --skip-git-repo-check — the prompt is passed as a positional arg, not a flag, and the trust/git-repo gate is skipped (codex 0.139+ otherwise refuses to run headless outside a git repo; sandy already provides the isolation). codex exec only returns exit codes 0 (success) or 1 (failure), with no nuanced exit codes. --continue / -c is silently dropped (codex has codex resume, but no headless continuation flag).

Not supported with codex: --remote, SANDY_SKILL_PACKS, SANDY_CHANNELS=discord. Telegram channels work via the host-side tmux relay.

Running OpenCode (`SANDY_AGENT=opencode`)

OpenCode (sst/opencode) is a provider-agnostic agent — sandy doesn't bind it to a specific LLM vendor. Set whichever provider key you have (in .sandy/.secrets or env), tweak ~/.config/opencode/opencode.json, and OpenCode picks the matching provider:

Provider	Key	Notes
Anthropic	`ANTHROPIC_API_KEY=sk-ant-...`	OpenCode reads natively
OpenAI	`OPENAI_API_KEY=sk-...`	Shared with Codex agent if both active
Google	`GEMINI_API_KEY=...`	Shared with Gemini agent if both active
OAuth	`opencode auth login` on host → `~/.local/share/opencode/auth.json`	Mounted read-only into the container

Sandy seeds ~/.config/opencode/opencode.json from the host's copy on first launch — point it at any provider OpenCode supports, including a local LLM.

Local LLM passthrough. Pair OpenCode with SANDY_LOCAL_LLM_HOST=<ip>:<port> (e.g. 127.0.0.1:11434 for Ollama, localhost:8000 for vLLM, etc.) to allow the container to reach a local LLM running on the Docker host. Sandy inserts a single narrow iptables ACCEPT for that exact host:port (Linux), maps host.docker.internal to the bridge gateway (Linux Docker doesn't auto-resolve it), and rejects world-open IPs (0.0.0.0) and bare IPs without ports. Edit ~/.config/opencode/opencode.json to set the provider's baseURL to http://host.docker.internal:<port>/v1. The rule is removed on session exit. The rest of LAN remains blocked.

Headless mode (-p / --print / --prompt "...") translates to opencode run — the prompt is positional. --continue / -c is silently dropped (no headless resume flag yet).

Not supported with opencode in v0: --remote, SANDY_SKILL_PACKS, SANDY_CHANNELS=discord. Synthkit is installed in the image but skill auto-discovery for opencode is deferred until upstream support stabilizes.

Multi-agent mode

Sandy runs any combination of Claude, Gemini, Codex, and OpenCode side-by-side in a single tmux session, selected via comma-separated values in SANDY_AGENT:

SANDY_AGENT=claude,gemini              # two panes
SANDY_AGENT=claude,codex               # two panes
SANDY_AGENT=claude,opencode            # two panes
SANDY_AGENT=claude,gemini,codex,opencode  # four panes
SANDY_AGENT=all                        # alias for claude,gemini,codex,opencode

Panes appear in the order listed. Each agent has its own config dir(s): ~/.claude, ~/.gemini, ~/.codex, and ~/.config/opencode + ~/.local/share/opencode. All panes share the same workspace mount. Exiting one pane leaves the others running. Single-agent modes use their own Docker images (sandy-claude-code, sandy-gemini-cli, sandy-codex, sandy-opencode); any multi-agent combo uses the sandy-full image, which bundles all four CLIs.

Feature support in multi-agent mode: skill packs apply to the Claude pane only. Telegram channels use the host-side relay and are routed to pane 0 by default — override with SANDY_CHANNEL_TARGET_PANE=0|1|2|3. --remote is not supported in any multi-agent combo. SANDY_LOCAL_LLM_HOST works in any combo that includes opencode (or any agent that wants to reach a host-side service over the gateway).

Screenshot skill (`/ss`)

Set SANDY_SCREENSHOT_DIR=<host-path> in ~/.sandy/config (or per-workspace .sandy/config) to give the agent eyes: sandy mounts the folder read-only into the container and generates a per-agent /ss skill that finds the newest screenshots and feeds them to the model.

# in ~/.sandy/config
SANDY_SCREENSHOT_DIR=/Users/you/Desktop/organized-screenshots

Usage inside a session:

/ss              # newest screenshot, no instruction → agent describes it
/ss huh          # newest screenshot + "huh" → agent explains
/ss 3 explain    # 3 newest screenshots, agent explains the set
/ss fix          # newest is an error message → agent diagnoses + fixes
/ss do this      # newest is a technique you saw → agent applies it
/ss make infographic  # newest N → agent synthesizes one

Agent	How to invoke
Claude	`/ss [N] [action]` slash command
Gemini	`/ss [N] [action]` slash command
Codex	"look at my recent screenshot" — codex matches by description
OpenCode	manual: `opencode "explain $(sandy-ss-paths 1)"` (no slash-command surface in v0)

No default — leaving SANDY_SCREENSHOT_DIR unset disables the feature entirely. macOS users typically point at ~/Desktop (the macOS default for Cmd+Shift+4 captures) or a custom folder configured via defaults write com.apple.screencapture location <path>. The mount is read-only by design; the agent should never modify your screenshot folder.

How Network Isolation Works

Network egress is one of sandy's isolation layers. For the full picture — the assumed adversary, every layer, and the honest residual risks — see THREAT_MODEL.md. Empirical bypass attempts are in ISOLATION_STRESS.md.

Egress proxy (`SANDY_EGRESS_PROXY`) — cross-platform isolation

The egress proxy is the recommended isolation mechanism and the only one that works on macOS. It routes the agent through a small proxy sidecar on a Docker --internal network (no route off the bridge except through the proxy), so it behaves identically on macOS and Linux. It's a tri-state:

Value	Mode	Behavior
`0` (default)	off	Linux iptables only; macOS has no network isolation (see below).
`1`	permissive	Blocks private/LAN/host/cloud-metadata destinations, allows all internet. Closes the macOS LAN gap with ~zero friction — recommended.
`2`	strict	Allows only a built-in default allowlist (model providers, GitHub incl. SSH, npm/PyPI/crates/Go/Debian) plus `SANDY_ALLOW_HOSTS`. Fails closed on everything else.

SANDY_EGRESS_PROXY=1   # in ~/.sandy/config or a workspace .sandy/config

Add extra reachable hosts with SANDY_ALLOW_HOSTS (privileged; comma-separated host, *.suffix, or host:port). git-over-SSH (SANDY_SSH=agent) is tunneled through the proxy automatically on both platforms; on macOS, host-agent key signing is unavailable under the proxy (use SANDY_SSH=token for a fully-supported HTTPS path). A local LLM (SANDY_LOCAL_LLM_HOST) is forwarded through the proxy rather than an iptables hole. See CLAUDE.md → "Egress Proxy" for the full topology.

macOS (Docker Desktop) — not isolated when the proxy is off

Warning: if you turn the proxy off with SANDY_EGRESS_PROXY=0 (the default is 1 — permissive), Docker Desktop does not provide LAN isolation. The container can reach host.docker.internal (→ your Mac's gateway), your host's localhost services, and any device on your physical LAN — your home router at 192.168.1.1, a NAS, a printer, an internal dashboard, your SSH daemon. A stress test in April 2026 opened a live TCP connection from inside the container to the host's SSHD and read its banner (see ISOLATION_STRESS.md, finding F2).

As defense-in-depth, sandy nullifies the Docker Desktop magic hostnames (gateway.docker.internal, metadata.google.internal, and — when SANDY_SSH != agent — host.docker.internal) via --add-host, and prints a launch-time warning banner on macOS. But raw-IP access is unaffected, and the banner is a warning, not a fix.

Fix: set SANDY_EGRESS_PROXY=1 (or =2), which applies real isolation on macOS. Otherwise treat proxy-off macOS sandy as "process and filesystem isolation only; no network isolation."

Linux

Sandy automatically inserts iptables rules into the DOCKER-USER chain that block all RFC 1918 traffic from the container's bridge interface:

Range	What it blocks
`10.0.0.0/8`	Home/office LAN, VPNs
`172.16.0.0/12`	Docker internals, some LANs
`192.168.0.0/16`	Home/office LAN
`169.254.0.0/16`	Link-local
`100.64.0.0/10`	CGNAT, Tailscale

Rules are automatically cleaned up when sandy exits. Stale rules from a previous unclean exit are cleaned up on startup. If iptables is not accessible, sandy warns that LAN isolation is not active.

Verifying Isolation

From inside the container, you can verify:

# Should FAIL — LAN is blocked
curl -m 5 http://192.168.1.1

# Should SUCCEED — public internet works
curl -m 5 https://api.anthropic.com

What's in the Box

Sandy's base image is a self-contained development environment. Everything below is pre-installed and ready to use — no setup required.

Language toolchains

Toolchain	Version	Notes
Python 3	Debian bookworm default	System Python; use `uv` for other versions
Node.js	22 LTS	Via NodeSource
Go	1.24
Rust	stable	Via rustup
C/C++	build-essential	gcc, g++, make, libc-dev

System tools

Tool	Purpose
`git`	Version control
`git-lfs`	Large file storage (auto-detected, auto-configured)
`gh`	GitHub CLI — PRs, issues, releases
`jq`	JSON processor
`ripgrep` (`rg`)	Fast code search
`curl`	HTTP client
`cmake`	Build system
`pkg-config`	Build helper
`socat`	Socket relay (SSH agent forwarding)
`tmux`	Terminal multiplexer (sandy's session wrapper)
`less`	Pager
`openssh-client`	SSH client

Python tools

Tool	Purpose
`uv`	Fast Python version & package manager
`pip` / `pip3`	Package installer (auto `--user` outside venvs)
`python3-venv`	Virtual environment support

Libraries

Library	Purpose
`libcairo2`	2D graphics / PDF rendering
`libpango1.0-0`	Text layout / PDF rendering
`libgdk-pixbuf-2.0-0`	Image loading / PDF rendering
`libssl-dev`	TLS development headers
`ncurses-term`	Terminal definitions

Plugin marketplace

Two plugin marketplaces are pre-configured in every sandbox: claude-plugins-official and sandy-plugins. Browse and install plugins with:

/plugin                                    # browse available plugins
/plugin install <name>@<marketplace>       # install a plugin
/plugin update                             # update installed plugins

Note — synthkit is no longer a plugin. Earlier versions of sandy referenced /plugin install synthkit@thinkkit. Synthkit is now a regular CLI tool (uv tool install synthkit, or uvx synthkit ...) and is pre-installed in the base image — the /md2pdf, /md2doc, /md2html, /md2email slash commands are generated by sandy at session start (no /plugin install needed).

Known issue — slash command autocomplete: Plugin skills are lazy-loaded by Claude Code and won't appear in slash command autocomplete until invoked once — either by typing the request naturally or via the fully qualified name (e.g. <plugin>:<skill>). After first invocation, they appear in autocomplete for the rest of the session. This is a known Claude Code bug — the slash command resolver only indexes the legacy commands/ system and ignores skills/ entries (despite commands being merged into skills). Sandy's own builtins (/md2pdf, /ss, etc.) live under commands/ so they're not affected.

Skill packs

Skill packs are optional Docker image layers that bake curated skill collections into the container. They're not included by default — enable them per-project and they're built once, cached, and instantly available on subsequent launches.

# .sandy/config
SANDY_SKILL_PACKS=gstack

Pack	Description	Source
`gstack`	28 Claude Code skills (QA, review, ship, browse, etc.) + headless Chromium browser engine	garrytan/gstack

First launch with a new skill pack takes a few minutes (downloading, compiling, installing Chromium). After that, launches are instant — everything is cached in a Docker image layer. Sandy auto-checks for newer skill pack releases on each launch and rebuilds when updates are available.

Skills are automatically discovered by Claude Code at session start. Skill pack bin/ directories are added to PATH.

GPU support

Sandy can pass host GPUs into the container for ML/AI workloads. This requires the NVIDIA Container Toolkit installed on the host.

Enable via environment variable or .sandy/config:

# .sandy/config
SANDY_GPU=all          # all GPUs
SANDY_GPU=0            # specific GPU
SANDY_GPU=0,1          # multiple GPUs

The sandy base image does not include CUDA. Use .sandy/Dockerfile to layer GPU tools for projects that need them (see examples/gpu/Dockerfile for a ready-to-copy starting point). The per-project image is cached and only rebuilds when .sandy/Dockerfile changes.

Example — CUDA + Python ML (works on x86_64 and arm64, including DGX Spark):

ARG BASE_IMAGE
FROM ${BASE_IMAGE}

# Add NVIDIA CUDA apt repository (arch-aware — maps aarch64 to sbsa for Debian)
RUN CUDA_ARCH="$(uname -m)"; [ "$CUDA_ARCH" = "aarch64" ] && CUDA_ARCH="sbsa"; \
    curl -fsSL "https://developer.download.nvidia.com/compute/cuda/repos/debian12/${CUDA_ARCH}/cuda-keyring_1.1-1_all.deb" \
        -o /tmp/cuda-keyring.deb \
    && dpkg -i /tmp/cuda-keyring.deb && rm /tmp/cuda-keyring.deb \
    && apt-get update \
    && apt-get install -y --no-install-recommends cuda-toolkit \
    && rm -rf /var/lib/apt/lists/*

For a lighter setup that skips system CUDA and uses pre-built wheels:

ARG BASE_IMAGE
FROM ${BASE_IMAGE}
RUN pip install --user torch torchvision torchaudio

Platform notes:

x86_64: Standard NVIDIA GPUs (RTX, A100, H100, etc.) — fully supported
arm64 / DGX Spark: Grace Blackwell architecture — fully supported (base image and CUDA repo are multi-arch)
macOS: Docker Desktop does not support GPU passthrough; SANDY_GPU has no effect

Persistent packages

Packages installed inside sandy persist across sessions per project. Each project sandbox has dedicated bind-mounted directories for each package manager:

pip install flask          # persists in sandbox pip/ dir
npm install -g typescript  # persists in sandbox npm-global/ dir
go install golang.org/x/tools/gopls@latest  # persists in sandbox go/ dir
cargo install ripgrep      # persists in sandbox cargo/ dir

These are per-project — packages installed in one project don't leak to another.

Python version management

The base image ships one system Python (Debian bookworm's default). If your project needs a specific version, use uv:

uv python install 3.11        # downloads once, persists across sessions
uv venv --python 3.11         # creates .venv in project dir
source .venv/bin/activate
uv pip install -r requirements.txt

Different projects can use different Python versions with the same sandy image — each project's sandbox stores its own uv-managed Python installations.

Using host virtual environments and build artifacts

Your project directory is bind-mounted read-write, so node_modules/, target/, and other build directories from the host are visible inside the container. .venv/ is a special case — see below.

Python .venv/ — a host-created venv's bin/python is a symlink to a host-only interpreter path (e.g. /Users/you/.local/share/uv/python/cpython-3.12-macos-aarch64/bin/python3.12) that doesn't exist inside the Linux container. To avoid breaking the host venv and to give the container a working venv, sandy shadows .venv/ with a sandbox-owned overlay:
- On launch, if $WORKSPACE/.venv exists on the host and isn't a symlink, sandy bind-mounts a sandbox-owned directory over it inside the container. The host venv on disk is untouched.
- On first launch the overlay is empty; sandy runs uv venv --clear --python <version> to materialize a fresh venv. The Python version comes from .python-version if present (authoritative), otherwise from the host's pyvenv.cfg.
- Populate it once: uv sync / uv pip install -e . / pip install -r requirements.txt. Subsequent launches skip straight to activation — the overlay persists in ~/.sandy/sandboxes/<project>/venv/.
- Sandy sets VIRTUAL_ENV and prepends .venv/bin to PATH automatically — no need to source .venv/bin/activate.
- If you bump .python-version after the overlay was built, sandy prints a drift warning with the recreate command (auto-recreate is deliberately not done — it would nuke installed packages).
- Opt out with SANDY_VENV_OVERLAY=0 in .sandy/config if you want the raw bind mount. The host and container then need matching Python versions at matching paths or the venv is broken.
- Non-standard venv names (venv/, .venv-py311/) are not overlaid — only the standard .venv/ is.
Node.js node_modules/ — pure JS packages work fine. Native addons compiled on the host work if the host is also Linux with compatible glibc. Fix: npm rebuild
Rust target/ — reusable if both sides are Linux x86_64. macOS host → Linux container triggers a full rebuild automatically
Go vendor/ — pure source, always works

Per-project tooling (`.sandy/Dockerfile`)

If your project needs system tools beyond the base image, create a .sandy/Dockerfile in your project directory:

ARG BASE_IMAGE
FROM ${BASE_IMAGE}

# No USER directive needed — entrypoint handles privilege dropping
RUN curl -LsSf https://github.com/typst/typst/releases/latest/download/typst-x86_64-unknown-linux-musl.tar.xz \
    | tar -xJ --strip-components=1 -C /usr/local/bin
ARG QUARTO_VERSION=1.9.36
RUN curl -fL "https://github.com/quarto-dev/quarto-cli/releases/download/v${QUARTO_VERSION}/quarto-${QUARTO_VERSION}-linux-amd64.tar.gz" \
    | tar -xz -C /opt \
    && ln -s /opt/quarto-${QUARTO_VERSION}/bin/quarto /usr/local/bin/quarto

Sandy detects this file and builds a project-specific image layered on top of the standard sandy image. The project image:

Rebuilds automatically when the Dockerfile changes or the base sandy image updates
Is cached per-project (tagged as sandy-project-<name>-<hash>)
Uses the .sandy/ directory as build context, so you can COPY files from there

This is the right approach for system packages (apt-get), large binary tools, or anything that needs root to install. See examples/ for ready-to-use configurations.

Automatic environment detection

Sandy checks your project on startup and handles common issues:

.python-version — if present, sandy auto-installs that Python version via uv (persists across sessions)
Host .venv/ — shadowed with a sandbox-owned overlay (see above). The host venv is never modified; the container gets its own materialized venv matching the host's Python version, auto-activated via VIRTUAL_ENV + PATH. Drift between the overlay and .python-version triggers a warning on relaunch
Foreign native modules — if node_modules/ contains native addons compiled for a different platform (e.g. macOS), sandy warns with npm rebuild as the fix

These checks run on every session start and add negligible overhead.

Terminal Notifications

Sandy passes through OSC escape sequences (9/99/777) from Claude Code to the outer terminal. This enables notification features in terminals like cmux and iTerm2 — pane rings, desktop alerts, and badges when Claude needs attention.

cmux auto-setup: When sandy detects it's running inside cmux (via the CMUX_WORKSPACE_ID environment variable), it automatically installs a notification hook that emits OSC 777 sequences on Claude Code events. No manual configuration needed — just run sandy in a cmux pane.

Custom hooks: If you have Claude Code hooks configured on the host (~/.claude/hooks/), sandy mounts them read-only into the container automatically. Host hooks take precedence over auto-setup (cmux auto-setup is skipped if ~/.claude/hooks/ exists on the host).

Clipboard: Sandy's tmux uses OSC 52 to copy mouse selections to the system clipboard. In iTerm2, enable this under Settings > General > Selection > "Applications in terminal may access clipboard". With this enabled, click-drag selections in the container are automatically copied to your Mac clipboard.

Channels (Telegram, Discord)

Sandy supports Claude Code channels — push messages from Telegram or Discord into your running session. Sandy auto-installs the channel plugin and seeds credentials on startup.

Quick setup (Telegram)

Create a bot via BotFather and copy the token

Add to .sandy/.secrets (gitignored):

TELEGRAM_BOT_TOKEN=123456789:AAH...
TELEGRAM_ALLOWED_SENDERS=your_telegram_user_id

Add to .sandy/config:

SANDY_CHANNELS=plugin:telegram@claude-plugins-official

Run sandy — the plugin is auto-installed, credentials are seeded, and Claude starts with the channel active

To find your Telegram user ID, message @userinfobot. If TELEGRAM_ALLOWED_SENDERS is omitted, sandy starts in pairing mode — DM your bot, then run /telegram:access pair <code> inside the session.

Quick setup (Discord)

Create an application at the Discord Developer Portal
In the Bot section, create a bot, reset the token, and copy it
Enable Message Content Intent under Privileged Gateway Intents
Use OAuth2 > URL Generator with the bot scope and these permissions: View Channels, Send Messages, Send Messages in Threads, Read Message History, Attach Files, Add Reactions. Open the generated URL to invite the bot to your server.

Add to .sandy/.secrets (gitignored):

DISCORD_BOT_TOKEN=your_discord_bot_token
DISCORD_ALLOWED_SENDERS=your_discord_user_id

Add to .sandy/config:

SANDY_CHANNELS=plugin:discord@claude-plugins-official

Run sandy — the plugin is auto-installed, credentials are seeded, and Claude starts with the channel active

If DISCORD_ALLOWED_SENDERS is omitted, sandy starts in pairing mode — DM your bot, then run /discord:access pair <code> inside the session.

Using both channels

Set both tokens in .sandy/.secrets and list both plugins in .sandy/config:

SANDY_CHANNELS=plugin:telegram@claude-plugins-official plugin:discord@claude-plugins-official

Channels with Gemini / Codex / multi-agent mode

For any SANDY_AGENT value other than single-agent claude, sandy uses a host-side Telegram relay instead of the in-container plugin — it long-polls the Telegram Bot API on the host and injects messages into the container's tmux session via docker exec … tmux send-keys. This is agent-agnostic but lower-fidelity: no chat threading, no edit-message updates, no attachments. Set SANDY_CHANNEL_TARGET_PANE=0|1|2 to route messages to a specific pane in multi-agent mode (default is pane 0 = the first agent listed in SANDY_AGENT). Discord via relay is not supported yet — use single-agent SANDY_AGENT=claude for Discord.

Per-project secrets

.sandy/.secrets uses the same KEY=VALUE format as .sandy/config but is intended for credentials. Add it to .gitignore:

.sandy/.secrets

Security Notes

The container runs as a non-root user (claude, mapped to host UID)
The root filesystem is read-only (/tmp and /home/claude are tmpfs)
no-new-privileges prevents privilege escalation
Credentials are seeded into per-project sandboxes, not shared across projects
The working directory is bind-mounted read/write — Claude can modify your files there (that's the point)

Protected files and directories

The workspace is bind-mounted read/write so Claude can modify your project files. However, certain files and directories are overlaid with read-only or sandbox mounts to block the most dangerous attack vectors for an AI coding agent: shell config injection, git hook injection, and tool config tampering.

Read-only mounts — host content is visible but cannot be modified:

Path	Why
`.bashrc`, `.bash_profile`, `.zshrc`, `.zprofile`, `.profile`	Blocks shell config injection (e.g. aliases, PATH hijacking)
`.gitconfig`	Blocks git config tampering (e.g. credential helpers, aliases)
`.ripgreprc`	Blocks search config injection
`.mcp.json`	Blocks MCP server config tampering
`.envrc`	Blocks `direnv` auto-sourcing (executes on `cd`)
`.tool-versions`, `.mise.toml`, `.nvmrc`, `.node-version`, `.python-version`, `.ruby-version`	Blocks asdf/mise/nvm/pyenv/rbenv toolchain hijacking
`.npmrc`, `.yarnrc`, `.yarnrc.yml`, `.pypirc`, `.netrc`	Blocks registry hijacking and credential exfiltration
`.pre-commit-config.yaml`	Blocks pre-commit hook injection
`.git/config`, `.gitmodules`, `.git/HEAD`, `.git/packed-refs`	Blocks git remote/hook path manipulation and ref spoofing
`.git/hooks/`	Blocks git hook injection (pre-commit, post-checkout, etc.)
`.git/info/`	Blocks `.git/info/attributes` filter-driver injection
`.git/modules/<sub>/{config,hooks,info}`	Same, for every submodule gitdir (walked recursively)
`.vscode/`, `.idea/`	Blocks IDE task/launch config injection
`.github/workflows/`	Blocks CI pipeline escape (opt-out via `SANDY_ALLOW_WORKFLOW_EDIT=1`)
`.circleci/`, `.devcontainer/`	Blocks CircleCI and devcontainer escape

Sandbox-mounted directories — overlaid with writable sandbox copies so Claude can create and modify them without touching the host:

Path	Behavior
`.claude/commands/`	Starts empty. Claude can create new slash commands
`.claude/agents/`	Starts empty. Claude can create new agents
`.claude/plugins/`	Starts empty. Managed via `/plugin install` inside the container

Always-mount pattern (1.0-rc1). Since 1.0-rc1, protected files and directories are always mounted — if the host has no corresponding file, sandy overlays an empty zero-byte file or empty directory read-only at that path instead. This closes the pre-1.0 gap where absent protected files could be created inside the container and silently loaded back on the host on first read. The one exception is the git-file set (.git/config, .gitmodules, .git/HEAD, .git/packed-refs), which is still gated on existence — those files are meaningless without a real git repo.

Name		Name	Last commit message	Last commit date
Latest commit History 221 Commits
.github/workflows		.github/workflows
analysis		analysis
docs		docs
examples		examples
proxy		proxy
research		research
templates		templates
test		test
.gitignore		.gitignore
.gitmodules		.gitmodules
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
RELEASE_NOTES.md		RELEASE_NOTES.md
SPECIFICATION.md		SPECIFICATION.md
SPEC_INTROSPECTION.md		SPEC_INTROSPECTION.md
doctor.sh		doctor.sh
install.sh		install.sh
sandy		sandy

Folders and files

Latest commit

History

Repository files navigation

sandy — an isolated sibling for your coding agents

Why Sandy — Virtual Environments for Claude Code

Isolation layers

Prerequisites

Installation

Usage

Configuration

Per-project config (.sandy/config)

Environment variables

Flags

Headless / remote servers

Running Gemini CLI (SANDY_AGENT=gemini)

Running Codex CLI (SANDY_AGENT=codex)

Running OpenCode (SANDY_AGENT=opencode)

Multi-agent mode

Screenshot skill (/ss)

How Network Isolation Works

Egress proxy (SANDY_EGRESS_PROXY) — cross-platform isolation

macOS (Docker Desktop) — not isolated when the proxy is off

Linux

Verifying Isolation

What's in the Box

Language toolchains

System tools

Python tools

Libraries

Plugin marketplace

Skill packs

GPU support

Persistent packages

Python version management

Using host virtual environments and build artifacts

Per-project tooling (.sandy/Dockerfile)

Automatic environment detection

Terminal Notifications

Channels (Telegram, Discord)

Quick setup (Telegram)

Quick setup (Discord)

Using both channels

Channels with Gemini / Codex / multi-agent mode

Per-project secrets

Security Notes

Protected files and directories

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 27

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Per-project config (`.sandy/config`)

Running Gemini CLI (`SANDY_AGENT=gemini`)

Running Codex CLI (`SANDY_AGENT=codex`)

Running OpenCode (`SANDY_AGENT=opencode`)

Screenshot skill (`/ss`)

Egress proxy (`SANDY_EGRESS_PROXY`) — cross-platform isolation

Per-project tooling (`.sandy/Dockerfile`)

Packages