feat(daemon): gate direct session shell behind explicit opt-in by doudouOUC · Pull Request #5031 · QwenLM/qwen-code

doudouOUC · 2026-06-12T08:09:29Z

What this PR does

This PR turns direct session shell execution into an explicit daemon opt-in. qwen serve --enable-session-shell is now required, and the effective policy is only enabled when a bearer token is configured. REST, ACP _qwen/session/shell, and the bridge execution sink all enforce the same policy.

It also requires direct shell callers to provide a client id already bound to the target session. ACP calls use the bridge-stamped client id from the owned session binding, and SDK docs now call out the opt-in, bearer auth, and session-bound client id requirements.

For direct createServeApp() embedders, token: '' is treated as tokenless rather than configured. Strict mutation routes return token_required, and the session shell capability stays hidden until a non-empty token is supplied.

Passing --enable-session-shell without a token now emits a boot warning and leaves direct session shell disabled.

Why it's needed

POST /session/:id/shell bypasses the normal agent tool approval flow and executes directly through the daemon. Leaving that surface reachable by default, or reachable with only a daemon token plus a session id, gives too much authority to a high-risk endpoint. This PR makes the capability disabled by default and adds a session ownership proof before any command can execute.

Reviewer Test Plan

How to verify

Start a default loopback daemon without a bearer token and confirm /capabilities.features does not include session_shell_command, ACP initialize does not advertise _qwen/session/shell, and REST shell returns 401 token_required.

Start an authenticated daemon without --enable-session-shell and confirm authenticated direct shell calls return session_shell_disabled without reaching the bridge.

Start an authenticated daemon with --enable-session-shell and confirm shell calls without X-Qwen-Client-Id return client_id_required, while calls with a client id bound to the session reach the bridge and execute normally.

For direct embedders, call createServeApp({ token: '' }) and confirm it behaves as tokenless: strict shell mutation returns token_required, and session_shell_command is not advertised.

Evidence (Before & After)

N/A for UI. Local automated verification covered the REST, ACP, bridge, CLI argument, build, and typecheck paths.

Tested on

OS	Status
🍏 macOS	✅ tested
🪟 Windows	⚠️ not tested
🐧 Linux	✅ tested

Environment (optional)

Local Node/npm workspace in the repository worktree.

Risk & Scope

Main risk or tradeoff: this is a behavior change for direct daemon shell clients; callers must explicitly enable the daemon capability and provide a session-bound client id.
Not validated / out of scope: no shell-specific rate limiter, no PermissionMediator synthetic approval flow, no changes to normal agent shell tool approvals or prompt queue behavior.
Breaking changes / migration notes: bare DaemonClient.shellCommand(sessionId, command) calls must pass opts.clientId when the daemon has direct session shell enabled; DaemonSessionClient.shellCommand() continues to forward the session-bound client id automatically. Direct embedders that pass an empty string token should pass a non-empty bearer token or omit token; an empty string is not treated as a configured token.

Linked Issues

References #4490.

中文说明

What this PR does

这个 PR 把 direct session shell 执行改成 daemon 显式 opt-in 能力。现在必须传 qwen serve --enable-session-shell，并且只有配置了 bearer token 时 effective policy 才会开启。REST、ACP _qwen/session/shell 和 bridge 执行入口都会执行同一套策略。

它还要求 direct shell 调用方提供已经绑定到目标 session 的 client id。ACP 调用会使用 owned session binding 里由 bridge stamp 的 client id，SDK 注释也补充了 opt-in、bearer auth 和 session-bound client id 的要求。

对直接使用 createServeApp() 的 embedder，token: '' 会被视为没有配置 token。strict mutation route 会返回 token_required，并且只有提供非空 token 后才会暴露 session_shell_command capability。

传了 --enable-session-shell 但没有配置 token 时，现在启动阶段会打印 warning，并保持 direct session shell disabled。

Why it's needed

POST /session/:id/shell 会绕过普通 agent tool approval flow，直接通过 daemon 执行命令。默认可达，或者只凭 daemon token 加 session id 可达，对高风险入口来说权限过大。这个 PR 默认禁用该能力，并在执行任何命令前增加 session ownership proof。

Reviewer Test Plan

How to verify

启动一个没有 bearer token 的默认 loopback daemon，确认 /capabilities.features 不包含 session_shell_command，ACP initialize 不广告 _qwen/session/shell，REST shell 返回 401 token_required。

启动一个带认证但没有 --enable-session-shell 的 daemon，确认认证后的 direct shell 调用返回 session_shell_disabled，并且不会进入 bridge。

启动一个带认证且开启 --enable-session-shell 的 daemon，确认缺少 X-Qwen-Client-Id 的 shell 调用返回 client_id_required，而携带绑定到该 session 的 client id 时会进入 bridge 并正常执行。

对直接 embed 的路径，调用 createServeApp({ token: '' })，确认它按未配置 token 处理：strict shell mutation 返回 token_required，且不广告 session_shell_command。

Evidence (Before & After)

非 UI 变更，本地自动化验证覆盖了 REST、ACP、bridge、CLI 参数、build 和 typecheck 路径。

Tested on

OS	Status
🍏 macOS	✅ tested
🪟 Windows	⚠️ not tested
🐧 Linux	✅ tested

Environment (optional)

本地 Node/npm 仓库 worktree。

Risk & Scope

Main risk or tradeoff: 这是 direct daemon shell client 的行为变更；调用方必须显式开启 daemon 能力，并提供 session-bound client id。
Not validated / out of scope: 不新增 shell-specific rate limiter，不接入 PermissionMediator synthetic approval flow，不改变普通 agent shell tool approval 或 prompt queue 行为。
Breaking changes / migration notes: 裸 DaemonClient.shellCommand(sessionId, command) 在 daemon 开启 direct session shell 后必须传 opts.clientId；DaemonSessionClient.shellCommand() 会继续自动转发 session-bound client id。直接 embedder 如果传了空字符串 token，应改为传非空 bearer token 或省略 token；空字符串不会被视为已配置 token。

Linked Issues

参考 #4490。

doudouOUC · 2026-06-12T08:09:43Z

E2E / Verification Report

Local verification was run on macOS for the direct session shell permission-policy follow-up.

Commands run:

cd packages/acp-bridge && npx vitest run src/bridge.test.ts
cd packages/cli && npx vitest run src/serve/server.test.ts src/serve/acpHttp/transport.test.ts src/commands/serve.test.ts
npm run build
npm run typecheck
git diff --check

Results:

packages/acp-bridge: 255 tests passed.
packages/cli: 514 focused serve/ACP/command tests passed.
npm run build: passed; emitted existing vscode companion curly warnings and stale browserslist data warnings only.
npm run typecheck: passed.
git diff --check: passed.

Manual audit coverage:

REST direct shell stays strict-gated and default-disabled.
ACP initialize only advertises _qwen/session/shell when the effective policy is enabled.
Disabled ACP shell calls still route to the connection stream and return a stable session_shell_disabled error.
Bridge executeShellCommand() requires enabled policy and a session-bound client id before session lookup/execution.
SDK session wrapper continues to forward the bound client id automatically.

Copilot

Copilot was unable to review this pull request because the user who requested the review has reached their quota limit.

wenshao

All three R1 findings addressed in this push:

Critical server.test.ts TS2367 — session_shell_command added to EXPECTED_REGISTERED_FEATURES with full conditional predicate coverage. Fixed.
Suggestion dispatch.ts error code — toRpcError now maps shell policy errors to RPC.INVALID_PARAMS (-32602), consistent with JSON-RPC 2.0 semantics. All test assertions updated. Fixed.
Suggestion dispatch.ts:1131 abort signal — executeShellCommand now forwards binding.abort.signal from the owned session binding. Test asserts the signal is wired and not pre-aborted. Fixed.

Deterministic analysis: tsc 0 errors (after build), eslint 0 errors. Tests: 512/512 passed (transport.test.ts + server.test.ts).

Downgraded from Approve to Comment: CI still running.

— qwen3.7-max via Qwen Code /review

wenshao · 2026-06-12T11:23:48Z

Local runtime verification on Linux (real daemon over HTTP) — PASS ✅

I built this branch (d97d93d) from a fresh npm ci and drove the real qwen serve daemon over HTTP in all three policy states plus the base build for contrast — five daemons under tmux, exercised with curl through the actual REST + ACP surface. Every line of the Reviewer Test Plan holds, the ownership proof is genuinely session-scoped, and the authority gap this PR closes reproduces on main. This also covers the Linux row (author tested macOS).

Setup: Linux, fresh npm ci + npm run build, daemons launched with the suite-style flags (serve --port … --hostname 127.0.0.1 --workspace <scratch>), real ShellExecutionService (no mocks). Base contrast = built main (531a15d).

Policy matrix (all observed at runtime)

State	`capabilities.features` has `session_shell_command`	ACP initialize advertises `_qwen/session/shell`	`POST /session/:id/shell`
A — no token, no flag	❌ (59 feats)	❌ (40 methods)	401 `token_required` (strict gate fires first)
B — token, no flag	❌ (59)	❌ (40)	403 `session_shell_disabled` — bridge never reached
C — token + `--enable-session-shell`	✅ (60)	✅ (41)	gated chain below → executes
D — flag without token	❌ (59)	❌ (40)	401 `token_required` (opt-in inert without auth)
base `main` — no flag	❌	n/a	200, executes ungated ← the gap this PR closes

State A — default tokenless loopback

session_shell_command in features: False | total: 59
_qwen/session/shell advertised:    False | total methods: 40
POST /session/<id>/shell (no token) → 401
  {"error":"This route requires the daemon to be configured with a bearer token…","code":"token_required"}

State B — authenticated, opt-in NOT set

session_shell_command in features: False        ACP _qwen/session/shell: not advertised
POST shell (token, no client-id)       → 403 {"code":"session_shell_disabled","errorKind":"session_shell_disabled"}
POST shell (token, WITH bound client-id) → 403 session_shell_disabled   ← disabled beats ownership (correct order)
daemon log: 2× `status=403 request completed`, 0× `shell command completed`  ← bridge sink never reached

State C — authenticated + `--enable-session-shell` (the ownership chain)

session_shell_command in features: True (60)    ACP _qwen/session/shell: advertised (41 methods)

POST shell, NO X-Qwen-Client-Id            → 403 {"code":"client_id_required"}
POST shell, malformed client-id "bad id…"  → 400 {"code":"invalid_client_id"}  (format gate)
POST shell, well-formed UNBOUND client-id  → 400 invalid_client_id  "…is not registered for session f20fd677…"
POST shell, real client-id from a DIFFERENT daemon → 400 invalid_client_id  ← cross-session isolation
POST shell, bound client-id, EMPTY command → 400 "`command` is required and must be a non-empty string"
POST shell, bound client-id, real command  → 200 {"exitCode":0,"output":"gate-open\n","aborted":false}
daemon log: `… clientId=client_6a95f11f… exitCode=0 shell command completed`  ← bridge executed

🔍 The ownership check is provably session-scoped, not "any token-holder": resolveTrustedClientId does entry.clientIds.has(clientId) on the addressed session. To prove this at runtime (a single --workspace daemon attaches every POST /session to one shared session, so I couldn't get two sessions in one registry) I spun up a second enabled daemon on a second workspace, took its real server-issued client-id, and presented it to the first daemon's session → 400 invalid_client_id, while that daemon's own bound id returned 200. A valid, server-issued client-id from elsewhere does not unlock a session it isn't bound to.

Probes beyond the plan

🔍 Wrong bearer token on the enabled shell route → 401 Unauthorized (auth gate precedes the shell policy — defense in depth).
🔍 --enable-session-shell without --token (State D) → capability stays hidden and REST still returns 401 token_required. Confirms the enableSessionShell === true && token !== undefined effective-policy fold: opt-in alone is inert.
🔍 Base main rejects the flag outright: Unknown arguments: enable-session-shell, enableSessionShell (yargs strict) — the flag is genuinely PR-new.
🔍 Malformed vs unbound client-id are distinct paths: format violation → REST parseClientIdHeader 400; well-formed-but-unregistered → bridge InvalidClientIdError 400 naming the session. Both surface invalid_client_id.

Findings

✅ All three enforcement layers behave as designed. I drove the REST route fully (every error branch + success) and the ACP initialize advertisement across all four states. I did not drive the ACP dispatch path (_qwen/session/shell JSON-RPC call → session_shell_disabled/client_id_required) end-to-end — that needs the full ACP connection+ownership SSE handshake — but it shares the same bridge sink I exercised via REST, and it's covered by the PR's transport unit tests. Worth a note for the record, not a gap.
The base contrast makes the motivation concrete: on main, a tokenless loopback daemon executes POST /session/<id>/shell with only a session id, no client id (200 … "MAIN-EXECUTES-UNGATED\n"). That's exactly the over-broad authority the PR removes.
Env note (not a bug): a single---workspace daemon attaches all POST /session (even {"forceNew":true}) to one shared session, minting a fresh client-id each time — all bound to that one session. So a "two sessions, same daemon" cross-ownership test isn't reachable from one daemon; the second-daemon approach above is the right way to exercise it.
Error contract matches the design doc exactly: REST 401 token_required / 403 session_shell_disabled / 403 client_id_required / 400 invalid_client_id, with both code and errorKind populated on the two new 403s.

中文版（点击展开）

Linux 本地真实运行验证（真实守护进程 + HTTP）— 通过 ✅

我从全新 npm ci 构建本分支（d97d93d），在 tmux 下对真实 qwen serve 守护进程的三种策略状态外加 main 基线共五个守护进程做了 HTTP 驱动验证，全部经由真实的 REST + ACP 接口用 curl 调用。Reviewer Test Plan 每一条都成立，ownership 证明确实是按 session 范围校验的，本 PR 所封堵的越权能力在 main 上可复现。本次同时覆盖 Linux 一行（作者本地为 macOS）。

环境： Linux，全新 npm ci + npm run build，守护进程以套件风格参数启动（serve --port … --hostname 127.0.0.1 --workspace <临时目录>），真实 ShellExecutionService（无 mock）。基线对比 = 构建后的 main（531a15d）。

策略矩阵（均为运行时实测）

状态	`capabilities.features` 含 `session_shell_command`	ACP initialize 广告 `_qwen/session/shell`	`POST /session/:id/shell`
A — 无 token、无 flag	❌（59 项）	❌（40 个方法）	401 `token_required`（strict 门先触发）
B — 有 token、无 flag	❌（59）	❌（40）	403 `session_shell_disabled` —— 不进入 bridge
C — token + `--enable-session-shell`	✅（60）	✅（41）	见下方门控链 → 执行
D — 有 flag 但无 token	❌（59）	❌（40）	401 `token_required`（无认证则 opt-in 无效）
基线 `main` — 无 flag	❌	不适用	200，无门控直接执行 ← 本 PR 封堵的缺口

状态 A — 默认无 token loopback

features 含 session_shell_command: False | 共 59
ACP _qwen/session/shell 广告:       False | 共 40 个方法
POST /session/<id>/shell（无 token）→ 401 {"code":"token_required"}

状态 B — 已认证、未开启 opt-in

features 含 session_shell_command: False        ACP _qwen/session/shell：未广告
POST shell（token、无 client-id）        → 403 {"code":"session_shell_disabled","errorKind":"session_shell_disabled"}
POST shell（token、携带已绑定 client-id） → 403 session_shell_disabled   ← disabled 优先于 ownership（顺序正确）
守护进程日志：2 次 `status=403 request completed`，0 次 `shell command completed`  ← bridge 入口从未到达

状态 C — 已认证 + `--enable-session-shell`（ownership 链）

features 含 session_shell_command: True（60）    ACP _qwen/session/shell：已广告（41 个方法）

POST shell，无 X-Qwen-Client-Id            → 403 {"code":"client_id_required"}
POST shell，格式非法 client-id "bad id…"   → 400 {"code":"invalid_client_id"}（格式门）
POST shell，格式合法但未绑定 client-id      → 400 invalid_client_id  "…is not registered for session f20fd677…"
POST shell，来自另一守护进程的真实 client-id → 400 invalid_client_id  ← 跨 session 隔离
POST shell，已绑定 client-id，空命令         → 400 "`command` is required and must be a non-empty string"
POST shell，已绑定 client-id，真实命令        → 200 {"exitCode":0,"output":"gate-open\n","aborted":false}
守护进程日志：`… clientId=client_6a95f11f… exitCode=0 shell command completed`  ← bridge 执行

🔍 ownership 校验可证明是按 session 范围的，而非"任意持 token 者"：resolveTrustedClientId 对被寻址的那个 session 做 entry.clientIds.has(clientId)。为在运行时证明这点（单一 --workspace 守护进程会把每次 POST /session 都 attach 到同一个共享 session，因此无法在一个注册表里凑出两个 session），我在第二个工作区另起了一个开启该能力的守护进程，取它真实下发的 client-id，拿去访问第一个守护进程的 session → 400 invalid_client_id；而该守护进程自己绑定的 id 则返回 200。一个合法、由服务端下发、但属于别处的 client-id，无法解锁未绑定它的 session。

计划之外的探测

🔍 对已开启的 shell 路由用错误 bearer token → 401 Unauthorized（认证门先于 shell 策略——纵深防御）。
🔍 --enable-session-shell 但无 --token（状态 D）→ 能力保持隐藏，REST 仍返回 401 token_required。证实 enableSessionShell === true && token !== undefined 的有效策略折叠：仅 opt-in 无效。
🔍 基线 main 直接拒绝该 flag：Unknown arguments: enable-session-shell, enableSessionShell（yargs 严格模式）——该 flag 确为 PR 新增。
🔍 格式非法与未绑定是两条不同路径：格式违规 → REST parseClientIdHeader 400；格式合法但未注册 → bridge InvalidClientIdError 400 并指明 session。二者都呈现 invalid_client_id。

观察

✅ 三层防御均按设计工作。我完整驱动了 REST 路由（每个错误分支 + 成功路径）以及四种状态下的 ACP initialize 广告。我没有端到端驱动 ACP dispatch 路径（_qwen/session/shell JSON-RPC 调用 → session_shell_disabled/client_id_required）——那需要完整的 ACP 连接+ownership 的 SSE 握手——但它与我经 REST 驱动的 bridge 入口共用同一处，且本 PR 的 transport 单测已覆盖。如实记录，非缺口。
基线对比让动机具体化：在 main 上，无 token 的 loopback 守护进程仅凭 session id（无 client id）即可执行 POST /session/<id>/shell（200 … "MAIN-EXECUTES-UNGATED\n"）。这正是本 PR 移除的过度权限。
环境说明（非 bug）：单 --workspace 守护进程会把所有 POST /session（包括 {"forceNew":true}）attach 到同一个共享 session，每次只新铸一个 client-id，全部绑定到那一个 session。因此"同一守护进程两个 session"的跨 ownership 测试无法从单个守护进程触达；上面的第二守护进程方案是正确的验证方式。
错误契约与设计文档完全一致：REST 401 token_required / 403 session_shell_disabled / 403 client_id_required / 400 invalid_client_id，两个新增 403 的 code 与 errorKind 均填充。

qqqys

Previous critical feedback appears resolved on head 0daa475; I did not find any new critical blocker in this pass.

wenshao · 2026-06-12T13:11:16Z

🧪 Local runtime verification (built `qwen serve` daemons in tmux + curl, real REST + ACP) — ✅ PASS (full three-layer gate confirmed, incl. an adversarial bypass attempt)

Built the PR head (0daa475, 3 commits, ~38 behind main) and drove four real daemon configurations from dist/cli.js in tmux, hitting the policy at every surface it touches — REST, ACP JSON-RPC (over a live SSE connection), the capability registry, and the bridge ownership check. Each row below is an actual request against a running daemon, not a unit test.

Configs: 4601 no-token loopback · 4602 --token (no flag) · 4603 --token --enable-session-shell · 4604 --enable-session-shell without token. Scratch workspace, isolated tmux socket.

Advertisement layer

Surface	4601 no-token	4602 token, no flag	4603 token + flag
`/capabilities.features` has `session_shell_command`	❌ absent	❌ absent	✅ present
ACP `initialize` methods has `_qwen/session/shell`	❌ (40 methods)	❌ (40 methods)	✅ (41 methods)

Both gated strictly on token-configured AND flag-set — exactly enableSessionShell === true && tokenConfigured.

Enforcement layer (the part that actually matters)

✅ No-token daemon → 401 token_required. POST /4601/session/<id>/shell → 401 {code:"token_required"}. 🔍 Even with a bogus Authorization: Bearer anything it's still 401 — the strict mutation gate keys on the daemon having a token configured, not on the request carrying one. (This route went from mutate() to mutate({strict:true}), so no-token loopback shell is now refused — the intended hardening, and a real behavior change for that setup.)
✅ Token, no flag → 403 session_shell_disabled, before the bridge. POST /4602/...shell with a valid bearer → 403 {code:"session_shell_disabled"}. I sent a non-existent session id and got session_shell_disabled, not session_not_found — proving the disabled check fires before session lookup / bridge dispatch.
✅ Token + flag, no X-Qwen-Client-Id → 403 client_id_required.
✅ Token + flag, session-bound client id → executes. Created a session (daemon-assigned clientId), then POST /4603/.../shell with that id → 200 {"exitCode":0,"output":"GATE-EXEC-OK-7777\n"}. Real command, real bridge.
✅ Token + flag, unbound client id → 400 invalid_client_id. X-Qwen-Client-Id: cid.unbound999 → Client id "cid.unbound999" is not registered for session <id> — the bridge's ownership proof (InvalidClientIdError) rejects a caller who authenticated to the daemon but never owned the session. This is the core of the PR's "session ownership proof."
✅ ACP execution path works. Over a live ACP connection on 4603 (init → SSE → session/new → _qwen/session/shell) → {"exitCode":0,"output":"ACP-EXEC-OK-5555\n"}.
✅ 🔍 Adversarial: unadvertised method on the disabled daemon is still rejected. A client that ignores the advertise list and calls _qwen/session/shell on 4602 (where it's hidden) gets {code:-32602, errorKind:"session_shell_disabled"}. Advertise-hiding is defense-in-depth; the real gate is in the handler. This is the most important result — the security doesn't depend on clients respecting the capability list.
✅ Boot warning. Launching --enable-session-shell without a token (4604) emitted exactly: qwen serve: --enable-session-shell ignored because no bearer token is configured. Set QWEN_SERVER_TOKEN or pass --token… and the capability stayed off.

Probes (held)

🔍 Empty command (" ") with a bound client → 400 \command` is required and must be a non-empty string`.
🔍 Malformed client id (bad id!!#) → 400 (fails the ^[A-Za-z0-9._:-]+$ validation before any dispatch).

Findings

⚠️ The session-bound client id is daemon-assigned, not caller-chosen. I sent X-Qwen-Client-Id: cid.alpha on session-create and the response came back bound to a generated client_b24c… instead. So a raw REST caller must read clientId from the create/load response and echo it back on the shell call — they cannot predict or pin it. The SDK's DaemonSessionClient.shellCommand() forwards it automatically (per the PR), so SDK users are fine; this is only a sharp edge for someone wiring the REST route by hand, and it'd be worth one line in the SDK docs' REST example.
The "without reaching the bridge" claim (scenario B) is solidly true and I verified it behaviorally (disabled-error for a non-existent session, not not-found) rather than relying on logs — note the bridge's executeShellCommand debug line is debug-level and wouldn't appear on stdout anyway, so don't use log-absence as the proof.
Env/scope: branch is ~38 commits behind main; a rebase before merge is advisable. ACP responses ride the connection SSE stream, and repeated bare POST /acp inits accumulate connections — a harness detail (I restarted daemons between ACP captures), not a product issue.

Verdict (merge reference)

PASS. Every layer the PR claims to enforce holds at runtime: the capability and ACP method are advertised only under token+flag; REST returns token_required / session_shell_disabled / client_id_required in the right order and before the bridge; bound client ids execute and unbound ids are rejected by the bridge ownership check; the ACP path enforces identically; and — the part I most wanted to break — a client bypassing the advertise list still hits a handler-level session_shell_disabled. The boot warning fires. Nothing here blocks merging; the daemon-assigned-clientId note is a docs nicety, not a defect.

🇨🇳 中文版（点击展开）

🧪 本地运行时验证（tmux 中构建运行 `qwen serve` 守护进程 + curl，真实 REST + ACP）— ✅ 通过（完整三层门控确认，含一次对抗性绕过尝试）

构建 PR head（0daa475，3 个 commit，落后 main 约 38 个），在 tmux 中用 dist/cli.js 驱动四种真实守护进程配置，在该策略涉及的每个表面上施压——REST、ACP JSON-RPC（经真实 SSE 连接）、capability 注册表、以及 bridge 所有权校验。下面每一行都是对运行中守护进程的真实请求，而非单测。

配置： 4601 无 token loopback · 4602 仅 --token（无 flag）· 4603 --token --enable-session-shell · 4604 有 --enable-session-shell 但无 token。临时工作区，独立 tmux socket。

广告层

表面	4601 无 token	4602 有 token 无 flag	4603 token + flag
`/capabilities.features` 含 `session_shell_command`	❌ 无	❌ 无	✅ 有
ACP `initialize` methods 含 `_qwen/session/shell`	❌（40 个）	❌（40 个）	✅（41 个）

两者都严格门控在 配置了 token 且设置了 flag——正是 enableSessionShell === true && tokenConfigured。

强制层（真正要紧的部分）

✅ 无 token 守护进程 → 401 token_required。 POST /4601/session/<id>/shell → 401 {code:"token_required"}。🔍 即便带上伪造的 Authorization: Bearer anything 仍是 401——strict mutation gate 看的是守护进程是否配置了 token，而非请求是否携带。（该路由从 mutate() 改为 mutate({strict:true})，所以无 token loopback 的 shell 现在被拒——这是有意的加固，也是该场景下的真实行为变化。）
✅ 有 token 无 flag → 403 session_shell_disabled，且未到 bridge。 用有效 bearer POST /4602/...shell → 403 {code:"session_shell_disabled"}。我发了一个不存在的 session id，得到的是 session_shell_disabled 而非 session_not_found——证明 disabled 检查在 session 查找 / bridge 派发之前就触发。
✅ token + flag，无 X-Qwen-Client-Id → 403 client_id_required。
✅ token + flag，session 绑定的 client id → 执行。 创建一个 session（守护进程分配 clientId），再用该 id POST /4603/.../shell → 200 {"exitCode":0,"output":"GATE-EXEC-OK-7777\n"}。真实命令、真实 bridge。
✅ token + flag，未绑定 client id → 400 invalid_client_id。 X-Qwen-Client-Id: cid.unbound999 → Client id "cid.unbound999" is not registered for session <id>——bridge 的所有权证明（InvalidClientIdError）拒绝了一个已向守护进程认证但从未拥有该 session 的调用方。这是本 PR "session ownership proof" 的核心。
✅ ACP 执行路径可用。 在 4603 上经真实 ACP 连接（init → SSE → session/new → _qwen/session/shell）→ {"exitCode":0,"output":"ACP-EXEC-OK-5555\n"}。
✅ 🔍 对抗性：在禁用守护进程上调用未广告的方法仍被拒。 一个无视广告列表、对 4602（该方法被隐藏）调用 _qwen/session/shell 的客户端，得到 {code:-32602, errorKind:"session_shell_disabled"}。**隐藏广告是纵深防御；真正的门在 handler 里。**这是最重要的结果——安全性不依赖客户端尊重 capability 列表。
✅ 启动警告。 无 token 启动 --enable-session-shell（4604）精确打印：qwen serve: --enable-session-shell ignored because no bearer token is configured. Set QWEN_SERVER_TOKEN or pass --token…，且能力保持关闭。

探针（守住）

🔍 绑定 client 下空命令（" "）→ 400 \command` is required and must be a non-empty string`。
🔍 非法 client id（bad id!!#）→ 400（在任何派发前未过 ^[A-Za-z0-9._:-]+$ 校验）。

发现

⚠️ session 绑定的 client id 由守护进程分配，而非调用方自选。 我在创建 session 时发了 X-Qwen-Client-Id: cid.alpha，响应却绑定到一个生成的 client_b24c…。所以裸 REST 调用方必须从 create/load 响应里读出 clientId 并在 shell 调用时回传——无法预测或钉死它。SDK 的 DaemonSessionClient.shellCommand() 会自动转发（如 PR 所述），故 SDK 用户无碍；这只是手工接 REST 路由者的一处尖角，值得在 SDK 文档的 REST 示例里加一行。
"未到 bridge"（场景 B）的声明确实成立，我用行为验证（对不存在 session 返回 disabled 而非 not-found）而非依赖日志——注意 bridge 的 executeShellCommand 调试行是 debug 级、本就不会出现在 stdout，所以别拿"日志缺失"当证据。
环境/范围：分支落后 main 约 38 个 commit，合并前建议 rebase。ACP 响应走连接 SSE 流，重复裸 POST /acp init 会累积连接——这是 harness 细节（我在 ACP 抓取间重启了守护进程），非产品问题。

结论（合并参考）

**通过。**PR 声称强制的每一层在运行时都成立：capability 与 ACP 方法仅在 token+flag 下广告；REST 按正确顺序、在 bridge 之前返回 token_required / session_shell_disabled / client_id_required；绑定 client id 可执行、未绑定被 bridge 所有权校验拒绝；ACP 路径同样强制；而且——我最想破坏的部分——绕过广告列表的客户端仍命中 handler 级的 session_shell_disabled。启动警告也触发。没有阻碍合并的问题；守护进程分配 clientId 那点是文档优化，非缺陷。

* feat(cli): gate direct session shell execution * fix(cli): address session shell review feedback * codex: address PR review feedback (#5031)

feat(cli): gate direct session shell execution

54e366b

doudouOUC marked this pull request as ready for review June 12, 2026 08:17

doudouOUC requested review from Copilot and wenshao June 12, 2026 08:17

Copilot AI reviewed Jun 12, 2026

doudouOUC changed the title ~~[codex] Gate direct session shell behind explicit opt-in~~ feat(daemon): gate direct session shell behind explicit opt-in Jun 12, 2026

wenshao requested changes Jun 12, 2026

View reviewed changes

Comment thread packages/cli/src/serve/server.test.ts Outdated

Comment thread packages/cli/src/serve/acpHttp/dispatch.ts Outdated

Comment thread packages/cli/src/serve/acpHttp/dispatch.ts Outdated

fix(cli): address session shell review feedback

d97d93d

doudouOUC requested a review from wenshao June 12, 2026 09:32

wenshao reviewed Jun 12, 2026

View reviewed changes

doudouOUC requested a review from wenshao June 12, 2026 11:06

wenshao previously approved these changes Jun 12, 2026

View reviewed changes

wenshao reviewed Jun 12, 2026

View reviewed changes

Comment thread packages/cli/src/serve/server.ts

Comment thread packages/cli/src/serve/runQwenServe.ts

Comment thread packages/cli/src/serve/acpHttp/dispatch.ts

Comment thread packages/cli/src/serve/acpHttp/dispatch.ts

Comment thread packages/cli/src/serve/acpHttp/dispatch.ts Outdated

codex: address PR review feedback (QwenLM#5031)

0daa475

doudouOUC dismissed wenshao’s stale review via 0daa475 June 12, 2026 11:54

doudouOUC requested a review from wenshao June 12, 2026 11:55

qqqys approved these changes Jun 12, 2026

View reviewed changes

wenshao approved these changes Jun 12, 2026

View reviewed changes

wenshao merged commit a064779 into QwenLM:main Jun 12, 2026
25 checks passed

doudouOUC deleted the codex/session-shell-permission-policy branch June 12, 2026 15:08

doudouOUC added a commit that referenced this pull request Jun 15, 2026

feat(daemon): gate direct session shell behind explicit opt-in (#5031)

55eab8f

* feat(cli): gate direct session shell execution * fix(cli): address session shell review feedback * codex: address PR review feedback (#5031)

github-actions Bot mentioned this pull request Jun 15, 2026

Upstream changes detected: Gemini CLI, Codex CLI (App Server Protocol), GitHub Copilot CLI, Qwen Code greenpolo/cc-multi-cli-plugin#8

Open

Conversation

doudouOUC commented Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does

Why it's needed

Reviewer Test Plan

How to verify

Evidence (Before & After)

Tested on

Environment (optional)

Risk & Scope

Linked Issues

What this PR does

Why it's needed

Reviewer Test Plan

How to verify

Evidence (Before & After)

Tested on

Environment (optional)

Risk & Scope

Linked Issues

Uh oh!

doudouOUC commented Jun 12, 2026

E2E / Verification Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wenshao left a comment

Choose a reason for hiding this comment

Uh oh!

wenshao commented Jun 12, 2026

Local runtime verification on Linux (real daemon over HTTP) — PASS ✅

Policy matrix (all observed at runtime)

State A — default tokenless loopback

State B — authenticated, opt-in NOT set

State C — authenticated + --enable-session-shell (the ownership chain)

Probes beyond the plan

Findings

Linux 本地真实运行验证（真实守护进程 + HTTP）— 通过 ✅

策略矩阵（均为运行时实测）

状态 A — 默认无 token loopback

状态 B — 已认证、未开启 opt-in

状态 C — 已认证 + --enable-session-shell（ownership 链）

计划之外的探测

观察

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qqqys left a comment

Choose a reason for hiding this comment

Uh oh!

wenshao commented Jun 12, 2026

🧪 Local runtime verification (built qwen serve daemons in tmux + curl, real REST + ACP) — ✅ PASS (full three-layer gate confirmed, incl. an adversarial bypass attempt)

Advertisement layer

Enforcement layer (the part that actually matters)

Probes (held)

Findings

Verdict (merge reference)

🧪 本地运行时验证（tmux 中构建运行 qwen serve 守护进程 + curl，真实 REST + ACP）— ✅ 通过（完整三层门控确认，含一次对抗性绕过尝试）

广告层

强制层（真正要紧的部分）

探针（守住）

发现

结论（合并参考）

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

doudouOUC commented Jun 12, 2026 •

edited

Loading

State C — authenticated + `--enable-session-shell` (the ownership chain)

状态 C — 已认证 + `--enable-session-shell`（ownership 链）

🧪 Local runtime verification (built `qwen serve` daemons in tmux + curl, real REST + ACP) — ✅ PASS (full three-layer gate confirmed, incl. an adversarial bypass attempt)

🧪 本地运行时验证（tmux 中构建运行 `qwen serve` 守护进程 + curl，真实 REST + ACP）— ✅ 通过（完整三层门控确认，含一次对抗性绕过尝试）