Skill

backends

Manage the qwen-stack supervisor's backend list — list configured backends with live health, add a new backend, remove one, or test connectivity. Operates on `~/.qwen-coprocessor-stack/config.json` and hot-reloads in the running supervisor without restart. Use when the user types `/qwen-stack:backends ...`.

Popularity

Stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/qwen-stack:backends list | add <id> <url> [model] [tier] [capacity] [weight] [ctx_size] | remove <id> | test [id]

User invocable

Model invocable

Inline context

Default effort

Argument hintlist | add <id> <url> [model] [tier] [capacity] [weight] [ctx_size] | remove <id> | test [id]

Tool Access

This skill is limited to the following tools:

BashReadWritemcp__plugin_qwen-stack_supervisor__qwen_backends

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Lifecycle and discovery for the supervisor's backend list. Edits to `~/.qwen-coprocessor-stack/config.json` hot-apply on the next `qwen_spawn` or `qwen_backends` call — no supervisor restart required. Existing sessions stay pinned to their backend (RDR-001 §Q3).

SKILL.md

92 lines · ~1.6k tokens

Stats

LanguageTypeScript

Stars1

MaintenanceExcellent

Last CommitJun 11, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

/qwen-stack:backends

Lifecycle and discovery for the supervisor's backend list. Edits to ~/.qwen-coprocessor-stack/config.json hot-apply on the next qwen_spawn or qwen_backends call — no supervisor restart required. Existing sessions stay pinned to their backend (RDR-001 §Q3).

Resolution priority (read by the supervisor)

QWEN_BACKENDS env var — if set, overrides the config file. If the user has this set in their shell, prefer editing the shell rc instead of the file (and tell them so).
~/.qwen-coprocessor-stack/config.json { "backends": [...] } — the file you read/write here.
Built-in single-local default (local-27b at localhost:8080/v1) when neither is set.

Backend object shape

{
  "id":       "qwentescence",                  // unique handle, kebab-case
  "url":      "http://qwentescence:1234/v1",   // OpenAI-compatible base
  "model":    "qwen3.6-35b-a3b",               // identifier returned by /v1/models
  "tier":     "remote",                        // "local" | "remote"
  "capacity": "heavy",                         // "fast" | "heavy"
  "weight":   1,                               // optional, default 1
  "ctx_size": 131072                           // optional; matches llama-server --ctx-size
}

ctx_size is operator-declared (the supervisor doesn't probe). When set and no per-spawn / env / config tier resolves max_context_tokens, the supervisor uses floor(0.85 * ctx_size) as the default cap for spawns that route to this backend (RDR-002 v0.7 amendment). Without it, spawns fall through to the hardcoded 111000 default — fine for a 131072-ctx backend, silently no-guardrail for an 8K-ctx local. Set it.

Subcommand routing

Parse the first positional arg as the subcommand. If absent or list, run list. Otherwise dispatch on add, remove, or test.

list (default when no args)

Call the MCP tool qwen_backends (no args). The supervisor returns each backend with a live healthy field (true/false/null).
Render a compact table: id, url, model, tier, capacity, healthy. Use ✓ / ✗ / ? glyphs for healthy true/false/null.
Note the count and where the supervisor read the list from. Detect this by checking:
- If QWEN_BACKENDS env var is set in the user's shell (echo $QWEN_BACKENDS), say "(from QWEN_BACKENDS env)".
- Else if ~/.qwen-coprocessor-stack/config.json exists, say "(from ~/.qwen-coprocessor-stack/config.json)".
- Else "(built-in default — file not present, env not set)".

add [model] [tier] [capacity] [weight]

Args:

<id> (required) — kebab-case unique handle.
<url> (required) — OpenAI-compatible base ending in /v1 (warn if missing).
[model] — defaults to the value /v1/models returns from the URL. If absent, do a quick curl -sf -m 5 <url>/models to discover, falling back to qwen3.6-35b-a3b with a note.
[tier] — local or remote. Default: remote if URL host is not localhost/127.0.0.1, else local.
[capacity] — fast or heavy. Default: heavy for remote, fast for local.
[weight] — integer ≥ 1. Default: 1.
[ctx_size] — positive integer matching the backend's --ctx-size. Optional; when supplied, the supervisor derives the default max_context_tokens cap as floor(0.85 * ctx_size) for spawns routed to this backend. Skip only if you intend to fall through to the hardcoded 111000 default (fine for a 131072-ctx backend, dangerous for an 8K local).

Steps:

Refuse if QWEN_BACKENDS env is set — it would silently override the file edit. Tell the user to either unset QWEN_BACKENDS in their shell or edit the env directly.
Probe <url>/health and <url>/v1/models (whichever responds) with curl -sf -m 5. If both fail, ask the user whether to add anyway. If they confirm, proceed.
Read ~/.qwen-coprocessor-stack/config.json (if it doesn't exist, treat as { "backends": [] }). Validate JSON shape.
Reject if <id> already exists in the list (case-insensitive). Tell the user to remove the old one first.
Append the new backend object with the resolved defaults filled in.
mkdir -p ~/.qwen-coprocessor-stack then write the JSON back with 2-space indent.
Confirm the write happened. Then call qwen_backends MCP tool to verify the supervisor's hot-reload picked it up — the new entry should appear with a healthy value.

remove

Refuse if QWEN_BACKENDS env is set — same reasoning as add.
Read the config file. Find the entry by id (case-insensitive). If not found, list the current ids and stop.
Write the filtered config back.
Verify via qwen_backends that the entry is gone.

test [id]

Call qwen_backends MCP tool.
If [id] provided, filter to that one. If not found, list available ids.
For each shown backend, also do a direct curl -sf -m 5 <url>/health (or <url>/v1/models if /health 404s) to confirm the supervisor's cached health matches reality. Report any divergence.

Error handling

File-write failure (permissions, disk full): show the underlying error, do not partially update.
Invalid JSON in existing config file: do not overwrite. Show the parse error and the offending content (first 200 chars). Suggest the user inspect manually.
The supervisor's hot-reload re-reads on the next qwen_spawn or qwen_backends call. If a confirmation qwen_backends call shows stale data, mention that running sessions are still using the old list (expected — RDR-001 §Q3) but new spawns will see the updated list.

Output style

Concise. Tables for list/test; one-line confirmations for add/remove.
Surface the file path edited so the user knows where state lives.
No emojis unless the user already uses them.
No "I will now…" preamble.

backends

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

backends

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

/qwen-stack:backends

Resolution priority (read by the supervisor)

Backend object shape

Subcommand routing

list (default when no args)

add [model] [tier] [capacity] [weight]

remove

test [id]

Error handling

Output style

Similar Skills

/qwen-stack:backends

Resolution priority (read by the supervisor)

Backend object shape

Subcommand routing

list (default when no args)

add [model] [tier] [capacity] [weight]

remove

test [id]

Error handling

Output style

Similar Skills