Turn an audited RL task into a user-confirmed experiment plan with metrics, tuning scope, GPU budget, launcher commands, report skeleton, and first baseline/ablation proposal. Use before launching RL training jobs.
How this skill is triggered — by the user, by Claude, or both
Slash command
/rl-experiment-assistant:rl-experiment-planThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Create a consensus-gated experiment plan for RL training. This skill runs after `rl-task-audit` or after the user provides equivalent task/context details.
Create a consensus-gated experiment plan for RL training. This skill runs after rl-task-audit or after the user provides equivalent task/context details.
Before any GPU-consuming run, obtain or record user confirmation for:
.rlxp/contract.yaml records the launch gate and explicitly lists missing confirmations.If confirmation is absent, create/update .rlxp/report.md and list exact missing confirmations. Do not launch training.
Run initialization and report setup yourself when local shell access and edit permission are available. The user-facing interface is the plugin prompt, not manual execution of bundled Python scripts.
draft_blocked: generated default; training_allowed: false; required confirmations missing.ready_for_user_confirmation: audit/plan filled enough fields for user review; launch still blocked.approved_for_launch: all required confirmations and approval record are explicit. training_allowed is derived from those records and applies only to queued experiments inside approved scope/budget.Create .rlxp/ if missing by running the bundled helper internally:
python <plugin-root>/scripts/rlxp_init.py --root . --project-name <name>
For Holosoma scene traversal, when the user supplies equivalent setup details, prefer split command/fragments to avoid duplicate config fragments:
python <plugin-root>/scripts/rlxp_init.py \
--root . \
--project-name holosoma-scene-traversal \
--profile holosoma-scene-traversal \
--mamba-env hssim \
--train-command "python -m holosoma.train_agent" \
--exp-fragment "exp:g1-29dof-scene-traversal-hurdle" \
--logger-fragment "logger:wandb-packman-scene-traversal"
Record raw and runnable commands separately:
python -m holosoma.train_agent exp:g1-29dof-scene-traversal-hurdle logger:wandb-packman-scene-traversal
mamba run -n hssim python -m holosoma.train_agent exp:g1-29dof-scene-traversal-hurdle logger:wandb-packman-scene-traversal
Then update:
.rlxp/adapter.yaml using canonical templates/adapter-template.yaml fields..rlxp/contract.yaml using canonical templates/contract-template.yaml status and confirmation fields..rlxp/report.md..rlxp/experiments.yaml.If the current working directory is not the target RL repository, use the absolute target path for --root and write updates there. Do not write generated .rlxp/ state into the plugin package or installed plugin cache.
The first proposal should usually be one of:
Only propose reward changes after baseline evidence exists, unless the codebase already has a known failing baseline and the user explicitly approves direct intervention.
Return:
# Experiment Plan
## Confirmed objective
## Metrics and score formula
## Guardrails
## Approved tuning scope
## Budget and hardware
## Baseline command
## Evaluation command
## Contract gate
## First experiment proposal
## Report files created/updated
## Items still requiring confirmation
Guides creation, editing, and verification of skills for AI coding agents using test-driven development with subagent scenarios. Use when authoring or debugging skills.
npx claudepluginhub junhyekh/rlxp --plugin rl-experiment-assistant