Slash Command

/add-scenario

From deepagents-builder

Add a single eval scenario to an existing dataset interactively or from a production trace.

Popularity

Parent stars

Invocation

How this command is triggered — by the user, by Claude, or both

Slash command

/deepagents-builder:add-scenario [--from-trace <path-or-id>]

Model invocable

No pre-commands

Tool Access

This command is limited to the following tools:

ReadWriteGlobGrepAskUserQuestion

Context Preview

The summary Claude sees in its command listing — used to decide when to auto-load this command

# Add Eval Scenario

Add a single scenario to an existing eval dataset.

## Workflow

### Step 1: Parse Arguments

Check `$ARGUMENTS` for `--from-trace`:

- **`--from-trace <local-path>`**: Load a local trace file (JSON)
- **`--from-trace langsmith:<run_id>`**: Fetch trace from LangSmith
- **No arguments**: Interactive mode — ask user to describe the scenario

### Step 2: Find Existing Dataset

Locate the dataset to add to:

1. Search `evals/datasets/*.yaml` and `evals/datasets/*.json`
2. If multiple datasets exist, ask which one to add to
3. If no dataset exists, suggest running `/design-e...

Command Content

72 lines · ~545 tokens

Stats

LanguagePowerShell

Parent stars1

MaintenanceExcellent

Last CommitMar 26, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Add Eval Scenario

Add a single scenario to an existing eval dataset.

Workflow

Step 1: Parse Arguments

Check $ARGUMENTS for --from-trace:

--from-trace <local-path>: Load a local trace file (JSON)
--from-trace langsmith:<run_id>: Fetch trace from LangSmith
No arguments: Interactive mode — ask user to describe the scenario

Step 2: Find Existing Dataset

Locate the dataset to add to:

Search evals/datasets/*.yaml and evals/datasets/*.json
If multiple datasets exist, ask which one to add to
If no dataset exists, suggest running /design-evals first

Step 3a: From Trace

If --from-trace was provided:

Load trace: Read the local JSON file or fetch from LangSmith
Display conversation: Show the user/assistant/tool turns
Ask for expected behavior: "This is what happened. What should have happened?"
- Which tools should have been called?
- What should the agent have responded?
- Is this a regression (agent used to work) or a new capability needed?
Generate scenario YAML: Convert the trace into structured scenario format
- Use actual tool calls as basis for expected_tools
- Generate mock_responses from actual tool responses
- Ask user for success_criteria
- Tag as regression if it's a bug fix scenario
Append to dataset

Step 3b: Interactive Mode

If no --from-trace:

Trigger the eval-designer agent in single-scenario mode
The designer asks:
- "Describe what happened or what should happen"
- "Which job does this relate to?" (show existing jobs)
- "Is this a happy path, edge case, or failure scenario?"
Generate scenario YAML
Append to the appropriate job section in the dataset

Step 4: Confirm and Suggest

Scenario '{name}' added to evals/datasets/{file}.yaml
Tags: [{tags}]

Next: Run /eval to generate the initial snapshot for this scenario.

/add-scenario

Popularity

Invocation

Tool Access

Context Preview

Command Content

/add-scenario

Popularity

Invocation

Tool Access

Context Preview

Command Content

Add Eval Scenario

Workflow

Step 1: Parse Arguments

Step 2: Find Existing Dataset

Step 3a: From Trace

Step 3b: Interactive Mode

Step 4: Confirm and Suggest

Other plugins with /add-scenario

Add Eval Scenario

Workflow

Step 1: Parse Arguments

Step 2: Find Existing Dataset

Step 3a: From Trace

Step 3b: Interactive Mode

Step 4: Confirm and Suggest

Other plugins with /add-scenario