Designs, codes, simulates, and deploys AI agents targeting DataRobot. Covers agent_spec.md authoring, dress-rehearsal simulation, template-based coding, and DataRobot CLI deployment.
How this skill is triggered — by the user, by Claude, or both
Slash command
/datarobot-agent-skills:datarobot-agent-assistThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
This skill merges **agent design, coding, and deployment** with **interactive dress-rehearsal simulation** in one place.
This skill merges agent design, coding, and deployment with interactive dress-rehearsal simulation in one place.
Assistance falls into three categories:
agent_spec.md, optionally simulate the agent before codingAGENTS.md deployment instructionsIf the user's first message is simply 1, 2, or 3, treat it as selecting one of these categories.
Present the three options clearly:
Welcome! I help you design, code, and deploy AI agents (with optional dress-rehearsal simulation before coding).
What would you like to do?
1. Design an AI agent → Describe your idea
2. Code an AI agent → Load and implement an existing agent_spec.md
3. Deploy an AI agent → Deploy an implemented agent to DataRobot
Show this menu first. After the user selects an option (1, 2, or 3), run the Pre-requisite Check and then the Script Path Resolution before doing anything else for that option.
Before invoking any helper script, resolve <skill_scripts_dir> once for the session:
<skill_scripts_dir> is the scripts/ subdirectory of the directory containing this SKILL.md file.ls <path_to_this_skill_dir>/scripts/. If the directory is missing, tell the user the skill installation is incomplete and stop.<skill_scripts_dir>/... reference in this skill.Run in order before proceeding:
git --version. If missing, tell the user to install from https://git-scm.com and stop.python --version. If missing or below 3.11, tell the user to install Python 3.11+ from https://python.org and stop.Ask at most 2 rounds of clarifying questions before proposing an initial draft spec. If tools are still ambiguous after two rounds, start simple.
Focus questions on:
If the user mentions UI-related needs early ("dashboard", "visualization", "multi-page", "admin panel", "settings page"), capture it immediately in the frontend field — do not defer.
To check available models: Run the helper script:
python <skill_scripts_dir>/list_llm_models.py \
--json
CRITICAL: In case the script fails due to any reason, do not proceed. Instead, return the error message to the user and ask how they want to proceed.
Recommend a gpt-5, claude-4-5, or gemini-2.5 model from the list unless the user specifies cost or other constraints.
If none of those preferred families appear in the catalog, pick the highest-capability available model by name — prefer ones containing large, pro, opus, or sonnet over mini, haiku, or flash.
Only display the full model catalog when the user explicitly asks to browse models.
If the user's desired model is unavailable, suggest starting with an available one and updating after implementation.
agent_spec.md (YAML format) whenever showing it to the user.Before offering to simulate or code, if the spec does not already have a frontend field set, always ask:
"The template includes a default chat UI — is that sufficient, or would you like a custom frontend such as a dashboard, data visualization, or multi-page app?"
Then update the spec accordingly:
frontend.type: "chat"frontend.type: "multi-page" or "custom" with pages and optional requirementsBefore transitioning to coding, ask the user (exact wording):
"Would you like to run a dress rehearsal simulation first? (recommended)"
Wait for their reply. If they say yes, follow Dress Rehearsal end to end: initialize with rehearsal.py --init, drive turns with --session, handle NOTE: / DONE, and produce the feedback report. Do not substitute improvised role-play or manual mock tool traces. If they decline or skip, proceed directly to coding — do not simulate without explicit confirmation.
Script path: python <skill_scripts_dir>/rehearsal.py ...
Simulate an agent_spec.md interactively before writing any code. Responses go through the DataRobot LLM Gateway; the rehearsal script handles API calls, state, and output. You orchestrate the loop, handle out-of-character commands, and produce the feedback report at the end.
Engine location: <skill_scripts_dir>/rehearsal.py (relative to repository root).
python <skill_scripts_dir>/rehearsal.py --init [--spec agent_spec.md]
If agent_spec.md does not exist and no path was provided, say so and stop.
The script creates a unique session directory in the system temp dir and prints two lines:
session=<session_dir>
output=<output_file>
Retain session_dir for all subsequent calls. Read the output_file and display its contents verbatim, then say:
You are now the end user of this agent. Type messages as a real user would.
Out-of-character commands:
NOTE: <text>— record a design observationDONE— end the session and generate your feedback report
Keep track of any notes and the number of turns as the session progresses — you'll need these for the report.
On each user message:
NOTE: — acknowledge the note, prompt for next message. Do not call the script.DONE — proceed to Step 3.python <skill_scripts_dir>/rehearsal.py --session {session_dir} "{user_message}"
The script prints output=<output_file>. Read that file and display its contents verbatim. It will contain [TOOL CALL], [SIMULATED RETURN], and [Agent]: blocks as appropriate.
If the script exits non-zero, display the error and ask whether to continue or abort.
Before writing the report, review the session and consider each of these areas — only surface the ones where you have something concrete to say:
Then write the report in this format:
════════════════════════════════════════════
DRESS REHEARSAL REPORT
════════════════════════════════════════════
{1–2 sentences: what was tested and how the agent performed overall}
{If notes were recorded: "Notes: " followed by each note on its own line, prefixed with —}
Suggested changes:
1. {specific, actionable change}
2. {specific, actionable change}
…
{If nothing worth changing: "No changes recommended."}
════════════════════════════════════════════
Then offer to implement any changes to agent_spec.md.
On Windows: coding is not supported. STOP and do NOT proceed with the next steps!
Verify agent_spec.md contains at minimum:
model — a valid LLM Gateway model IDsystem_prompt — non-emptytools — at least one tool defined (or explicit confirmation from the user that no tools are needed)frontend.type — setIf agent_spec.md does not exist, inform the user and offer to run the Design phase (option 1) first. If any required field above is missing, surface the gap and update the spec before continuing. Do not start coding against an incomplete spec.
Read agent_spec.md — it must exist (see gate above).
Check if AGENTS.md exists in the template directory (default: current working directory).
If AGENTS.md does not exist, prepare the template with these steps in order. ALWAYS follow the steps in order and do not skip any, even if they seem redundant. This is critical for ensuring the template is properly set up and avoiding wasted effort coding on a broken foundation.
a. Check the working directory — if it contains files other than agent_spec.md, warn the user and ask them to clear it before proceeding.
b. Move agent_spec.md aside if present — if the file exists in the working directory, move it to a temp location (e.g. /tmp/agent_spec.md.bak) before cloning so it isn't overwritten. Restore it after cloning completes.
c. Clone the template: Run the helper script:
python <skill_scripts_dir>/clone_template.py
d. Select the agentic framework:
STOP. Do NOT proceed until the user has replied with their framework choice.
Ask the user (exact message):
Which agentic framework would you like to use?
- LangGraph
- CrewAI
- LlamaIndex
- NeMo Agent Toolkit (NAT)
- Base
Wait for the user's reply. Do not assume or default to any framework. If their next message is not a framework choice (silence, unrelated text), re-display the options and wait again — do not proceed with any other coding step. Once the user replies, map their choice to the corresponding value (langgraph, crewai, llamaindex, nat, base) and run:
python <skill_scripts_dir>/select_framework.py \
--target-dir . \
--framework <value>
e. Validate the template: Run dr dependency check. Treat any non-zero exit as a hard error — do not attempt to resolve it automatically. Return the full output to the user and stop.
f. Setup the template: Run the helper script. Use the model field from agent_spec.md as --llm-model; if absent, use the model selected during the design phase.
python <skill_scripts_dir>/setup_template.py \
--llm-model <model-name> \
--target-dir .
CRITICAL: In case any of the above scripts fail due to any reason, do not proceed with coding. Instead, return the error message to the user and ask how they want to proceed.
g. Re-read AGENTS.md now that the template is ready.
Recreate the TODO list based on agent_spec.md — break down the implementation into discrete steps and add them to the TodoWrite tool.
.env files (.env.template files are OK)AGENTS.md to find the local test command.AGENTS.md for deployment instructionsThe following are the examples of helper scripts used in the skill. They are located in the scripts directory and are designed to assist with various tasks.
Lists available LLM models from DataRobot LLM Gateway.
Fetches and displays active models from the DataRobot LLM Gateway catalog:
python <scripts_dir>/list_llm_models.py \
--json
Requires env vars: DATAROBOT_API_TOKEN, DATAROBOT_ENDPOINT
Clones the DataRobot agent application template repository.
Clones the template to the current directory (repository URL and branch are hardcoded):
python <scripts_dir>/clone_template.py
Clone to a specific directory:
python <scripts_dir>/clone_template.py \
--target-dir ./my-project
Sets up a template repository for initializing a new agent project.
python <scripts_dir>/setup_template.py \
--llm-model <model-name> \
--target-dir .
Saves the chosen agentic framework to .datarobot/answers/agent-agent.yml
(field agent_template_framework). Preserves all other fields in the file.
python <scripts_dir>/select_framework.py \
--framework langgraph \
--target-dir .
Valid --framework values: langgraph, crewai, llamaindex, nat, base
Write specs in YAML to agent_spec.md in the working directory. Fields are optional when the spec is still evolving.
model: "anthropic/claude-sonnet-4-5-20250929" # DataRobot LLM Gateway model ID
system_prompt: "Your agent's instructions..."
tools:
- function_name: tool_name
inputs:
- arg_name: input_arg
type: str # one of: str, int, float, bool, list, dict
object_schema: "(optional: schema of dict/list contents)"
out:
- arg_name: output_arg
type: str
auth_spec:
service_name: "External API Service"
auth_method: api_key # api_key | oauth2 | basic_auth | bearer_token | service_account | other
examples:
- "Example user query 1"
- "Example user query 2"
frontend:
type: "chat" # chat | multi-page | custom
pages:
- "Analytics - shows search history and top topics"
requirements: "(optional additional UI requirements)"
When tools require external service auth, note that credentials must be configured as runtime parameters in the infrastructure code (see AGENTS.md for the pattern).
See references/agent-spec-examples.md for complete working examples.
Claude's built-in tools replace the plugin's custom Python tools:
| Plugin Tool | Claude Tool |
|---|---|
read_file | Read |
write_file | Write |
edit_file | Edit |
shell | Bash |
list_dir | Glob or Bash (ls) |
grep_files | Grep |
glob | Glob |
web_search | WebSearch |
get_web_page | WebFetch |
write_todos / read_todos | TodoWrite |
show_agent_spec | Write to agent_spec.md + display as YAML |
prepare_to_code | Bash (git clone + dr start) |
list_available_models | WebFetch (DataRobot API) |
code_research | Agent (Explore subagent) |
| Agent simulation (dress rehearsal) | Dress Rehearsal + <skill_scripts_dir>/rehearsal.py in this skill directory |
The DataRobot CLI (dr) is required for managing DataRobot custom applications.
Check if the CLI is installed:
dr --version
Expected output: DataRobot CLI version: v0.2.66 (or similar)
If not installed, run:
macOS/Linux:
curl https://cli.datarobot.com/install | sh
Windows:
irm https://cli.datarobot.com/winstall | iex
If the CLI version is too old, run to upgrade:
dr self update --force
Verify the CLI is authenticated:
dr auth check
If not authenticated, run:
dr auth login
This will guide the user through the authentication process interactively.
npx claudepluginhub datarobot-oss/datarobot-agent-skills --plugin datarobot-agent-skillsProvides the full ADK agent development lifecycle: scaffold, build, evaluate, deploy, publish, and observe. Entrypoint for building agents with google-agents-cli.
Creates Claude Code agents from scratch or by adapting templates. Guides requirements gathering, template selection, and file generation following Anthropic best practices (v2.1.63+).
Generates comprehensive, PhD-level expert agent prompts for Claude Code with code examples, patterns, and best practices. Supports single, batch, and architecture-driven agent creation.