Skill

intern-training

Reference for intern training data formats, benchmark structure, and evaluation protocol. Use when running aptitude tests, collecting training data, or evaluating intern performance.

Popularity

Shared by

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/nerd:intern-training

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Located at `${CLAUDE_PLUGIN_ROOT}/skills/intern-training/benchmark-seed/`.

Supporting Files

SKILL.md

66 lines · ~585 tokens

Stats

Parent stars0

MaintenanceExcellent

Last CommitMar 16, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Intern Training Reference

Benchmark Seed Data

Located at ${CLAUDE_PLUGIN_ROOT}/skills/intern-training/benchmark-seed/.

Structure

benchmark-seed/
├── manifest.json                    # Version, counts, language coverage
├── parameter-detection/             # 5 examples
│   ├── pd-001-rust-search.json
│   ├── pd-002-python-retry.json
│   ├── pd-003-ts-cache.json         # Includes false positives (PI, HTTP_OK)
│   ├── pd-004-go-ratelimit.json
│   └── pd-005-python-ml.json        # Includes RANDOM_SEED (not tunable)
├── result-classification/           # 5 examples
│   ├── rc-001-clear-improvement.json
│   ├── rc-002-clear-regression.json
│   ├── rc-003-neutral.json
│   ├── rc-004-mixed-signals.json    # Hard: throughput up but latency/memory up
│   └── rc-005-subtle-regression.json # Medium: overfitting pattern
└── context-extraction/              # 4 examples
    ├── ce-001-auth-middleware.json
    ├── ce-002-cache-eviction.json
    ├── ce-003-rate-limiter.json
    └── ce-004-retry-backoff.json

Example Format

Each benchmark example is a JSON file with:

id: Unique identifier (e.g., "pd-001-rust-search")
language: Programming language (for parameter-detection and context-extraction)
difficulty: easy, medium, or hard
context_tokens: Approximate token count of input
input: The input to send to the intern
expected_output: The ground truth output to score against
notes: Optional notes about tricky aspects (false positives to avoid, etc.)

Training Data Format (JSONL)

Stored at .nerd/intern/training-data/{task_type}.jsonl.

Each line is a JSON object:

{
  "task_type": "result-classification",
  "input": {"experiment_id": "E001", "results": {}},
  "output": {"classification": "improved", "evidence": "..."},
  "reasoning": "Claude's chain-of-thought explanation",
  "source_agent": "report-compiler",
  "created_at": "2026-03-15T10:30:00Z",
  "run_id": "run-2026-03-15-001",
  "dedup_key": "E001:result-classification"
}

The reasoning field captures Claude's chain-of-thought for knowledge distillation in v2.

intern-training

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

intern-training

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

Intern Training Reference

Benchmark Seed Data

Structure

Example Format

Training Data Format (JSONL)

Similar Skills

Intern Training Reference

Benchmark Seed Data

Structure

Example Format

Training Data Format (JSONL)

Similar Skills