From dotforge
Runs standardized tasks in two git worktrees (full config vs minimal) to compare Claude Code output quality. Use after Fases 0-2 to validate configuration ROI.
How this skill is triggered — by the user, by Claude, or both
Slash command
/dotforge:benchmarkThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Compare the effectiveness of a project's full dotforge configuration against a minimal baseline by executing the same standardized task in two isolated worktrees.
Compare the effectiveness of a project's full dotforge configuration against a minimal baseline by executing the same standardized task in two isolated worktrees.
Cost warning: Each benchmark runs Claude Code twice (full + minimal). Use sparingly and only after Fases 0-2 are working.
.claude/settings.json and CLAUDE.md$DOTFORGE_DIR/tests/benchmark-tasks/.claude/.forge-manifest.json or infer from project files$DOTFORGE_DIR/tests/benchmark-tasks/{stack}.ymlgeneric.ymlDisplay:
═══ BENCHMARK SETUP ═══
Project: {{name}}
Stack detected: {{stack}}
Task: {{task title}}
Description: {{task description}}
⚠ This will run Claude Code twice in isolated worktrees.
Proceed? (yes/no)
Create two git worktrees from the current HEAD:
Full config — git worktree add /tmp/bench-full-{{slug}} HEAD
.claude/ directory as-isCLAUDE.md as-isMinimal config — git worktree add /tmp/bench-minimal-{{slug}} HEAD
CLAUDE.md with only project name and "Build & Test" section.claude/settings.json with only allowedTools (no hooks, no deny list).claude/rules/, no hooks, no agentsFor each worktree, run the task prompt using Claude Code in non-interactive mode:
cd /tmp/bench-full-{{slug}}
claude --print "{{task prompt}}" --allowedTools "Bash,Read,Write,Edit,Glob,Grep"
Same for minimal worktree.
Capture for each run:
═══ BENCHMARK RESULTS — {{project}} ═══
Task: {{task title}}
Stack: {{stack}}
Date: {{YYYY-MM-DD}}
Full Config Minimal Config Delta
Files created: {{N}} {{N}} {{+/-N}}
Tests created: {{yes/no}} {{yes/no}} —
Tests passing: {{N/M}} {{N/M}} {{+/-N}}
Lint issues: {{N}} {{N}} {{+/-N}}
Errors: {{N}} {{N}} {{+/-N}}
── ANALYSIS ──
{{if full is better across metrics:
"Full config prevented {{N}} lint issues and {{N}} errors.
ROI: rules + hooks justified for this project."}}
{{if similar:
"Minimal difference detected. Consider simplifying configuration
or running /forge rule-check to identify inert rules."}}
{{if minimal is better:
"⚠ Full config may be adding overhead without benefit.
Review rules for contradictions or excessive constraints."}}
git worktree remove /tmp/bench-full-{{slug}} and minimal~/.claude/metrics/{{slug}}/benchmark-{{date}}.jsonResults JSON schema:
{
"project": "{{slug}}",
"date": "{{YYYY-MM-DD}}",
"stack": "{{stack}}",
"task": "{{task id}}",
"full": {
"files_created": 0,
"tests_passing": 0,
"lint_issues": 0,
"errors": 0,
"has_test": false
},
"minimal": {
"files_created": 0,
"tests_passing": 0,
"lint_issues": 0,
"errors": 0,
"has_test": false
}
}
npx claudepluginhub luiseiman/dotforge --plugin dotforgeProvides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.
Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.