By ArtMin96
Evaluator-optimizer pattern: adversarial review, verification, static analysis gates, postmortems
Draft Verification critique: self-review + git history comparison. Run before marking any non-trivial task complete.
Argue against a design decision or implementation approach. Forces consideration of alternatives before committing. Use when evaluating architecture or design choices.
Aggregate all quality warnings from the current session. Use before committing to see a summary of all hook-generated warnings.
Interview the user relentlessly about a plan or design until reaching shared understanding, resolving each branch of the decision tree. Use when the user is in plan mode, planning an implementation, wants to stress-test a plan, get grilled on their design, or mentions "grill me".
Run a full project health check. Auto-detects PHP and/or JS/TS projects and runs the appropriate quality pipeline. Use before committing or when you want a quality snapshot.
Executes bash commands
Hook triggers when Bash tool is used
Modifies files
Hook triggers on file write and edit operations
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Agent = Model + Harness. The harness is everything except the model: behavioral steering, context management, memory, evaluation, orchestration, and multi-agent decomposition. Research shows changing only the harness can produce a 6x performance gap.
Forge Studio implements harness principles as composable Claude Code plugins.
9 plugins. 35 skills. 22 hooks. 4 agents.
# Add the marketplace
/plugin marketplace add ArtMin96/forge-studio
# Install by layer — pick what you need
# Behavioral Steering (recommended: start here)
/plugin install behavioral-core@forge-studio
# Context Management
/plugin install context-engine@forge-studio
# Memory Architecture
/plugin install memory@forge-studio
# Evaluation & Quality Gates
/plugin install evaluator@forge-studio
# Orchestration
/plugin install workflow@forge-studio
# Multi-Agent Decomposition
/plugin install agents@forge-studio
# Reference & Tips
/plugin install reference@forge-studio
# Execution Trace Collection
/plugin install traces@forge-studio
# Token-Optimized Output (always-on compressed communication)
/plugin install caveman@forge-studio
After installing, start a new session for plugins to load.
A lean CLAUDE.md template is included at templates/CLAUDE.md. Designed to work with forge-studio plugins — covers personality, judgment, context management, self-evaluation, and project config without repeating what hooks enforce.
cp templates/CLAUDE.md ./CLAUDE.md
# Edit the Project Config and Conventions sections for your project
A power-user settings.json template is included at templates/settings.json. Enables extended thinking, maximum effort, LSP tools, and bypass permissions with a deny list for destructive commands.
# Copy to your global Claude Code config
cp templates/settings.json ~/.claude/settings.json
Key choices:
See Settings Best Practices for detailed documentation.
┌─────────────────────────────────────────────┐
│ User / IDE │
├─────────────────────────────────────────────┤
│ Harness (Forge Studio) │
│ │
│ behavioral-core ──── Steering & discipline │
│ context-engine ───── Context window mgmt │
│ memory ───────────── Cross-session recall │
│ evaluator ────────── Quality gates & review│
│ workflow ─────────── Orchestration patterns│
│ agents ───────────── Multi-agent triad │
│ reference ────────── Power-user tips │
│ caveman ──────────── Token-optimized output│
│ │
├─────────────────────────────────────────────┤
│ Claude Model │
└─────────────────────────────────────────────┘
See docs/architecture.md for the full design rationale.
Re-injects behavioral rules on every message to prevent drift in long sessions. Rules live in rules.d/ as individual files, priority-ordered by numeric prefix (10-no-sycophancy, 20-no-filler, etc.). Add, remove, or reorder rules by managing files. Hooks enforce ~100% compliance where system prompt instructions degrade to ~80%.
| Skill | Purpose |
|---|---|
/rules-audit | Audit session for sycophancy, apologies, scope creep, filler |
/scope <task> | Define task boundaries and acceptance criteria |
/timebox [N] | Set a message budget (default 15) for the current task |
Progressive 5-stage context pressure tracking replaces fixed message-count thresholds. Warns at ~50% (re-read files), ~65% (consider compact), ~75% (recommend compact), ~85% (recommend handoff), ~92% (critical — handoff now). Automatically uses actual context percentage when Claude Code exposes it.
npx claudepluginhub artmin96/forge-studio --plugin evaluatorOrchestration patterns: task routing, explore-plan-implement cycle, daily rituals
Execution trace collection, analysis, and harness evolution. Stores structured JSONL traces for pattern detection and failure-driven improvement.
Context management: progressive pressure tracking, session handoffs, drift detection, edit safety, large-file warnings, plan mode hooks, MCP monitoring, prompt budget awareness
Multi-agent decomposition: planner/generator/reviewer triad with tool-isolated capability boundaries
Three-tier memory architecture: pointer index, on-demand topics, searchable transcripts
Harness for Claude Code — skills, /harness:* slash commands, persona subagents, lifecycle hooks, and MCP tools without per-repo `harness setup`. Sibling plugins exist for Cursor, Gemini CLI, and Codex.
Opinionated workflow guides and best practices - the preacher's proven patterns for Claude Code projects
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, and rules evolved over 10+ months of intensive daily use
Ultra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.
Multi-model consensus engine integrating OpenAI Codex CLI, Gemini CLI, and Claude CLI for collaborative code review and problem-solving.
Curate auto-memory, promote learnings to CLAUDE.md and rules, extract proven patterns into reusable skills.