Plan Stack methodology plugins for AI-native development
npx claudepluginhub planstack-ai/planstackPlan Stack - AI-native development workflow built on Context Engineering. Research → Plan → Implement → Review.
Stop stuffing context. Start engineering it.
A development workflow built on Context Engineering principles for reliable AI-assisted system development.
See the Principles · See the Workflow
In two years, context windows grew 62×—from 32K to 2M tokens.
The intuition was obvious: "If everything fits, just put everything in."
It doesn't work.
The uncomfortable truth:
A context window is not storage. It is cognitive load.
| Approach | Why It Breaks |
|---|---|
| Long chat sessions | Context degrades long before the token limit. Earlier instructions fade. The model becomes repetitive or contradictory. |
| Dump the whole repo | Signal drowns in noise. The model misinterprets architectural intent because it can't distinguish core from peripheral. |
| Detailed documentation | More detail ≠ better understanding. Comprehensive docs produce unfocused responses that drift from the actual question. |
| RAG everything | Retrieval finds related content, not relevant content. Without curation, you're still stuffing context—just automatically. |
These approaches share a common assumption:
"If the model can access it, the model can use it."
This assumption is wrong.
Access is not understanding. Retrieval is not reasoning. The bottleneck was never capacity—it was curation.
Don't dump the monolith. Scope by responsibility.
Provide the smallest effective context for the task—not the smallest file, but the smallest bounded context.
For "Add OAuth2 authentication," the model needs:
User model, SessionController, routes.rb, auth middlewareIt does not need:
Ask: "What is the minimum context required to solve this specific problem?"
Pass artifacts, not histories.
Break work into stages. Each stage receives the output of the previous stage—not the entire conversation.
Plan → Execute → Reflect
This keeps signal density high and context fresh.
Ask: "Can I decompose this into stages that pass summaries, not transcripts?"
Never run at 100% capacity. Reserve space to reason.
Token limits cover input and output combined. Stuffing 195K into a 200K window leaves almost no room for the model to think.
Claude Code's quality improved dramatically when Anthropic enforced the 75% rule—not despite the constraint, but because of it.
Ask: "Have I left enough space for the model to think—not just respond?"
Plan Stack is a development workflow that implements all three principles simultaneously.
| Phase | What Happens | Principles Applied |
|---|---|---|
| Research | AI checks docs/plans/ for similar past implementations | Isolation: Starts from curated context, not raw codebase |
| Plan | AI generates implementation plan, human reviews | Headroom: No implementation yet—reasoning gets full capacity |
| Execute | Code with the plan as guide | Chaining: Receives plan artifact, not conversation history |
| Review | AI compares plan vs implementation, detects drift | Isolation + Chaining: Fresh context, artifact-based evaluation |
Implementing a feature involves:
A good plan captures all of that in 200–300 lines.
Six months later:
| Without a plan | With a plan |
|---|---|
| Re-read 50 files | Read one file |
| Re-discover everything | Understand intent |
| Re-make decisions | Modify with confidence |
Plans are curated context—expensive research distilled into something both humans and AI can reliably consume.
/clear PatternContext degrades long before it overflows.
Plan Stack embraces this:
You restart at 0% context—without starting over.
Production-ready workflow orchestration with 84 marketplace plugins, 192 local specialized agents, and 156 local skills - optimized for granular installation and minimal token usage
Directory of popular Claude Code extensions including development tools, productivity plugins, and MCP integrations
Curated collection of 154 specialized Claude Code subagents organized into 10 focused categories