Intelligence engineering plugin for building systems that learn from their own operation. Successor to SciPy-Pro v4. Provides 7 workflow commands (/reason, /graph, /train, /architect, /simulate, /measure, /learn), 24 domain-expert agents across three tiers, and compound-learning + enrichment-pipeline guidance for EpiGNN, KGE RotatE, SBERT, GL-Fusion, model-swarm routing, and network effects.
Based on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
System design -- two-mode deployment, feedback loop control, pipeline optimization, layer separation, domain pack architecture.
Graph intelligence -- from objects to structure. Community detection, causal inference, self-organization, GNN embeddings, temporal memory.
Run the compound learning pipeline. Save the session log, run Bayesian confidence updates, and present the review queue with auto-captured claims, tensions, and attention items.
Intelligence measurement -- IQ tracking across 7 axes, benchmarking, trend analysis, sensitivity analysis, leverage identification.
Epistemic reasoning -- from raw text to structured claims, tensions, and models. NER, NLI scoring, claim decomposition, confidence calibration.
Specialist in tracing how ideas influenced each other over time. Handles temporal precedence filtering, influence DAG construction, provenance chains, and lineage tracing. Invoke when working on causal_engine.py, provenance.py, engine.py Pass 7, or any code that tracks the temporal evolution of knowledge and how one piece of evidence led to another. Examples: - <example>User asks "trace the lineage of this claim back to its sources"</example> - <example>User asks "build an influence DAG for this notebook"</example> - <example>User asks "generate a provenance narrative for this object"</example> - <example>User asks "why does the causal engine include this edge?"</example> - <example>User asks "find the root sources that originated this line of thinking"</example> - <example>User asks "feed temporal precedence features into the learned scorer"</example>
Specialist in propositional reasoning — decomposing text into atomic claims and determining support, contradiction, or neutrality between them. Handles claim decomposition (LLM and rule-based), NLI classification with CrossEncoder, stance detection, and epistemic status tracking. Invoke when working on claim extraction, contradiction detection, NLI scoring, the epistemic status lifecycle, or Tension features for the IQ Tracker. Examples: - <example>User asks "add claim-level NLI to the compose engine"</example> - <example>User asks "why are contradictions not being detected?"</example> - <example>User asks "decompose this object into individual claims"</example> - <example>User asks "track the epistemic status of extracted claims"</example> - <example>User asks "improve the stance detection accuracy"</example> - <example>User asks "build NLI features for the Level 2 learned scorer"</example>
Specialist in "what if?" reasoning over the knowledge graph. Handles dependency tree construction, counterfactual retraction cascading, fragility analysis, ATMS-style multi-context belief management, and alternative graph state comparison. Invoke when building Level 7 counterfactual simulation or any system that models consequences of removing or accepting knowledge elements. Examples: - <example>User asks "what happens if I remove this source?"</example> - <example>User asks "which claims are load-bearing on this evidence?"</example> - <example>User asks "simulate accepting this contested claim"</example> - <example>User asks "measure the fragility of my understanding"</example> - <example>User asks "build the claim dependency graph"</example>
Specialist in adapting the engine to perform differently in different knowledge domains. Handles per-cluster model adaptation, feature importance decomposition, meta-learning for rapid domain adaptation, few-shot relation learning, schema induction, and domain pack construction. Invoke when building Level 5 self-modifying pipeline, domain packs, or any system that adapts its behavior to knowledge domain characteristics. Examples: - <example>User asks "make the engine weight signals differently per domain"</example> - <example>User asks "build a domain pack for legal research"</example> - <example>User asks "learn new relation types from a few examples"</example> - <example>User asks "auto-detect which domain a cluster belongs to"</example> - <example>User asks "induce ontology from corpus content"</example>
Specialist in population-based search for optimal system configurations. Handles genetic algorithms, NSGA-II multi-objective optimization, CMA-ES for continuous parameters, neuroevolution, and fitness functions derived from the IQ Tracker. Invoke when tuning engine hyperparameters, optimizing multi-axis performance, or searching configuration spaces. Examples: - <example>User asks "evolve engine hyperparameters using IQ scores"</example> - <example>User asks "find Pareto-optimal configurations across IQ axes"</example> - <example>User asks "optimize engine thresholds using CMA-ES"</example> - <example>User asks "run multi-objective search over engine configs"</example>
Uses power tools
Uses Bash, Write, or Edit tools
Most Claude Code plugins give you a set of slash commands and some domain knowledge. These plugins do something different: they learn.
Each plugin in this repo is a domain-specialized engineering intelligence that accumulates knowledge across sessions, grounds itself in real library source code (not training data), and coordinates with a companion chat skill on Claude.ai. The plugin implements. The chat skill plans. Over time, the plugin gets better at its job because it tracks what works, what doesn't, and what it's still uncertain about.
This is the two-surface architecture: one surface for thinking, one for building.
A typical plugin contains four layers:
Specialist agents and slash commands. Each plugin ships with 3 to 7 agents that handle specific subtasks. UI-Design-Pro has a design critic, a component builder, an accessibility auditor, an animation engineer, and a visual architect. Django-Engine-Pro has agents for model design, ORM optimization, migration planning, and MCP server exposure. Agents compose in defined sequences: you always run the stack detector before the component builder, always run the design critic after.
Source-code references. Plugins include install.sh scripts that shallow-clone real library repos into a local refs/ directory. When UI-Design-Pro needs to know how Radix handles focus restoration, it greps the actual Radix source, not its training data. When D3-Pro needs to verify a scale constructor's API, it reads the Observable source directly. This matters because training data goes stale. Source code doesn't.
Skills and decision frameworks. Static knowledge: inheritance decision tables, ORM anti-pattern catalogs, polymorphic rendering rules, animation physics constants. These encode the expert judgment that doesn't change between sessions.
An epistemic knowledge layer. This is the part that learns. Each plugin maintains a knowledge/ directory containing typed claims in JSONL, confidence scores, session logs, and (for some plugins) SBERT embeddings. Claims start as drafts. After review, they become active. Active claims carry Bayesian confidence that updates based on session outcomes: when a suggestion informed by a claim gets accepted, confidence rises; when it gets rejected, confidence drops. Over time, each plugin develops its own body of verified, weighted knowledge about its domain.
Each plugin here has a counterpart: a chat skill that runs on Claude.ai (or Claude Desktop). The division of labor is deliberate.
The chat skill handles planning, reasoning, and decision-making. When you're deciding between DRF and Ninja for an API, or choosing an inheritance strategy for a model hierarchy, or evaluating whether a component needs polymorphic rendering, the chat skill walks you through the tradeoffs and produces a structured handoff document.
The Claude Code plugin handles implementation and learning. It takes the handoff document, builds the thing, greps real source code when it needs to verify an API, logs what it tried, and updates its knowledge base with what it learned.
The chat skill never sees knowledge/claims.jsonl. The plugin never produces planning documents. Each surface does what it's good at.
| Chat Skill (Claude.ai) | Claude Code Plugin |
|---|---|
| Decision frameworks | Slash commands and agents |
| Tradeoff analysis | Source-code grepping |
| Structured handoff docs | Implementation and testing |
| Domain reasoning | Session logging and learning |
| Static (expert knowledge) | Dynamic (knowledge that evolves) |
Every plugin with a knowledge/ directory runs the same protocol:
Session start: Read manifest.json for current state. Load active claims sorted by confidence. Check tensions.jsonl for unresolved conflicts in the task's domain. Surface tensions before making decisions, not after.
During work: Track which claims informed each suggestion. Note when the user accepts, modifies, or rejects a recommendation.
Session end: Write observations to session_log/. Flag contradictions as tension signals. Note recurring patterns the knowledge base doesn't yet cover.
The knowledge types are borrowed from Theseus (a separate epistemic engine project):
Current knowledge stats across the fleet:
| Plugin | Total Claims | Active | Avg Confidence |
|---|---|---|---|
| UI-Design-Pro | 140 | 135 | 0.667 |
| Django-Engine-Pro | 111 | 29 | 0.75 |
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimnpx claudepluginhub travis-gilbert/claude-marketplace --plugin theseus-proMobile app development specialist: PWA retrofitting, React Native architecture, offline-first sync, mobile API design, touch optimization, and mobile visualization adaptation.
Makes Claude Code extraordinarily good at transforming websites into applications: converting page-based Next.js sites into app-like experiences with persistent layouts, command palettes, and background sync; wrapping them in Tauri desktop shells with native OS integration; and producing architecture handoffs for native Swift/AppKit macOS apps.
Makes Claude Code genuinely good at designing knowledge-graph answer experiences with cosmos.gl on top of DuckDB-WASM, Mosaic, and vgplot. Owns the SceneDirective adapter, the three-picker ControlDock (Position, Weight, Edges), and the recipe library that turns novel ideas into usable images.
Git and deployment automation with verification at every step. Staged file review, conventional commits, pre-commit checks, push with CI/CD detection, and post-deploy health verification.
Makes Claude Code extraordinarily good at building D3 visualizations that are mathematically accurate, physically believable, and aesthetically grounded in the Mike Bostock / Observable canon.
Comprehensive feature development workflow with specialized agents for codebase exploration, architecture design, and quality review
Harness-native ECC operator layer - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, selective install profiles, and production-ready workflows for Claude Code, Codex, OpenCode, Cursor, and related agent harnesses
Upstash Context7 MCP server for up-to-date documentation lookup. Pull version-specific documentation and code examples directly from source repositories into your LLM context.
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
A growing collection of Claude-compatible academic workflow bundles. Covers scientific figures, manuscript writing and polishing, reviewer assessment, citation retrieval, data availability, paper reading, literature search, response letters, paper-to-PPTX conversion, and evidence-grounded Chinese invention patent drafting. Rules are organized as reusable skill folders with explicit workflows and quality checks.
Tools to maintain and improve CLAUDE.md files - audit quality, capture session learnings, and keep project memory current.