By mistakeknot
Agent trust scoring — reputation tracking, severity-weighted decay, and suppression candidates.
Agent trust scoring for Claude Code. Tracks which review agents produce useful findings and which waste tokens.
When interflux dispatches review agents, some consistently produce findings you act on — and some produce noise you dismiss. Intertrust closes the feedback loop: it records each accept/dismiss decision, computes a trust score per agent, and feeds that score back into dispatch priority so the good agents run first and the noisy ones get deprioritized.
The scoring algorithm uses severity-weighted time decay: a P0 finding accepted yesterday counts more than a P3 finding dismissed last month. Scores blend project-specific data with global data, so a new project inherits the agent's cross-project reputation until enough local data accumulates.
Intertrust was extracted from the interspect profiler to maintain single-responsibility: interspect handles evidence collection and routing overrides; intertrust handles reputation and trust.
First, add the interagency marketplace (one-time setup):
/plugin marketplace add mistakeknot/interagency-marketplace
Then install:
/plugin install intertrust
Check which agents are earning trust and which are candidates for suppression:
/trust-status
AGENT PROJECT TRUST ACCEPTED DISCARD REVIEWS
fd-safety my-project 0.92 18 2 20
fd-correctness my-project 0.85 12 3 15
fd-game-design my-project 0.15 1 12 13 <!>
Agents with trust < 0.30 are flagged with <!> as suppression candidates. These agents consistently produce findings that nobody acts on.
For a specific agent:
/trust-status fd-safety
Score range: 0.05 (floor) to 1.0 (ceiling).
Inputs: Every time you resolve a review finding (via /clavain:resolve), the outcome is recorded:
Severity weighting: A P0 finding counts 4x, P1 counts 2x, P2 counts 1x, P3 counts 0.5x. Catching a real security issue (P0, accepted) boosts trust much more than flagging a style nit (P3, accepted).
Time decay: Half-life of ~30 days. Recent outcomes matter more than old ones. An agent that improved its prompts last week shouldn't be penalized for noise it generated two months ago.
Project/global blending: New projects inherit the agent's global reputation until enough local data accumulates (blend weight reaches 1.0 at 20 local reviews).
Integration: interflux multiplies each agent's triage score by its trust score at dispatch time. High-trust agents get dispatched first. Low-trust agents may not get dispatched at all if the token budget is tight.
Trust scoring is progressive enhancement — it never blocks workflows. If intertrust is not installed, all agents get a neutral trust score of 1.0.
Trust data lives in the shared .interspect/interspect.db SQLite database (the trust_feedback table). The library is self-contained with no dependency on the interspect plugin — it creates its own table if needed.
intertrust/
├── .claude-plugin/plugin.json # Plugin manifest
├── hooks/
│ └── lib-trust.sh # Trust scoring library (233 lines)
├── commands/
│ └── trust-status.md # /trust-status command
└── tests/
└── test_trust_scoring.sh # 11 tests
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub mistakeknot/interagency-marketplace --plugin intertrust[DEPRECATED — use intervoice] Analyze your writing style and adapt Claude's output to sound like you. Replaced by intervoice, which reads one global multi-register profile instead of per-project glob-routed files.
Recursive AGENTS.md generator with integrated Oracle critique, CLAUDE.md harmonization, incremental updates, diff previews, and smart monorepo scoping. Cross-AI compatible.
Token-efficient code reconnaissance for LLMs. Autonomous skills save 48-85% tokens via diff-context, semantic search, structural patterns, and symbol analysis. Includes MCP server for direct tool integration.
Self-improving agent rig: codifies product and engineering discipline into composable workflows from brainstorm to ship. Compounds knowledge, generates domain agents, monitors its own docs, and surfaces conservative update drift. Orchestrates Claude, Codex, and Oracle through 6 agents, 52 commands, 19 skills, 0 MCP servers. Factory substrate: CXDB turn DAG, scenario bank with satisfaction scoring, evidence pipeline, agent capability policies. Companions: interspect, interphase, interline, interflux, interpath, interwatch, interslack, interform, intercraft, interdev, interpeer, intertest.
Token efficiency benchmarking, session analytics, and API-equivalent cost analysis for agent workflows
Complete AI coding workflow system. Self-correcting memory + persistent FTS5-indexed research wikis + auto-research loop + multi-LLM council on a single SQLite store. 33 skills, 8 agents, 22 commands, 37 hook scripts across 24 events. Cross-agent via SkillKit.
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.
Persistent file-based planning for AI coding agents. Crash-proof markdown plans (task_plan.md, findings.md, progress.md) that survive context loss and /clear, with an opt-in completion gate and multi-agent shared state. Manus-style. Works with Claude Code, Codex CLI, Cursor, Kiro, OpenCode and 60+ agents via the SKILL.md standard. Includes Arabic, German, Spanish, and Chinese (Simplified and Traditional).
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.
TypeScript/JavaScript full-stack development with NestJS, React, and React Native