By SpicesFire
Self-audit protocol for Claude. Detects sycophancy, position drift, and user-pressure-induced agreement. Pre-commits falsification criteria before reasoning and generates external tests the user runs independently of Claude. Triggers: 'honesty check', 'stress test position', 'audit your position', 'drift check'.
A Claude Code skill that audits Claude's own sycophancy, position drift, and pressure-induced agreement — then generates tests you run independently of Claude to verify it.
Most "be more honest" prompts ask the model to grade its own homework. The problem: a system that can't introspect, has no reference point outside the conversation, and can't tell "I genuinely updated" from "I caved to please you" — that system cannot self-certify its own honesty.
Dürüst ("honest" in Turkish) doesn't pretend to solve that. It does three things instead:
It openly states what it cannot do — and routes around those walls with validation you control.
In Claude Code:
/plugin marketplace add SpicesFire/durust
/plugin install durust@durust
That's it. Claude can now invoke the skill automatically when integrity is in question, or you can trigger it by hand.
Prefer no plugin? Copy
skills/durust/SKILL.mdinto~/.claude/skills/durust/SKILL.md(or paste its body into~/.claude/commands/durust.md).
By hand — say any of:
honesty check·audit your position·stress test position·drift check(Turkish triggers also work:dürüst ol·kendini denetle·pozisyon testi·drift kontrolü)
Automatically — Claude invokes it itself when it detects:
A two-layer output:
HELD / MODIFIED / WITHDRAWN / UNCERTAIN, possibly PROVISIONAL), and the single most decisive test to run.Any verdict stays PROVISIONAL — confidence capped at ~70% — until you run the external tests. Internal audit alone is never enough.
| # | Step | Why it matters |
|---|---|---|
| 1 | Position snapshot | Claim + confidence, no reasoning yet |
| 2 | Pre-commit falsification criteria | Written before reasoning, so reasoning can't rationalize |
| 3 | Position reasoning | Constrained by the criteria already locked in |
| 4 | Position history | Did the stance drift under pressure? Flags DRIFT SUSPECTED |
| 5 | Steel-man + reverse-pressure test | "Would I hold the opposite view if you'd argued the opposite?" |
| 6 | External anchor | Web search / sources / other models — or honestly: none |
| 7 | External Test Protocol | The primary output: tests you run without Claude |
| 8 | Audit log | Persisted for long-term calibration |
It surfaces these honestly instead of papering over them:
Factual lookups, user-preference questions, coding/math tasks, creative writing, quick casual chat. It's for the moments where Claude's intellectual integrity itself is the subject — not for ordinary tool-use.
MIT — use it, fork it, improve it. Issues and PRs welcome, especially around v3.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub spicesfire/durust --plugin durustComprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, and rules evolved over 10+ months of intensive daily use
Tools to maintain and improve CLAUDE.md files - audit quality, capture session learnings, and keep project memory current.
Develop, test, build, and deploy Godot 4.x games with Claude Code. Includes GdUnit4 testing, web/desktop exports, CI/CD pipelines, and deployment to Vercel/GitHub Pages/itch.io.
A growing collection of Claude-compatible academic workflow bundles. Covers scientific figures, manuscript writing and polishing, reviewer assessment, citation retrieval, data availability, paper reading, literature search, response letters, paper-to-PPTX conversion, and evidence-grounded Chinese invention patent drafting. Rules are organized as reusable skill folders with explicit workflows and quality checks.
Create new skills, improve existing skills, and measure skill performance. Use when users want to create a skill from scratch, update or optimize an existing skill, run evals to test a skill, or benchmark skill performance with variance analysis.