Skill

metacog-bench

Run a minimal metacognitive evaluation pass over the current task or repository change. Use to assess quality, calibration, and overhead.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/aletheia-nexus:metacog-bench

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

ReadGrepGlobBash

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Evaluate the current work on six axes:

SKILL.md

35 lines · ~354 tokens

Stats

LanguageRust

Stars0

MaintenanceExcellent

Last CommitApr 8, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Minimal MetaCog Bench

Evaluate the current work on six axes:

Axis	Question	Score 1-5
Correctness evidence	Is the result backed by tests, checks, or verifiable facts?
Calibration quality	Does the stated confidence match the actual evidence strength?
Tool efficiency	Were tools used only when necessary? Any redundant calls?
Context overhead	How much context was consumed vs. the value delivered?
Workflow friction	Were there unnecessary round-trips, retries, or dead ends?
Residual risk	What is the likelihood of an undetected issue?

Return exactly

Scores: correctness=X, calibration=X, tools=X, context=X, friction=X, risk=X
Strongest gain: [what Aletheia workflows helped most]
Strongest cost: [what added overhead without proportional value]
Improvement: [one concrete instrumentation or workflow change]

Rules:

Be honest. A 5/5 on correctness with no tests run is wrong.
"Strongest cost" must be actionable (not "it took time").
The improvement must be specific enough to implement in the next session.

metacog-bench

Invocation

Tool Access

Context Preview

SKILL.md

metacog-bench

Invocation

Tool Access

Context Preview

SKILL.md

Minimal MetaCog Bench

Return exactly

Similar Skills

Minimal MetaCog Bench

Return exactly

Similar Skills