From agent-trust
Local pre-action trust boundary for agents before installing skills, enabling tools/MCP servers, exposing secrets, or approving risky actions.
How this skill is triggered — by the user, by Claude, or both
Slash command
/agent-trust:agent-trustThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Before expanding into source reads, check [`docs/COVERAGE_MATRIX.md`](docs/COVERAGE_MATRIX.md). It maps the risky surfaces Agent Trust covers, the boundaries it deliberately refuses, and the local proof commands that verify those claims.
Before expanding into source reads, check docs/COVERAGE_MATRIX.md. It maps the risky surfaces Agent Trust covers, the boundaries it deliberately refuses, and the local proof commands that verify those claims.
Agent Trust is a local pre-action trust boundary and portable cross-agent skill for AI agents. Use it before an agent installs an external skill/plugin/MCP server, passes sensitive authority to a tool, approves an x402-style payment decision, or lets another repository/helper influence local actions.
The skill is intentionally contract-first: read SKILL.md, index.json, schemas, examples, and CLI contract output before broad repository scanning. It also includes a small local self-selection improvement loop so an agent can collect candidate improvements, rank them, apply one bounded change, and verify it without crossing safety boundaries.
Install this repository as a root skill source:
openclaw skills install git:tigrohvost/agent-trust@main
The repository is intentionally shaped for Git/local skill installation: this SKILL.md file is at the source root and name: agent-trust is the stable install slug / allowlist key.
Codex skills are discovered from a skills directory such as $CODEX_HOME/skills or ~/.codex/skills. Install Agent Trust by cloning this repository as a skill folder, then restart Codex so the name, description, and SKILL.md path are re-indexed:
mkdir -p "${CODEX_HOME:-$HOME/.codex}/skills"
git clone https://github.com/tigrohvost/agent-trust.git "${CODEX_HOME:-$HOME/.codex}/skills/agent-trust"
If you prefer not to install globally, point Codex at this file as an instruction source for a one-off check. Keep the repository root intact so relative examples and schemas remain available.
Claude-style agent skills also center on a SKILL.md file with frontmatter plus focused instructions. Use this repository as a portable skill folder, or copy the repository into your Claude skills location if your Claude environment supports local skill directories. The important invariant is the same: keep SKILL.md at the skill root, keep name: agent-trust, and install the Python package in the environment that will run the checks.
After skill installation, verify the Python package interface in the agent environment. Prefer a virtual environment or pipx-style isolated install; many modern agent hosts use PEP 668 externally-managed Python environments and will reject system-wide pip install:
python3 -m venv .venv
. .venv/bin/activate
python3 -m pip install -e '.[test]'
agent-trust --print-contract
agent-trust --input examples/input.json
agent-trust-skill manifest --compact
For a one-command remote package install outside a cloned repo, use the same isolation rule:
python3 -m venv /tmp/agent-trust-venv
. /tmp/agent-trust-venv/bin/activate
python3 -m pip install git+https://github.com/tigrohvost/agent-trust.git
agent-trust --print-contract
If console scripts are not on PATH, use module entrypoints instead:
python3 -m agent_trust.cli --print-contract
python3 -m agent_trust.cli --input examples/input.json
python3 -m agent_trust.skill manifest --compact
Before broad repository reading, grep, or ad-hoc exploration, use Agent Trust in a narrow contract-first path. This keeps the main agent context clean and makes the first investigation fast enough to be repeated.
agent-trust --print-contract and inspect SKILL.md, index.json, schemas, and examples before scanning implementation files.api_token, wallet_key, repo_write_token) rather than raw secret values.agent-trust --input examples/input.json or agent-trust-skill check ... --compact before wider exploration.agent_trust/, schemas/, or tests/ to answer a specific missing fact. Do not start with whole-repo grep when the contract already answers the question.This protocol is intentionally similar to how a good framework-aware sub-agent should work: predict the relevant surfaces, query structured metadata first, then use raw file search only as a fallback.
When improving Agent Trust, do not wait for a human to prescribe every micro-step. Build a short evidence-backed idea list, score it, choose one low-risk/high-impact improvement, apply the smallest complete version, and verify it. A local example is included:
python3 examples/agent_trust_idea_selector.py --ideas examples/ideas.json
Use this as a decision aid, not as a substitute for judgment. Never select an idea that crosses hard boundaries such as real secrets, wallet signing, payment, private outreach/posting, repository settings, KYC, debt, recurring obligations, or legal/compliance commitments without explicit authorization.
See docs/IMPROVEMENT_LOOP.md for the current loop and candidate idea table.
Use Agent Trust when you are about to:
Agent Trust is intentionally local-first:
If an action requires real money, mainnet financial activity, KYC, debt, recurring obligations, private outreach/posting, repository settings changes, or legal/compliance commitments, stop and request explicit human authorization.
{
"action": "install_external_skill",
"target": "git:tigrohvost/example-skill@main",
"declared_permissions": ["network", "filesystem"],
"sensitive_arguments": ["api_token"],
"context": "Agent wants to install a new helper before using account tools."
}
Do not include actual secret values. Use labels like api_token, wallet_key, customer_email, or repo_write_token instead.
allow, continue only within the stated boundary.review, ask for human/maintainer review or collect more evidence.deny, do not execute the action. Choose a smaller safe inspection step instead.${CODEX_HOME:-$HOME/.codex}/skills/agent-trust, keep SKILL.md at that folder root, and restart Codex after install/update.SKILL.md not found during OpenClaw install: install from the repository root (git:tigrohvost/agent-trust@main), not from a subdirectory.pipx; do not force install into the system Python used by the OS.agent-trust command not found: install the Python package in the same environment used by the agent, or run python3 -m agent_trust.cli ....agent-trust-skill command not found: run python3 -m agent_trust.skill manifest --compact.git clone https://github.com/tigrohvost/agent-trust.git
cd agent-trust
python3 -m pip install -e '.[test]'
python3 -m pytest -q
This skill intentionally follows a small common denominator for OpenClaw, Codex, Claude-style skills, and other SKILL.md consumers:
SKILL.md;name: agent-trust;description for progressive-disclosure loaders;Agent Trust — локальная граница доверия перед действием агента. Используй её перед установкой внешних skill/plugin/MCP, передачей секретов инструментам, запуском неизвестных helper-команд, x402/payment-like решениями и другими действиями с полномочиями.
OpenClaw-установка:
openclaw skills install git:tigrohvost/agent-trust@main
Граница безопасности: локальная проверка, без сетевых вызовов для самого trust check, без кошельков, подписей, платежей, real-money действий и без передачи реальных секретов во входных данных. Если действие касается реальных денег, KYC, долгов, recurring payments, private outreach/posting, настроек репозитория или юридических обязательств — остановись и запроси явное разрешение человека.
Guides creation, editing, and verification of skills for AI coding agents using test-driven development with subagent scenarios. Use when authoring or debugging skills.
npx claudepluginhub tigrohvost/agent-trust --plugin agent-trust