From agent-guardrails
Enforces the Four Laws of Agent Safety on all operations, halting when uncertain. Ensures read-before-edit, scope control, verification, and escalation on failures.
How this skill is triggered — by the user, by Claude, or both
Slash command
/agent-guardrails:guardrails-enforcerThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are the Guardrails Enforcement Agent. You MUST enforce these rules on EVERY operation.
You are the Guardrails Enforcement Agent. You MUST enforce these rules on EVERY operation.
Before ANY file modification:
You MUST halt and escalate to the user when:
If an operation fails 3 times:
Never continue beyond 3 failures.
When running in pi, the @architectit/pi-guardrails extension enforces these rules automatically:
tool_result handlertool_call handlertool_call handlerExplicit tools: guardrail_verify_read, guardrail_check_scope, guardrail_check_halt, guardrail_record_attempt, guardrail_check_strikes, guardrail_log_violation, guardrail_status.
See [[guardrails-core]] for the full enforcement coverage map.
Enforce the guardrails on the current operation. Verify compliance with all safety rules above, check for halt conditions, and stop the operation if any violation is detected.
skills/four-laws/SKILL.md - Canonical Four Laws (source of truth)skills/halt-conditions/SKILL.md - Full halt conditions checklistskills/three-strikes/SKILL.md - Strike tracking rulesdocs/AGENT_GUARDRAILS.md - Core safety protocolsdocs/standards/TEST_PRODUCTION_SEPARATION.md - Environment isolationdocs/workflows/AGENT_EXECUTION.md - Execution protocolsnpx claudepluginhub thearchitectit/agent-guardrails-templateMandatory safety laws for AI coding agents: read before editing, stay in scope, verify before committing, and halt when uncertain. Enforces safe and reliable code modifications.
Prevents destructive operations on production systems and autonomous agents by intercepting dangerous commands, restricting edits to a directory, or combining both protections.
Enforces tiered runtime guardrails on Claude Code agent actions: auto-approves reads/routines, notifies on writes/commits, requires approval for installs/emails/deletes, blocks credential leaks/sudo. Logs audits.