Skill

anonymize-doc

Detects and anonymizes PII (SSNs, cards, emails, phones, names) and business data (companies, revenue, costs, pricing) in files using Scrubadub and spaCy NER. Supports check-only or 5 strategies (mask/hash/pseudo/token/mixed); GDPR/HIPAA aware.

Python

Bash

security

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/dstoic:anonymize-doc

User invocable

Model invocable

Inline context

Default effort

Configuration

Modelsonnet

Tool Access

This skill is limited to the following tools:

ReadBashWrite

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Detect and anonymize PII + sensitive business data using ML-powered detection (Scrubadub + spaCy NER).

Supporting Files

PHASE1_IMPROVEMENTS.mdexamples.mdreference.mdrequirements.txtscripts/anonymize.pyscripts/detect.pytest_names.txttest_names_fr.txt

SKILL.md

57 lines · ~507 tokens

Stats

LanguagePython

Parent stars12

Parent forks3

MaintenanceExcellent

Last CommitApr 6, 2026

Actions

View Source View Plugin View on GitHub View README

PII & Business Data Anonymization

Detect and anonymize PII + sensitive business data using ML-powered detection (Scrubadub + spaCy NER).

Instructions

0. Dependency Check

SKILL_DIR="$(dirname "$(realpath "$0")" 2>/dev/null || echo /home/mat/dev/agent-skills/dstoic/skills/anonymize-doc)"
pip install -q -r "$SKILL_DIR/requirements.txt" && python -m spacy download -q en_core_web_sm 2>/dev/null

1. Ask Mode

Question: "Check for PII/business data (detection only) or anonymize?"

2. Detection

Run: python "$SKILL_DIR/scripts/detect.py" <file_path>
Parse JSON output → format report (type, line/col, severity, category)
Ask: "Proceed with anonymization?"

3. Anonymization

Ask strategy:

Strategy	Use Case	Reversible	GDPR
`mask`	Max privacy, redaction	No	✅ Full
`hash`	Analytics, tracking	No	✅ Full
`pseudo`	Demos, case studies	Yes	⚠️ Partial
`token`	Financial, vault-backed	Yes	⚠️ Partial
`mixed`	Complex docs (auto per severity)	Mixed	⚠️ Partial

Run: python "$SKILL_DIR/scripts/anonymize.py" <file_path> --strategy <choice>

Outputs: <file>-anonymized.<ext> + <file>-audit-log.json

4. Safety Rules

Preserve original file — never overwrite
Never commit audit logs to git — check .gitignore has *-audit-log.json
Warn: reversible strategies need secure mapping storage
GDPR requires irreversible (mask/hash) for full compliance

See reference.md for entity types, severity tiers, compliance details. See examples.md for before/after samples.

anonymize-doc

Popularity

Invocation

Configuration

Tool Access

Context Preview

Supporting Files

SKILL.md

anonymize-doc

Popularity

Invocation

Configuration

Tool Access

Context Preview

Supporting Files

SKILL.md

PII & Business Data Anonymization

Instructions

0. Dependency Check

1. Ask Mode

2. Detection

3. Anonymization

4. Safety Rules

Similar Skills

PII & Business Data Anonymization

Instructions

0. Dependency Check

1. Ask Mode

2. Detection

3. Anonymization

4. Safety Rules

Similar Skills