tokenkrush

Crushes tokens in CLAUDE.md / AGENTS.md / SOUL.md and other AI agent instruction files. Keeps every directive intact.

Spelled with a K because I'm Austrian and we like our Ks.

Why

Vercel's "AGENTS.md outperforms Skills in our agent evals" (Jan 27, 2026) showed that a compressed AGENTS.md (pipe-delimited, telegraphic prose, 40 KB → 8 KB) outperformed more sophisticated skill-retrieval systems — 100% pass rate vs 79%. The insight: agents parse dense markdown fine. Token budget spent on padding is wasted.

If you want the full argument for fighting context bloat in agent instruction files, read the post. tokenkrush encodes that post's conclusions as a reusable skill.

Measured token savings

Tested on a real 187-line global CLAUDE.md (8784 chars) compressed into the current tokenkrush v1 style, with token counts measured via each provider's live API:

Model	v0 verbose	v1 tokenkrush	Δ tokens
anthropic/claude-sonnet-4.6	2563	1560	−39.1%
anthropic/claude-opus-4.7	3459	2108	−39.1%
openai/gpt-5.4	2250	1367	−39.2%
google/gemini-3-flash-preview	2381	1471	−38.2%

~39% token reduction, consistent across every tested tokenizer, with zero directive loss. The four Karpathy principles, all ALWAYS / NEVER emphasis, all model IDs, and all code blocks were preserved byte-for-byte.

Why not go more aggressive?

While we took inspiration from the Vercel blog's ultra-dense format (single-line blobs, dash-joined words, curly-brace hierarchies), our own benchmarks against current 2026 tokenizers didn't support pushing that far. A Vercel-style "v2" compression of the same file actually lost tokens vs. v1 on every model tested:

Model	v1 tokenkrush	v2 Vercel-aggressive
anthropic/claude-sonnet-4.6	−39.1%	−27.5% (11.6 pp worse)
anthropic/claude-opus-4.7	−39.1%	−28.7% (10.4 pp worse)
openai/gpt-5.4	−39.2%	−30.4% ( 8.8 pp worse)
google/gemini-3-flash-preview	−38.2%	−23.7% (14.5 pp worse)

Why: dash-joining shatters common-word tokens (e.g., "Bun workspaces" is 2 tokens; "Bun-workspaces" can be 3-4). Curly-brace hierarchies introduce unfamiliar structural tokens. The v2 version even had slightly more characters than v1 (4832 vs 4778), proving that character count is a misleading proxy for tokens. The Vercel post's insight about density is correct; the extreme form isn't optimal against today's tokenizers.

A note on why this is LLM-driven, not a script

We benchmarked a regex-based deterministic compressor against the same fixtures and it capped at 7–9% token savings — roughly 5× less than the LLM-driven approach. The gap is structural: regex can strip bold markers, collapse table padding, and merge **Label:**-style bullet lists, but it can't abbreviate named entities (React Native → RN), remove verbose qualifiers ((primary for web apps) → (web)), or distinguish fact lists from prose lists. Compression at 39% is a judgment task. See closed issue #2 for the full benchmark.

See examples/before-after-claude-md.md for the full compression diff on a real-world global CLAUDE.md with all four Karpathy principles preserved verbatim.

What it compresses

Ecosystem	Files
Claude Code	`CLAUDE.md`, `.claude.md`, `.claude.local.md`
Cross-tool	`AGENTS.md`
Google	`GEMINI.md`
Cursor	`.cursor/rules`, `CURSOR.md`
Windsurf	`.windsurfrules`
GitHub Copilot	`copilot-instructions.md`
OpenClaw	`SOUL.md`, `IDENTITY`, `TOOLS.md`, `AGENTS.md`, `BOOTSTRAP.md`, `BOOT.md`, `HEARTBEAT.md`, `USER`
Memory files	`MEMORY.md`, `memory/.md`, `.memory.md`

Install

Three paths. Pick whichever matches your tool.

1. Claude Code plugin marketplace

/plugin marketplace add gregoramon/tokenkrush
/plugin install tokenkrush

2. npx (works for any tool ecosystem)

npx @gregoramon/tokenkrush init

Detects installed AI tool directories (~/.claude/, ~/.openclaw/workspace/, ~/.cursor/, ~/.gemini/) and installs the skill into each.

Flags:

--target <path> — install under <path>/skills/tokenkrush/ (bypasses ecosystem detection)
--all — install to all detected ecosystems without prompting (currently the default)
--link — reserved for symlink mode; not yet implemented
--help, -h — show usage

tokenkrush

Popularity

What's Inside

README

tokenkrush

Why

Measured token savings

Why not go more aggressive?

A note on why this is LLM-driven, not a script

What it compresses

Install

1. Claude Code plugin marketplace

2. npx (works for any tool ecosystem)

3. OpenClaw via ClawHub

Confidence

Similar Plugins

claude-mem

antigravity-bundle-web-designer

caveman

frontend-design

More by gregoramon

shotcraft

More by gregoramon

shotcraft

Popularity

Health & Quality

Similar Plugins

claude-mem

antigravity-bundle-web-designer

caveman

frontend-design

ui-design

marketing-skills