Benchmarking framework to analyse tokenization per file format.
npx claudepluginhub thoeltig/file-format-token-accuracy-benchmarkBenchmarking framework to analyse tokenization per file format.
Comprehensive benchmarking suite for measuring token usage and retrieval accuracy across file formats for LLM consumption to find the most efficient format.
This framework validates which file formats deliver the most reliable information to LLMs with optimal token efficiency. The benchmark focuses on structural and retrieval accuracy understanding data organization and extracting specific values which are fundamental to avoiding context confusion.
Note on Question Categories: Complex filtering and mathematical aggregation are interesting but test the model's intelligence rather than format effectiveness. Also the accuracy for filtering and aggregation would be irrelevant if the data is prefiltered or the data contains fields with the calculated values. Structural questions (understanding data shape/organization) and retrieval questions (extracting specific values) directly validate format clarity.
The initial benchmark run (Dec 19, 2025) with Claude 4.5 Haiku tested 8 formats across 120 questions with weighted accuracy prioritizing structural understanding and retrieval (60% combined weight vs filtering/aggregation at 40%). 4 flat array data sets of variants 40 and 80 records, random optional fields and all mandatory fields.
[!NOTE] All benchmark results have been moved to a separate repository: file-format-token-accuracy-benchmark-results
Key Findings (see Initial BenchmarkReport.md):
Active Formats (8 total):
Removed after Initial Test:
Record Count: 31 records (single standardized count, replaces 40/80 records)
Structures: Flat and Nested per format (extends previous flat array structure)
Field Variants: Mandatory and Optional per format (same object but for optional random fields are set to null)
A complete benchmark run consists of 5 steps:
When you run a benchmark, it generates a folder like benchmark_format_all_structure_all_variant_all_haiku/ containing:
Claude Code marketplace entries for the plugin-safe Antigravity Awesome Skills library and its compatible editorial bundles.
Production-ready workflow orchestration with 84 marketplace plugins, 192 local specialized agents, and 156 local skills - optimized for granular installation and minimal token usage
Directory of popular Claude Code extensions including development tools, productivity plugins, and MCP integrations