Data cleaning, enrichment, restructuring, and packaging skills for tabular and JSON datasets. Excludes data visualisation.
Add or update a CHANGELOG.md in a data repository, recording dataset versions, schema changes, row-count deltas, enrichments applied, and re-publications. Follows Keep-a-Changelog conventions adapted for datasets. Use when the user wants versioned documentation of how a dataset has evolved over time.
Create a data dictionary for a dataset (CSV, JSON, JSONL, Parquet, Excel) that documents every column/field — name, type, description, units, example values, nulls allowed, source. Use when a dataset has no accompanying documentation and the user wants one generated.
Add ISO 3166 country codes (alpha-2, alpha-3, numeric) to a dataset that references countries by name but lacks standardised codes. Use when the user has a CSV/JSON/Parquet/Excel dataset with country names and wants ISO 3166 codes added as new columns/fields.
Prepare or refactor a dataset for upload into a REST API or MCP server — mapping dataset columns to API request fields, handling batching, pagination, rate limits, authentication, idempotency, and error retries. Works from an OpenAPI spec the user provides, a user-pointed MCP tool schema, or documentation for a well-known API (Salesforce, HubSpot, Airtable, Notion, Stripe, Shopify, Pipedrive, etc.). Generates a loader script plus a dry-run preview before executing.
Convert between CSV and JSON formats — CSV to JSON array, CSV to JSONL, JSON to CSV, JSONL to CSV. Handles type inference, header/record mapping, nested structure flattening, and encoding issues. Use when the user wants to reformat tabular data between row-oriented CSV and object-oriented JSON forms.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Data cleaning, enrichment, restructuring, and packaging skills for tabular and JSON datasets. Data visualisation is out of scope (handled by a separate plugin).
| Skill | Purpose |
|---|---|
data-cleanliness-scan | Scan flat files (CSV/Parquet/JSON/Excel) and flag columns likely to fail SQL ingestion or analysis |
standardise-country-names | Normalise inconsistent country names ("USA" vs "United States of America") |
text-to-numeric | Parse formatted strings like $4.27, 1,234.56, €1.2M, (500) into numeric columns |
unicode-consistency | Detect and fix mixed Unicode normalisation, mojibake, invisible chars, confusables |
date-wrangling | Convert dates/times between ISO 8601, epoch (s/ms/µs/ns), with/without timezone, fiscal, week-date |
iso-review | Audit the dataset for fields that could be standardised to an ISO standard (3166, 4217, 639, 8601, LEI, ISIN, …) and optionally refactor |
| Skill | Purpose |
|---|---|
add-iso3166 | Add ISO 3166 country codes (alpha-2/3, numeric) to datasets referencing countries |
enrich-with-currency | Map ISO 3166 codes to ISO 4217 currency codes (plus name / symbol) |
data-enrichment | Brainstorm and rank enrichment opportunities (temporal, geo, entity, FX, embeddings, holidays …) |
| Skill | Purpose |
|---|---|
add-data-dictionary | Generate a data dictionary (Markdown / YAML / JSON / CSV) for a dataset |
update-data-dictionary | Keep an existing data dictionary in sync after schema changes |
data-dictionary-export | Export a data dictionary to a polished PDF via Typst |
data-to-document | Render a dataset (or a filtered slice) to PDF via Typst, with layout auto-chosen from data shape, selectable fields, and custom column labels |
add-changelog | Maintain a dataset-focused CHANGELOG.md (Keep-a-Changelog, SemVer-adapted) |
| Skill | Purpose |
|---|---|
csv-to-json | Bidirectional CSV ↔ JSON / JSONL conversion |
json-restructure | Reshape JSON — flatten, nest, group-by, explode arrays, promote/demote fields |
data-shape | Propose a normalised SQL schema (tables, keys, relationships) from a flat source |
data-comparability | Align multiple datasets — reconcile headers, types, vocabularies, units — for merge/union |
| Skill | Purpose |
|---|---|
pii-flag | Detect PII (names, emails, IDs, cards, coords, …) at cell-level with confidence scores |
synthetic-data-overlay | Replace PII with realistic synthetic substitutes preserving shape and referential integrity |
| Skill | Purpose |
|---|---|
database-guide | Recommend a database backend (relational / analytical / document / graph / vector / time-series) |
parquet-jsonl-package | Package a dataset as Parquet and/or JSONL with compression and partitioning |
sql-load | Load a flat dataset into SQL (Postgres / MySQL / SQLite / MSSQL / DuckDB) with schema validation |
graph-database | Reshape tabular/JSON data into nodes + edges, emit Cypher / GraphML / CSV bulk loads |
vector-upsert | Embed text fields and upsert into a vector DB (Pinecone / Qdrant / Weaviate / pgvector / Chroma / Milvus) |
hf-dataset-push | Publish a packaged dataset to Hugging Face Hub with dataset card |
api-loader | Prepare and push data into a REST API or MCP server, from an OpenAPI spec or well-known SDK |
geodata-formatter | Convert CSV / tabular geodata into GeoJSON (or NDGeoJSON) with CRS reprojection and geometry inference |
divergent-data-pipe | Build an incremental sync from a canonical upstream into a downstream project that has diverged (renames / enrichments), preserving the divergence |
Every skill follows the safety and data-layout rules in CONVENTIONS.md. Highlights:
_iso3166, _numeric, _synthetic); overwrite only on explicit user request.npx claudepluginhub danielrosehill/claude-code-plugins --plugin Claude-Data-WranglerAgent Skills for AI/ML tasks including dataset creation, model training, evaluation, and research paper publishing on Hugging Face Hub
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
A growing collection of Claude-compatible academic workflow bundles. Covers scientific figures, manuscript writing and polishing, reviewer assessment, citation retrieval, data availability, paper reading, literature search, response letters, paper-to-PPTX conversion, and evidence-grounded Chinese invention patent drafting. Rules are organized as reusable skill folders with explicit workflows and quality checks.
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.
Persistent file-based planning for AI coding agents. Crash-proof markdown plans (task_plan.md, findings.md, progress.md) that survive context loss and /clear, with an opt-in completion gate and multi-agent shared state. Manus-style. Works with Claude Code, Codex CLI, Cursor, Kiro, OpenCode and 60+ agents via the SKILL.md standard. Includes Arabic, German, Spanish, and Chinese (Simplified and Traditional).
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.
Claude Code plugin: ideation and planning workflow — capture, evaluate, rank, simulate, and plan ideas, with ideation/single-idea-eval/multi-idea-ranking/feature-ideas/simulation/idea-capture variants.
First-pass data analysis toolkit: correlations, PII flagging, anomalies, hypothesis tests, data dictionaries, and trend analysis on a dataset in a folder.
Claude Code plugin for generating personal user manuals and private documentation for codebases. Creates personalized, private reference guides with PDF output support.
Research, filter, compare, and evaluate AI models on OpenRouter — discover models by capability (tool use, vision, audio), get cost/context-aware recommendations, run head-to-head comparisons, and conduct deep research that goes beyond the OpenRouter catalog.
Claude Code plugin for writing assistance, proofreading, style editing, and text transformation workflows.