By guynachshon
ML engineering assistant for Hugging Face — research-first methodology, dataset auditing, HF Jobs orchestration, sandbox-driven development. Slash commands, a research subagent, and content-aware approval policy ported from the standalone ml-intern CLI.
Fine-tune a model on a dataset, end-to-end (research → validate → train → push).
Audit a HF dataset — schema, splits, sample rows, and red flags. Direct port of `hf_inspect_dataset`.
Default ML Intern entrypoint — equivalent to running `ml-intern "<your prompt>"` headlessly.
Force a literature-first research crawl — delegates immediately to the `research` subagent without doing anything else.
Submit an HF Job (training, eval, batch inference) with the ml-intern pre-flight checklist.
Admin access level
Server config contains admin-level keywords
Executes bash commands
Hook triggers when Bash tool is used
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
External network access
Connects to servers outside your machine
External network access
Connects to servers outside your machine
Requires secrets
Needs API keys or credentials to function
Requires secrets
Needs API keys or credentials to function
Uses power tools
Uses Bash, Write, or Edit tools
Uses power tools
Uses Bash, Write, or Edit tools
An ML intern that autonomously researches, writes, and ships good-quality ML code on the Hugging Face ecosystem — papers, datasets, training, evaluation, sandbox-driven development, and HF Jobs orchestration.
This repo ships three frontends on top of the same tools:
/ml-intern, /research, /finetune, etc. in any repo.claude from the root, get the same experience plus full-fidelity context.ml-intern CLI — the original, self-contained command-line agent.All three share the same tool implementations under agent/tools/ (the plugin vendors a slim copy under plugin/lib/ml_intern_lib/).
From any repo, inside Claude Code:
/plugin marketplace add guynachshon/claude-code-ml-intern
/plugin install ml-intern@ml-intern
That's it. Set HF_TOKEN and GITHUB_TOKEN in your shell so the MCP server can use them, then start a session and try:
/ml-intern fine-tune llama-3-8b on HuggingFaceH4/ultrachat_200k
/research best DPO recipe for instruction tuning, 7B-13B range
/inspect-dataset HuggingFaceH4/ultrachat_200k
Full plugin docs (slash commands, approval policy, env knobs, troubleshooting): plugin/README.md.
If you want to hack on ml-intern itself or run it as a project (not a plugin), clone the repo and use it as a Claude Code workspace:
git clone [email protected]:guynachshon/claude-code-ml-intern.git
cd claude-code-ml-intern
uv sync
claude
Claude Code picks up:
CLAUDE.md — persona and methodology (research-first, dataset audit, pre-flight checklist for jobs, error-recovery rules)..mcp.json — auto-starts two MCP servers:
ml-intern-tools (stdio) — hf_papers, hf_inspect_dataset, hf_jobs, hf_repo_files, hf_repo_git, explore_hf_docs, fetch_hf_docs, github_*, sandbox bash/read/write/edit.hf-mcp-server (HTTP, hosted at huggingface.co/mcp) — official HF tools..claude/agents/research.md — the parallel research subagent (read-only HF tools)..claude/commands/*.md — /ml-intern, /research, /inspect-dataset, /finetune, /run-job..claude/hooks/*.py — content-aware approval, session redaction+upload, dynamic context injection.Full project-mode guide: CLAUDE_CODE_GUIDE.md.
The original ml-intern CLI, for use without Claude Code:
git clone [email protected]:guynachshon/claude-code-ml-intern.git
cd claude-code-ml-intern
uv sync
uv tool install -e .
Then ml-intern works from any directory:
ml-intern # interactive
ml-intern "fine-tune llama on my dataset" # headless
ml-intern --model anthropic/claude-opus-4-6 "your prompt"
ml-intern --max-iterations 100 "your prompt"
ml-intern --no-stream "your prompt"
Create a .env in the project root (or export in your shell):
ANTHROPIC_API_KEY=<your-anthropic-api-key> # if using anthropic models
HF_TOKEN=<your-hugging-face-token>
GITHUB_TOKEN=<github-personal-access-token>
Without HF_TOKEN, the CLI prompts on first launch. To get a GITHUB_TOKEN, follow the GitHub PAT tutorial.
npx claudepluginhub guynachshon/claude-code-ml-intern --plugin ml-internTeaches Claude Code how to use @chenglou/pretext for DOM-free multiline text measurement and layout.
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.
Access thousands of AI prompts and skills directly in your AI coding assistant. Search prompts, discover skills, save your own, and improve prompts with AI.
Feature development with code-architect/explorer/reviewer agents, CLAUDE.md audit and session learnings, and Agent Skills creation with eval benchmarking from Anthropic.
Production-grade engineering skills for AI coding agents — covering the full software development lifecycle from spec to ship.
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Persistent file-based planning for AI coding agents. Crash-proof markdown plans (task_plan.md, findings.md, progress.md) that survive context loss and /clear, with an opt-in completion gate and multi-agent shared state. Manus-style. Works with Claude Code, Codex CLI, Cursor, Kiro, OpenCode and 60+ agents via the SKILL.md standard. Includes Arabic, German, Spanish, and Chinese (Simplified and Traditional).