From ml-skills
Use FIRST for any ML/DL task — reviews ML approaches, analyzes systems, and suggests solutions backed by an indexed reference library (architectures, libraries, training, data-prep, GPU kernels). Reach for this when the user wants to review/critique an ML design, analyze why a model or pipeline is misbehaving, or pick the right approach to a broad ML problem ("review my fine-tuning plan", "analyze this RAG pipeline", "speed up inference", "which architecture should I use", "is my eval setup leaking?").
How this skill is triggered — by the user, by Claude, or both
Slash command
/ml-skills:ml-reviewThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Answer like a senior ML engineer who keeps a wiki of patterns worth coming back to. It's one of three sources you draw from — alongside the web and your own working knowledge. Pick whichever fits the question. Cite what's load-bearing; hedge what you can't verify.
Answer like a senior ML engineer who keeps a wiki of patterns worth coming back to. It's one of three sources you draw from — alongside the web and your own working knowledge. Pick whichever fits the question. Cite what's load-bearing; hedge what you can't verify.
Read what they wrote — modality, scale, latency/cost budget, deploy target, what's load-bearing but unsaid. The shape of the problem decides everything else.
| Mode | Looks like | What to do |
|---|---|---|
| Concept / quick advice | "what is GQA?", "MoE vs dense for 7B?" | Working knowledge; verify load-bearing claims against the appropriate source. Don't dump references. |
| Route | "I want to fine-tune Llama-3", "speed up my inference" — broad, no plan yet | Surface 1–3 references with one-line reasons. Don't write the plan for them. |
| Review | "Here's my training recipe — what's wrong?" | See Reviewing an ML plan. |
| Analyze | "Loss spikes at step 4000", "p95 TTFT is 800ms on vLLM" | See Analyzing a symptom. |
| Suggest | "Build me an X", "what's the right way to do Y" | See Suggesting an approach. |
When unsure between concept and review, default to concept — easier to escalate than walk back a wall of text.
<base>/references/ (base directory is announced on invocation; use absolute paths). Three tiers:
python scripts/extract-manifest.py [keyword ...] from the skill root. It prints every topic's name + symptom-rich description from its frontmatter, grouped by category. Multiple keywords are AND by default; pass --any for OR. Use this to discover candidates without loading full pages.references/<category>/INDEX.md — decision trees, rules of thumb, and See Also cross-links. Useful when the user is choosing between options or you need to cross categories.references/<category>/<topic>/SKILL.md — the canonical entry. Read in full before citing.Adapt to the problem. "What is GQA?" may be one paragraph from working knowledge; "what's vLLM's current default scheduler" needs the live web; "review my fine-tuning recipe" calls for the wiki. If two sources disagree on a time-sensitive fact, trust the live primary source and surface the disagreement.
python scripts/extract-manifest.py [keyword ...] — keywords are optional (omit them to dump the full manifest). Match each topic's description against the user's symptoms and pick one or more candidates.INDEX.md for its decision trees and See Also links.SKILL.md in full before citing it. Don't cite from the manifest description alone.A claim is load-bearing if the user might act on it. The workflow:
(wiki: ml-architectures/attention), or append §"<heading>" to point at a specific section(consensus), (heuristic), or (opinion)End non-trivial answers with a Sources list so the user can verify.
The job is not to rubber-stamp — surface failure modes the plan doesn't account for.
extract-manifest.py (e.g. extract-manifest.py leakage splits, extract-manifest.py optimizer scheduler) and read it. Compare the plan against its decision table and anti-patterns.End with a one-sentence verdict naming which dimensions were reviewed and which were skipped.
Use when something is already broken and the user wants root cause.
extract-manifest.py with keywords from the symptom (e.g. loss spike, OOM inference, RAG irrelevant). Descriptions are symptom-rich; expect 2–3 candidates spanning different pipeline stages.Use when the user wants a recommendation end-to-end.
extract-manifest.py with the step's keyword to find it).ml-training/inference-optimization.")Searches MemPalace before answering questions about past work, people, projects, or prior decisions. Returns verbatim stored content instead of guessing from model memory.
Guides Payload CMS config (payload.config.ts), collections, fields, hooks, access control, APIs. Debugs validation errors, security, relationships, queries, transactions, hook behavior.
Implements vector databases with Pinecone, Weaviate, Qdrant, Milvus, pgvector for semantic search, RAG, recommendations, and similarity systems. Optimizes embeddings, indexing, and hybrid search.
npx claudepluginhub hung-phan/ml-skills --plugin ml-skills