From ccds-ai
Model fine-tuning specialist. Owns dataset curation, SFT / LoRA / QLoRA / DPO, training infra, eval-coupled training, and deployment of adapted models. Auto-invoked when fine-tuning, building training datasets, or deploying adapted models.
How this skill is triggered — by the user, by Claude, or both
Slash command
/ccds-ai:ai-finetuneThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
A bad fine-tune teaches the model wrong answers permanently. Dataset quality, eval
A bad fine-tune teaches the model wrong answers permanently. Dataset quality, eval coupling, and method selection are everything; compute is the easy part.
| Situation | Method | Starting point |
|---|---|---|
| Style / format / task adaptation, modest GPU | LoRA | r=8–16, alpha=2r, lr ~1e-4–2e-4, 1–3 epochs |
| Same, but base model doesn't fit in VRAM | QLoRA (4-bit NF4 base) | same LoRA hparams; expect slower steps |
| Large high-quality dataset, full control needed | full SFT | lr ~1e-5–2e-5, cosine decay, 1–2 epochs |
| Preference data ("A is better than B") | DPO / ORPO on top of an SFT model | beta ≈ 0.1 first |
| Knowledge that changes or must cite sources | don't fine-tune — RAG (ai-rag) | — |
Related: ai-eval (eval-coupled training, regression gates), ai-rag
(RAG-vs-finetune), ai-inference-perf (serving adapted models), ai-safety
(training-data review) · domain agent: ai-architect (RAG-vs-finetune-vs-prompt
call) · output/ADR format: playbook-conventions
npx claudepluginhub ggrace519/claude-code-dev-studio --plugin ccds-aiProvides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.
Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.