From training-hub
Guides users through setting up LLM training with training_hub: environment detection, installation, GPU checks, and configuration.
How this skill is triggered — by the user, by Claude, or both
Slash command
/training-hub:setup-guideThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are helping the user set up LLM training. For algorithm selection guidance, hyperparameter tuning, and troubleshooting, consult the `training-hub-guide` skill.
You are helping the user set up LLM training. For algorithm selection guidance, hyperparameter tuning, and troubleshooting, consult the training-hub-guide skill.
"${CLAUDE_PLUGIN_ROOT}/scripts/th_detect.sh"
If library=missing:
installer=uv: run uv pip install training-hubinstaller=pip: run pip install training-hubinstaller=none: tell the user they need Python and pip/uv installed firsttraining-hub[cuda] — flash-attn, bitsandbytes for GPU accelerationtraining-hub[lora] — Unsloth, TRL for parameter-efficient fine-tuningtraining-hub[grpo] — ART, veRL for reinforcement learningFor installation issues, consult the training-hub-guide skill (installation-troubleshooting section).
If gpu=unavailable, warn: "No GPU detected. Training requires CUDA-capable GPUs. You can still configure, but training will fail without a GPU."
Report GPU count if available.
If the user has a clear task ("fine-tune Llama on my data"), offer a fast path with sensible defaults:
"I detected N GPU(s). I can set up with these defaults:
- Algorithm:
lora_sft(parameter-efficient, works on a single GPU)- Learning rate:
1e-5- Epochs:
2- Batch size:
64- Max sequence length:
4096You'll just need to provide your model path and data path. Accept these defaults, or customize?"
If the user accepts, ask only for model path and data path, then skip to Step 7.
If the user wants to customize, proceed with the full configuration.
Ask these questions one at a time:
training-hub-guide skill for algorithm selection guidance if the user is unsure.meta-llama/Llama-3.1-8B-Instruct, or a local path.messages field../outputCollect hyperparameters based on the chosen algorithm. Consult the training-hub-guide skill (hyperparameter-guide section) for recommended defaults by dataset size and algorithm.
Ask: "Do you want to configure experiment tracking?" (W&B, MLflow, or TensorBoard). See the training-hub-guide skill for logger details.
Write the config to .training-hub/config.json:
{
"algorithm": "<algorithm>",
"model_path": "<model_path>",
"data_path": "<data_path>",
"ckpt_output_dir": "<output_dir>",
"nproc_per_node": N,
"hyperparams": { ... },
"algorithm_config": { ... },
"logging": { ... }
}
Add .training-hub/ to .gitignore if not already present.
Ask: "Want me to estimate GPU memory requirements before training?"
If yes, run:
"${CLAUDE_PLUGIN_ROOT}/scripts/th_estimate.sh"
If this skill is invoked again and a config already exists, ask: "You already have a configuration. Do you want to update it or start fresh?"
npx claudepluginhub red-hat-ai-innovation-team/training_hub --plugin training-hubGuides LLM post-training with Training Hub: installation, algorithm selection (SFT, OSFT, LoRA), hyperparameter tuning, OOM troubleshooting, loss curve interpretation, and CUDA/GPU debugging.
Trains or fine-tunes language/vision models using TRL or Unsloth on Hugging Face Jobs cloud GPUs. Supports SFT, DPO, GRPO, reward modeling, and GGUF export for local deployment.
Train or fine-tune TRL language models on Hugging Face Jobs using SFT, DPO, GRPO, Reward Modeling, with GGUF export for local deployment.