Run inference-time scaling on LLM prompts—batch process JSONL, CSV, or TXT files to generate multiple candidates and select the best using voting, scoring, or search algorithms like Self-Consistency, Best-of-N, and Beam Search
Use when the user wants to run inference-time scaling on multiple prompts from a file (JSONL, CSV, or TXT). Applies to batch processing, evaluation runs, or dataset-level scaling.
Guides users through inference-time scaling with its_hub, including algorithm selection (Self-Consistency, Best-of-N, Beam Search, Particle Filtering), budget tuning, reward model setup, tool-calling integration, interpreting results, and troubleshooting. Use when the user is working with its_hub, asking about scaling algorithms, debugging scaling issues, or tuning inference quality.
Use when the user wants to run inference-time scaling on a prompt — detect environment, execute scaling, and present results. For algorithm selection, budget tuning, reward models, and troubleshooting, consult the inference-scaling-guide skill.
Use when the user wants to set up inference-time scaling for the first time, or when its_hub is not yet installed/configured in the current environment.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub red-hat-ai-innovation-team/its_hub --plugin its-hubSynthetic data generation — composable blocks and YAML-defined flows for building LLM training datasets
LLM post-training — unified interface for SFT, OSFT, LoRA fine-tuning, and GRPO reinforcement learning
Orchestrate multiple open-weight LLMs via Fireworks AI to deliberate on queries using Karpathy's LLM Council concept. Models respond individually, rank each other's responses, then a Chairman synthesizes the best answer. Powered entirely by fast, affordable open-weight models on Fireworks.
A real-time directory of AI models that allows your AI agent to advise and pick the ideal LLM for the user's task.
Professional AI/ML Engineering toolkit: Prompt engineering, LLM integration, RAG systems, AI safety with 12 expert plugins
Smart LLM routing with Claude subscription monitoring, complexity-first model selection, and 20+ AI providers
Evaluate and compare ML model performance metrics
OpenRouter SDK plugin - unified interface for 500+ LLM models with intelligent routing, cost optimization, and framework integrations (Vercel AI SDK, LangChain, OpenAI SDK, PydanticAI)