By thinkwee
Diagnose and fix reinforcement learning training issues for LLM agents (single/multi-agent, multi-turn, tool-augmented) using a machine-readable corpus compiled from 312 projects. Maps symptoms like reward stagnation, KL/entropy blow-ups, eval flat, tool-call failures, and credit assignment to cited fixes.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub thinkwee/claude-plugins --plugin agents-meet-rlPre-submission auditor for academic papers: verify references actually exist (catch AI-hallucinated and retracted citations), check internal faithfulness (numbers match the tables, figures match the prose, no broken citations/refs), check LaTeX formatting/writing/anonymization, and enforce venue-specific rules (page limits, mandatory sections, checklists) for ACL/EMNLP/NAACL/CVPR/ICCV/ECCV/NeurIPS/ICML/ICLR.
RL routing + Thompson Sampling bandit for AgentDB. 9 algorithms (Q-Learning, SARSA, DQN, PPO, Actor-Critic, Policy Gradient, Decision Transformer, MCTS, Model-Based RL); /learn-task, /route-task.
LLM post-training — unified interface for SFT, OSFT, LoRA fine-tuning, and GRPO reinforcement learning
Design patterns for the Langroid multi-agent LLM framework
ML engineering plugin: Give your AI coding agent ML engineering superpowers.
Hive agent skills for collaborative evolution. /hive-setup installs hive-evolve, registers your agent, and clones a task. /hive runs the autonomous experiment loop. /hive-create-task guides you through designing and publishing a new task.
A single-skill package for generating harness blueprints for agentic systems.