My plugins for Claude Code.
/plugin install <plugin-name>@ben-claude-plugins
| Plugin | Description |
|---|---|
| rodney | Browser automation via the rodney CLI for web scraping, frontend verification, and page interaction |
| eval-designer | Design production-quality LLM evaluations for Langfuse |
| langfuse | Query Langfuse LLM observability platform via the lf CLI |
| obsidian-agent-tools | CDP tools for Obsidian plugin development and testing |
| readme-generator | Generate excellent README files following best practices from https://github.com/matiassingers/awesome-readme |
| resend | Send emails and manage domains, API keys, and templates via the Resend CLI |
| sprite | Manage Sprites - persistent, isolated Linux microVMs for safe code execution |
MIT
Based on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimManage Sprites - persistent, isolated Linux microVMs for safe code execution
Generate excellent README files following best practices from awesome-readme
Obsidian CLI tools for plugin development, testing, and vault automation
Browser automation via the rodney CLI for web scraping, frontend verification, and page interaction
npx claudepluginhub tavva/ben-claude-plugins --plugin eval-designerTeaches AI coding agents to create promptfoo eval suites with deterministic assertions, provider configs, and best practices
Skills for adding DeepEval evaluations, tracing, datasets, Confident AI reports, and iterative improvement loops to AI applications.
Claude Code skill pack for Langfuse LLM observability (24 skills)
Skills for building LLM evaluations: pipeline audit, error analysis, synthetic data generation, LLM-as-Judge design, evaluator validation, RAG evaluation, and annotation interfaces.
Measure AI output quality, user satisfaction, task success, and design effectiveness.
Skills for working with Langfuse, the open-source LLM engineering platform for tracing, prompt management, and evaluation.