By tessaryai
Generate a calibrated eval suite for your LLM product — graders, datasets, and a visual report, straight from your repo.
Claude Code plugins from Tessary AI — drop-in tooling for teams building with LLMs.
In any Claude Code session, add this marketplace and install a plugin:
/plugin marketplace add tessaryai/plugins
/plugin install <plugin-name>@tessary
To update later:
/plugin marketplace update tessary
Point evals at your repo and it generates a complete eval suite for your LLM features: one grader per failure mode (judge prompt, rubric, self-tests), plus a visual report you can open in your browser. Have production traces? Hand them in and graders get calibrated against real data.
/plugin install evals@tessary
Then run /evals:synthesize-graders in a Claude Code session, or just ask Claude to "synthesize evals for this repo."
See the plugin README for flags, packs, and validator usage.
crew is a multi-agent development harness. Hand it a goal and an orchestrator decides what
needs doing and does it — triaging issues, implementing changes through a deliberative team,
reviewing PRs, responding to review feedback, and keeping docs and knowledge current —
running unattended up to a review-ready PR (it never merges; you do). You can also invoke
any single capability directly.
/plugin install crew@tessary
Then either let it drive — /crew:run "close out the open bugs" — or call a primitive
yourself, e.g. /crew:review-pr 42 or /crew:triage-bug 17. Run /crew:init-config once to
tune it to your repo. This release is local-first (runs in your own Claude Code session);
GitHub Actions packaging is planned.
See the plugin README for the full skill list and configuration.
MIT — see LICENSE.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub tessaryai/plugins --plugin evalsA multi-agent dev harness: hand it a goal and the orchestrator triages, implements, reviews, responds to feedback, and keeps docs and knowledge current — autonomously, up to a review-ready PR. Use the individual skills directly, or /crew:run to let it decide what to do.
Agent Skills for AI/ML tasks including dataset creation, model training, evaluation, and research paper publishing on Hugging Face Hub
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
A growing collection of Claude-compatible academic workflow bundles. Covers scientific figures, manuscript writing and polishing, reviewer assessment, citation retrieval, data availability, paper reading, literature search, response letters, paper-to-PPTX conversion, and evidence-grounded Chinese invention patent drafting. Rules are organized as reusable skill folders with explicit workflows and quality checks.
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.
Persistent file-based planning for AI coding agents. Crash-proof markdown plans (task_plan.md, findings.md, progress.md) that survive context loss and /clear, with an opt-in completion gate and multi-agent shared state. Manus-style. Works with Claude Code, Codex CLI, Cursor, Kiro, OpenCode and 60+ agents via the SKILL.md standard. Includes Arabic, German, Spanish, and Chinese (Simplified and Traditional).
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.