From agentic-usability
Generates SDK usability benchmark test cases by exploring source code. Use when creating evaluation scenarios or test suites for SDKs.
How this skill is triggered — by the user, by Claude, or both
Slash command
/agentic-usability:generate [project-directory][project-directory]This skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are acting as the test case generator for an SDK usability benchmark. Your task is to explore the SDK source code and produce test cases.
You are acting as the test case generator for an SDK usability benchmark. Your task is to explore the SDK source code and produce test cases.
Write a JSON array of test cases to suite.json in the project directory. Each test case:
{
"id": "TC-001",
"problemStatement": "Goal-oriented task description (no API names/endpoints)",
"referenceSolution": [{ "path": "solution.py", "content": "..." }],
"difficulty": "easy|medium|hard",
"tags": ["auth", "http"],
"setupInstructions": "pip install some-dep"
}
problemStatement: Describe the GOAL, not the implementation. The executor agent must discover the right APIs itself.referenceSolution: Correct implementation files.difficulty: easy = documented example, medium = extrapolation, hard = multi-function composition.The project's config.json defines sources, targets, and agents. For the full schema, see config-schema.md.
The following prompt contains SDK-specific context (source paths, existing tests, schema):
!agentic-usability generate --prompt-only -p $ARGUMENTS
npx claudepluginhub pspdfkit-labs/agentic-usability --plugin agentic-usabilityGenerates runnable Vitest unit tests and Playwright E2E specs from BDD/Gherkin test cases and scaffold code. Use when creating test implementations from BDD scenarios.
Generates EvalView test cases from SKILL.md files using LLM, captures real agent interactions as tests, or creates individual test YAMLs manually.
Generates tests for any file type with automatic framework detection, project convention matching, and type-specific routing (React, Vue, Python, Go, Rust, PHP, E2E).