By richfrem
Meta-plugin containing the ecosystem generation primitives. Includes scaffolding for Agent Skills, Plugins, CLI sub-agents, autonomous GitHub workflows, Azure Foundry agents, and more.
Validate a plugin's structure, components, and security
../skills/create-agentic-workflow/SKILL.md
../skills/create-azure-agent/SKILL.md
../skills/create-docker-skill/SKILL.md
../skills/create-github-action/SKILL.md
Scaffold a GitHub agentic workflow from an existing skill
Deploy a skill as an Azure AI Foundry hosted agent
This skill should be used when the user asks to "create a slash command", "add a command", "write a custom command", "define a command with arguments", "create a command that runs bash", "add a /command to my plugin", "use $ARGUMENTS in a command", "set up argument-hint", "create a workflow command", "interactive command", or needs guidance on slash command structure, YAML frontmatter fields, file references, bash execution, command organization, or command best practices. Use this skill whenever Claude Code slash commands are mentioned even without the word "command" -- e.g. "I want a shortcut that reviews PRs" or "automate my deploy workflow" should trigger this. Do NOT use this for hooks (use create-hook), skills (use create-skill), or agents (use create-sub-agent).
Scaffold an agent skill with Docker runtime support
Scaffold a deterministic GitHub Actions CI/CD workflow
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
A strictly cross-platform (Windows, Mac, Ubuntu) library that serves as the universal upstream source for reusable AI agent plugins and skills across multiple IDEs and agent frameworks:
.agents/ folder standard (no duplicate copies needed for .github, .gemini, .agent, etc).120 skills across 29 plugins — all maintained from a single hub-and-spoke source tree.
This repository is built on a pragmatic acceptance of the current AI engineering landscape: the ecosystem changes weekly, and workflows that were revolutionary six months ago are obsolete today.
Frameworks like agent-agentic-os, spec-kitty, and agent-execution-disciplines are treated as Transitional Architectures — bridges between what agents need to do today and what native SDKs will eventually handle. When Anthropic, Google, and GitHub harden native memory persistence, execution safety, and multi-agent orchestration, large swaths of this tooling will be happily discarded.
Skills are Applications; the SDK is the OS. Individual skills must function in complete isolation — no hard dependencies on sibling plugins, no assumptions about which framework is running.
[!IMPORTANT] Start here — fresh clone or first-time setup. The single
.agents/environment directory is not committed to your repo. It will be empty by default.All installation methods (uvx, bootstrap.py, npx skills, and Claude Marketplace) are now consolidated in a single authoritative guide:
👉 Go to INSTALL.md
The agent-agentic-os plugin implements a Triple-Loop architecture for continuous, autonomous skill improvement:
| Layer | Agent | Role |
|---|---|---|
| L0 | triple-loop-architect (Claude) | Interactive setup: scaffolds isolated sibling lab, seeds all files, launches L1 |
| L1 | Gemini CLI (gemini --yolo --model gemini-3-flash-preview) | Headless orchestrator: reads eval-instructions.md, runs the loop, gates via evaluate.py |
| L2 | Copilot CLI (gpt-5-mini) | Cheap mutation proposer: proposes SKILL.md edits using free Copilot quota |
The loop is autonomous and cost-effective: L2 uses GitHub Copilot's gpt-5-mini (free quota), enabling 20–80 mutation proposals per run at near-zero cost. L1 (Gemini Flash) orchestrates unattended overnight. evaluate.py is the absolute gate — exit 0 = KEEP, exit 1 = DISCARD + auto-revert.
Not all skills are good candidates — the best targets have clear, objective routing criteria and adversarial eval cases. Use eval-autoresearch-fit to score a skill before running a loop.
To start a loop on any skill:
@triple-loop-architect
Kick off a 10-iteration Triple-Loop optimization run targeting the `<skill-name>` skill
inside the `<plugin-folder>` plugin. Use gemini-3-flash-preview as L1 and gpt-5-mini as L2.
See the full sample prompt: references/sample-prompts/triple-loop-architect-prompt.md
Live example — convert-mermaid skill, 26 iterations across 2 rounds: 0.61 → 1.00

Each blue diamond is a baseline anchor (one per session). Green = new best score. Amber = kept but not a record. The two-segment shape shows a fresh re-baseline for round 2 — the plotter handles this automatically.
Monitor a live run: python3 plugins/agent-agentic-os/scripts/plot_eval_progress.py --tsv <lab>/evals/ --live
Flywheel layers:
os-improvement-loop): improves OS-level protocols and session ledgers between sessionsos-eval-runner + os-skill-improvement): improves individual skill routing accuracy within a sessionos-nightly-evolver): runs the INNER flywheel unattended — see agents/os-nightly-evolver.mdSkills that score HIGH on the autoresearch viability rubric (objectivity + speed + frequency + utility) can run fully autonomous self-improvement loops:
mutate SKILL.md → evaluate.py → exit 0 (KEEP) or exit 1 (DISCARD) → repeat
Ecosystem Fitness Sweep v1 is complete — all 116/120 production skills scored for autoresearch viability. Results:
npx claudepluginhub richfrem/agent-plugins-skills --plugin agent-scaffoldersOrchestration hub for the plugin ecosystem. Manages structural audits, agent environment sync (install + cleanup orphans), cross-repo plugin replication, and Universal Bridge Installation to adapt standard plugins to 30+ target environments (GitHub Copilot, Gemini, Cursor, Roo, etc).
Python dependency management with pip-compile locked-file workflow and tiered hierarchy for Python backends.
Spec-Driven Development lifecycle + Universal Bridge sync engine — the flagship workflow plugin for AI-assisted feature development
Autonomous discovery, business requirements capture, and prototyping loop for spec-driven engineering.
An opinionated learning layer and harnessing discipline above what Claude Code ships natively. Provides a structured memory hierarchy, a continuous improvement loop for model instructions, and multi-agent event bus coordination. Designed for developers running long-horizon workflows who need a cohesive feedback control system rather than isolated orchestration primitives.
Unified capability management center for Skills, Agents, and Commands.
Create and validate production-grade agent skills with 100-point marketplace grading
Ultimate Claude Code skill creator. Design, scaffold, build, review, evolve, and publish production-grade AI agent skills following the Agent Skills open standard and 3-layer architecture.
Open collection of AI agent skills — reusable, framework-agnostic SKILL.md packages
Collection of agent skills
Skills for creating new agent skills for Claude Code and VS Code Copilot