Search everything...

Stats

Actions

Available In

agent-agentic-os

Name: agent-agentic-os
Author: richfrem

By richfrem

An opinionated learning layer and harnessing discipline above what Claude Code ships natively. Provides a structured memory hierarchy, a continuous improvement loop for model instructions, and multi-agent event bus coordination. Designed for developers running long-horizon workflows who need a cohesive feedback control system rather than isolated orchestration primitives.

npx claudepluginhub richfrem/agent-plugins-skills --plugin agent-agentic-os

Popularity

Stars

Above avg

Med: 0·Avg: 285

Installs

Med: 0·Avg: 1

What's Inside

Slash Commands2

os-init

/os-init

Bootstrap the project by triggering the agentic-os-setup conversational architect.

os-memory

/os-memory

Force garbage collection and conflict resolution on the tiered memory system.

Agents4

agentic-os-setup

/agentic-os-setup

Trigger with "use the agentic-os-setup agent", "run the setup agent", "set up an agentic OS", "persist memory", "add the OS harness", or when the user requires memory persistence, repository-level conventions, or autonomous background loops. Directs the orchestration, synthesis, and provisioning of a persistent AI environment. <example> Context: User wants to initialize their project for AI agents. user: "Can you help me set up an agentic OS in this folder?" assistant: "I'll use the agentic-os-setup agent to handle the full orchestration for you." <commentary> User requesting specific specialized task execution. Trigger agent. </commentary> </example> <example> Context: A non-technical user wants the AI to remember things. user: "How do I get Claude to persist its memory in my repo between sessions?" assistant: "I'll launch the agentic-os-setup agent to scaffold a persistent memory environment for you." <commentary> User asking for a core Agentic OS feature (persistence). Trigger agent. </commentary> </example> <example> Context: User has an existing codebase but no .claude config. user: "I already have a big project here, can you just add the OS harness without breaking it?" assistant: "Yes, I will run the agentic-os-setup agent to carefully layer the Agentic OS into your existing project." <commentary> Partial setup / integration requested. Trigger agent. </commentary> </example>

os-health-check

/os-health-check

Trigger with "run health check", "check os metrics", "system monitor", or when the user wants to review the Agentic OS liveness metrics across the Event Bus, locks, and memory arrays. <example> user: "Run a system monitor check on the OS." assistant: "I'll execute the os-health-check agent to scan the event bus and state file." <commentary> User explicitly requested a system diagnostic, triggering the health check agent. </commentary> </example>

triple-loop-architect

/triple-loop-architect

Interactive entry point for starting a skill evaluation loop via the Triple-Loop Learning System. Trigger with "eval [skill]", "evaluate [skill]", "run eval on [skill]", "setup triple-loop lab for [skill]". Handles full setup using the canonical Sibling Repo Labs protocol (creates an isolated repo for safe iteration). <example> Context: User wants to start an eval loop on a skill safely. user: "eval using-git-worktrees" assistant: [triggers triple-loop-architect, resolves skill path, scaffolds sibling lab repo, prepares evals] </example>

triple-loop-orchestrator

/triple-loop-orchestrator

Unattended overnight Triple-Loop Learning orchestrator. Oversees the autonomous INNER looping (Strategic Double-Loop and Tactical Single-Loop) on a target skill in its isolated sibling lab. Uses Gemini or Copilot CLI for proposals, gated strictly by objective `evaluate.py` performance. Trigger with "trigger the triple-loop-orchestrator on [skill] for [N] iterations", or "run orchestrator all night on [skill]". <example> Context: User wants to improve a skill headlessly. user: "Trigger triple-loop-orchestrator on link-checker for 80 iterations." assistant: "Launching the Triple-Loop Orchestrator to oversee unattended iterations on the link-checker lab..." </example>

Skills11

os-clean-locks

/os-clean-locks

Trigger with "/os-clean-locks", "clear all locks", "reset agent locks", or when an agent is deadlocked and cannot acquire a lock because a previous agent crashed and left a stale lock behind in `context/.locks/`. <example> Context: User is seeing errors about locks already existing. user: "/os-clean-locks" assistant: <Bash> rm -r context/.locks/ python3 context/kernel.py state_update active_agent os-clean-locks </Bash> </example> <example> Context: Agent detects a deadlock when trying to acquire a lock during a task. assistant: [autonomously] "The acquire_lock call for 'memory' failed -- a prior agent likely crashed and left a stale lock. I'll invoke os-clean-locks to clear it before retrying." <commentary> Implicit audit trigger -- agent detects deadlock from kernel output and self-heals using os-clean-locks without user prompting. </commentary> </example>

os-eval-backport

/os-eval-backport

Reviews a completed os-eval-runner lab run and backports approved changes to master plugin sources. Trigger with "backport the eval results", "review the lab run", "apply eval improvements to master", "check what the eval agent changed".

os-eval-lab-setup

/os-eval-lab-setup

Bootstraps a skill evaluation lab repo for an autoresearch improvement run. Trigger with "set up an eval lab", "bootstrap the eval repo", "prepare the test repo for skill evaluation", "create an eval environment for this skill", "set up the lab space for this skill", or when starting a new skill optimization run that needs a standalone test environment. <example> Context: User wants to start an improvement run on a skill in an isolated lab repo. user: "Set up an eval lab for the link-checker skill" assistant: [triggers os-eval-lab, runs intake interview, bootstraps lab repo, installs engine, copies plugin files, generates eval-instructions.md] </example> <example> Context: User has a lab repo but needs it configured. user: "Prepare the test repo at <USER_HOME>/Projects/test-my-skill-eval for skill evaluation" assistant: [triggers os-eval-lab, installs engine, copies plugin files, generates eval-instructions.md] </example>

os-eval-runner

/os-eval-runner

Trigger: "evaluate this skill", "run autoresearch loop on", "optimize this skill". Use when an agent proposes a change to an existing skill and needs empirical validation. <example> Context: Start autonomous improvement loop on a skill. user: "Run the autoresearch loop on <SKILL_PATH> for 20 iterations" assistant: [triggers os-eval-runner, runs Mode 1 intake] </example> <example> Context: Incomplete optimize request. user: "Optimize the commit skill" assistant: [triggers os-eval-runner, runs Phase 0 intake interview] </example> <example> Context: `Triple-Loop Retrospective` proposes a skill edit. assistant: [autonomously] "Before I apply this description change, I'll run os-eval-runner to confirm." </example> <example> Context: An agent is asking for general information about a skill, not evaluating a proposed change. agentic-os-setup: "Tell me about the os-clean-locks skill." assistant: "It cleans up stale lock files..." </example>

os-guide

/os-guide

Trigger with "explain agentic os", "how do I set up a persistent agent environment", "what is the CLAUDE.md hierarchy", "explain the context folder structure", "how does session memory work", "what is soul.md or user.md", "explain auto-memory or MEMORY.md", "what is a loop scheduler or heartbeat", or when the user asks for the canonical guide.

Hooks1

Event Hooks

All tools

3 hooks across 3 events

Stats

Version1.4.1

LanguagePython

Stars2

Forks2

MaintenanceExcellent

LicenseMIT

Last CommitApr 5, 2026

AddedMar 28, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Available In

richfrem-agent-plugins-skills3

Safety Signals

Critical

Matches all tools

Hooks run on every tool call, not just specific ones

Caution

Uses power tools

Uses Bash, Write, or Edit tools

README

Universal Agent Plugins & Skills Ecosystem

Project Overview

A strictly cross-platform (Windows, Mac, Ubuntu) library that serves as the universal upstream source for reusable AI agent plugins and skills across multiple IDEs and agent frameworks:

Claude Code, GitHub Copilot, Gemini CLI, Antigravity, Roo Code, Windsurf, Cursor, and other compliant integrations.
Now universally supporting the single .agents/ folder standard (no duplicate copies needed for .github, .gemini, .agent, etc).

120 skills across 29 plugins — all maintained from a single hub-and-spoke source tree.

Core Philosophy: Transitional Architectures & Decoupled Skills

This repository is built on a pragmatic acceptance of the current AI engineering landscape: the ecosystem changes weekly, and workflows that were revolutionary six months ago are obsolete today.

Frameworks like agent-agentic-os, spec-kitty, and agent-execution-disciplines are treated as Transitional Architectures — bridges between what agents need to do today and what native SDKs will eventually handle. When Anthropic, Google, and GitHub harden native memory persistence, execution safety, and multi-agent orchestration, large swaths of this tooling will be happily discarded.

Skills are Applications; the SDK is the OS. Individual skills must function in complete isolation — no hard dependencies on sibling plugins, no assumptions about which framework is running.

Installation

[!IMPORTANT] Start here — fresh clone or first-time setup. The single .agents/ environment directory is not committed to your repo. It will be empty by default.

All installation methods (uvx, bootstrap.py, npx skills, and Claude Marketplace) are now consolidated in a single authoritative guide:

👉 Go to INSTALL.md

Architecture Highlights

Triple-Loop Autonomous Skill Improvement

The agent-agentic-os plugin implements a Triple-Loop architecture for continuous, autonomous skill improvement:

Layer	Agent	Role
L0	`triple-loop-architect` (Claude)	Interactive setup: scaffolds isolated sibling lab, seeds all files, launches L1
L1	Gemini CLI (`gemini --yolo --model gemini-3-flash-preview`)	Headless orchestrator: reads `eval-instructions.md`, runs the loop, gates via `evaluate.py`
L2	Copilot CLI (`gpt-5-mini`)	Cheap mutation proposer: proposes SKILL.md edits using free Copilot quota

The loop is autonomous and cost-effective: L2 uses GitHub Copilot's gpt-5-mini (free quota), enabling 20–80 mutation proposals per run at near-zero cost. L1 (Gemini Flash) orchestrates unattended overnight. evaluate.py is the absolute gate — exit 0 = KEEP, exit 1 = DISCARD + auto-revert.

Not all skills are good candidates — the best targets have clear, objective routing criteria and adversarial eval cases. Use eval-autoresearch-fit to score a skill before running a loop.

To start a loop on any skill:

@triple-loop-architect

Kick off a 10-iteration Triple-Loop optimization run targeting the `<skill-name>` skill
inside the `<plugin-folder>` plugin. Use gemini-3-flash-preview as L1 and gpt-5-mini as L2.

See the full sample prompt: references/sample-prompts/triple-loop-architect-prompt.md

Live example — convert-mermaid skill, 26 iterations across 2 rounds: 0.61 → 1.00

convert-mermaid eval progress

Each blue diamond is a baseline anchor (one per session). Green = new best score. Amber = kept but not a record. The two-segment shape shows a fresh re-baseline for round 2 — the plotter handles this automatically.

Monitor a live run: python3 plugins/agent-agentic-os/scripts/plot_eval_progress.py --tsv <lab>/evals/ --live

Flywheel layers:

OUTER flywheel (os-improvement-loop): improves OS-level protocols and session ledgers between sessions
INNER flywheel (os-eval-runner + os-skill-improvement): improves individual skill routing accuracy within a session
Overnight (os-nightly-evolver): runs the INNER flywheel unattended — see agents/os-nightly-evolver.md

Karpathy Autoresearch Loop

Skills that score HIGH on the autoresearch viability rubric (objectivity + speed + frequency + utility) can run fully autonomous self-improvement loops:

mutate SKILL.md → evaluate.py → exit 0 (KEEP) or exit 1 (DISCARD) → repeat

Ecosystem Fitness Sweep v1 is complete — all 116/120 production skills scored for autoresearch viability. Results:

View full README on GitHub

agent-agentic-os

Popularity

What's Inside

Confidence

README

Universal Agent Plugins & Skills Ecosystem

Project Overview

Core Philosophy: Transitional Architectures & Decoupled Skills

Installation

👉 Go to INSTALL.md

Architecture Highlights

Triple-Loop Autonomous Skill Improvement

Karpathy Autoresearch Loop

Similar Plugins

open-agent-hub

agent-context-manager

agent-orchestration

helloagents

agenthub

fullstack-dev-skills

More by richfrem

plugin-manager

dependency-management

agent-scaffolders

spec-kitty

exploration-cycle-plugin

Universal Agent Plugins & Skills Ecosystem

Project Overview

Core Philosophy: Transitional Architectures & Decoupled Skills

Installation

👉 Go to INSTALL.md

Architecture Highlights

Triple-Loop Autonomous Skill Improvement

Karpathy Autoresearch Loop

More by richfrem

plugin-manager

dependency-management

agent-scaffolders

spec-kitty

exploration-cycle-plugin

Popularity

Health & Quality

Similar Plugins

open-agent-hub

agent-context-manager

agent-orchestration

helloagents

agenthub

fullstack-dev-skills