By blixxurd
A cited, fact-checked reference on building agent harnesses (the runtime/agentic loop around an LLM). A skill routes Claude through the bundled guide, code samples, and primary sources on demand.
Research, code samples, and a master guide on building agent harnesses — the runtime/agentic loop that wraps an LLM and turns it into an autonomous agent. Language-agnostic principles with the Claude Agent SDK as the concrete reference; code examples in TypeScript (primary) and Python.
Intended as durable context for humans and other AI agents working on harness code.
📖 docs/agent-harness-best-practices.md —
the master guide. 12 themed sections + a decision cheat-sheet + anti-patterns, every
claim cited, each linked to a runnable sample.
💻 code_samples/ — runnable, heavily-commented examples, one per
theme (TypeScript + Python). See the code samples index.
📦 raw_outputs/ — the raw research this was built from.
📐 docs/llm-navigable-repo-playbook.md — a
domain-agnostic playbook for emulating this repo's structure: the patterns that make it
cheap for an LLM agent to navigate, with a copyable template and checklist.
This repo is also a Claude Code plugin: it ships a skill that routes Claude through the bundled guide, samples, and sources on demand (it navigates the manifest and reads only the relevant section — it does not load the whole guide into context). Install it in any project:
/plugin marketplace add blixxurd/fidget-marketplace
/plugin install agent-harness@fidget
The skill then auto-triggers on agent-harness work (the loop, tool design, context,
permissions, subagents, evals, …). Packaging lives in
.claude-plugin/ and skills/; the content stays in
docs/, code_samples/, and raw_outputs/ — a single source of truth.
A deep-research workflow fanned out across 13 angles, ran parallel web searches, fetched and saved 25 primary sources, extracted 154 falsifiable claims, and put each through 3-vote adversarial verification (a claim survived only if fewer than 2 of 3 skeptics refuted it). 150 claims survived and were synthesized into the digest and this guide. Sources are overwhelmingly primary: Anthropic engineering & SDK docs, Cognition, OpenAI, and LangChain.
.
├── README.md # you are here
├── llms.txt # LLM-friendly index (llmstxt.org convention)
├── index.json # machine-readable manifest: section ↔ sample ↔ source
├── AGENTS.md # orientation + ground rules for agents/humans
├── LICENSE # MIT
├── docs/
│ └── agent-harness-best-practices.md # the master guide
├── code_samples/
│ ├── README.md # index mapping samples → doc sections
│ ├── typescript/ # 10 samples (primary), type-checked
│ └── python/ # 3 illustrative parallels
├── raw_outputs/ # raw research material
│ ├── README.md # provenance pipeline + join keys
│ ├── search/ # 13 per-angle search dumps
│ ├── sources/ # 26 fetched source pages (full text)
│ │ ├── INDEX.md # citation key ↔ source file ↔ URL ↔ claims
│ │ └── sources.json # machine-readable crosswalk
│ ├── claims/verified-claims.json # 150 survivors + 4 dropped, with sourceId
│ └── synthesis/
│ ├── findings-digest.md # granular 12-theme cited digest
│ └── themes.json # machine-readable themes
└── .github/workflows/typecheck.yml # CI: tsc --noEmit on the TS samples
Using this as LLM context? Point your tool at
llms.txt(a structured index of every doc, sample, and source) or feed it the master guide directly. Agents editing the repo should readAGENTS.mdfirst.
An agent is a bounded loop (
maxTurns+maxBudgetUsd) around a model calling well-designed tools (workflow-shaped, error-proofed args, errors-as-data, bounded output). Treat context as a finite, degrading budget. Keep authorization in code, never the prompt. Default to a single linear agent. Observe everything and evaluate by outcomes, not tool-call paths. Build the thinnest harness that makes today's model reliable — and design it so you can delete the scaffolding as models improve.
See the guide for the cited, detailed version.
MIT © 2026 Fidget Softworks, LLC. The guidance is distilled from publicly available primary sources, each cited in the guide; please consult the originals for authoritative detail.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub blixxurd/fidget-marketplace --plugin agent-harnessRun deep research on a topic and turn its verified findings into a cited, LLM-navigable context-skill — a Claude Code plugin that routes Claude to the one relevant, sourced section on demand. Ships the build pipeline, a copyable repo skeleton, a report→raw_outputs mapper, and a manifest validator.
A growing collection of Claude-compatible academic workflow bundles. Covers scientific figures, manuscript writing and polishing, reviewer assessment, citation retrieval, data availability, paper reading, literature search, response letters, paper-to-PPTX conversion, and evidence-grounded Chinese invention patent drafting. Rules are organized as reusable skill folders with explicit workflows and quality checks.
Harness-native ECC operator layer - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, selective install profiles, and production-ready workflows for Claude Code, Codex, OpenCode, Cursor, and related agent harnesses
Comprehensive feature development workflow with specialized agents for codebase exploration, architecture design, and quality review
Core skills library for Claude Code: TDD, debugging, collaboration patterns, and proven techniques
UI/UX design intelligence. 67 styles, 161 palettes, 57 font pairings, 25 charts, 15 stacks (React, Next.js, Vue, Svelte, Astro, SwiftUI, React Native, Flutter, Tailwind, shadcn/ui, Nuxt, Jetpack Compose). Actions: plan, build, create, design, implement, review, fix, improve, optimize, enhance, refactor, check UI/UX code. Projects: website, landing page, dashboard, admin panel, e-commerce, SaaS, portfolio, blog, mobile app. Elements: button, modal, navbar, sidebar, card, table, form, chart. Styles: glassmorphism, claymorphism, minimalism, brutalism, neumorphism, bento grid, dark mode, responsive, skeuomorphism, flat design. Topics: color palette, accessibility, animation, layout, typography, font pairing, spacing, hover, shadow, gradient.
Real-time statusline HUD for Claude Code - context health, tool activity, agent tracking, and todo progress