By josephjoshua
AI-driven engineering harness for StarryOS kernel improvement: systematic bug hunting with Linux comparison testing, performance benchmarking, application compatibility testing, code quality enforcement, and structured reporting.
You are a kernel bug triage specialist. Your job is to classify StarryOS bugs into the categories required by the competition, assess severity, identify related bugs, and recommend fix priority.
You are a senior kernel engineer reviewing code changes to the StarryOS kernel. Your goal is to ensure every change is production-quality, idiomatic Rust, architecturally sound, and safe.
You are a Linux kernel behavior comparison agent. Your job is to determine the *correct* Linux behavior for syscalls and compare it against StarryOS behavior, producing structured divergence reports.
This skill should be used when the user asks to "audit the kernel", "review kernel internals", "find concurrency bugs", "check for deadlocks", "find memory leaks", "audit scheduler", "review memory management", "stress test", "find races", "check lock ordering", "audit signal handling", "find improvements", "review kernel architecture", or wants to go beyond syscall testing to analyze StarryOS kernel internals for bugs, performance issues, and improvement opportunities.
This skill should be used when the user asks to "benchmark StarryOS", "measure performance", "improve performance", "compare speed", "profile syscall latency", "optimize I/O", "optimize scheduler", "optimize memory", "optimize filesystem", "measure throughput", "context switch overhead", "multicore scaling", or wants to run performance benchmarks comparing StarryOS against Linux baselines and identify optimization targets.
This skill should be used when the user asks to "check upstream", "check PRs", "what's been fixed upstream", "overlap check", "check rcore-os PRs", "sync with upstream", "deduplicate fixes", or wants to see which bugs from known.json have already been fixed or claimed by upstream PRs in rcore-os/tgoskits and rcore-os/linux-compatible-testsuit.
This skill should be used when the user asks to "run the harness autonomously", "evolve", "auto-hunt", "autonomous mode", "self-evolving loop", "sweep syscalls", "deep dive", "what should I work on next", "pick next target", "run a sweep", "continuous improvement", or wants the starry-harness to autonomously select targets, run analysis cycles, and track progress.
This skill should be used when the user asks to "find bugs in StarryOS", "hunt bugs", "test syscalls", "discover vulnerabilities", "test starry", "fix syscall", "compare with Linux", "run syscall test", "check Linux compatibility", or wants to systematically discover, test, and fix StarryOS kernel bugs using Linux comparison testing. Supersedes the older test-starry skill.
Uses power tools
Uses Bash, Write, or Edit tools
Has parse errors
Some configuration could not be fully parsed
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
A Claude Code plugin for systematic StarryOS kernel development. Provides bug hunting with Linux comparison testing, kernel internal auditing, performance benchmarking, application compatibility testing, code quality enforcement, upstream submission preparation, and structured reporting — backed by deterministic static analysis tools that eliminate guesswork.
Add to ~/.claude/settings.json:
{
"extraKnownMarketplaces": {
"starry-harness": {
"source": { "source": "github", "repo": "JosephJoshua/starry-harness" }
}
},
"enabledPlugins": {
"starry-harness@starry-harness": true
}
}
Restart Claude Code.
| Skill | What it does |
|---|---|
hunt-bugs | Syscall bug discovery: generate test, run on Docker Linux, run on StarryOS, diff, fix. Linux must pass first. |
audit-kernel | Deep kernel internal audit beyond syscalls: scheduler, memory, concurrency, signals, filesystem. Uses lockdep, stress testing, property-based tests. |
benchmark | Performance measurement against Linux baselines. Profile, optimize, re-measure. |
test-app | Run real Linux applications (Nginx, PostgreSQL, Python, etc.) on StarryOS. Strace profiling, gap analysis, blocker fixes. |
review-quality | Code quality gate for kernel changes. Rust idioms, safety, API design, framework reuse. |
report | Structured bug reports, benchmark reports, app compatibility reports, and a running work journal. |
evolve | Autonomous target selection with sweep/deep modes. Picks what to work on based on coverage gaps and past effectiveness. Enforces the review pipeline. |
start-submission | Prepares upstream PRs: fresh clone, minimal fix port, test format conversion, verification, Chinese PR draft. Does everything except gh pr create. |
check-upstream | Compares known.json against open/merged upstream PRs to identify already-fixed bugs, claimed bugs, and safe-to-submit bugs. |
| Agent | Role |
|---|---|
linux-comparator | Runs tests on Docker Linux and StarryOS, produces structured diff |
kernel-reviewer | Read-only code quality review with fresh context |
bug-triager | Classifies bugs by category and severity, recommends fix priority |
Static analysis scripts that produce ground-truth output. The agent interprets results — the tools themselves cannot hallucinate.
| Script | What it does |
|---|---|
abi-check.py | Compares StarryOS syscall arg counts against Linux SYSCALL_DEFINE signatures. Each entry sourced from kernel v6.12 with verifiable URLs. |
lock-order-graph.py | Builds a directed graph of lock acquisitions, detects cycles (deadlocks). Rust ownership-aware: distinguishes let guard = x.lock() from x.lock().method(). |
pattern-scanner.py | Scans kernel source against regex rules (9 default patterns). Rules evolve as new bug classes are found. |
kernel-graph.py | Maps all 204 syscalls to subsystems, files, locks, and unsafe blocks. |
change-tracker.py | Identifies which tests need re-running based on git diff since last run. |
| Script | What it does |
|---|---|
pipeline.sh | Full compile → inject → build → QEMU boot → result capture. Supports --arch riscv64|aarch64|x86_64|loongarch64. |
linux-ref-test.sh | Compile and run a C test inside Docker Linux. Supports --arch for cross-arch comparison via QEMU user-mode. |
stress-test.sh | Multi-run test execution with SMP sweeping (--smp 1,2,4) and timeout-based deadlock detection. |
regression-check.sh | Runs all tests in known.json, compares against expected pass/fail counts, flags regressions. |
strace-profiler.sh | Runs an application under strace in Docker, produces a structured syscall profile with gap analysis. |
convert-test.py | Converts starry_test.h test format to upstream test_framework.h format. |
update-known.sh | Updates known.json with results from a pipeline run. |
| Script | What it does |
|---|---|
man-lookup.sh | Fetches syscall man pages (local, Docker, or man7.org). |
journal-entry.sh | Appends structured entries to the work journal. |
draft-pr.sh | Generates a PR draft markdown file with a ready-to-paste gh pr create command. |
npx claudepluginhub josephjoshua/starry-harness --plugin starry-harnessModern R development skills for Claude Code - tidyverse patterns, rlang metaprogramming, Bayesian inference, performance optimization, and more
Unity Development Toolkit - Expert agents for scripting/refactoring/optimization, script templates, and Agent Skills for Unity C# development
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, and rules evolved over 10+ months of intensive daily use
Comprehensive SEO analysis plugin for Claude Code. 25 sub-skills (21 core + 1 orchestrator + 1 framework + 2 extension mirrors) and 18 sub-agents cover technical SEO, content quality, schema, sitemaps, Core Web Vitals, local SEO, backlinks, AI/GEO, ecommerce, hreflang, SXO, clustering, drift monitoring, and Google APIs. Includes optional MCP extensions, SPA-aware rendering, portability, and hardened SSRF/DNS-rebinding safe fetchers.
Comprehensive .NET development skills for modern C#, ASP.NET, MAUI, Blazor, Aspire, EF Core, Native AOT, testing, security, performance optimization, CI/CD, and cloud-native applications
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.