By xiaolai
Score, lint, audit, and test natural-language programming artifacts (skills, agents, hooks, prompts, rules) across Claude Code, Codex CLI, and Antigravity. Enforce 50 quality rules with a 100-point rubric, auto-fix common issues, detect vocabulary drift, check cross-component consistency, and scan for security risks.
Check cross-component consistency — reference integrity, orphans, contradictions
Auto-fix fixable issues in NL artifacts — missing fields, heading gaps, field renames
Initialize NLPM for this project — detect artifacts, set lint strictness, capture baseline trend snapshot
Discover and inventory all natural language programming artifacts in a repository
Render a self-contained HTML report — per-file scores, score trend, cross-component graph, vocabulary noun-verb map, drift candidates, and findings — into .claude/nlpm-reports/. Opens directly via file://. No server, no build, no network.
Cross-component consistency analyzer for NL programming artifacts. Checks reference integrity, detects orphans, finds behavioral contradictions, and identifies terminology drift across plugin components. <example> Context: User runs /nlpm:check on a plugin directory user: "/nlpm:check" assistant: "I'll use the checker to verify cross-component consistency." </example> <example> Context: Developer renamed a skill directory and wants to verify no broken references assistant: "I'll dispatch the checker to find any broken skill references across agents and commands." </example>
Discover and classify all NL programming artifacts in a repository. <example> Context: User wants to inventory their NL artifacts user: "/nlpm:scan" assistant: "I'll use the scanner to discover all NL artifacts." </example> <example> Context: User wants to check a specific project user: "/nlpm:scan ~/github/myproject" assistant: "I'll scan that project for NL programming artifacts." </example>
Scores NL programming artifacts on a 100-point scale using deterministic penalties. Use this agent when scoring plugin components, checking artifact quality, or running quality analysis on commands, agents, skills, rules, hooks, or CLAUDE.md. <example> Context: User runs /nlpm:score on a directory assistant: "I'll use the scorer to analyze and score these artifacts." </example> <example> Context: Quality check before a plugin release assistant: "I'll dispatch the scorer to verify all artifacts meet the threshold." </example> <example> Context: Fix command needs to identify issues before applying repairs assistant: "I'll use the scorer to identify issues and their penalties." </example>
Scan NL programming plugins (Claude Code, Codex CLI, Antigravity) for security risks in executable artifacts: hooks, scripts, MCP configs, dependencies, and prompt injection surfaces. Recognizes per-tool layouts — `.claude/`, `.codex/`, `.gemini/` / `.agent/` — and per-tool config formats (JSON for Claude/Gemini hooks, TOML for Codex `config.toml`). <example> Context: Auditing an external plugin before submitting PRs user: "Scan this plugin for security issues" assistant: "I'll use the security-scanner agent to check all executable artifacts." <commentary> Pre-contribution security gate. Must pass before any PRs are submitted to external repos. </commentary> </example> <example> Context: User wants to vet a plugin before installing user: "Is this plugin safe to install?" assistant: "I'll use the security-scanner agent to check for dangerous patterns." <commentary> Safety check for plugin consumers. Reports execution surfaces and risk level. </commentary> </example>
Researches one tool's CURRENT official documentation and diffs it against an nlpm convention overlay, returning a structured gap report (ADD / FIX / REMOVE / CONFIRM / RESOLVED) so the overlay can be brought up to date. Read-only on both the web and the repo — it never edits nlpm files. Use when an overlay (conventions-claude, conventions-codex, conventions-antigravity) may have drifted from the upstream spec it documents. <example> Context: User runs /nlpm:spec-sync to refresh the Claude Code overlay user: "/nlpm:spec-sync claude" assistant: "I'll dispatch the spec-researcher on the conventions-claude overlay to fetch the current Claude Code docs and report what changed." </example> <example> Context: A new Codex CLI release shipped and the overlay's changes table stops months ago user: "Has the Codex overlay fallen behind the latest releases?" assistant: "I'll use the spec-researcher to pull the latest Codex docs and GitHub releases and diff them against conventions-codex." </example> <example> Context: Maintainer wants to verify the overlays before a release without editing anything user: "Check the overlays for drift before I cut the release, but don't change anything." assistant: "I'll dispatch the spec-researcher in report-only mode for each overlay to surface stale field names and resolved uncertainties." </example>
Use when scoring or writing Antigravity (or legacy Gemini CLI) artifacts — covers .gemini/ paths, .agent/ workspace skills, gemini-extension.json, GEMINI.md, TOML slash commands, Gemini-lineage hook events. Spec is unsettled (Antigravity 2.0 launched 2026-05-19); many checks are advisory until PR-B verification.
Use when scoring or writing Claude Code artifacts — covers .claude/ paths, plugin.json schema, command + agent + skill frontmatter, CLAUDE.md, hook events, hooks.json format, settings.json, LSP, monitors, memory file conventions, and the Claude Code built-in tool catalog. Refreshed 2026-06-07 against docs map dated 2026-06-05 (Claude Code ≥ v2.1.16x).
Use when scoring or writing Codex CLI artifacts — covers .codex/config.toml schema, .codex-plugin/plugin.json, .agents/skills/ layout, Codex hook events, AGENTS.md hierarchy, marketplace.json, and the agents/openai.yaml sidecar. Refreshed 2026-06-07 against Codex 0.137.0 (2026-06-04).
Universal NL programming conventions — SKILL.md open spec (agentskills.io), AGENTS.md as canonical universal memory file, vague-quantifier list, prompt engineering layers, naming conventions, the override system. Tool-specific schemas live in nlpm:conventions-claude / nlpm:conventions-codex / nlpm:conventions-antigravity.
Multi-agent workflow patterns for Claude Code -- parallel dispatch, sequential pipelines, QC gates, retry loops, shared partials. Use when designing systems with multiple agents, commands, or processing stages.
Modifies files
Hook triggers on file write and edit operations
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimnpx claudepluginhub xiaolai/nlpm --plugin nlpmBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Natural-Language Programming Manager — score, check, fix, and test NL artifacts across Claude Code, Codex CLI, and Antigravity. Tier-aware scoring with per-tool overlays.
Part of the xiaolai plugin marketplace.
NLPM is the only multi-tool NL artifact validator that systematically checks manifest-vs-disk consistency — the bug class where a SKILL.md exists on disk but is silently missing from plugin.json (and therefore invisible after claude plugin install). Verified across 8+ tools including Anthropic's official plugin-validator and the Linux Foundation's skills-ref. See analysis/ecosystem-gap.md for the research.
NLPM treats natural language artifacts as programs that can be linted. Just as ESLint scores JavaScript and ruff scores Python, NLPM scores the markdown files that drive AI behavior: skills, agents, commands, rules, hooks, prompts, CLAUDE.md, and memory files.
Eight commands, each doing one thing:
| Command | What it does |
|---|---|
/nlpm:ls | Discover and inventory all NL artifacts in a repo |
/nlpm:score | Score artifact quality (100-point scale) |
/nlpm:check | Cross-component consistency checks |
/nlpm:fix | Auto-fix fixable issues |
/nlpm:trend | Track quality score trends over time |
/nlpm:test | Run NL artifact tests against spec files (TDD) |
/nlpm:init | Initialize NLPM for a project |
/nlpm:security-scan | Scan plugins for security risks in executable artifacts |
Slash commands ship as a Claude Code plugin. The scoring rubric covers three ecosystems (Claude Code, Codex CLI, Antigravity) via tier-aware overlays — see analysis/multi-tool-design-2026-05.md. The standalone Python 3.11+ validator (bin/nlpm-check) has no Claude Code dependency and runs in pre-commit hooks or CI on any tool's artifacts.
NLPM also runs as a self-evolving GitHub Actions pipeline that audits real plugin repos, contributes fix PRs, harvests teaching examples from clean ones, and feeds learnings back into its own rule catalog:
auditor/exemplars/ — 62 published so far, covering 31 of the 50 Rules with real-world positive references. See the gallery.auditor-cite-exemplars.yml runs weekly and opens a human-gated PR adding > Real-world example: [<repo>] links to skills/nlpm/rules/SKILL.md, so each rule documents both the bad case (in the rule body) and the good case (in a real repo).auditor/scripts/validate-rule-ids.py re-validates every audit's rule_id against the rubric (type drift) and the rule's title keywords (semantic drift). The 2026-05-13 sweep found 990 mislabeled rule_ids across 128 historical audits; the validator is now wired as a soft-warn telemetry step in every new audit so future drift is caught immediately.auditor/scripts/rule-health.py reports validated_hits per rule (raw hits minus drift hits) and exemplars_count per rule, so the "needs attention" view is calibrated against actual rule violations rather than scorer noise.Two install paths — both reach the same code. Pick one:
Via Anthropic's official community marketplace (curated; updates lag the maintainer's marketplace by up to ~24h):
claude plugin marketplace add anthropics/claude-plugins-community
claude plugin install nlpm@claude-community --scope project # or --scope user
Via the xiaolai marketplace (latest version lands here first):
claude plugin marketplace add xiaolai/claude-plugin-marketplace
# Project scope (recommended)
claude plugin install nlpm@xiaolai --scope project
# Global (all projects)
claude plugin install nlpm@xiaolai --scope user
Install fails with "Plugin not found in marketplace 'xiaolai'"? Your local marketplace clone is stale. Run
claude plugin marketplace update xiaolaiand retry —plugin installdoes not auto-refresh. (The community marketplace doesn't have this caveat.)
In Claude Code:
A 260-token system prompt that overrides three structural presumptions every RLHF-trained LLM inherits from training: that you want confirmation, that old scarcity still applies, that best practices are ceilings.
English language coach for non-native speakers — auto-corrects prompts, translates non-English, refines with :: prefix, tracks improvement over time
One plugin to bridge and delegate across Claude Code, Codex CLI, and Gemini CLI — single-source AGENTS.md, shared skills, mirrored hooks and MCP servers, and full Claude↔Codex bidirectional delegation.
Auto-updated multi-skill reference for the whole Anthropic doc ecosystem. 8 skills covering claude-code, claude-agent-sdk, anthropic-api, anthropic-platform-features, claude-connectors, claude-cowork, mcp-spec, and anthropic-pulse (news + research digests). Pipeline refreshes daily.
Initialize a multi-agent workspace with shared configuration for Claude Code, Codex CLI, and Gemini CLI
A configurable linter for agent skills, plugins, and AI coding assistant context
Analyze Claude Code agent session transcripts to identify inefficiencies, anti-patterns, repeated mistakes, missing tooling opportunities, and user frustration signals for continuous improvement
Audit and optimize Claude Code configurations with dynamic best-practice research
No description provided.
Claude Code skill pack for Grammarly (24 skills)
Prompt engineering techniques for accurate, grounded Claude responses — anti-hallucination workflow with citation-backed analysis