By intertwine
Build production LLM pipelines with DSPy 3.2.x: define typed signatures and modules, evaluate with rich-feedback metrics, optimize via GEPA, and reason over 100k+ token contexts using recursive RLM agents.
Drive a complete DSPy 3.2.x project end-to-end — spec → program → metric → baseline → GEPA optimize → export → deploy. Orchestrates the other four DSPy skills (dspy-fundamentals, dspy-evaluation-harness, dspy-gepa-optimizer, dspy-rlm-module) in the correct order. Use this for any non-trivial DSPy build from scratch.
Build DSPy evaluation harnesses with rich-feedback metrics that are essential for GEPA optimization. Use when writing a metric function, calling dspy.Evaluate, splitting dev/val sets, debugging "why is my optimizer not improving?", or designing CI-ready DSPy eval suites.
Write idiomatic DSPy 3.2.x programs — typed Signatures, dspy.Module subclasses, Predict/ChainOfThought/ReAct/ProgramOfThought, and save/load. Use this when starting any new DSPy project or when fixing non-idiomatic DSPy code (hard-coded prompts, ad-hoc string templates, untyped outputs, non-serializable classes).
Optimize DSPy programs with dspy.GEPA — the reflective/evolutionary optimizer that is the 2026 gold standard for DSPy (beats MIPROv2 on complex tasks with far fewer rollouts when the metric returns rich feedback). Use when the user says optimize, compile, GEPA, reflective optimization, or "make this program better" and a DSPy program + metric + trainset exist.
Use dspy.RLM (Recursive Language Model) for reasoning over contexts too large to fit in an LLM's working window — entire codebases, long logs, massive documents, or multi-step data exploration that needs a sandboxed Python REPL. Use when the input is >100k tokens, needs recursive chunking, or benefits from the LLM writing and running code to probe data.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Production-grade DSPy 3.2.x skills for coding agents. A synthesized, spec-compliant pack of five agent skills that turns Claude Code, Codex CLI, and any other agentskills.io-compatible agent into a DSPy expert.
SKILL.md + deep reference.md)example_*.py scripts with offline --dry-runBetterTogether chaining example| Skill | When it auto-invokes |
|---|---|
dspy-fundamentals | Any new DSPy code: Signatures, Modules, Predict/ChainOfThought/ReAct, save/load |
dspy-evaluation-harness | Writing metrics, splitting dev/val sets, calling dspy.Evaluate |
dspy-gepa-optimizer | Optimizing/compiling DSPy programs with dspy.GEPA |
dspy-rlm-module | Long context, codebase QA, recursive exploration via dspy.RLM |
dspy-advanced-workflow | End-to-end builds — orchestrates the other four |
/plugin marketplace add intertwine/dspy-agent-skills
/plugin install dspy-agent-skills@dspy-agent-skills
npx skills)npx skills add intertwine/dspy-agent-skills --list
npx skills add intertwine/dspy-agent-skills --skill '*' -a codex -y
The Vercel skills CLI currently expects a GitHub owner/repo, URL, well-known HTTPS endpoint, or local path as its source. The bare form npx skills add dspy-agent-skills is not resolvable unless the upstream CLI adds a source alias, so use intertwine/dspy-agent-skills.
git clone https://github.com/intertwine/dspy-agent-skills
cd dspy-agent-skills
./scripts/install.sh # symlinks into ~/.claude/skills/ and ~/.agents/skills/
Flags: --claude-only, --codex-only, --copy (copy instead of symlink), --uninstall, --dry-run.
Drop skills/* into ~/.claude/skills/ (Claude Code) or ~/.agents/skills/ (Codex CLI). See docs/installation.md for all options.
In your agent, say:
"Build a DSPy sentiment classifier, optimize it with GEPA, and save the artifact."
The agent auto-loads dspy-advanced-workflow, which chains the other skills and outputs a full baseline → GEPA → export pipeline. No further prompting needed.
Three runnable demos under examples/ exercise every skill against real LMs and ship with committed baseline vs. GEPA-optimized numbers plus explicit 3.1.3 vs. 3.2.0 comparison notes.
| Example | Artifact DSPy | Task LM | Baseline | Optimized | Δ | Status |
|---|---|---|---|---|---|---|
| 01-rag-qa | 3.2.0 | Ministral 3B 2512 | 80.47 | 100.00 | +19.53 | Clean comparison refreshed on 2026-04-28 |
| 02-math-reasoning | 3.2.0 | Ministral 3B 2512 | 85.00 | 93.33 | +8.33 | Refreshed on 2026-04-21 |
| 03-invoice-extraction | 3.1.3 | Liquid LFM 2.5 1.2B (free) | 0.833 | 0.931 | +0.098 | Historical artifact retained |
The refreshed 01 and 02 artifacts use the paid pair openrouter/mistralai/ministral-3b-2512 + openrouter/qwen/qwen3-30b-a3b-instruct-2507. 03 stays on its historical DSPy 3.1.3 artifact because a clean DSPy 3.2.0 baseline on the same Liquid/Nemotron pair already reached 0.944, leaving little useful headroom for a replacement GEPA artifact. See examples/README.md and each example's version_comparison.md for the exact commands and caveats.
Every API claim is grounded in:
# Run validation suite
uv run --with pytest python -m pytest tests/ -v
# Smoke-test every example offline (no API key needed)
for f in skills/*/example_*.py; do uv run --with dspy python "$f" --dry-run; done
# Validate the current DSPy API surface used by these skills
env -u UV_EXCLUDE_NEWER uv run --with dspy==3.2.1 python scripts/check_dspy_surface.py
npx claudepluginhub intertwine/dspy-agent-skills --plugin dspy-agent-skillsSchedule recurring Claude Code tasks using native OS schedulers (launchd/crontab). Zero dependencies.
Collection of 22 focused skills for building, optimizing, evaluating, and deploying DSPy applications.
Professional AI/ML Engineering toolkit: Prompt engineering, LLM integration, RAG systems, AI safety with 12 expert plugins
Benchmark, evaluate, and optimize skills to ensure reliable performance across all LLMs
Editorial "LLM Application Developer" bundle for Claude Code from Antigravity Awesome Skills.
LLM application development with RAG, embeddings, LangChain, and prompt engineering
ML engineering plugin: Give your AI coding agent ML engineering superpowers.