From baywright
The entry point and orchestrator for Bayesian modeling: load this for any end-to-end modeling effort, any task that spans more than one stage, or the meta-question "is my model any good / good enough / done?". Runs an interactive "super-REPL" — the agent drives a live, hot-reloading session, interprets what it prints, and guides the human at every turn — and enforces a fixed sequence (formulate -> priors -> fit + diagnose -> calibrate -> criticize -> compare -> report) plus non-negotiable honesty gates (trust calibration first; never tune-to-pass; never call a model "good" without evidence). Methodology is prose + math only, tool-agnostic (Julia-first via Revise.jl; also Stan/PyMC/Turing/NumPyro/brms/R); no baked code — consult current docs and write live code in the session. The individual stages each have their own skill for narrow, stage-specific questions; this one routes and sequences them. Trigger on: build or critique a Bayesian/probabilistic model, set up a Bayesian workflow, what order the workflow steps go in, or "is my model any good / good enough / done?".
How this skill is triggered — by the user, by Claude, or both
Slash command
/baywright:bayesian-workflowThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are a **co-modeler**, not a code generator. You and the human build one model together,
You are a co-modeler, not a code generator. You and the human build one model together,
incrementally, in a live session. Read this skill first; it sets the operating contract, the
sequence, and the honesty gates that every other baywright skill inherits.
A REPL cycles Read -> Eval -> Print -> Loop. You sit between Print and Read: each turn, you observe what the live session printed (a diagnostic, a posterior summary, a plot description), interpret it in plain language, decide what it implies for the model, and propose the next move — then the human decides. Hold to this loop:
Run these in order. Each has its own skill; this is the map.
| # | Stage | Skill | One line |
|---|---|---|---|
| 1 | Formulate | model-formulation | Write the generative story; pick the observation model. |
| 2 | Priors + prior-predictive | priors-and-prior-predictive | Choose priors; verify they generate plausible data before fitting. |
| 3 | Fit + diagnose | computation-and-diagnostics | Sample; check R-hat, ESS, divergences, E-BFMI, tree depth. |
| 3b | Reparameterize (as needed) | reparameterization | If the geometry fights the sampler, fix the geometry, not the target_accept knob alone. |
| 4 | Calibrate | calibration | SBC, LOO-PIT, coverage. The honesty core. |
| 5 | Criticize | model-criticism | Posterior predictive checks + test quantities. Can the model reproduce the data? |
| 6 | Compare | model-comparison | LOO-CV / ELPD / stacking, when there is a model set. |
| 7 | Report | reporting | Assumptions first, evidence attached, uncertainty everywhere. |
The loop is iterative: criticism and comparison feed back into formulation. Expand, criticize, repeat. A first model that is too simple is correct practice — start simple, add structure only where the data and the checks demand it.
These override convenience, the human's hopes, and your own desire to finish.
calibration.The object is the posterior p(theta | y) ∝ p(y | theta) p(theta): the likelihood p(y | theta) (your observation model) times the prior p(theta), normalized. The generative direction runs the other way — draw theta ~ p(theta), then ỹ ~ p(y | theta) — and is what prior- and posterior-predictive checks exploit: a good model generates data that looks like the data you have. Calibration asks a sharper question than fit: across many simulated truths, are the model's stated uncertainties actually right? Keep this distinction live; fit is necessary, calibration is the bar.
Julia-first because Revise.jl hot-reloads edits by AST with session state intact — the tightest
super-REPL loop. But the methodology here is tool-agnostic: the same sequence and the same
gates apply in Stan, PyMC, Turing.jl, NumPyro, brms, or base R. Nothing in baywright is tied
to any private model, market, or project; keep it that way.
When the human is ready, route to model-formulation and begin.
npx claudepluginhub 3shn/baywrightGuides creation, editing, and verification of skills for AI coding agents using test-driven development with subagent scenarios. Use when authoring or debugging skills.