Stats

Actions

Available In

Tags

baywright

A Julia-first Bayesian-workflow copilot for Claude Code — a Socratic super-REPL that builds probabilistic models with you, one honest step at a time.

What it is

A REPL is Read → Eval → Print → Loop. baywright makes the agent sit inside that loop, between Print and Read: it watches what your live session prints (a divergence count, a rank histogram, a posterior interval), interprets it in plain language, and shapes what you do next. The model is not a script you run once and report on — it is a living session where edits hot-reload by AST (Revise.jl) with state preserved, and you stay in the loop at every turn.

baywright owns the process, not the syntax:

Opinionated on method, agnostic on tooling. It enforces a fixed workflow (formulate → priors → fit → calibrate → criticize → compare → report) and non-negotiable honesty gates, but the modeling library is yours to choose.

Prose + math, zero baked code. The skills carry reasoning and mathematics, never code snippets that rot or lock you to one library. When syntax is needed, the agent consults the current documentation for your chosen tool and writes live code into the session.

Julia-first. Julia uniquely hot-reloads live edits precisely by AST via Revise.jl, giving a Clojure/Erlang-style live-coding loop. Other ecosystems (Stan, PyMC, Turing, NumPyro, brms, R) are first-class for the methodology; the live-reload loop is sharpest in Julia.

Status

v0.2 — the MCMC lane is complete. This release has the full methodology (9 doctrine skills) plus the interactive loop: the /baywright:bw-* workflow verbs, the persistent Model Ledger, the on-demand specialist agents (prior-interviewer, model-critic, diagnostic-reader), and the fail-closed done-gate hook. Amortized / simulation-based inference (SBI) and causal inference are later phases.

Skills (v0.1)

Skill

Covers

bayesian-workflow

The spine: the workflow sequence, the operating contract, the honesty gates. Start here.

model-formulation

The generative story; choosing an observation model; decision-relevance.

priors-and-prior-predictive

Prior elicitation and prior predictive checks.

computation-and-diagnostics

Sampler choice; convergence (R-hat, ESS, divergences, E-BFMI, tree depth).

reparameterization

Funnels, non-centering, standardization, identifiability geometry.

calibration

Simulation-based calibration (SBC), LOO-PIT, coverage — the honesty core.

model-criticism

Posterior predictive checks, test quantities, identifiability.

model-comparison

LOO-CV, ELPD, stacking, WAIC.

reporting

The audit-trail report: assumptions first, evidence attached.

model-ledger

The format of the living per-model record the loop reads and writes.

The workflow loop

Drive the workflow with the /baywright:bw-* verbs (Claude Code namespaces plugin commands by plugin name, so your bw shorthand reads as /baywright:bw-…):

Verb

Does

/baywright:bw-start

Run the formulation intake and create the Model Ledger.

/baywright:bw-status

Show where the model stands and which gates are pending.

/baywright:bw-next

Advance one stage, gate-checked — won't pass a stage with no evidence.

/baywright:bw-gate

Run the calibration gate (prior-predictive, SBC, recovery), fail-closed.

/baywright:bw-criticize

Run a posterior-predictive criticism pass.

/baywright:bw-explain <concept>

Teaching mode: explain a concept, then point to current docs.

The Model Ledger (baywright-ledger.md) is the state: a machine-checkable YAML status block over human-readable prose — the generative story, priors and their justification, each stage's status and evidence, the honesty log, and the verdict.

On-demand specialist agents: prior-interviewer (Socratic prior elicitation), model-critic (adversarial "how is this wrong?"), diagnostic-reader (reads R-hat / SBC / LOO output in plain language).

The done-gate hook is a fail-closed Stop hook: it refuses to let a session end on a "model is good/done" claim unless the ledger records the required gates (formulation, priors, computation, calibration, criticism), or you have explicitly set verdict: pending-accepted. It is the firewall made mechanical — honest workflow even when the agent is the one doing the modeling.

The Julia live-session driver

The recommended Julia backend is a persistent REPL driven over MCP, with Revise.jl for hot-reload. baywright documents this posture but does not bundle an MCP config — bring your own REPL server (e.g. a kaimon-style Julia MCP) and the workflow uses whatever live-session and documentation tools you have. A user without a live REPL still gets the full doctrine and drives their tool however they like.

Install (local test)

baywright

A Julia-first Bayesian-workflow copilot for Claude Code — a Socratic super-REPL that builds probabilistic models with you, one honest step at a time.

What it is

baywright owns the process, not the syntax:

Opinionated on method, agnostic on tooling. It enforces a fixed workflow (formulate → priors → fit → calibrate → criticize → compare → report) and non-negotiable honesty gates, but the modeling library is yours to choose.
Prose + math, zero baked code. The skills carry reasoning and mathematics, never code snippets that rot or lock you to one library. When syntax is needed, the agent consults the current documentation for your chosen tool and writes live code into the session.
Julia-first. Julia uniquely hot-reloads live edits precisely by AST via Revise.jl, giving a Clojure/Erlang-style live-coding loop. Other ecosystems (Stan, PyMC, Turing, NumPyro, brms, R) are first-class for the methodology; the live-reload loop is sharpest in Julia.

Status

Skills (v0.1)

Skill	Covers
`bayesian-workflow`	The spine: the workflow sequence, the operating contract, the honesty gates. Start here.
`model-formulation`	The generative story; choosing an observation model; decision-relevance.
`priors-and-prior-predictive`	Prior elicitation and prior predictive checks.
`computation-and-diagnostics`	Sampler choice; convergence (R-hat, ESS, divergences, E-BFMI, tree depth).
`reparameterization`	Funnels, non-centering, standardization, identifiability geometry.
`calibration`	Simulation-based calibration (SBC), LOO-PIT, coverage — the honesty core.
`model-criticism`	Posterior predictive checks, test quantities, identifiability.
`model-comparison`	LOO-CV, ELPD, stacking, WAIC.
`reporting`	The audit-trail report: assumptions first, evidence attached.
`model-ledger`	The format of the living per-model record the loop reads and writes.

The workflow loop

Drive the workflow with the /baywright:bw-* verbs (Claude Code namespaces plugin commands by plugin name, so your bw shorthand reads as /baywright:bw-…):

Verb	Does
`/baywright:bw-start`	Run the formulation intake and create the Model Ledger.
`/baywright:bw-status`	Show where the model stands and which gates are pending.
`/baywright:bw-next`	Advance one stage, gate-checked — won't pass a stage with no evidence.
`/baywright:bw-gate`	Run the calibration gate (prior-predictive, SBC, recovery), fail-closed.
`/baywright:bw-criticize`	Run a posterior-predictive criticism pass.
`/baywright:bw-explain <concept>`	Teaching mode: explain a concept, then point to current docs.

baywright

Popularity

What's Inside

Confidence

README

baywright

What it is

Status

Skills (v0.1)

The workflow loop

The Julia live-session driver

Install (local test)

Similar Plugins

context7-plugin

ecc

fullstack-dev-skills

godot-skills

superpowers

prompts.chat

More by 3shn

SysML v2 Co-Pilot

baywright

What it is

Status

Skills (v0.1)

The workflow loop

The Julia live-session driver

Install (local test)

More by 3shn

SysML v2 Co-Pilot

Popularity

Health & Quality

Similar Plugins

context7-plugin

ecc

fullstack-dev-skills

godot-skills

superpowers

prompts.chat