From clawbio
Scores per-gene blood RNA-seq expression outliers against a control reference panel for rare-disease diagnostics. Flags dosage-sensitive candidates for WGS-negative or VUS cases.
How this skill is triggered — by the user, by Claude, or both
Slash command
/clawbio:rare-disease-rnaseqThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Reproduces the diagnostic principle of the Genomics England NGRL paper (Blood-based RNA-Seq of 5,412 individuals, medRxiv 2026.03.19.26348811). For each case sample, scores per-gene expression against a control reference panel and flags candidates falling in a curated dosage-sensitive disease-gene panel.
Reproduces the diagnostic principle of the Genomics England NGRL paper (Blood-based RNA-Seq of 5,412 individuals, medRxiv 2026.03.19.26348811). For each case sample, scores per-gene expression against a control reference panel and flags candidates falling in a curated dosage-sensitive disease-gene panel.
Per-gene robust outlier scoring on log2(CPM+1):
This implements the diagnostic principle of OUTRIDER (per-gene outlier vs control panel) without the autoencoder, so it runs in seconds with no R/Bioconductor stack. For clinical-grade calls swap to the full DROP pipeline (gagneurlab/drop) which adds OUTRIDER's denoising autoencoder, FRASER2 splicing outliers, and confounder correction. The skill's I/O contract is the same so the upgrade is drop-in.
.csv or .tsv): rows = genes (HGNC symbol), columns = sample IDs.txt): one case sample ID per line.txt): one control sample ID per line (typically n ≥ 50).csv with gene and mechanism columns): defaults to a built-in 50-gene haploinsufficient panelrdoutlier_report/
├── report.md # per-case candidate diagnoses + clinical narrative
├── result.json # standard ClawBio envelope
├── figures/
│ └── case_outlier_heatmap.png # z-scores across cases × top genes
├── tables/
│ ├── outlier_calls.csv # all flagged outliers with z-score, direction, mechanism
│ └── per_gene_stats.csv # control median + MAD per gene
└── reproducibility/
├── commands.sh
├── environment.yml
└── checksums.sha256
python clawbio.py run rdoutlier --demo
Generates 100 synthetic Gulf-ancestry control samples + 2 cases with injected outliers (FBN1 down, NF1 up) across a 200-gene panel. Demonstrates the diagnostic loop end-to-end in seconds.
| Component | Demo | Production |
|---|---|---|
| Aligner + quantifier | none (synthetic counts) | STAR + featureCounts (or Salmon) |
| Outlier algorithm | robust per-gene z-score | OUTRIDER autoencoder + FRASER2 splicing |
| Control panel | 100 synthetic samples | QBB n≈12K PAXgene blood RNA-seq |
| Confounder correction | none | DROP pipeline (RIN, batch, hidden factors) |
| Disease panel | 50 haploinsufficient genes | ClinGen haploinsufficient + PanelApp |
| Return-of-result loop | report.md | Sidra MDT reflex from WGS-negative referrals |
ClawBio is a research and educational tool. It is not a medical device and does not provide clinical diagnoses. Consult a healthcare professional before making any medical decisions.
npx claudepluginhub clawbio/clawbio --plugin clawbioPerforms differential expression analysis on bulk or pseudo-bulk RNA-seq count matrices with QC, PCA, and contrast testing. Useful for transcriptomics researchers.
Diagnoses rare diseases by matching patient phenotypes (HPO terms) to candidate diseases from Orphanet/OMIM, prioritizes gene panels, interprets ACMG variants, and analyzes protein structures. Use for diagnostic odyssey assistance and genetic-counseling differentials.
Runs differential gene expression analysis on bulk RNA-seq count matrices using PyDESeq2, including normalization, dispersion estimation, Wald tests, FDR correction, and volcano/MA plots.