From clawbio
Predicts gene expression (log TPM + TPM) from a TSS-centered DNA sequence via a hosted API, conditioned on a free-text cell-type or assay description.
How this skill is triggered — by the user, by Claude, or both
Slash command
/clawbio:gi-expressionThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are **gi-expression**, a ClawBio agent that calls the **Genomic Intelligence** sequence-to-expression model. Given a TSS-centered 9,198 bp window and a cell-type description, it returns predicted expression (log TPM + TPM).
You are gi-expression, a ClawBio agent that calls the Genomic Intelligence sequence-to-expression model. Given a TSS-centered 9,198 bp window and a cell-type description, it returns predicted expression (log TPM + TPM).
⚠️ Remote inference — opt-in required. Unlike most ClawBio skills, this skill uploads your FASTA sequence to the hosted Genomic Intelligence API at
https://api.genomicintelligence.ai. Prefer a browser? The same models run interactively at https://genomicintelligence.ai. Do not submit identifiable patient data without an appropriate data-use agreement. Key setup: see Authentication below.
Fire this skill when the user says any of:
Do NOT fire when:
rnaseq-degi-promoter → gi-expression → rnaseq-de interpretation).POST https://api.genomicintelligence.ai/v1/tasks/expression/predict — default model g0-expression.
{"description": "assay term name is polyA plus RNA-seq. biosample summary is Homo sapiens K562."} by default; override via --description "..."./v1/tasks/expression/predict.report.md (headline log TPM) + result.json + reproducibility/.# Demo — HBB in K562
python skills/gi-expression/gi_expression.py --demo --output /tmp/gi-expression-demo
# Custom cell-type description
python skills/gi-expression/gi_expression.py \
--input my_tss_window.fa \
--description "assay term name is polyA plus RNA-seq. biosample summary is Homo sapiens liver." \
--output report_dir
# Via ClawBio runner
python clawbio.py run gi-expression --demo
The skill requires a Genomic Intelligence partner key in GI_API_KEY. Resolution order:
--api-key <value> CLI flag (explicit override).GI_API_KEY environment variable.RuntimeError pointing here.A shared hackathon-tier key ships in .env.example at the repo root (50 concurrent / 120 rpm, opt-in only). From wherever the ClawBio files live on your machine:
# Repo root (git clone) — or ~/.claude/plugins/cache/clawbio/clawbio/<version>/ for plugin installs
cp .env.example .env
set -a && source .env && set +a
Request an individual key at [email protected], then:
export GI_API_KEY=gi_yourkeyhere
python clawbio.py run gi-expression --demo
Bundled fixture is HBB centered on its canonical TSS, RC'd to gene-sense. With the K562 description, expect ~2.86 log(TPM+1) ≈ 16 TPM (HBB is highly expressed in K562 erythroleukemia).
description is required. The model is conditioned on it; "assay term name is polyA plus RNA-seq. biosample summary is Homo sapiens [tissue]." is the canonical format.GI_API_KEY for heavier use.output_dir/
├── report.md
├── result.json
└── reproducibility/
├── command.sh
└── environment.json
Routes here on: "predict expression", "sequence to expression", "TPM prediction", "cell-type expression".
Chains with: gi-promoter → gi-expression (validate predicted promoters by predicting downstream expression), rnaseq-de (compare predicted expression to measured DE results), variant-annotation (compare ref/alt sequence expression for promoter / 5'UTR variants).
Research tool. Not a clinical assay. Predictions are model outputs, not measurements.
npx claudepluginhub clawbio/clawbio --plugin clawbioCalls the Genomic Intelligence API to detect promoter regions in DNA sequences using the GENA-LM BERT Large transformer. Returns per-window probabilities and called regions.
Integrates GTEx gene expression (TPM across 54 tissues, eQTLs) with ENCODE regulatory elements to validate tissue-specific expression, enhancer-gene links, and functional genomics findings.
Analyzes chromatin state, histone modifications, ATAC-seq accessibility, and TF binding from ENCODE, Roadmap Epigenomics, and ChIP-Atlas. Use for regulatory landscape mapping and cCRE annotations.