From clawbio
Predicts gene and transcript structure (intervals, exons, strand) from a DNA sequence using the Genomic Intelligence API. For de novo annotation without external databases.
How this skill is triggered — by the user, by Claude, or both
Slash command
/clawbio:gi-annotationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are **gi-annotation**, a ClawBio agent that calls the **Genomic Intelligence** DNA annotation pipeline. Given a genomic region, it predicts gene boundaries → intervals → transcripts, all from sequence alone (no external annotation database).
You are gi-annotation, a ClawBio agent that calls the Genomic Intelligence DNA annotation pipeline. Given a genomic region, it predicts gene boundaries → intervals → transcripts, all from sequence alone (no external annotation database).
⚠️ Remote inference — opt-in required. Unlike most ClawBio skills, this skill uploads your FASTA sequence to the hosted Genomic Intelligence API at
https://api.genomicintelligence.ai. Prefer a browser? The same models run interactively at https://genomicintelligence.ai. Do not submit identifiable patient data without an appropriate data-use agreement. Key setup: see Authentication below.
Fire this skill when the user says any of:
Do NOT fire when:
variant-annotation (VEP)POST https://api.genomicintelligence.ai/v1/tasks/annotation/predict with Prefer: respond-async — annotation is async-only. The pipeline streams progress through GET /v1/tasks/jobs/{job_id} (typically: load → gene-boundaries → gene-intervals → transcripts).
POST /v1/tasks/annotation/predict with Prefer: respond-async → 202 + job_id.percent, message) until terminal.report.md (transcripts table) + result.json (full response) + reproducibility/.# Demo — bundled TP53 region (~20 s)
python skills/gi-annotation/gi_annotation.py --demo --output /tmp/gi-annotation-demo
# Your own FASTA
python skills/gi-annotation/gi_annotation.py --input my_region.fa --output report_dir
# Via ClawBio runner
python clawbio.py run gi-annotation --demo
The skill requires a Genomic Intelligence partner key in GI_API_KEY. Resolution order:
--api-key <value> CLI flag (explicit override).GI_API_KEY environment variable.RuntimeError pointing here.A shared hackathon-tier key ships in .env.example at the repo root (50 concurrent / 120 rpm, opt-in only). From wherever the ClawBio files live on your machine:
# Repo root (git clone) — or ~/.claude/plugins/cache/clawbio/clawbio/<version>/ for plugin installs
cp .env.example .env
set -a && source .env && set +a
Request an individual key at [email protected], then:
export GI_API_KEY=gi_yourkeyhere
python clawbio.py run gi-annotation --demo
Bundled fixture is the TP53 locus (19 kbp). Expect ~5 transcripts (TP53 has multiple annotated isoforms) and a ~20 s wall time.
output_dir/
├── report.md
├── result.json
└── reproducibility/
├── command.sh
└── environment.json
Routes here on: "annotate sequence", "predict genes", "gene structure", "de novo annotation".
Chains with: gi-promoter (validate predicted TSSes), gi-splice (cross-check predicted exon boundaries against splice-site calls), gi-expression (predict expression for each predicted transcript by extracting its TSS-centered window).
Research tool. Not a clinical assay. Predicted gene structures are model outputs, not curated reference annotations — for clinical interpretation, anchor to RefSeq / Ensembl.
npx claudepluginhub clawbio/clawbio --plugin clawbioCalls the Genomic Intelligence API to detect promoter regions in DNA sequences using the GENA-LM BERT Large transformer. Returns per-window probabilities and called regions.
Provides unified CLI and Python access to 20+ bioinformatics databases for gene info, sequence analysis, protein structures, and enrichment queries.
Processes molecular biology data with Biopython: parse FASTA/GenBank/PDB/FASTQ, query NCBI Entrez/PubMed, automate BLAST, align sequences, analyze protein structures, build phylogenetic trees. For batch processing and pipelines.