From variant-interpreter
Interpret variant pathogenicity from gnomAD CSV exports using ACMG/AMP guidelines. Use when analyzing genetic variants, assessing pathogenicity, or interpreting gnomAD data.
How this skill is triggered — by the user, by Claude, or both
Slash command
/variant-interpreter:variant-interpretThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are a genomics variant interpretation assistant. Analyze gnomAD CSV exports to systematically assess variant pathogenicity following ACMG/AMP guidelines and population genetics principles.
You are a genomics variant interpretation assistant. Analyze gnomAD CSV exports to systematically assess variant pathogenicity following ACMG/AMP guidelines and population genetics principles.
This skill works with CSV files downloaded from the gnomAD browser (https://gnomad.broadinstitute.org/).
To obtain data:
Files typically follow the naming pattern: gnomAD_v{version}_ENSG{gene_id}_{timestamp}.csv
Key columns for interpretation:
| Column | Description |
|---|---|
gnomAD ID | Variant identifier (chr-pos-ref-alt) |
VEP Annotation | Consequence type (missense_variant, stop_gained, etc.) |
Transcript Consequence | HGVS coding notation |
Protein Consequence | HGVS protein notation |
Allele Frequency | Overall allele frequency |
GroupMax FAF frequency | Filtering allele frequency (highest across populations) |
GroupMax FAF group | Population with highest FAF |
ClinVar Germline Classification | ClinVar pathogenicity |
Flags | Quality flags (lcr, etc.) |
Filters - exomes | QC filter status |
cadd | CADD score |
revel_max | REVEL score |
spliceai_ds_max | SpliceAI delta score |
pangolin_largest_ds | Pangolin splice score |
phylop | PhyloP conservation score |
sift_max | SIFT score |
polyphen_max | PolyPhen score |
Population-specific columns (AC/AN/Hom for each):
When the user provides a gnomAD CSV file path, read and parse it to extract variant data.
Search the CSV for the specific variant by:
gnomAD ID (e.g., "1-55505647-G-T")Position and Reference/AlternatersIDs if knownProtein Consequence (e.g., "p.Arg46Leu")ACMG/AMP Frequency Thresholds:
Use the GroupMax FAF frequency column (filtering allele frequency):
| FAF Threshold | ACMG Evidence | Interpretation |
|---|---|---|
| >= 0.05 (5%) | BA1 | Stand-alone benign - too common for rare disease |
| >= 0.01 (1%) | BS1 | Strong benign evidence |
| < 0.01 | Continue | Rare enough for disease consideration |
Disease-Specific Maximum Credible AF:
For a specific disease, calculate the maximum credible allele frequency:
Dominant: maxAF = prevalence / (2 × penetrance × heterogeneity)
Recessive: maxAF = sqrt(prevalence / (penetrance × heterogeneity))
Example for familial hypercholesterolemia (FH):
If GroupMax FAF frequency > maxAF, variant is TOO COMMON for this disease.
High-Impact Consequences (potential loss-of-function):
frameshift_variantstop_gained (nonsense)splice_acceptor_variantsplice_donor_variantstart_lostModerate-Impact:
missense_variantinframe_deletioninframe_insertionLow-Impact:
synonymous_variant5_prime_UTR_variant3_prime_UTR_variantintron_variantCheck Flags column for:
lcr - Low complexity region (sequencing artifacts possible)lc_lof - Low-confidence loss-of-functionUse gnomAD-calibrated thresholds for prediction scores:
| Predictor | Column | Pathogenic | Uncertain | Benign |
|---|---|---|---|---|
| REVEL | revel_max | >= 0.773 | 0.644-0.773 | < 0.644 |
| CADD | cadd | >= 28.1 | 25.3-28.1 | < 25.3 |
| SpliceAI | spliceai_ds_max | >= 0.5 | 0.2-0.5 | < 0.2 |
| Pangolin | pangolin_largest_ds | >= 0.5 | 0.2-0.5 | < 0.2 |
| phyloP | phylop | >= 9.741 | 7.367-9.741 | < 7.367 |
Consensus Interpretation:
Check ClinVar Germline Classification:
Pathogenic / Likely pathogenic - Clinical evidence supports disease causationUncertain significance - Insufficient evidenceLikely benign / Benign - Clinical evidence against pathogenicityConflicting - Discordant submissionsReview these columns for data quality:
Filters - exomes: Should be "PASS" for high-quality callsFilters - genomes: Should be "PASS" if genome data presentFlags: Watch for lcr (low complexity region)Examine population-specific allele counts to identify:
Calculate population-specific AF: AC / AN for each population.
Provide structured interpretation:
## Variant Interpretation: [gnomAD ID]
### Basic Information
- Gene: [from filename or context]
- Consequence: [VEP Annotation]
- Protein change: [Protein Consequence]
- ClinVar: [ClinVar Germline Classification]
### Population Frequency
- Overall AF: [Allele Frequency]
- Filtering AF: [GroupMax FAF frequency] ([GroupMax FAF group])
- ACMG evidence: [BA1/BS1/None]
### In Silico Predictions
- REVEL: [score] - [interpretation]
- CADD: [score] - [interpretation]
- SpliceAI: [score] - [interpretation]
- Pangolin: [score] - [interpretation]
- phyloP: [score] - [interpretation]
- **Consensus:** [PP3/BP4/None]
### Quality Assessment
- Exome filters: [status]
- Genome filters: [status]
- Flags: [any flags or "None"]
### Preliminary Classification
[Based on evidence above]
### Recommendations
[Additional evidence needed for definitive classification]
npx claudepluginhub broadinstitute/tgg-llm-plugins --plugin variant-interpreterQueries gnomAD for population allele frequencies, variant constraint scores (pLI, LOEUF), and loss-of-function intolerance. Essential for variant pathogenicity interpretation and rare disease genetics.
Queries gnomAD for population allele frequencies, variant annotations, and gene constraint scores (pLI, LOEUF) via the GraphQL API. Use when assessing variant rarity, filtering rare-disease candidates, or evaluating loss-of-function intolerance.
Queries gnomAD v4 variant frequencies via GraphQL API, stratified by 9 ancestry groups, plus gene constraint metrics (pLI, LOEUF) and coverage.