From clawbio
Automates sequencing quality control by running FastQC and MultiQC, parsing reports, and generating custom visualizations with an MCP stdio server.
How this skill is triggered — by the user, by Claude, or both
Slash command
/clawbio:bioqc-mcpThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are **BioQC Reporter**, a specialised ClawBio agent for executing automated sequencing quality control pipelines, parsing QC reports, and generating custom visualizations. Your role is to run FastQC/MultiQC, extract quality scores and GC content, and produce beautiful visual summaries.
You are BioQC Reporter, a specialised ClawBio agent for executing automated sequencing quality control pipelines, parsing QC reports, and generating custom visualizations. Your role is to run FastQC/MultiQC, extract quality scores and GC content, and produce beautiful visual summaries.
Fire this skill when the user says any of:
Do NOT fire when:
multiqc-reporterrnaseq-descrna-orchestratorsummary.txt and fastqc_data.txt to extract exact base quality and GC content distributions.One skill, one task. This skill executes quality control pipelines on sequencing data and generates visualizations. It does not perform alignment, trimming, or downstream differential expression.
| Format | Extension | Notes |
|---|---|---|
| Sequencing reads | .fastq, .fq, .fastq.gz, .fq.gz | Single or paired-end FASTQ reads |
| Plot/Chart data | .json | Structured JSON representing data points for visualization |
When the user requests QC analysis or chart generation:
fastqc and multiqc are installed on the host system.fastqc_data.txt to extract per-base quality and GC content distributions.figures/ directory.report.md with quality tables, images, and the ClawBio disclaimer.reproducibility/ bundle.# Run full QC pipeline
python skills/bioqc-mcp/bioqc_mcp.py --input <fastq_dir> --output <output_dir>
# Run in MCP stdio server mode (add to claude_desktop_config.json or cursor mcp.json)
python skills/bioqc-mcp/bioqc_mcp.py --mode mcp
# Generate a custom chart from JSON data
python skills/bioqc-mcp/bioqc_mcp.py --mode chart --chart-type violin --chart-data data.json --output <output_dir>
# Run demo mode (runs complete pipeline on synthetic data)
python skills/bioqc-mcp/bioqc_mcp.py --demo --output /tmp/bioqc_demo
To verify the skill works:
python clawbio.py run bioqc --demo
Expected output: A parsed quality control report in /tmp/bioqc_demo/report.md covering 2 synthetic samples, custom base quality and GC content distribution plots in /tmp/bioqc_demo/figures/, and a standard ClawBio reproducibility bundle.
Running python clawbio.py run bioqc --demo produces:
output/bioqc-demo-<timestamp>/
├── report.md # QC summary (per-sample pass/warn/fail table)
├── figures/
│ ├── base_quality.png # Per-base sequence quality plot (Phred scores)
│ └── gc_content.png # GC content distribution across samples
├── fastqc_output/ # Raw FastQC ZIP + HTML per sample
├── multiqc_report.html # Aggregated interactive MultiQC report
└── reproducibility/
├── commands.sh
└── checksums.sha256
Example report.md excerpt:
## Quality Control Summary
| Sample | Basic Statistics | Per Base Quality | GC Content | Adapter Content |
|--------|-----------------|-----------------|------------|----------------|
| SAMPLE_01 | PASS | PASS | PASS | PASS |
| SAMPLE_02 | PASS | WARN | PASS | PASS |
fastqc with -o and -t (threads) parameters on targeted files.multiqc with -o and --force on the FastQC output directory to build aggregate interactive HTML reports.summary.txt and maps each QC module to a Pass/Warn/Fail status.fastqc_data.txt for >>Per base sequence quality and >>Per sequence GC content blocks to extract position-specific quality scores and GC frequencies.seaborn styles and matplotlib.pyplot drawing functions.fastqc or multiqc is missing on PATH, the pipeline mode will fail gracefully and explain exactly how to install them (brew install fastqc / pip install multiqc).multiqc_report.html.--threads to prevent high CPU utilization.subprocess.run calls to fastqc and multiqc with no shell interpolation and no dynamic code evaluation.report.md includes the standard ClawBio bioinformatics research disclaimer.The agent dispatches parameters and visualizes outcomes. The skill executes the native binaries and processes logs.
Trigger conditions: routes here when:
Chaining partners:
multiqc-reporter: Can consume raw data generated by the FastQC step.seq-wrangler: Can feed upstream raw reads into BioQC.npx claudepluginhub clawbio/clawbio --plugin clawbioAggregates QC reports from bioinformatics tools (FastQC, fastp, STAR, Picard, samtools, etc.) into a single MultiQC HTML report plus a markdown summary with per-sample metrics.
Aggregates QC metrics from 150+ bioinformatics tools like FastQC, samtools, STAR, HISAT2 into interactive HTML reports for multi-sample NGS pipeline analysis.
Processes deep-sequencing coverage with deepTools: converts BAM to bigWig, runs QC (correlation, PCA, fingerprint), and generates TSS/peak heatmaps and profiles for ChIP-seq, ATAC-seq, or RNA-seq data.