chemaudit

Name: chemaudit
Author: kohulan

By Kohulan

Validate, curate, audit, and filter chemical structures for QSAR/ML drug discovery workflows using ChemAudit tools. Batch process SMILES/InChI/CSV/SDF files, standardize with ChEMBL-compatible pipelines and provenance tracking, detect dataset issues like contradictions and parse errors, filter generative outputs with REINVENT scorers, and lookup/resolve across PubChem, ChEMBL, COCONUT, Wikidata, SureChEMBL.

npx claudepluginhub kohulan/chemaudit

Popularity

Stars

Top 10%

Med: 0·Avg: 285

Installs

Med: 0·Avg: 1

What's Inside

Skills8

chemaudit-batch-validation

/chemaudit-batch-validation

Process CSV, TSV, TXT, or SDF files of molecules through ChemAudit's batch pipeline with Redis-backed progress tracking, on-demand analytics (scaffold, chemical space, clustering, MMP, taxonomy, R-group), and nine export formats. Use when user says "validate this file", "batch validation", "process these molecules", "upload CSV of SMILES", "export results", "scaffold analysis", "cluster compounds", "Butina clustering", or "PDF report for these molecules". Deployment-profile-gated file size and molecule-count limits — query /config first.

chemaudit-database-integrations

/chemaudit-database-integrations

Look up molecules across PubChem, ChEMBL, COCONUT, Wikidata, and SureChEMBL patent databases, resolve chemical identifiers (CAS, ChEMBL ID, PubChem CID, ChEBI, UNII, DrugBank, names), and run cross-database consistency checks. Use when user says "look up in PubChem", "ChEMBL bioactivity", "natural product search", "COCONUT lookup", "Wikidata lookup", "resolve CAS number", "resolve ChEMBL ID", "cross-database check", "find this compound", "patent search", "SureChEMBL", or "identifier resolution". Supports SMILES, InChI, InChIKey, CAS, ChEMBL ID, PubChem CID, ChEBI, UNII, DrugBank ID, Wikipedia URL, and compound names (OPSIN + PubChem fallback).

chemaudit-dataset-intelligence

/chemaudit-dataset-intelligence

Audit dataset health with a Fourches-style 5-component score, detect contradictory bioactivity labels across duplicates, compare two dataset versions (added / removed / modified / unchanged), and generate curation reports. Use when user says "audit this dataset", "dataset health score", "contradictory labels", "duplicate activity conflict", "dataset diff", "compare dataset versions", "curation report", "data quality check", or "clean this dataset for ML". Accepts CSV and SDF with optional activity columns.

chemaudit-diagnostics

/chemaudit-diagnostics

Diagnose SMILES parse errors with position and fix suggestions, compare InChI strings layer-by-layer, check format round-trip lossiness (SMILES→InChI→SMILES and SMILES→MOL→SMILES), compare three standardization pipelines side-by-side, and pre-validate SDF/CSV files for structural integrity. Use when user says "why does this SMILES fail", "diagnose this structure", "InChI diff", "layer comparison", "round-trip check", "compare InChI strings", "file pre-validation", "SDF integrity", "M END missing", or "fix this SMILES error".

chemaudit-molecule-validation

/chemaudit-molecule-validation

Validate and score chemical structures using ChemAudit's 16 deep validation checks, 1,500+ structural alerts, and multi-rule drug-likeness scoring. Use when user says "validate this molecule", "check this SMILES", "is this compound valid", "ML-readiness score", "drug-likeness", "deep validation", "quality score", "PAINS check", or asks about stereochemistry, valence, tautomers, or structural issues. Supports SMILES, InChI, MOL blocks, IUPAC names (OPSIN + PubChem), and database identifiers (ChEMBL ID, CAS, PubChem CID, InChIKey) via the /resolve endpoint.

MCP Servers1

chemaudit