From encode-toolkit
Assesses trustworthiness of scientific publications by checking retractions, corrections, expressions of concern, and reproducibility failures via PubMed, bioRxiv, Consensus.
How this skill is triggered — by the user, by Claude, or both
Slash command
/encode-toolkit:publication-trustThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Evaluate the scientific integrity and reliability of publications before building analyses on their findings.
Evaluate the scientific integrity and reliability of publications before building analyses on their findings.
Not all published findings are reliable. Problems range from formal retractions (data fabrication, image manipulation) to informal contradictions where independent groups fail to reproduce key claims. In genomics and computational biology, building pipelines or analyses on unreliable findings wastes resources and propagates errors.
The problem is not always obvious. Formal retractions are rare compared to the number of problematic papers. More commonly:
This skill provides a systematic approach to detecting these issues.
| Level | Label | Meaning |
|---|---|---|
| 5 | High confidence | No issues found; replicated by independent groups |
| 4 | Standard | No issues found; not yet independently replicated |
| 3 | Caution advised | Minor corrections, errata, or partial contradictions exist |
| 2 | Reliability concerns | Key findings contradicted by independent study, or expression of concern issued |
| 1 | Compromised | Retracted, or key findings refuted with evidence of methodological problems |
Default trust level is 4 (standard) — absence of evidence is not evidence of absence. A paper starts at "standard" and moves up with independent replication or down with identified issues.
For any publication being assessed, retrieve full metadata:
get_article_metadata(pmids=["PMID"])
Record:
If starting from a DOI:
convert_article_ids(ids=["DOI"], id_type="doi")
Search PubMed for retraction notices linked to this paper:
search_articles(query="PMID[PMID] AND (Retracted Publication[pt] OR Retraction of Publication[pt])")
Also check the article_types field from Step 1 — if it contains "Retracted Publication", the paper has been formally retracted.
If retracted → Trust Level 1 (Compromised)
Search for corrections:
search_articles(query="PMID[PMID] AND (Published Erratum[pt] OR Correction[pt])")
Errata can be minor (typo in a table) or major (recalculated results that change conclusions). Read the erratum to distinguish:
search_articles(query="PMID[PMID] AND Expression of Concern[pt]")
An Expression of Concern from a journal editor indicates an active investigation. Trust Level 2 (Reliability concerns) until resolved.
This is the most important and most nuanced step. Many problematic findings are never formally retracted — they are contradicted by subsequent independent work.
find_related_articles(pmids=["PMID"], link_type="pubmed_pubmed", max_results=50)
This returns computationally similar articles. From these, search for contradiction signals.
Search PubMed for articles that cite the original AND contain contradiction language:
search_articles(query="\"[key claim from title]\" AND (\"fail to replicate\" OR \"unable to reproduce\" OR \"does not\" OR \"do not support\" OR \"contradicts\" OR \"challenges\" OR \"reanalysis\" OR \"re-analysis\" OR \"not reproducible\" OR \"could not confirm\")")
Also search with the first author's last name + key topic terms:
search_articles(query="[FirstAuthor] [KeyTopic] AND (Comment[pt] OR Letter[pt] OR \"fail\" OR \"does not\")")
Use Consensus to find contradicting evidence:
consensus_search(query="does [key claim from paper] replicate? contradiction evidence")
Contradictions sometimes appear first as preprints:
search_preprints(category="[relevant category]", date_from="[pub date]", date_to="[today]")
Not every disagreement is a refutation. Assess:
| Factor | Strengthens contradiction | Weakens contradiction |
|---|---|---|
| Independent lab | Yes — different group, different reagents | Same group correcting themselves (may be honest science) |
| Sample size | Larger sample in contradicting study | Smaller sample or different model system |
| Methodology | Direct replication attempt | Different methodology that may explain discrepancy |
| Specificity | Contradicts the specific key claim | Disagrees on secondary finding |
| Mechanism | Provides alternative explanation with evidence | Simply fails to replicate without explanation |
If key finding is contradicted by independent group with stronger methodology → Trust Level 2
If a contradiction or retraction is found, check whether the authors have other problematic publications:
search_articles(query="[SeniorAuthor][Author] AND Retracted Publication[pt]")
A pattern of retractions from the same group is a stronger signal than a single incident.
When flagging author concerns, use measured language:
Never use: fabrication, fraud, doctored, liar, shady, dishonest — unless a formal investigation has published findings using those terms. Stick to what the published record shows.
Present findings in a structured format:
## Publication Trust Assessment
**Paper**: [Title] ([Journal], [Year])
**PMID**: [PMID] | **DOI**: [DOI]
**Authors**: [First Author] ... [Senior Author]
### Trust Level: [X/5] — [Label]
### Formal Markers
- Retraction: None / Yes (date, reason)
- Corrections: None / [count] ([minor/major])
- Expression of Concern: None / Yes (date)
### Contradicting Evidence
- [Citation of contradicting paper] — [brief description of contradiction]
- Independent replication: Yes/No/Unknown
### Author Context
- [Any relevant notes about author track record]
### Recommendation
- [Clear guidance on whether to rely on this paper's findings]
When this skill identifies a trust issue, the finding should propagate:
Original: Li et al. 2016, Cell — "Artemisinins Target GABA Receptor Signaling and Impair Alpha Cell Identity" (Kubicek lab)
Contradicting: van der Meulen et al. 2017, Cell Metabolism — "Artemether Does Not Turn Alpha Cells into Beta Cells" (Huising lab)
Trust Level: 2 (Reliability concerns) — Key finding directly contradicted by independent group using primary cells (vs. cell lines in original).
Author context: When encountering other publications from this group, note that a previous high-profile finding was independently contradicted.
Any paper with article_types containing "Retracted Publication":
Original: Potti et al. 2006, Nature Medicine — Genomic signatures predicting individual patient chemotherapy response. Used microarray data to build classifiers for drug sensitivity.
Contradicting: Baggerly & Coombes 2009, Annals of Applied Statistics — Independent reanalysis found simple off-by-one indexing errors, mislabeled cell lines, and reversed sensitive/resistant labels. The genomic signatures were artifacts of data handling mistakes, not biology.
Outcome: Clinical trials based on the signatures were halted. The paper was retracted in 2011. Senior author had multiple subsequent retractions.
Trust Level: 1 (Compromised) — Statistical reanalysis using the original data conclusively demonstrated methodological errors. This case illustrates why reanalysis (using the same data to reach different conclusions) is the strongest form of contradiction — it eliminates methodology differences as an explanation.
If the user reviews a flagged paper and determines the contradiction doesn't apply (e.g., different model system, different question), they can override:
Trust assessments are living documents — they should be updated as new evidence emerges.
Goal: Validate key literature claims that inform your ENCODE analysis pipeline — ensuring you're building on solid scientific ground before investing analysis effort. Context: Genomics analysis choices (peak callers, normalization methods, quality thresholds) are often justified by citing papers. This skill verifies those citations are accurate and current.
Before running a ChIP-seq analysis, you might rely on these claims:
Search PubMed for the original MACS2 publication and recent benchmarks:
search_articles(query="MACS2 peak calling benchmark ChIP-seq", max_results=5, sort="relevance")
Key findings:
search_articles(query="Landt 2012 ChIP-seq quality guidelines ENCODE", max_results=3)
Key findings:
encode_track_experiment(accession="ENCSR000AKA", notes="H3K27ac ChIP-seq - QC thresholds verified per Landt 2012 (PMID:22955991)")
Expected output:
{
"status": "tracked",
"accession": "ENCSR000AKA",
"notes": "H3K27ac ChIP-seq - QC thresholds verified per Landt 2012 (PMID:22955991)"
}
Record which claims were verified, sources found, and any caveats:
encode_get_experiment(accession="ENCSR123ABC")
Expected output:
{
"accession": "ENCSR123ABC",
"assay_title": "Histone ChIP-seq",
"target": "H3K27ac-human",
"status": "released",
"audit": {"WARNING": 1, "NOT_COMPLIANT": 0, "ERROR": 0},
"replicates": 2,
"lab": "Bernstein, Broad",
"date_released": "2020-03-15"
}
Trust check: 0 errors, 2 replicates, released status — meets ENCODE standards.
encode_get_citations(accession="ENCSR123ABC")
Expected output:
{
"accession": "ENCSR123ABC",
"citations": {
"consortium": "ENCODE Project Consortium. Nature 2020;583:699-710",
"lab_publication": "Roadmap Epigenomics. Nature 2015;518:317-330",
"data_citation": "ENCODE Project. https://www.encodeproject.org/experiments/ENCSR123ABC/"
}
}
encode_compare_experiments(
accession1="ENCSR123ABC",
accession2="ENCSR456DEF"
)
Expected output:
{
"compatible": true,
"shared": {"organism": "Homo sapiens", "assembly": "GRCh38"},
"differences": {"lab": ["Bernstein, Broad", "Snyder, Stanford"]},
"warnings": ["Different labs — check for batch effects"]
}
| This skill produces... | Feed into... | Purpose |
|---|---|---|
| Verified QC thresholds | quality-assessment | Use validated thresholds for ENCODE data QC |
| Citation verification reports | cite-encode | Ensure cited methods papers are accurate |
| Validated pipeline recommendations | pipeline-guide | Select pipelines backed by verified benchmarks |
| Literature provenance records | data-provenance | Document which papers justified analysis choices |
| Verified methods claims | scientific-writing | Write methods sections with verified citations |
| Updated tool recommendations | bioinformatics-installer | Install tools backed by current benchmarks |
| Verified regulatory annotations | regulatory-elements | Confirm annotation sources are authoritative |
| Literature search results | cross-reference | Link ENCODE experiments to verified publications |
cite-encode — Citation management with trust integrationquality-assessment — ENCODE experiment quality (complementary to publication quality)data-provenance — Track trust assessments in provenance chainsdisease-research — Disease research workflows that depend on publication reliabilitynpx claudepluginhub ammawla/encode-toolkit --plugin encode-toolkitAssesses trustworthiness of scientific publications by checking retractions, corrections, expressions of concern, and reproducibility failures via PubMed, bioRxiv, Consensus.
Conducts systematic literature reviews across PubMed, arXiv, bioRxiv, and Semantic Scholar, producing markdown and PDF output with verified citations. Use for meta-analysis, research synthesis, or broad literature searches in biomedical and scientific domains.
Conducts systematic literature reviews across PubMed, arXiv, bioRxiv, Semantic Scholar and other academic databases. Generates professionally formatted markdown and PDF documents with verified citations in APA, Nature, Vancouver and other styles.