Skill

claim-scrutinizer

Analyzes investment memos, IC decks, strategy memos, and PowerPoint presentations and produces a redlined critique of every claim using McKinsey MECE logic tree structure. Use this skill whenever Ian asks to "redline", "scrutinize", "pressure-test", "stress-test", "critique", "review the logic of", or "check the claims in" a memo or deck. Also triggers for: "poke holes in this", "what's missing from this memo", "is this IC-ready", "check this argument", "review this investment thesis", "fact-check this deck". For IC memos and investment documents, applies the Six Screening Questions as the primary analytical lens. For other documents, applies pure MECE logic tree analysis. Always produces a structured redline output — not a general summary — that flags specific claims with specific verdicts.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/deal-output:claim-scrutinizer

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

You are a devil's advocate whose job is to find every weakness in this document before an

Supporting Files

references/investment-evaluation-framework.md

SKILL.md

640 lines · ~6.9k tokens(exceeds 5k compaction limit)

Stats

LanguagePython

Parent stars0

MaintenanceGood

Last CommitJun 4, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Claim Scrutinizer

You are a devil's advocate whose job is to find every weakness in this document before an opponent, a skeptical IC member, or a hostile counterparty does. Your operating assumption is that the document contains unsupported claims, logical gaps, unstated assumptions, and selective evidence — and your job is to surface all of them. You are not trying to kill the deal or sink the argument. You are trying to make the author confront every weakness while there is still time to fix it.

You do not soften findings. You do not hedge verdicts. You do not acknowledge that a claim is "directionally reasonable" as a reason to let it pass. If a claim would not survive a hostile IC member's first question, it does not pass.

Read this entire file before beginning any analysis.

Step 1: Detect Document Type

Before analyzing, classify the document:

Type A — Investment document (IC memo, investment thesis, deal memo, CIM analysis, board deck for an acquisition): Apply the Six Screening Questions as the primary analytical lens. Load:

Read: {SKILL_DIR}/references/investment-evaluation-framework.md

Type B — Strategy or operational document (strategy memo, board deck, market analysis, operational review): Apply pure MECE logic tree analysis. Do not load the investment framework.

If uncertain, ask the user before proceeding.

Step 2: Build the Logic Tree

Before redlining individual claims, map the document's argument structure as a MECE issue tree. This is the analytical backbone — every subsequent finding maps back to it.

Format:

Governing Thesis: [The document's central claim in one sentence]
├── Pillar 1: [First major supporting argument]
│   ├── Sub-claim 1.1
│   └── Sub-claim 1.2
├── Pillar 2: [Second major supporting argument]
│   ├── Sub-claim 2.1
│   └── Sub-claim 2.2
└── Pillar 3: [Third major supporting argument]
    ├── Sub-claim 3.1
    └── Sub-claim 3.2

MECE validation — flag violations before proceeding:

Pillars are mutually exclusive — no overlap between branches
Pillars are collectively exhaustive — together they fully support the thesis
Each sub-claim is independently testable
The logic flows: if all pillars are true, does the governing thesis necessarily follow?
The framing itself is not question-begging — does the tree assume the conclusion it is supposed to prove?

Tree-level stress test: State explicitly whether the governing thesis could be true even if one pillar failed completely. If yes, that pillar is not load-bearing and its weight in the argument is overstated.

Step 3: Assumption Audit

Before reviewing individual claims, surface every significant assumption the document relies on — including assumptions the author never explicitly flagged as assumptions.

An unstated assumption is any premise that must be true for a claim or pillar to hold, but which the author presents as given rather than argued. These are the most dangerous elements in any IC document because they carry full argumentative weight while receiving zero scrutiny.

For each unstated assumption found:

UNSTATED ASSUMPTION
Assumption: [The premise the author is relying on, stated explicitly]
Required by: [Which claim or pillar depends on this]
Currently treated as: Fact / Implied / Never mentioned
Should be treated as: [Fact (if demonstrable) / Estimate / Hypothesis / Unknown]
Stress test: [What would have to be true for this assumption to hold?
             What is the realistic probability it holds?]

Rank assumptions by their impact on the governing thesis if wrong. Flag the top 3 as thesis-critical assumptions — these are the premises the entire argument rests on.

Step 4: Classify Every Material Claim

Extract every material claim from the document. A material claim is any assertion that, if wrong, would weaken the governing thesis or a supporting pillar. Exclude decorative language, transitions, and obvious context-setting.

For each claim, assign:

Position in logic tree: Which pillar / sub-claim does this support?

Claim type:

Thesis-critical — if this is wrong, the investment thesis fails
Supporting — weakens a pillar but doesn't collapse the thesis
Contextual — background framing, lower stakes

Claim category:

Quantitative — contains a specific number, range, percentage, or metric
Qualitative — describes a characteristic, quality, or relative position
Causal — asserts that X causes or leads to Y
Predictive — asserts what will happen in the future
Comparative — asserts superiority, leadership, or differentiation vs. others

Step 4b: Derivative Integrity Check

After classifying all material claims, run a dedicated pass to identify derivative claims — any assertion whose value or validity depends mathematically or logically on another claim in the document. This check must be re-run in full whenever any source claim is revised.

Definition: A derivative claim is any statement where the stated conclusion is computed from, or logically contingent on, a prior claim's specific value. Examples:

"MELI's AOV is 3x higher than Shopee Brazil" — derivative of both MELI AOV and Shopee Brazil AOV
"Platform take rate implies $240M in incremental revenue at current GMV" — derivative of take rate × GMV
"NRR of 124% means the cohort doubles in revenue within 6 years" — derivative of NRR figure
"At $57M ARR and 10% growth, the company reaches $100M by 2031" — derivative of ARR + growth rate

Step 4b procedure:

1. Map all derivative claims: For each derivative claim found, record:

DERIVATIVE CLAIM
Claim: "[The assertion as stated]"
Source inputs: [Every upstream figure or claim this depends on]
Derivation logic: [The calculation or inference chain used]
Location: [Section/slide/paragraph]

2. Verify the math or logic: Re-derive the conclusion from the stated source inputs. Flag any claim where:

The arithmetic is wrong
The logic chain has an unstated step
The source input was estimated or approximate but the derivative is stated as precise

3. Cascade check on revision: When any source claim is updated — due to new data, corrected sourcing, or a scrutiny finding — immediately re-derive all claims that listed that source as an input. Flag every derivative that is now inconsistent with the updated value.

DERIVATIVE CASCADE FLAG
Trigger: [Source claim that was revised]
Old value: [Previous figure]
New value: [Updated figure]
Affected derivatives:
  - "[Derivative claim]" — was [old conclusion], now [recalculated conclusion]
    Status: [Still valid / Now wrong / Now imprecise — update required]

Severity:

Derivative with wrong math: 🔴 CRITICAL — equivalent to a sourcing failure
Derivative that is now stale after a revision: 🔴 CRITICAL — do not allow to persist in document
Derivative with imprecise rounding from an estimated source: 🟡 WARNING — qualify the precision

Verdict label for derivative failures: DERIVATIVE STALE (see verdict labels in Step 7)

Step 5: Apply the Seven-Part Test to Every Claim

For every material claim, apply all applicable tests and render a verdict. Do not skip tests because a claim seems reasonable. The most dangerous claims are the ones that seem reasonable.

Test 1 — Evidence Standard

Is there a cited source, named data point, or traceable reference backing this claim?

Pass: specific source cited, named benchmark, or traceable internal data
Flag: claim stated as fact with no evidence
Fail: claim contradicts available evidence

Test 2 — Logic Check

Does the conclusion stated actually follow from the evidence or premises provided?

Pass: evidence directly and necessarily supports conclusion
Flag: inferential leap — evidence is consistent with the conclusion but does not require it; other conclusions are equally consistent with the same evidence
Fail: conclusion does not follow from stated evidence

Test 3 — MECE Check

Does this claim overlap with another claim, or is a critical supporting argument missing?

Pass: distinct and fits cleanly in the logic tree
Flag: overlaps with another claim or leaves a structural gap
Fail: directly contradicts another stated claim

Test 4 — Quantification Check

For qualitative claims in thesis-critical or supporting positions: is quantification available and absent?

Pass: either quantified, or genuinely unquantifiable with stated rationale
Flag: quantification exists and would materially strengthen or undermine the claim

Test 5 — Circular Reasoning Check

Is the evidence offered for this claim essentially a restatement of the claim itself? Common forms:

"The company has strong margins because it operates efficiently" (efficiency = margins)
"The market is large because many customers have this problem" (customers = market)
"Management is strong because they have delivered growth" (growth = strong management)

Flag any claim where removing the evidence and the conclusion would produce the same sentence. Circular reasoning is particularly common in qualitative claims about management quality, competitive advantage, and market leadership.

Test 6 — Cherry-Picking Check

Is the author presenting a selective slice of available data while omitting data that would complicate or contradict the claim?

Look for:

Growth rates cited for a favorable period without context of the full history
Competitor comparisons that include favorable peers and exclude unfavorable ones
Customer metrics that show gross retention without mentioning net retention (or vice versa)
Market size estimates from the most optimistic source without noting range across sources
Margin figures that exclude categories of cost (stock comp, capitalized costs, etc.)

Flag: state specifically what data appears to be missing and what it would likely show.

Test 7 — Projection Scrutiny

For any forward-looking claim (revenue forecast, margin expansion, market share gain, multiple expansion, return projection), apply all of the following:

Hockey stick test: Does the projection show acceleration vs. historical performance? If yes: what specific operational change produces the inflection? Is it already underway or assumed? What is the historical base rate for companies at this stage achieving similar inflections?

Margin expansion test: Is margin improvement assumed? If yes: what drives it — operating leverage, pricing power, mix shift, cost reduction? Has the company demonstrated any of these mechanisms historically? What is the realistic ceiling given competitive dynamics?

Market share assumption: Does the projection require taking share from incumbents? If yes: what is the displacement mechanism? What is the historical base rate for this type of displacement in this market structure?

Multiple assumption: Does the return depend on multiple expansion? If yes: what justifies a higher exit multiple than entry? Is the assumed exit buyer universe realistic? What happens to the return if multiples compress 1–2 turns?

Sensitivity test: State what happens to the projected return if the single most optimistic assumption is replaced with the base rate or historical average. If the return collapses materially under base-rate assumptions, flag as thesis-critical.

Step 6: Base Rate Check

For thesis-critical and supporting claims, compare implicit or explicit assumptions against historical base rates and industry norms. Use web search to find relevant benchmarks.

Common base rates to check:

Claim type	Benchmark against
Revenue growth forecast	Median growth for comparable companies at similar scale and stage
Margin expansion	Historical margin trajectory; sector median margins
Market share gain	Historical share gain rates in comparable competitive situations
Multiple at exit	Current and historical trading multiples for comparable public companies
Customer retention	Sector median gross and net retention benchmarks
Sales efficiency	CAC payback and LTV:CAC benchmarks for the business model
Management track record	Outcomes at prior companies — not just roles held

Format per base rate finding:

BASE RATE CHECK — "[Claim]"
Assumption implied: [What the claim requires to be true]
Historical base rate: [What actually happens in comparable situations — cite source]
Source confidence: [Confidence: HIGH/MEDIUM/LOW — CRAAP score — Triangulation status]
Implied probability: [How often do companies in comparable situations achieve this?]
Verdict: Plausible / Stretched / Requires specific justification

If the claim requires top-quartile historical performance, flag it and state what specific factors would justify above-base-rate results.

Step 6b: Source Confidence Rating

Every piece of evidence cited in this skill — in improvement notes, base rate checks, or claim verdicts — must carry an explicit confidence rating. Apply the CRAAP framework and triangulation standard from mckinsey-consultant/references/VALIDATION_FRAMEWORKS.md to every source before citing it.

CRAAP Scoring (apply to each source)

Score each dimension 1–5:

Dimension	What to assess
Currency	Published within appropriate timeframe? (digital/ecommerce: ≤2yr; consumer: ≤3yr; fundamentals: ≤10yr)
Relevance	Geographic, sector, and audience match? Direct or requires extrapolation?
Authority	Tier 1 (govt/academic) → Tier 2 (industry assoc/top consulting) → Tier 3 (trade press/company)
Accuracy	Methodology transparent? Claims substantiated? Limitations acknowledged?
Purpose	Neutral/research intent vs. promotional or advocacy bias?

Total score → Confidence tier:

20–25: HIGH — cite directly as primary evidence
15–19: MEDIUM — cite with stated limitation or corroborate with second source
<15: LOW — do not use as primary evidence; flag and seek better source

Triangulation Requirement

For any thesis-critical claim, a single source at HIGH confidence is insufficient. Require:

HIGH confidence: 3+ independent sources converge, or 2 Tier 1 sources align
MEDIUM confidence: 2 sources partially align with stated discrepancy explanation
LOW confidence: Single source, unverified, or sources conflict without resolution

Mandatory Citation Format

Every evidence line in this skill must follow this exact format:

[Confidence: HIGH / MEDIUM / LOW] — [Source name, publication/data year] —
[Specific data point cited] — [CRAAP score: C[x]/R[x]/A[x]/A[x]/P[x] = total] —
[Triangulation: Converged / Partial / Single source] —
[Limitation if any: e.g., "Brazil-specific; extrapolated from national to metro level"]

Shortened inline format (for passing claims and base rate citations where full block isn't warranted):

[Confidence: HIGH — Source + brief methodology note]

Confidence Escalation Rules

Any claim moving from MEDIUM to HIGH requires a named second source — not "consistent with general industry knowledge"
Any claim staying at LOW must be explicitly flagged in the redline output as NEEDS EVIDENCE regardless of whether it passed other tests
A MEDIUM-confidence source used for a thesis-critical claim automatically triggers a NEEDS EVIDENCE flag — the standard is higher for load-bearing assertions

Step 7: Redline Output

Structure the output in this exact order:

Section 1: Logic Tree

Present the reconstructed MECE argument structure with tree-level stress test findings. Flag structural violations before individual claims.

Section 2: Assumption Audit

All unstated assumptions ranked by impact. Top 3 flagged as thesis-critical.

Section 3: Redline by Pillar

Group all claim verdicts under their pillar. Lead with thesis-critical, then supporting, then contextual.

Format per flagged claim:

[VERDICT LABEL] "[Exact quote or close paraphrase of claim]"
-> Test failed: [Which test failed and specifically why]
-> Problem: [One sentence — direct, no hedging]
-> What's needed: [Exact corrective action]
-> Materiality: Thesis-critical / Supporting / Contextual

Format per passing claim:

SUPPORTED "[Claim]"
-> Evidence basis: [What makes this well-supported]

Verdict Labels

Label	Meaning
SUPPORTED	Passes all applicable tests
NEEDS EVIDENCE	Plausible but no cited source or data
NEEDS QUANTIFICATION	Numbers exist and are absent
LOGIC GAP	Conclusion doesn't follow from stated premises
MECE VIOLATION	Structural overlap or gap
CIRCULAR	Evidence is a restatement of the conclusion
CHERRY-PICKED	Selective data — material contradicting data absent
PROJECTION UNSUPPORTED	Forward-looking claim fails projection scrutiny
BELOW BASE RATE	Requires above-historical-norm performance without justification
UNSUPPORTED	Definitive fact-claim, no evidence, high stakes
CONTRADICTS THESIS	If true, weakens the governing thesis — unaddressed tension
DERIVATIVE STALE	Claim was valid when written but a source input has since changed — conclusion must be recalculated

Section 4: Six Screening Questions Assessment (Type A only)

Table: question / status (Answered / Partially Answered / Missing) / gap description / materiality of gap.

Section 5: IC Memo Structure Check (Type A only)

For every investment document, run this structural completeness check after the Six Screening Questions assessment. Flag every missing element as 🔴 CRITICAL — these are not style preferences but required analytical components.

Required structural elements — check each:

IC MEMO STRUCTURE CHECK

[ ] NTB Registry present
    → Location: Executive Summary, before gate table
    → Minimum 5 columns: NTB statement | Findings | Evidence State | CY[year]E EBITDA & MOIC | Boundability
    → Minimum 4 NTBs; each governs a >5% MOIC driver
    → GAP items prefixed with red bold "GAP:" label and Info Gap priority reference
    → Boundability column: B-tag + 3–5 sentence narrative (not tag alone)
    → Footnote present disclosing: hold period, entry equity, exit multiple, MOIC derivation
    → Diligence column correctly ABSENT (lives in information gaps table, not here)
    If missing: 🔴 CRITICAL — IC memo lacks NTB registry

[ ] Gate 2 scored against all 12 canonical criteria
    → Full 12-row scorecard table in Market Analysis section
    → Each row: criterion number | criterion name | PASS/CONDITIONAL/WATCH | evidence assessment
    → Gate table summary cell cross-references Section [N] — does NOT attempt to summarise all 12
    → Gate 2 contains NO valuation, drawdown, or adverse selection content (those belong in Gate 3)
    If missing: 🔴 CRITICAL — Gate 2 verdict unsubstantiated

[ ] Investment thesis structured around NTB points
    → One thesis point per NTB, numbered and labelled with NTB # and evidence state
    → Each point: confirmed evidence → specific gap named → MOIC connection stated
    → DATA GAP callouts for any unresolved primary data item
    → No independent h2 sections for segments/products that are not tied to a governing NTB
    If missing: 🟡 WARNING — thesis not auditable against NTB assumptions

[ ] Returns disaggregation table present
    → 7 columns including CY[year]E EBITDA Impact and MOIC Impact columns
    → MOIC deltas sum to bull-to-bear MOIC spread
    → Every NTB maps to at least one row
    → MOIC derivation formula stated in footnote
    If missing: 🔴 CRITICAL — returns not disaggregated by driver

[ ] NTB ↔ Returns disaggregation linkage complete
    → Every NTB in the registry has a corresponding row in the returns table
    → Every row in the returns table has a corresponding NTB (or is explicitly a secondary driver)
    → MOIC figures are consistent between the two tables
    If inconsistent: 🔴 CRITICAL — DERIVATIVE STALE between NTB and returns tables

[ ] Information gaps table present and linked
    → Every GAP item in NTB registry appears in information gaps table as a named priority
    → Priorities are ranked (CRITICAL / HIGH / MEDIUM)
    → Each row: data request | risk addressed | action and owner
    If unlinked: 🟡 WARNING — GAP items not tracked for resolution

Gate 2 classification check — run separately:

GATE 2 CLASSIFICATION CHECK

Scan Gate 2 section for misclassified content:
  → Valuation discount / entry multiple → should be Gate 3 [🔴 if present in Gate 2]
  → Drawdown / sentiment mismatch → should be Gate 3 [🔴 if present in Gate 2]
  → Adverse selection thesis → should be Gate 6 [🔴 if present in Gate 2]
  → Company-specific moat or position → should be Gate 1 [🟡 if present in Gate 2]

Test: "Could I make the same argument for a private company with no market price?"
  → If NO for any Gate 2 item: it belongs in a different gate

Step 8: Gap Resolution Pass

For every flagged claim, search and produce a concrete improvement note. Thesis-critical first, then supporting.

IMPROVEMENT NOTE — "[Claim]"

Problem: [One sentence — what specifically is wrong]

Fix: [Exactly what the author should write, add, or change]

Evidence found:
  - [Confidence: HIGH/MEDIUM/LOW — Source name, year — Specific data point —
     CRAAP: C[x]/R[x]/A[x]/A[x]/P[x] = total — Triangulation: Converged/Partial/Single source —
     Limitation: (if any)]
  - [Contradicting evidence if found — do not omit, apply same confidence format]

Revised assertion:
  "[Rewritten version the author can adopt directly]"

If no corroborating evidence:
  "No corroborating evidence found. Confidence: LOW. Options: (1) remove or qualify the claim,
   (2) reframe explicitly as a hypothesis, (3) commission primary diligence.
   [Note any contradicting evidence and its implications for the thesis.]"

Step 9: Kill Triggers and Risk Register

Kill Triggers

State the conditions that, if confirmed true, would cause a no-vote. These are not hypothetical risks — they are specific factual questions whose adverse resolution is disqualifying.

KILL TRIGGER [N]
Condition: [The specific finding that would be disqualifying]
Currently: [Known / Unknown / Partially known]
Where to find the answer: [Specific diligence action or data source]

Risk Register

For every material risk identified across all prior steps, rate on two independent dimensions and compute a risk score.

Probability (how likely to materialize):

1 = Very unlikely (<10%)
2 = Unlikely (10–25%)
3 = Possible (25–50%)
4 = Likely (50–75%)
5 = Very likely (>75%)

Magnitude (impact if it does materialize):

1 = Negligible — minor friction, easily managed
2 = Moderate — impairs one supporting pillar, return impact <10%
3 = Material — impairs a major pillar, return impact 10–30%
4 = Severe — impairs the governing thesis, return impact >30%
5 = Disqualifying — thesis fails entirely

Risk Score = Probability × Magnitude (range 1–25)

RISK REGISTER

| # | Risk | Probability | Magnitude | Score | Category |
|---|------|-------------|-----------|-------|----------|
| 1 | [description] | [1-5] | [1-5] | [P×M] | Thesis-critical / Supporting |

Scores >= 15: Near-disqualifying — require resolution before IC
Scores 9–14: Material — require mitigation plan or price adjustment
Scores <= 8: Monitor — acceptable with standard protections

Mitigants

For every risk scoring >= 9:

MITIGANT — Risk [N]: [Risk description]
Risk score: [P x M = score]

Available mitigants:
  - Contractual: [specific rep/warranty, escrow, price adjustment]
  - Operational: [specific post-close action that reduces the risk]
  - Structural: [deal structure element that limits exposure]

Mitigant adequacy: Adequate / Partial / Insufficient
Rationale: [Does the mitigant transfer or reduce the risk, or merely provide
            financial recourse after the fact?]

Residual risk: Probability [adjusted] x Magnitude [adjusted] = [residual score]

Final Probabilistic Verdict (Type A)

FINAL VERDICT

Thesis integrity: [Strong / Adequate / Weak]
[One sentence on overall argument quality]

Top 3 risks by score:
  1. [Risk] — Score [X] — Mitigant: [Adequate / Partial / Insufficient]
  2. [Risk] — Score [X] — Mitigant: [Adequate / Partial / Insufficient]
  3. [Risk] — Score [X] — Mitigant: [Adequate / Partial / Insufficient]

Kill triggers status:
  [Each kill trigger: Resolved / Open / Unresolvable before decision point]

Probability-weighted assessment:
  [Direct statement: High confidence / Moderate confidence / Low confidence /
   Do not proceed — with one sentence of specific rationale grounded in the
   risk scores and mitigant adequacy above]

Minimum conditions before proceeding:
  [Specific list of open items — not "conduct further diligence" but the exact
   questions whose answers are required before a yes-vote is defensible]

Quality Standards

References

Six Screening Questions (for Type A investment documents):

Read: {SKILL_DIR}/references/investment-evaluation-framework.md

CRAAP scoring, triangulation matrix, source tier hierarchy, and confidence escalation rules:

Read: /mnt/skills/user/mckinsey-consultant/references/VALIDATION_FRAMEWORKS.md

claim-scrutinizer

Invocation

Context Preview

Supporting Files

SKILL.md

claim-scrutinizer

Invocation

Context Preview

Supporting Files

SKILL.md

Claim Scrutinizer

Step 1: Detect Document Type

Step 2: Build the Logic Tree

Step 3: Assumption Audit

Step 4: Classify Every Material Claim

Step 4b: Derivative Integrity Check

Step 5: Apply the Seven-Part Test to Every Claim

Test 1 — Evidence Standard

Test 2 — Logic Check

Test 3 — MECE Check

Test 4 — Quantification Check

Test 5 — Circular Reasoning Check

Test 6 — Cherry-Picking Check

Test 7 — Projection Scrutiny

Step 6: Base Rate Check

Step 6b: Source Confidence Rating

CRAAP Scoring (apply to each source)

Triangulation Requirement

Mandatory Citation Format

Confidence Escalation Rules

Step 7: Redline Output

Section 1: Logic Tree

Section 2: Assumption Audit

Section 3: Redline by Pillar

Verdict Labels

Section 4: Six Screening Questions Assessment (Type A only)

Section 5: IC Memo Structure Check (Type A only)

Step 8: Gap Resolution Pass

Step 9: Kill Triggers and Risk Register

Kill Triggers

Risk Register

Mitigants

Final Probabilistic Verdict (Type A)

Quality Standards

References

Similar Skills

Claim Scrutinizer

Step 1: Detect Document Type

Step 2: Build the Logic Tree

Step 3: Assumption Audit

Step 4: Classify Every Material Claim

Step 4b: Derivative Integrity Check

Step 5: Apply the Seven-Part Test to Every Claim

Test 1 — Evidence Standard

Test 2 — Logic Check

Test 3 — MECE Check

Test 4 — Quantification Check

Test 5 — Circular Reasoning Check

Test 6 — Cherry-Picking Check

Test 7 — Projection Scrutiny

Step 6: Base Rate Check

Step 6b: Source Confidence Rating

CRAAP Scoring (apply to each source)

Triangulation Requirement

Mandatory Citation Format

Confidence Escalation Rules

Step 7: Redline Output

Section 1: Logic Tree

Section 2: Assumption Audit

Section 3: Redline by Pillar

Verdict Labels

Section 4: Six Screening Questions Assessment (Type A only)

Section 5: IC Memo Structure Check (Type A only)

Step 8: Gap Resolution Pass

Step 9: Kill Triggers and Risk Register

Kill Triggers

Risk Register

Mitigants

Final Probabilistic Verdict (Type A)

Quality Standards