Skill

consolidate-research

Consolidate and synthesize research outputs from multiple AI models or sources into a unified, pattern-aware, provenance-enriched report with quality metrics. Use when the user has research outputs to consolidate, wants to synthesize multiple reports, asks to "consolidate", "synthesize", or "merge" research findings, or needs to reconcile conflicting information from different sources. Works with outputs from Claude, Gemini, GPT-5.2, or any combination of AI/human sources. Supports manifest-driven (from create-research-brief) and standalone operation.

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/research-tools:consolidate-research

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Synthesize research outputs from multiple sources into a unified, confidence-tiered, pattern-aware report with provenance tracking and quality metrics. Works standalone or manifest-driven from `create-research-brief`.

Supporting Files

references/consolidation-manifest-schema.mdreferences/output-templates.mdreferences/pattern-registry.mdreferences/reconciliation.mdreferences/system-capabilities.md

SKILL.md

411 lines · ~5.7k tokens(exceeds 5k compaction limit)

Stats

LanguagePython

Parent stars2

Parent forks1

MaintenanceGood

Last CommitMar 4, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Signal	Likely Source	Strength
Deep reasoning chains, cross-domain analogies, self-corrections, hedged nuance	Claude Opus 4.6	primary_researcher
Structured tables with dense citations, systematic catalogs, source appendix	Gemini 3.1 Pro Deep Research	structured_cataloger
Site-specific citations, temporal progression ("as of [date]"), intervention markers	GPT-5.2 Deep Research	targeted_investigator
Quick facts, recent dates, sentiment language, short-form responses	GPT-5.2 Chat	recency_validator
Proprietary data, specific methodologies, organizational context	Human analyst	domain_expert

Source	Extract	Tag
Claude Opus 4.6	Reasoning chains (analytical steps, not just conclusions)	`type: reasoning_chain`
	Conclusions linked to their supporting chains	`type: factual/causal`
	Web search findings with source URLs	`channel: web_search`
	Cross-domain insights and analogies	`type: cross_domain_synthesis`
	Self-review corrections (higher-confidence refinements)	`type: factual`
Gemini 3.1 Pro	Tables — preserve structure, do not flatten	`type: structural_artifact`
	Prose claims with inline citation mapping	`type: factual/causal`
	Source appendix — map citations to claims, compute quality scores	provenance metadata
	Comparison matrices with per-cell provenance	`type: structural_artifact`
	file_search-grounded claims	`channel: internal_document`
GPT-5.2 Deep	Assertions with temporal markers ("as of [date]")	`provenance.temporal_marker`
	Site-specific findings with source domain	`channel: site_restricted`
	Intervention-adjusted findings (higher targeted confidence)	boosted weight
	Timeline/progression narratives	`type: structural_artifact`
GPT-5.2 Chat	Quick facts with recency dates	`channel: quick_validation`
	Sentiment signals	`type: recommendation`
	All Chat claims default to lower provenance weight	validation role

Score	Source Type	Examples
5	Primary	SEC filings, peer-reviewed, official stats, vendor docs
4	High-quality secondary	Gartner, Forrester, named-source journalism
3	General secondary	News, press releases, industry publications
2	Tertiary	Wikipedia, blogs, aggregators
1	Unsourced assertion	No citation trail
0	Unverifiable	Contradicts known facts or cites non-existent sources

Type	Handling
Factual assertions	Cross-validate; trace to primary sources
Causal claims	Map reasoning chains; note mechanism divergence
Quantitative data	Flag discrepancies >10%; verify primary source agreement
Recommendations	Tag as interpretation; link to supporting facts
Unique insights	Preserve with provenance flag; do not discard
Reasoning chains	Preserve structure; validate logical steps; compare across sources
Structural artifacts	Preserve format (tables, matrices, timelines); merge in Step 5

Step	Test	Resolution
1	Coexist? Different scope/timeframe/definition?	Preserve both with clarifying context
2	Provenance? Higher channel? (`site_restricted` > `web_search` > `quick_validation`; `internal_document` > `web_search` for proprietary)	Favor higher provenance; note alternative
3	Citation dedup? Same primary source(s)?	Treat as SINGLE-SOURCE (Tier 2 cap), not independent
4	Specificity? One more specific or better sourced?	Favor specificity; note alternative
5	Majority? Most sources agree?	Lead with majority; preserve dissent
6	All diverge	Flag "unresolved." Agentic mode: web search. Else: present all views

Tier	Threshold	Criteria
1 (High)	>75%	Cross-model from DIFFERENT primaries + avg quality ≥4 + falsifiable
2 (Moderate)	50-75%	Same-primary cross-model (cap) OR single quality ≥4 OR majority w/ avg 3
3 (Low)	<50%	Single-source quality <4 OR contested OR unsourced

Check	Question
Citation diversity	Different primary sources, or all citing same 2-3?
Specificity test	Falsifiable claim, or vague enough to be unfalsifiable?
Recency check	Could this have changed? Source publication dates?
Contrarian search	Credible dissent? What would skeptics say?
Mechanism check	Same mechanism = shared bias risk. Different mechanisms converging = higher confidence.

#	Prompt	Seeks
1	"Which findings, when combined, imply something neither source stated?"	Emergent insights
2	"Which constraints interact with findings from another domain?"	Constraint interactions
3	"Which consensus views look different through an unrelated domain's lens?"	Frame-breaking
4	"What would need to be true for the consensus to be wrong?"	Contrarian check

Artifact	Merge Strategy
Tables	Union rows/columns; per-cell provenance; highlight conflicts
Timelines	Interleave chronologically; flag disputed dates with both versions
Decision trees	Merge branches; note divergent recommendations at same decision point
Matrices	Union dimensions; per-cell provenance; highlight scoring disagreements

#	Section	Content
1	Executive Summary	2-3 paragraphs: key findings, implications, overall confidence
2	Tier 1 Findings	High-confidence claims: claim + support + implication
3	Tier 2 Findings	Moderate-confidence: claim + support + caveat
4	Contested Areas	Per-source views + assessment + resolution path
5	Coverage Gaps	Expected vs actual coverage table with recommended actions
6	Unique Insights	Per-source single-source findings with provenance
7	Cross-Domain Synthesis	Emergent insights, constraint interactions, frame-breaking observations
8	For Downstream	Pattern-specific actionable outputs
9	Quality Metrics	Computed metrics block
10	Freshness Model	Staleness detection + refresh recommendations
11	Methodology Notes	Mode, sources, conflicts resolved, chain context, limitations
12	Self-Review Results	7-check outcomes summary

Pattern	Additional Sections
`landscape_mapping`	Taxonomy + Player Inventory per category, White Space Map
`comparative_evaluation`	Weighted Decision Matrix (options x criteria), Sensitivity Analysis, Recommendation with flip conditions
`implementation_pattern`	Architecture Decision Catalog, Pattern Catalog by phase, Anti-Pattern Register
`best_practices`	8-Dimension Knowledge Base sections, Quick Reference Card
`competitive_intelligence`	Per-Competitor Strategic Profile, Competitive Dynamics Analysis
`market_research`	TAM/SAM/SOM with ranges, Segmentation Framework, Demand Drivers/Inhibitors
`user_research`	Persona Cards, JTBD Map (functional/emotional/social), Unmet Needs Hierarchy
`economic_analysis`	Financial Model (cost + value + ROI), Sensitivity Table, Benchmark Comparison
`compliance_requirements`	Requirements Register, Constraint Map, Governance Recommendations

Pattern	Downstream Content	Primary Target
`landscape_mapping`	Shortlist criteria + recommended evaluation set	`comparative_evaluation`
`comparative_evaluation`	Decision recommendation + selection rationale + ADR draft	Foundry / `implementation_pattern`
`implementation_pattern`	Architecture decision log + implementation checklist	Foundry (requirements)
`best_practices`	Skill-ready KB structure + Quick Reference Card	Claude Code skills
`competitive_intelligence`	Positioning strategy + differentiation matrix	Ignite (GTM)
`market_research`	Segment prioritization + entry strategy recommendation	Spark (ideation)
`user_research`	Persona cards + JTBD map + ranked unmet needs	Foundry + Spark
`economic_analysis`	Financial model summary + key assumptions + sensitivity ranges	Vantage
`compliance_requirements`	Requirements register + constraint map	Foundry

Metric	Formula	Meaning
`coverage_ratio`	claims addressed / total claims	Input coverage completeness
`conflict_resolution_rate`	resolved / identified	Disagreement handling
`provenance_depth`	% Tier 1 with multi-source from DIFFERENT primaries	Independent corroboration
`actionability_score`	% findings with downstream action	Output usefulness
`staleness_risk`	% claims with sources older than volatility threshold	Temporal reliability
`dependency_chain_integrity`	% recommendations with validated chains	Logical soundness
`cross_domain_synthesis_yield`	count of cross-domain insights	Synthesis value-add

#	Check	Procedure	If Failed
1	Contradiction scan	Do claims in one section contradict another?	Reconcile or flag explicitly.
2	Confidence tier audit	Did any claim's tier shift during writing?	Update tier + propagate dependencies.
3	Dependency chain validation	Are all recommendation → fact chains consistent?	Flag broken chains; downgrade recommendations.
4	Coverage check	Compare actual coverage vs. manifest coverage_matrix (or inferred).	Document gaps in Coverage Gaps section.
5	Cross-domain synthesis check	Was the mandatory synthesis pass executed? Insights generated?	If skipped, execute now. If zero yield, document.
6	Quality metrics validation	Are metrics internally consistent? (e.g., denominator matches actual count)	Recompute.
7	Freshness check	Any source dates older than topic volatility threshold?	Flag in staleness_risk metric + Freshness Model.

Mode	Best For	Leads With	Key Differentiator
Standard	General synthesis	Convergent findings	Balanced tiering
Adversarial	High-stakes decisions	Stress-tested claims	False confidence audit on ALL Tier 1
Gap-Driven	Coverage completeness	Missing areas	Coverage matrix as primary frame
Confidence-Weighted	Executive / financial decisions	Evidence quality	Citation quality drives tiers
Depth-First	Strategic / analytical topics	Deepest reasoning	Reasoning chains featured
Breadth-First	Landscape / domain mapping	Comprehensive coverage	Completeness over depth
Agentic	Minimal-intervention consolidation	Full autonomous pipeline	Web search for gap-filling + verification

Pattern	Default Mode	Override Trigger
`landscape_mapping`	breadth_first	"deep-dive on key players" → depth_first
`comparative_evaluation`	confidence_weighted	"quick comparison" → standard
`implementation_pattern`	depth_first	"comprehensive catalog" → breadth_first
`best_practices`	gap_driven	"focus on anti-patterns only" → depth_first
`competitive_intelligence`	adversarial	"just the facts" → standard
`market_research`	standard	"high-stakes investment" → confidence_weighted
`user_research`	depth_first	"broad needs survey" → breadth_first
`economic_analysis`	confidence_weighted	"rough estimate" → standard
`compliance_requirements`	gap_driven	"highest-risk areas" → depth_first

Chain Link	Validation
Landscape → Comparative	Do all evaluated options appear in upstream landscape? Flag new ones.
Comparative → Implementation	Does selected option match upstream recommendation? If different, document why.
Market → Landscape	Are market segments consistent with landscape scope? Flag scope drift.
Market → Competitive	Are market structure assumptions consistent? Flag divergence.
Compliance → Implementation	Do all patterns satisfy upstream constraint map? Flag violations.
User Research → Comparative	Do evaluation criteria map to identified user needs? Flag orphan criteria.

Pattern	Default Mode	Override Trigger
`landscape_mapping`	breadth_first	"deep-dive on key players" → depth_first
`comparative_evaluation`	confidence_weighted	"quick comparison" → standard
`implementation_pattern`	depth_first	"comprehensive catalog" → breadth_first
`best_practices`	gap_driven	"focus on anti-patterns only" → depth_first
`competitive_intelligence`	adversarial	"just the facts" → standard
`market_research`	standard	"high-stakes investment" → confidence_weighted
`user_research`	depth_first	"broad needs survey" → breadth_first
`economic_analysis`	confidence_weighted	"rough estimate" → standard
`compliance_requirements`	gap_driven	"highest-risk areas" → depth_first

Chain Link	Validation
Landscape → Comparative	Do all evaluated options appear in upstream landscape? Flag new ones.
Comparative → Implementation	Does selected option match upstream recommendation? If different, document why.
Market → Landscape	Are market segments consistent with landscape scope? Flag scope drift.
Market → Competitive	Are market structure assumptions consistent? Flag divergence.
Compliance → Implementation	Do all patterns satisfy upstream constraint map? Flag violations.
User Research → Comparative	Do evaluation criteria map to identified user needs? Flag orphan criteria.

consolidate-research

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

consolidate-research

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

Consolidate Research

Workflow Overview

Step 1: Receive & Validate Inputs

Manifest Detection

Source Identification Heuristics

Validation

Step 2: Normalize Inputs

Extraction Patterns by Source

Normalized Claim Format

Citation Quality Scale

Step 3: Triage Claims

Claim Types

Claims Matrix

Claim Dependency Graphs

Step 4: Reconcile

Provenance-Weighted Disagreement Protocol

Confidence Tier Assignment

Dependency Propagation

False Confidence Audit

Step 5: Synthesize

Cross-Domain Synthesis Pass (4 Required Prompts)

Structural Artifact Merging

Step 6: Generate Output

Template Selection

Universal Sections (All Patterns)

Pattern-Specific Sections

"For Downstream" Actionability

Quality Metrics

Freshness Model

Step 7: Self-Review (Mandatory)

Consolidation Modes (7)

Mode Quick Reference

Mode Selection Logic

Pattern x Mode Defaults

Mode Procedures

Research Chain Protocol

Chain-Specific Validations

Constraint Propagation Rules

Integration Notes

Similar Skills

Consolidate Research

Workflow Overview

Step 1: Receive & Validate Inputs

Manifest Detection

Source Identification Heuristics

Validation

Step 2: Normalize Inputs

Extraction Patterns by Source

Normalized Claim Format

Citation Quality Scale

Step 3: Triage Claims

Claim Types

Claims Matrix

Claim Dependency Graphs

Step 4: Reconcile

Provenance-Weighted Disagreement Protocol

Confidence Tier Assignment

Dependency Propagation

False Confidence Audit

Step 5: Synthesize

Cross-Domain Synthesis Pass (4 Required Prompts)

Structural Artifact Merging

Step 6: Generate Output

Template Selection

Universal Sections (All Patterns)

Pattern-Specific Sections

"For Downstream" Actionability

Quality Metrics