Skill

Marketing Mix Modeling & Attribution

Use when the user mentions attribution, ROAS optimization, channel contribution, marketing mix model, MMM, media mix, budget allocation, budget optimization, incrementality, adstock, saturation curves, diminishing returns, channel effectiveness, media effectiveness, cross-channel attribution, multi-touch attribution, MTA, Shapley value attribution, or marketing ROI measurement. Also trigger when user asks 'which channel is driving results' or 'where should we spend more.' If campaign spend data is not yet extracted, suggest running data-extraction first. Results feed into reporting and paid-media skills.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/marketing-analytics:attribution-analysis

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

**Skill ID:** attribution-analysis

Supporting Files

references/incrementality_calibration.mdreferences/mmm_methodology.mdreferences/pymc_marketing_api.mdscripts/_lightweight_mmm.pyscripts/compute_contributions.pyscripts/fit_mmm.pyscripts/optimize_budget.pyscripts/validate_model.py

SKILL.md

278 lines · ~3.6k tokens

Stats

LanguagePython

Parent stars0

MaintenanceGood

Last CommitMar 20, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Marketing Mix Modeling & Attribution

Skill ID: attribution-analysis Priority: P0 — Foundational (highest strategic value) Category: Measurement & Attribution Depends On: data-extraction, paid-media Feeds Into: reporting, paid-media (budget optimization), experimentation (calibration)

Objective

You will automate the end-to-end marketing mix modeling pipeline: data ingestion from ad platforms, adstock and saturation curve fitting using Bayesian methods, posterior predictive validation, channel-level contribution decomposition, and budget optimization with scenario planning. Support calibration with incrementality lift test results. Produce executive-ready attribution reports with confidence intervals and actionable budget reallocation recommendations.

Process Steps

Follow these steps in order. Each step must complete successfully before proceeding to the next. If a step fails, diagnose the issue and retry before moving on.

Step 1: Data Validation and Preparation

Check that required input files exist in workspace/raw/:
- campaign_spend_{platform}.csv — Daily or weekly spend by channel
- conversions.csv — Outcome variable (leads, revenue, signups) at matching grain
- external_factors.csv — Optional: seasonality indices, competitor spend, macro indicators
If spend data is missing, instruct the user to run the data-extraction skill first.
Run scripts/fit_mmm.py --validate to check data quality:
- Verify date alignment across all input files
- Detect and impute missing data windows (flag confidence adjustments)
- Normalize spend, impressions, and conversion data into a unified grain
Default to weekly data grain. Only use daily grain when the dataset spans 2+ years.
Generate control variables automatically:
- Seasonality via Fourier terms (yearly and quarterly harmonics)
- Holiday indicators for the relevant market
- Macroeconomic indicators if provided in external_factors.csv

Step 2: Prior Specification

Check for workspace/analysis/incrementality_results.json from the experimentation skill.
If lift test results exist, use scripts/fit_mmm.py --calibrate to translate them into informative priors. See references/incrementality_calibration.md.
If no lift test data exists, use weakly informative priors based on:
- Industry benchmarks for the client's vertical
- Expert knowledge provided by the user
- Default half-normal or log-normal priors per references/mmm_methodology.md
Always run a prior sensitivity analysis: fit with both vague and informative priors, then compare posteriors to assess prior influence.

Step 3: Model Fitting

Execute scripts/fit_mmm.py to fit the Bayesian MMM:
- Use PyMC-Marketing's MMM class as the primary framework
- Sample with NUTS (No-U-Turn Sampler) via MCMC
- Estimate adstock (carry-over) parameters per channel: geometric or Weibull decay
- Estimate saturation (diminishing returns) parameters per channel: Hill function
Store all MCMC trace data for reproducibility using ArviZ InferenceData format.
If MCMC fails or is impractical, fall back to lightweight ridge regression (Robyn-style) and document the methodological tradeoff.

Step 4: Model Validation

Execute scripts/validate_model.py to assess model quality:
- Posterior predictive checks: 90%+ of observed data within 90% credible interval
- R-hat < 1.05 for all parameters
- Effective sample size > 400 for all parameters
- WAIC and LOO-CV for model comparison if multiple specifications are tested
If diagnostics fail, adjust the model (reparameterize, increase samples, tighten priors) and refit. Do not proceed with a poorly converged model.

Step 5: Contribution Decomposition

Execute scripts/compute_contributions.py to decompose the outcome into channel-level contributions:
- Compute posterior mean and credible intervals per channel
- Verify contributions sum to total observed outcome within 2% tolerance
- Identify the base (intercept + controls) vs. media-driven components
Write results to workspace/analysis/mmm_channel_contributions.json.

Step 6: Budget Optimization

Execute scripts/optimize_budget.py to find optimal budget allocation:
- Maximize total conversions/revenue subject to total budget constraint
- Respect per-channel minimum and maximum spend constraints if provided
- Propagate posterior uncertainty into optimization (use posterior samples)
Generate scenario analyses as requested by the user:
- "What if we shift X% from display to search?"
- "What if total budget increases/decreases by Y%?"
Produce marginal ROAS curves showing next-dollar efficiency per channel.
Flag channels approaching saturation ceilings.
Write results to workspace/analysis/mmm_budget_optimization.json.

Step 7: Reporting

Generate workspace/analysis/mmm_diagnostics.json with MCMC convergence metrics.
Generate workspace/reports/mmm_executive_summary.html containing:
- Channel contribution waterfall chart with uncertainty bands
- Budget reallocation recommendation table with expected lift estimates
- Model diagnostics: trace plots, R-hat, effective sample size, posterior predictive coverage
- Marginal ROAS curves per channel
All visualizations must include confidence/credible intervals. Never present point estimates without uncertainty.

Key Capabilities

Data Preparation

Normalize spend, impressions, and conversion data across platforms into a unified grain
Automatically detect and impute missing data windows with flagged confidence adjustments
Generate control variables: seasonality (Fourier), holidays, macroeconomic indicators, competitor activity

Model Fitting

Fit Bayesian MMM with MCMC sampling (NUTS) via PyMC-Marketing's MMM class
Estimate adstock (carry-over) and saturation (diminishing returns) parameters per channel
Compute posterior channel contributions with credible intervals
Run posterior predictive checks and WAIC/LOO model comparison

Budget Optimization

Solve constrained optimization: maximize total conversions/revenue subject to budget constraints
Generate scenario analyses: "what if we shift X% from display to search?"
Produce marginal ROAS curves showing next-dollar efficiency per channel
Account for saturation: flag channels approaching diminishing returns ceiling

Multi-Touch Attribution (Supplementary)

Shapley value attribution for fair credit allocation across touchpoints
Markov chain models for path-based attribution
Data-driven models when user-level journey data is available

Reporting

Channel contribution waterfall charts with uncertainty bands
Budget reallocation recommendation tables with expected lift estimates
Model diagnostics dashboard: trace plots, R-hat, ESS, posterior predictive coverage

Input / Output Data Contracts

Inputs

File	Description	Required
`workspace/raw/campaign_spend_{platform}.csv`	Daily/weekly spend by channel from data-extraction skill. Columns: `date`, `channel`, `spend`, `impressions`, `clicks`.	Yes
`workspace/raw/conversions.csv`	Outcome variable at matching grain. Columns: `date`, `conversions`, `revenue`.	Yes
`workspace/raw/external_factors.csv`	Seasonality indices, competitor spend, macro indicators. Columns: `date`, `factor_name`, `value`.	No
`workspace/analysis/incrementality_results.json`	Calibration priors from experimentation skill. Schema: `{"channel": str, "lift": float, "ci_lower": float, "ci_upper": float}`.	No

Outputs

File	Description
`workspace/analysis/mmm_channel_contributions.json`	Posterior mean and credible intervals per channel
`workspace/analysis/mmm_budget_optimization.json`	Optimal allocation under current and scenario budgets
`workspace/analysis/mmm_diagnostics.json`	MCMC convergence metrics, model fit statistics
`workspace/reports/mmm_executive_summary.html`	Interactive report with charts and recommendations

Cross-Skill Integration

This skill is the strategic hub of the marketing analytics portfolio:

data-extraction (upstream): Provides normalized campaign spend data. If spend files are missing, direct the user to run data-extraction first.
paid-media (bidirectional): Consumes spend metrics as input; budget optimization outputs directly inform paid-media's budget pacing and bid strategies.
experimentation (bidirectional): Incrementality test results from experimentation serve as calibration priors for the MMM. In return, MMM results identify channels where incrementality is uncertain, guiding future experiment design. This creates a virtuous cycle: experiments validate the model, the model guides budget allocation, and budget changes create new natural experiments.
reporting (downstream): All MMM outputs (contributions, optimization, diagnostics) feed into executive dashboards via the reporting skill.
compliance-review (downstream, financial services): In financial services mode, all reports must pass through compliance-review before distribution.

When invoking cross-skill workflows:

Always check for upstream data availability before starting analysis.
Write outputs in the canonical JSON schemas so downstream skills can parse them.
Include metadata (timestamp, model version, data range) in all output files.

Financial Services Considerations

When operating in a financial services context, apply these additional requirements:

Confidence intervals required: All performance claims derived from MMM must include credible intervals and methodology disclaimers. Never present bare point estimates.
SEC Marketing Rule compliance: Past performance language must comply with the SEC Marketing Rule. Before distributing any attribution report externally, invoke the compliance-review skill to validate language and disclosures.
Net-of-fees adjustment: Attribution to specific fund products requires net-of-fees performance calculation and benchmark comparison. Always ask whether results should be presented gross or net of fees.
Long sales cycles: Financial services sales cycles can span 12-36 months (especially institutional AUM acquisition). Account for this by:
- Using longer adstock decay windows
- Considering lagged conversion models
- Separating lead-generation attribution from AUM-conversion attribution
Audit trail: Maintain complete provenance of all model inputs, parameters, and outputs for regulatory audit purposes.

Development Guidelines

Follow these guidelines strictly when implementing and running MMM analyses:

PyMC-Marketing first: Use PyMC-Marketing as the primary framework. Include fallback to lightweight Robyn-style ridge regression only when MCMC infrastructure is unavailable or the user explicitly requests it.
Deterministic computation: All statistical computations must run in the Python scripts under scripts/. Never estimate statistical quantities, posterior distributions, p-values, or model parameters within the LLM. The LLM interprets results; scripts compute them.
Weekly default grain: Default to weekly data aggregation. Only use daily grain when the dataset contains 2+ years of daily history.
Prior sensitivity: Always run prior sensitivity analysis. Fit with both vague and informative priors, compare posteriors, and report the degree of prior influence.
Reproducibility: Store all MCMC trace data in ArviZ InferenceData format. Record random seeds, sampler settings, and PyMC-Marketing version in diagnostics output.
Progressive disclosure: This skill loads progressively. Metadata (name, description) loads at startup (~100 tokens). Full SKILL.md instructions load on activation (~5,000 tokens). Reference files and scripts load on-demand during execution.
Uncertainty everywhere: Never present a result without its uncertainty. Every channel contribution, ROAS estimate, and budget recommendation must include credible intervals or posterior distributions.

Acceptance Criteria

A completed MMM analysis must satisfy all of the following:

R-hat < 1.05 for all estimated parameters
Effective sample size > 400 for all estimated parameters
Posterior predictive check covers 90%+ of observed data within 90% credible interval
Budget optimizer produces feasible allocations that sum to the specified total budget
Scenario analysis correctly propagates uncertainty from posterior to predictions
Channel contribution decomposition sums to total observed outcome within 2% tolerance
Full pipeline executes end-to-end from raw data to executive report in a single session

Reference Files

references/mmm_methodology.md — Bayesian MMM theory, adstock/saturation math, prior guidance
references/pymc_marketing_api.md — PyMC-Marketing MMM class reference and usage patterns
references/incrementality_calibration.md — Translating lift test results into priors

Scripts

scripts/fit_mmm.py — Core MMM fitting with PyMC-Marketing (MCMC sampling, diagnostics)
scripts/optimize_budget.py — Constrained optimization using posterior samples
scripts/compute_contributions.py — Channel-level contribution decomposition
scripts/validate_model.py — Posterior predictive checks, WAIC, LOO-CV computation

Marketing Mix Modeling & Attribution

Invocation

Context Preview

Supporting Files

SKILL.md

Marketing Mix Modeling & Attribution

Invocation

Context Preview

Supporting Files

SKILL.md

Marketing Mix Modeling & Attribution

Objective

Process Steps

Step 1: Data Validation and Preparation

Step 2: Prior Specification

Step 3: Model Fitting

Step 4: Model Validation

Step 5: Contribution Decomposition

Step 6: Budget Optimization

Step 7: Reporting

Key Capabilities

Data Preparation

Model Fitting

Budget Optimization

Multi-Touch Attribution (Supplementary)

Reporting

Input / Output Data Contracts

Inputs

Outputs

Cross-Skill Integration

Financial Services Considerations

Development Guidelines

Acceptance Criteria

Reference Files

Scripts

Similar Skills

Marketing Mix Modeling & Attribution

Objective

Process Steps

Step 1: Data Validation and Preparation

Step 2: Prior Specification

Step 3: Model Fitting

Step 4: Model Validation

Step 5: Contribution Decomposition

Step 6: Budget Optimization

Step 7: Reporting

Key Capabilities

Data Preparation

Model Fitting

Budget Optimization

Multi-Touch Attribution (Supplementary)

Reporting

Input / Output Data Contracts

Inputs

Outputs

Cross-Skill Integration

Financial Services Considerations

Development Guidelines

Acceptance Criteria

Reference Files

Scripts

Similar Skills