Skill

auditing-experiments-flags

From posthog

Audit PostHog experiments and feature flags for configuration issues, staleness, and best-practice violations.

testing

code-quality

Popularity

Stars

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/posthog:auditing-experiments-flags

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

This skill teaches you how to run configuration audits on experiments and feature flags.

Supporting Files

references/experiment-checks.mdreferences/finding-taxonomy.mdreferences/flag-checks.mdreferences/remediation-actions.mdreferences/synthesis-patterns.md

SKILL.md

81 lines · ~934 tokens

Stats

LanguagePython

Stars54

Forks8

MaintenanceExcellent

Last CommitJun 17, 2026

Actions

View Source View Plugin View on GitHub View README

Auditing experiments and feature flags

This skill teaches you how to run configuration audits on experiments and feature flags. All checks use the experiment and feature flag read tools (experiment-get, experiment-list, feature-flag-get-definition, feature-flag-get-all) — no SQL queries are needed for Phase 1 checks.

Usage modes

Quick check (single entity)

When the user asks about a specific experiment or flag:

Fetch the entity via experiment-get (experiment ID) or feature-flag-get-definition (numeric flag ID).
Apply the relevant checks from experiment checks or flag checks.
Report findings inline as markdown, grouped by severity (CRITICAL first, then WARNING, then INFO).
Include entity links as [Experiment: name](/experiments/id) or [Flag: key](/feature_flags/id).

Scoped audit (one domain)

When the user asks to audit all experiments or all flags:

Bulk-fetch via experiment-list or feature-flag-get-all.
Run all checks for that domain against each entity.
Group findings by severity, then by entity.
Report as inline markdown.

Full audit (comprehensive)

When the user asks for a comprehensive audit of both experiments and flags:

Fetch all experiments via experiment-list and all flags via feature-flag-get-all.
Run all experiment checks and all flag checks.
Apply recurring patterns to identify patterns across multiple findings.
If there are more than 5 entities with findings, output as a notebook artifact via notebooks-create for easier navigation. Otherwise report inline.

Output format

For each finding, include:

Severity badge: 🔴 CRITICAL, 🟡 WARNING, or 🔵 INFO
Check name: Which check produced this finding
Entity link: Markdown link to the entity
What's wrong: One-sentence description
Action: What to do about it (see remediation actions)

Example:

🟡 WARNING — Flag integration · Experiment: checkout-redesign The linked feature flag is inactive (paused). Traffic is not being split. Action: Re-enable the flag or end the experiment.

Handling unavailable data

Some checks require activity logs (feature-flags-activity-retrieve for flags), which may not be available in every session. If activity log data is unavailable:

Skip checkActivityHistory (experiment check) entirely.
Skip the "toggle instability" and "never activated" sub-checks in flag lifecycle checks.
In your report, note which checks were skipped and why:

Skipped: Activity history checks (activity logs not available via current tools)

Partial failures

If a fetch call fails for some entities:

Continue with the entities you could fetch.
Report which entities could not be assessed and why.
Do not silently omit entities from the audit.

Reference files

Experiment checks — experiment configuration checks
Flag checks — feature flag checks
Finding types — severity and category definitions
Recurring patterns — patterns across multiple findings
Remediation actions — what to do about each finding

auditing-experiments-flags

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

auditing-experiments-flags

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

Auditing experiments and feature flags

Usage modes

Quick check (single entity)

Scoped audit (one domain)

Full audit (comprehensive)

Output format

Handling unavailable data

Partial failures

Reference files

Similar Skills

Auditing experiments and feature flags

Usage modes

Quick check (single entity)

Scoped audit (one domain)

Full audit (comprehensive)

Output format

Handling unavailable data

Partial failures

Reference files

Similar Skills