Skill

amplitude-experiment-test

Wraps Amplitude Experiment SDK testing patterns: client initialization with API key (or local-flags JSON), the fetch / variant API, exposure-event suppression in tests, and assignment-integrity tests. Use when writing tests for code that uses Amplitude Experiment for A/B testing or flag management. Composes guardrail-metrics-reference + peeking-problem-reference + ab-test-validity-checklist.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/qa-experimentation:amplitude-experiment-test

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Per

SKILL.md

179 lines · ~1.4k tokens

Stats

LanguagePython

Parent stars0

MaintenanceExcellent

Last CommitJun 4, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

amplitude-experiment-test

Overview

Per amplitude.com/docs/experiment, the Amplitude Experiment SDKs (server-side and client-side) expose fetch + variant APIs: fetch the user's assigned variants, then read each variant on demand.

Amplitude correlates exposure + outcome events via the same user ID space as Amplitude Analytics, so exposure-event suppression in tests is important to avoid polluting analytics.

When to use

Tests for code that reads an Amplitude Experiment variant.
Suppressing exposure events in non-production test runs.
Assignment-integrity tests per ab-test-validity-checklist Step 3.

Authoring

Install

pip install amplitude-experiment           # Python (server-side)
npm install --save-dev @amplitude/experiment-node-server

Initialize (server-side)

import * as Experiment from '@amplitude/experiment-node-server';

const client = Experiment.Experiment.initializeRemote(API_KEY, {
  // Suppress real fetches in tests
  fetchTimeoutMillis: 1000,
});

For fully-offline tests, use the local evaluation mode with a flags snapshot:

import { LocalEvaluationClient } from '@amplitude/experiment-node-server';

const localClient = new LocalEvaluationClient(API_KEY, {
  flagConfigPollerIntervalMillis: 0,   // No polling in tests
});

// Or provide flags directly via the local-evaluation-config fixture
await localClient.start(localFlagConfigJson);

Fetch + read variant

const user = { user_id: 'user-1', device_id: 'dev-1' };

test('user variant from local eval', async () => {
  const variants = localClient.evaluate(user);
  expect(variants['checkout-experiment'].value).toBe('treatment-a');
});

Force a variant for a test

Amplitude Experiment's standard pattern is via the flag config: override the flag's default-variant for a specific user ID by modifying the local-eval fixture. Alternatively, mock the evaluate method:

import { jest } from '@jest/globals';

test('user in treatment', () => {
  jest.spyOn(localClient, 'evaluate').mockReturnValue({
    'checkout-experiment': { value: 'treatment-a' } as any,
  });

  const variants = localClient.evaluate(user);
  expect(variants['checkout-experiment'].value).toBe('treatment-a');
});

Suppress exposure events in tests

Default behavior fires an exposure event on variant() read. Suppress per amplitude.com/docs/experiment:

// In test setup:
const client = Experiment.Experiment.initializeRemote(API_KEY, {
  // Disable automatic exposure tracking
  automaticExposureTracking: false,
});

Assignment integrity tests

test('deterministic assignment', () => {
  const v1 = localClient.evaluate({ user_id: 'user-1' });
  const v2 = localClient.evaluate({ user_id: 'user-1' });
  expect(v1).toEqual(v2);
});

Running

npm test

CI integration

jobs:
  amplitude-experiment-tests:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v5
      - uses: actions/setup-node@v4
      - run: npm ci
      - run: npm test
        env:
          AMPLITUDE_API_KEY: ${{ secrets.AMPLITUDE_TEST_KEY }}

For fully-offline CI: skip the env var and use local-eval with checked-in flag config JSON.

Anti-patterns

Anti-pattern	Why it fails	Fix
Tests use prod Amplitude key	Test users pollute analytics	Use test workspace + dev key
Exposure events enabled in CI	Spurious exposure tracking	`automaticExposureTracking: false`
Mocking variant() result without testing the fetch	Misses fetch-network bugs	Test both layers separately
Local-eval flag JSON not committed	Test flakes when prod changes	Commit fixture
Skipping `client.stop()` / cleanup	Network handles leak	Always teardown
Different user-ID space between test + analytics	Amplitude correlation broken	Match the prod user-ID strategy

Limitations

Local-evaluation mode is feature-limited. Some flag types (CMAB, multi-armed bandit) aren't supported offline.
Mocking variant() loses targeting-rule fidelity. Use real local-eval when targeting matters.
Exposure suppression is binary. Can't selectively suppress per-test.
Doesn't validate Amplitude's results analysis. Platform-side statistics separate.

References

Amplitude Experiment docs: amplitude.com/docs/experiment.
Local evaluation: amplitude.com/docs/experiment/general/evaluation/local-evaluation.
Companion catalogs: guardrail-metrics-reference, peeking-problem-reference, ab-test-validity-checklist.
Sibling SDKs: statsig-test, optimizely-test, vwo-test.

amplitude-experiment-test

Invocation

Context Preview

SKILL.md

amplitude-experiment-test

Invocation

Context Preview

SKILL.md

amplitude-experiment-test

Overview

When to use

Authoring

Install

Initialize (server-side)

Fetch + read variant

Force a variant for a test

Suppress exposure events in tests

Assignment integrity tests

Running

CI integration

Anti-patterns

Limitations

References

Similar Skills

amplitude-experiment-test

Overview

When to use

Authoring

Install

Initialize (server-side)

Fetch + read variant

Force a variant for a test

Suppress exposure events in tests

Assignment integrity tests

Running

CI integration

Anti-patterns

Limitations

References

Similar Skills