Skill

Testing Infrastructure

Provides centralized testing utilities, debug protocols, and CI/local parity guidelines. Includes shared lib modules for evidence capture, MCP client, campaign utilities, and browser testing guidance for Playwright vs chrome-superpower.

Python

testing

Popularity

Stars

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/claude-commands:testing-infrastructure

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

**Purpose**: Centralized testing utilities, debug protocols, and CI/local parity guidelines.

SKILL.md

145 lines · ~1.3k tokens

Stats

LanguagePython

Stars27

Forks4

MaintenanceExcellent

Last CommitJun 2, 2026

Actions

View Source View Plugin View on GitHub View README

Testing Infrastructure

Purpose: Centralized testing utilities, debug protocols, and CI/local parity guidelines.

Test Utilities (MANDATORY)

Always use testing_mcp/lib/ utilities - NEVER reimplement test infrastructure.

Available Shared Utilities

Module	Functions
`lib/evidence_utils.py`	`get_evidence_dir()`, `capture_provenance()`, `save_evidence()`, `write_with_checksum()`, `create_evidence_bundle()`, `save_request_responses()`
`lib/mcp_client.py`	`MCPClient(base_url, timeout)`, `client.tools_call(tool_name, args)`
`lib/campaign_utils.py`	`create_campaign()`, `process_action()`, `get_campaign_state()`, `ensure_game_state_seed()`
`lib/server_utils.py`	`start_local_mcp_server()`, `pick_free_port()`, `DEFAULT_EVIDENCE_ENV`
`lib/model_utils.py`	`settings_for_model()`, `update_user_settings()`
`lib/narrative_validation.py`	`validate_narrative_quality()`, `extract_dice_notation()`

Required Pattern

# Import from lib modules
from testing_mcp.lib.evidence_utils import get_evidence_dir, capture_provenance
from testing_mcp.lib.mcp_client import MCPClient
from testing_mcp.lib.campaign_utils import create_campaign, process_action

# NEVER reimplement these functions

Anti-Pattern

Writing custom capture_provenance(), get_evidence_dir(), save_evidence(), or any function that duplicates testing_mcp/lib/ functionality.

Browser Testing Tools

MANDATORY GUIDANCE: Choose the right tool for the task

chrome-superpower (MCP Tool)

Use for: Exploratory manual browsing and interactive testing

Quick browser exploration during development
Manual testing workflows that need human guidance
Inspecting page state and debugging UI issues
Taking screenshots and reading page content

DON'T use for: Deterministic automated tests (async operations don't complete reliably)

Playwright

Use for: Deterministic browser tests with validation

Automated end-to-end test suites
Tests that submit forms and verify streaming responses
Tests that need to wait for async operations to complete
CI-ready browser automation

Pattern:

from playwright.sync_api import sync_playwright

playwright = sync_playwright().start()
browser = playwright.chromium.launch(headless=True)
context = browser.new_context()
page = context.new_page()

# Navigate and interact
page.goto(url)
page.fill("#input", "text")
page.click("button[type='submit']")

# Validate results
elements = page.query_selector_all(".story-entry")
assert len(elements) > 0, "Expected story content"

Why this matters: chrome-superpower returns immediately from async JavaScript operations (shows [object Promise]), making it unsuitable for tests that need to validate streaming responses. Playwright properly waits for events and async operations.

CI/Local Parity

Mock external dependencies to ensure tests pass in both CI and local environments:

with patch('shutil.which', return_value='/usr/bin/command'):
    with patch('subprocess.run') as mock_run:
        mock_run.return_value = MagicMock(returncode=0)
        # test code here

Rules:

Mock shutil.which(), subprocess.run(), file ops
Never rely on system state in tests
Test files ($PROJECT_ROOT/tests/*) may use direct logging

Debug Protocol

Test Failure Debugging

Embed debug info in assertions, not print statements
Debugging order: Environment -> Function -> Logic -> Assertions
Test most basic assumption first: "Does the function actually work?"

# CORRECT - Debug info in assertion
debug_info = f"function_result={result}, context={context}"
self.assertTrue(result, f"FAIL DEBUG: {debug_info}")

# WRONG - Print statements (lost in CI)
print(f"Debug: {result}")

Testing Protocol

ZERO TOLERANCE: Fix ALL test failures in CI

LOCAL TESTING: Don't run full test suite locally - rely on GitHub CI

Run only SPECIFIC tests: TESTING=true python $PROJECT_ROOT/tests/test_<specific>.py
GitHub CI is the authoritative source for test results

README-aligned Runner Selection (Critical)

`testing_mcp` suites

Treat many testing_mcp/*.py files as script entrypoints, not pytest-collected test modules.
Prefer direct execution:
- cd testing_mcp && ../vpython test_<name>.py --server http://127.0.0.1:8001
- cd testing_mcp && ../vpython test_<name>.py --start-local
- Schema scripts: ./vpython testing_mcp/schema/test_schema_<name>.py
Avoid pytest testing_mcp/... for script-style files that parse CLI args or expect script runtime setup.

`testing_ui` browser auth bypass

Follow $PROJECT_ROOT/testing_ui/README_TEST_MODE.md exactly:
- Start backend with TESTING_AUTH_BYPASS=true.
- Open UI with ?test_mode=true&test_user_id=<id>.
- Verify browser flow sets/uses:
  - window.testAuthBypass.enabled
  - X-Test-Bypass-Auth: true
  - X-Test-User-ID: <id>
This is mandatory for browser E2E runs that cannot inject custom auth headers directly.

MCP Smoke Tests

MCP_SERVER_URL="https://..." MCP_TEST_MODE=real node scripts/mcp-smoke-tests.mjs

Hard-fails on any non-200 response
Results saved to /tmp/repo/branch/smoke_tests/

Related Skills

evidence-standards.md - Evidence capture standards
end2end-testing.md - E2E test patterns

Testing Infrastructure

Popularity

Invocation

Context Preview

SKILL.md

Testing Infrastructure

Popularity

Invocation

Context Preview

SKILL.md

Testing Infrastructure

Test Utilities (MANDATORY)

Available Shared Utilities

Required Pattern

Anti-Pattern

Browser Testing Tools

chrome-superpower (MCP Tool)

Playwright

CI/Local Parity

Debug Protocol

Test Failure Debugging

Testing Protocol

README-aligned Runner Selection (Critical)

testing_mcp suites

testing_ui browser auth bypass

MCP Smoke Tests

Related Skills

Similar Skills

Testing Infrastructure

Test Utilities (MANDATORY)

Available Shared Utilities

Required Pattern

Anti-Pattern

Browser Testing Tools

chrome-superpower (MCP Tool)

Playwright

CI/Local Parity

Debug Protocol

Test Failure Debugging

Testing Protocol

README-aligned Runner Selection (Critical)

testing_mcp suites

testing_ui browser auth bypass

MCP Smoke Tests

Related Skills

Similar Skills

`testing_mcp` suites

`testing_ui` browser auth bypass

`testing_mcp` suites

`testing_ui` browser auth bypass