Skill

tdd

From tdd

Use TDD for all code-writing tasks. TRIGGER when the user asks to implement, build, create, fix, refactor, or write any code — features, bug fixes, refactors, APIs, utilities, etc. DO NOT TRIGGER when the user is only debugging (reading logs, inspecting state, tracing execution), asking questions about code without wanting changes, doing exploratory/spike work, or writing tests for already-existing code.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/tdd:tdd

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Strict TDD mode. Every line of production code must be justified by a failing test — this

SKILL.md

150 lines · ~1.9k tokens

Stats

Stars0

MaintenanceGood

Last CommitMar 10, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Test Driven Development (TDD)

Strict TDD mode. Every line of production code must be justified by a failing test — this constraint is what makes TDD valuable, because it guarantees that tests actually verify behaviour rather than just existing alongside code.

Core Cycle: Red -> Green -> Refactor

Follow this cycle for every behaviour. Never skip a phase.

RED: Write a Failing Test

Choose the next behaviour from the test list.
Write ONE test asserting ONE new behaviour. Name it as a behaviour specification (e.g., test_returns_empty_list_when_no_items_match).
The test must compile, link, and run. If the test references functions or methods that don't exist yet (common in compiled languages like C/C++/Rust/Go/Java), add minimal stubs first — declare the function in the header and provide an empty body (or return a default value) in the source file. These stubs are scaffolding to make the test runnable, not production code. Do NOT write any real logic in stubs.
Run the test. It must fail.
Verify it fails for the RIGHT reason — a wrong return value, an incorrect state, or missing behaviour. The test output must show a test assertion failure, not a compile error, linker error, or crash. If it fails for the wrong reason, fix the test or stubs until it fails on an assertion.
If the test already passes, it is not testing new behaviour — discard it and write a different one.

Run the test and confirm the assertion failure before moving to Green. Skipping this step defeats the purpose of TDD — an unverified red phase means there's no proof the test can catch a regression. A compile or link error is NOT a valid red — it proves nothing about behaviour.

GREEN: Write the Minimum Code to Pass

Write the simplest, smallest amount of production code that makes the failing test pass.
Prefer simpler transformations (Transformation Priority Premise):
- Return a constant before introducing a variable
- Use a variable before adding a conditional
- Use a conditional before introducing a loop
- Use a loop before introducing recursion This ordering matters because jumping to complex transformations often introduces speculative code that no test demanded.
Do NOT write code that is not demanded by a failing test. Premature error handling, optimisation, or extra features undermine TDD's feedback loop — they introduce untested behaviour.
Run ALL tests. They must all pass.
If getting to green takes more than 1-2 attempts, the test covers too much ground. Delete it, write a simpler test, and try again.

Run all tests and confirm they pass before moving to Refactor.

REFACTOR: Improve Structure While Green

Look at both production code AND test code for improvements:
- Remove duplication (in code AND tests)
- Improve naming
- Extract methods, functions, or classes
- Simplify conditionals
- Restructure test helpers, fixtures, or setup
Test code deserves the same quality standards as production code. Hard-to-read tests become a maintenance burden that slows down future TDD cycles.
Make ONE refactoring change at a time. Run all tests after each change.
If a refactoring breaks a test, undo it immediately and take a smaller step.
Refactoring is optional in any given cycle — skip it if the code is already clean.

Run tests after every refactoring change to confirm they still pass.

Rules

No production code without a failing test. This is TDD's core discipline. If there's an urge to write implementation code, channel it into writing a test first.
Run real tests every time. Use the project's actual test runner. Never simulate or assume test output — the whole point is to let real test results drive decisions.
Minimise mocking. Mocks lie about how the system actually works. Only mock at hard architectural boundaries (external HTTP APIs, databases without in-memory alternatives, filesystem when truly necessary, system clock/randomness). If it can be tested with real code, test it with real code.
Keep steps small. Each cycle should touch a small, focused behaviour. Needing more than ~10 lines of production code to pass a test usually means the test covers too much.
Maintain a test list. Before starting, brainstorm the behaviours to test and write them as a checklist ordered from simplest to most complex. Update the list as new cases are discovered or existing ones are completed. Share the list with the user.
Refactor tests too. When test setup becomes repetitive or helper functions could be clearer, refactor them with the same rigour as production code.
Announce the phase. Before each action, state the current phase:
- RED: Writing a test for [behaviour]
- GREEN: Writing minimum code to pass
- REFACTOR: [what is being improved]

Getting Started

When beginning a TDD task:

Understand the goal. Read existing code, understand the requirement. Ask clarifying questions if the desired behaviour is ambiguous.
Detect the test framework. Check existing test files, package.json, pyproject.toml, Cargo.toml, go.mod, Gemfile, build.gradle, or equivalent. Use whatever framework and conventions the project already uses. If no tests exist, ask the user which framework to use.
Brainstorm the test list. List behaviours from simplest to most complex. Start with degenerate cases (empty input, zero, null) and build toward core logic. Share the list with the user.
Start the first Red-Green-Refactor cycle with the simplest test from the list.

Completion

The task is done when:

All behaviours on the test list are covered and passing
The user confirms the feature is complete
The code is in a clean, refactored state with all tests green

Run the full test suite one final time to confirm everything passes.

Common Pitfalls

Writing the whole implementation then backfilling tests is not TDD. It loses the design feedback that emerges from writing tests first.
Skipping Red means a passing test proves nothing — it might pass for the wrong reason entirely.
Over-mocking (mocking three things to test one function) suggests testing through a higher-level interface with fewer mocks would be more effective.
Refactoring and adding behaviour simultaneously conflates two concerns. Refactor only when green; add behaviour only through a new failing test.
Skipping test runs removes the evidence that TDD is working. Every phase transition requires actually running tests.
Treating compile/link errors as Red (compiled languages). A test that doesn't compile or link never ran — it proves nothing about behaviour. Add stubs to make the test runnable, then confirm it fails on an assertion. Only then is it genuinely Red.

tdd

Invocation

Context Preview

SKILL.md

tdd

Invocation

Context Preview

SKILL.md

Test Driven Development (TDD)

Core Cycle: Red -> Green -> Refactor

RED: Write a Failing Test

GREEN: Write the Minimum Code to Pass

REFACTOR: Improve Structure While Green

Rules

Getting Started

Completion

Common Pitfalls

Similar Skills

Test Driven Development (TDD)

Core Cycle: Red -> Green -> Refactor

RED: Write a Failing Test

GREEN: Write the Minimum Code to Pass

REFACTOR: Improve Structure While Green

Rules

Getting Started

Completion

Common Pitfalls

Similar Skills