From workflows
Enforces RED-GREEN-REFACTOR cycle: prompts writing a failing test first, verifying failure, then implementing to pass. Includes structured gates for build, launch, and log-based verification before E2E testing.
How this skill is triggered — by the user, by Claude, or both
Slash command
/workflows:dev-tddThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
- [The Iron Law](#the-iron-law-of-tdd)
Your job is NOT to implement features. Your job is to write tests that prove features work.
Reframe every task:
❌ "Implement user login"
✅ "Write a test that proves user login works. Then make it pass."
❌ "Fix the icon rendering bug"
✅ "Write a test that fails when icons render wrong. Then fix it."
The test IS your deliverable. The implementation just makes the test pass.
## File-Based Logging (MANDATORY)ALL CODE MUST USE FILE-BASED LOGGING.
Every application you write MUST redirect output to a log file:
./app > /tmp/app.log 2>&1 &./app --log-file=/tmp/app.log 2>&1 &pytest -v > /tmp/test.log 2>&1Why: Without log files, you have NO EVIDENCE of what happened. "I saw it in terminal" is not verification.
Read the full requirements: @references/logging-requirements.md
NO E2E TESTS WITHOUT PASSING THE EXECUTION GATE FIRST.
Before running E2E tests or taking screenshots, you MUST complete all 6 gates in order:
GATE 1: BUILD
GATE 2: LAUNCH (with file-based logging)
GATE 3: WAIT
GATE 4: CHECK PROCESS
GATE 5: READ LOGS ← MANDATORY, CANNOT SKIP
GATE 6: VERIFY LOGS
THEN AND ONLY THEN: E2E tests/screenshots
Key enforcement:
For GUI applications:
Read the complete gate sequence: @references/execution-gates.md
YOU MUST WRITE THE FAILING TEST FIRST. YOU MUST SEE IT FAIL. This is not negotiable.
Before writing ANY implementation code:
The RED step is not optional. If the test hasn't failed, you haven't practiced TDD.
RED → Write test → Run through GATES → See failure → Read logs → Document
GREEN → Minimal code → Run through GATES → See pass → Read logs → Document
REFACTOR → Clean up while staying green
# Write the test FIRST
def test_user_can_login():
result = login("[email protected]", "password123")
assert result.success == True
assert result.token is not None
Run the test through the execution gates:
# For unit tests, minimum gates are: EXECUTE + READ OUTPUT
pytest tests/test_auth.py::test_user_can_login -v 2>&1 | tee /tmp/test.log
# pytest: run specific test and see RED failure
# READ the output (MANDATORY)
cat /tmp/test.log
Output will show:
FAILED - NameError: name 'login' is not defined
Log to LEARNINGS.md:
## RED: test_user_can_login
- Test written
- Ran through gates (pytest executed, output read)
- Fails with: NameError: name 'login' is not defined
- Expected: function doesn't exist yet
Write the minimum code to make the test pass:
def login(email: str, password: str) -> LoginResult:
# Minimal implementation
return LoginResult(success=True, token="dummy-token")
Run the test through gates again:
pytest tests/test_auth.py::test_user_can_login -v 2>&1 | tee /tmp/test.log
# pytest: run test again and see GREEN success
# READ the output (MANDATORY)
cat /tmp/test.log
Output will show:
PASSED
Log to LEARNINGS.md:
## GREEN: test_user_can_login
- Minimal login() implemented
- Ran through gates (pytest executed, output read)
- Test passes
- No errors in output
- Ready for refactor
Clean up the code while keeping tests passing:
def login(email: str, password: str) -> LoginResult:
user = User.find_by_email(email)
if user and user.check_password(password):
return LoginResult(success=True, token=generate_token(user))
return LoginResult(success=False, token=None)
Verify tests remain green after refactoring:
pytest tests/test_auth.py -v
# pytest: run all tests and verify GREEN after refactor
Output will show:
All tests PASSED
Read the shared enforcement:
Read ${CLAUDE_SKILL_DIR}/../../references/constraints/real-test-enforcement.md.
Key rule: THE TEST MUST EXECUTE THE CODE AND VERIFY RUNTIME BEHAVIOR. Grepping is NOT testing. Log reading is NOT testing. Code review is NOT testing.
Document every TDD cycle in .planning/LEARNINGS.md:
## TDD Cycle: [Feature/Test Name]
### RED
- **Test:** `test_feature_works()`
- **Command:**
```bash
pytest tests/test_feature.py::test_feature_works -v
# pytest: run test and observe RED failure
FAILED - AssertionError: expected True, got None
feature_works() functionpytest tests/test_feature.py::test_feature_works -v
# pytest: run test and verify GREEN success
PASSED
pytest tests/test_feature.py -v
# pytest: run all tests and confirm GREEN after refactor
## TDD Facts
- A test that passes on its first run has proven nothing: it never failed, so it cannot distinguish working code from a test that exercises nothing. The RED run is the only evidence that the test actually tests the feature. Claiming TDD on a never-red test is an unverified claim presented as fact.
- When an assertion fails during GREEN, the test itself may be wrong — but editing the assertion to match observed output converts the test into a record of the bug. Question the assertion before changing it.
- Mocks, direct function calls, and substitute protocols verify the stand-in, not the production behavior — boundary bugs live exactly where the stand-in replaces the real thing. A test that doesn't replicate what the user does is a FAKE test, and unit tests alone hide those boundary bugs.
**If your test doesn't fail first, you haven't practiced TDD.**
## Delete & Restart
<EXTREMELY-IMPORTANT>
**Wrote implementation code before test? You MUST DELETE IT. No exceptions.**
When you discover implementation code that wasn't driven by a test:
1. **DELETE** your implementation code
2. **WRITE** the test first
3. **RUN** it, **SEE RED**
4. **REWRITE** the implementation
"But it works" is not an excuse. "But it would waste your time" is not an excuse.
**Code you wrote without TDD is UNTRUSTED code. You delete it and do it right.**
</EXTREMELY-IMPORTANT>
## E2E Test Requirement
<EXTREMELY-IMPORTANT>
### The Iron Law of E2E in TDD
**USER-FACING FEATURES REQUIRE E2E TESTS IN ADDITION TO UNIT TESTS.**
TDD cycle for user-facing changes:
Unit TDD: RED → GREEN → REFACTOR ↓ E2E TDD: RED → GREEN → REFACTOR
**Both cycles must complete. Unit GREEN does not mean DONE.**
### When E2E is Required
| Change Type | Unit Tests | E2E Required? |
|-------------|------------|---------------|
| Internal logic | Yes | No |
| API endpoint | Yes | Yes (test full request/response) |
| UI component | Yes | **Yes** (Playwright/automation) |
| CLI command | Yes | Yes (test actual invocation) |
| User workflow | Yes | **Yes** (simulate user actions) |
| Visual change | Yes | **Yes** (screenshot comparison) |
### E2E TDD Cycle
1. **RED**: Write E2E test simulating user action
- Run through ALL 6 GATES (BUILD → LAUNCH → WAIT → CHECK → READ LOGS → VERIFY LOGS)
- Only after GATE 6: Run the E2E test
- Observe the failure (feature doesn't exist)
- Document: "E2E RED: [test] fails with [error]. All gates passed, logs clean."
2. **GREEN**: Implement to make E2E pass (unit tests already green)
- Run through ALL 6 GATES again
- Only after GATE 6: Run the E2E test
- Verify the pass
- Document: "E2E GREEN: [test] passes. All gates passed, logs clean."
3. **REFACTOR**: Ensure both unit and E2E stay green
- Continue running through gates for each test run
### Delete & Restart (E2E)
**You shipped user-facing code without E2E test? You MUST WRITE ONE NOW.**
Retroactive E2E is better than no E2E. But next time: You write E2E FIRST.
</EXTREMELY-IMPORTANT>
## Integration
This skill is invoked by:
- `dev-implement` - for TDD during implementation
- `dev-debug` - for regression tests during debugging
For testing tool options (Playwright, ydotool, etc.), see:
Read `${CLAUDE_SKILL_DIR}/../../skills/dev-test/SKILL.md` and follow its instructions.
npx claudepluginhub edwinhu/workflows --plugin workflowsEnforces strict Test-Driven Development (TDD): write failing test first for features, bug fixes, refactors before any production code.
Enforces RED-GREEN-REFACTOR TDD cycle: write a failing test first, then minimal code to pass, then refactor. Use when implementing features or fixing bugs during the implement phase.