Testing

Scraut has three layers of tests, each targeting a different failure mode.

Layer 1 — Unit tests (Python)

What they cover: Individual Python functions with all external calls mocked. No real API calls, no file I/O to real paths, no Docker.

Run locally:

cd apps/automation
pytest test/unit/ -m unit -v

Run in CI: Every push to main or develop via test-ci.yml.

What they catch: Logic bugs, wrong argument handling, fallback provider routing, use_small_model selection, token tracking.

Layer 2 — Integration tests (Python)

What they cover: Every script's --help flag — verifies all scripts are importable and their CLI argument parsers are syntactically valid.

Run locally:

cd apps/automation
pytest test/integration/ -m integration -v

Run in CI: Every push, same test-ci.yml job.

What they catch: Import errors, broken argparse configs, missing __init__.py files.

Layer 3 — Workflow lint (`actionlint`)

What they cover: All 25+ .github/workflows/*.yml files statically. No Docker, no credentials.

Run locally:

# Install
brew install actionlint            # macOS
# or: https://github.com/rhysd/actionlint/releases

actionlint                         # runs from repo root

Run in CI: Every push via the lint job in test-ci.yml.

What they catch:

YAML syntax errors
Invalid workflow/job/step structure
Broken ${{ }} expression syntax
References to secrets or contexts that don't exist
Shell script errors (via integrated shellcheck)
Incorrect action input/output wiring

Layer 4 — `act` workflow tests (local, Docker)

What they cover: The actual GitHub Actions workflow files running in a Docker container. This is the only layer that exercises:

The composite setup-ollama action
That pip install resolves correctly
That env vars and workflow inputs flow through to Python scripts
That the --dry-run path works end-to-end through the YAML → script chain

Prerequisites:

# Install act
brew install act                   # macOS
# or: https://github.com/nektos/act/releases

# Docker must be running
docker info

# Optional but recommended: GitHub CLI for GITHUB_TOKEN
gh auth login

Set up secrets (one-time):

cp apps/automation/test/act/.secrets.example .secrets
# Set GITHUB_TOKEN — minimum required
gh auth token   # copy output into .secrets

Run all act tests:

./scripts/test-act.sh

Run a single workflow:

./scripts/test-act.sh daily-standup
./scripts/test-act.sh sprint-planning
./scripts/test-act.sh template-reset
./scripts/test-act.sh backlog-grooming
./scripts/test-act.sh issue-triage

What act tests verify (dry-run mode):

Workflow	Event	dry_run covers
`template-reset.yml`	`workflow_dispatch`	standup template creation, no commit
`daily-standup.yml`	`workflow_dispatch`	summary + coach steps, no commit
`sprint-planning.yml`	`workflow_dispatch`	sprint folder + planning PR, no writes
`backlog-grooming.yml`	`workflow_dispatch`	prioritisation + scope check, no posts
`issue-triage.yml`	`workflow_dispatch`	triage script, no label/comment writes

All tests pass dry_run=true — scripts receive --dry-run and skip all writes, GitHub mutations, and Slack posts.

LLM calls still happen. Most scripts call complete() before the if not dry_run: gate, so LLM is invoked even in dry-run mode. Without credentials the fallback returns "" silently. With a GITHUB_TOKEN in .secrets, the fallback chain automatically uses GitHub Models (gpt-4o-mini) — real LLM output is generated but never written anywhere.

Using real LLM in act tests

LLM calls already happen in dry-run act tests — scripts call complete() before the dry_run gate. The question is just which model responds.

Option 1: GitHub Models (recommended, zero extra config)

GITHUB_TOKEN in .secrets auto-activates the fallback chain in scraut.yml:

fallback:
  provider: github
  model: gpt-4o-mini

When the primary provider has no key, the fallback fires → real gpt-4o-mini output. No API cost, available to all GitHub accounts.

gh auth token   # copy into .secrets as GITHUB_TOKEN=<token>

Option 2: Ollama on host (fully offline)

act runs in Docker. Ollama on your Mac is reachable inside the container at host.docker.internal:11434. To use it:

Start Ollama on your Mac:

brew install ollama
ollama pull qwen2.5:0.5b    # 394 MB, only once
ollama serve                 # keep running during tests

Point workspace/scraut.yml at the host:

llm:
  provider: ollama
  model: qwen2.5:0.5b
  base_url: http://host.docker.internal:11434

No API key needed. qwen2.5:0.5b is the best sub-1B model — 394 MB, excellent instruction-following on CPU.

Why act tests are local-only

Running act inside GitHub Actions requires Docker-in-Docker (--privileged mode), which is not available on GitHub-hosted runners. actionlint (Layer 3) provides automated YAML validation in CI. Run act tests locally before pushing workflow changes.

How the layers complement each other

Unit tests        → Python logic is correct
Integration tests → Scripts are importable, CLI args valid
actionlint        → Workflow YAML is valid, expressions correct
act tests         → Workflows actually execute end-to-end

No single layer replaces another. A broken composite action (setup-ollama) would only be caught by the act layer. A logic bug in _resolve_model() would only be caught by unit tests.

`act` image and architecture

Scraut ships a .actrc at the repo root that configures act to use the lightweight ghcr.io/catthehacker/ubuntu:act-latest image (~500 MB vs ~18 GB for the full GitHub runner image) and forces linux/amd64 architecture for consistency on Apple Silicon Macs.

# .actrc
-P ubuntu-latest=ghcr.io/catthehacker/ubuntu:act-latest
--container-architecture linux/amd64

First run pulls the Docker image once (~500 MB). Subsequent runs use the cached image.

Test file map

apps/automation/
  test/
    unit/                            ← pytest unit tests (mock everything)
    integration/                     ← pytest integration tests (--help smoke tests)
    act/
      events/
        issues_opened.json           ← issue payload for issue-triage act test
        pull_request_opened.json     ← PR payload for pr-linker/dod-check act tests
      .secrets.example               ← copy to .secrets, fill in GITHUB_TOKEN

scripts/
  test-act.sh                        ← act test runner (./scripts/test-act.sh)

.actrc                               ← act platform + architecture config
.secrets                             ← gitignored; your local act secrets

Layer 1 — Unit tests (Python)​

Layer 2 — Integration tests (Python)​

Layer 3 — Workflow lint (actionlint)​

Layer 4 — act workflow tests (local, Docker)​

Using real LLM in act tests​

Option 1: GitHub Models (recommended, zero extra config)​

Option 2: Ollama on host (fully offline)​

How the layers complement each other​

act image and architecture​

Test file map​

Layer 1 — Unit tests (Python)

Layer 2 — Integration tests (Python)

Layer 3 — Workflow lint (`actionlint`)

Layer 4 — `act` workflow tests (local, Docker)

Using real LLM in act tests

Option 1: GitHub Models (recommended, zero extra config)

Option 2: Ollama on host (fully offline)

How the layers complement each other

`act` image and architecture

Test file map