The War Machine: Dispatch, Leasing, and a Fifty-Case Simulation

Stand up the autopilot proper — OODA dispatch, serverless git-ref CAS leasing, RICE triage, tiered pipelines, and a fifty-case end-to-end simulation that proves the fleet before it ever ships.

🔥 Lvl 1000Warrior 🏰 Main Quest ⚔️ Epic 4-6 hours

The War Machine: Dispatch, Leasing, and a Fifty-Case Simulation

The scouts have returned. In Chapter II — The Proving Grounds you taught your realm to look at itself — to survey the terrain, score its own weaknesses, and write an honest ledger of what was broken. But a ledger is not a campaign. A pile of grievances is not an army. Today you build the War Machine: the thing that decides what to fight, claims a single battle so no two soldiers swing at the same foe, and rehearses fifty skirmishes in a war-room sandbox before a single real blade is drawn.

Primary Tech: 🛠️ github-actions
Skill Focus: Devops
Series: The Autonomous Realm
Author: IT-Journey Team
XP Range: ⚡ 3000-3750

📈 Your Progress

Not started · 0%

Your progress is stored in your browser only. Use your inventory to back it up.

🗝️ Prerequisites

Recommended quests

The Proving Grounds: The Repo's First CI Gate

Knowledge requirements

Completed Chapter II — The Proving Grounds
Comfortable with Git, branches, and pull requests
Basic GitHub Actions familiarity

System requirements

A GitHub repository you own
Git and a text editor or IDE
A Claude Code OAuth token to drive the agent steps

The scouts have returned. In Chapter II — The Proving Grounds you taught your realm to look at itself — to survey the terrain, score its own weaknesses, and write an honest ledger of what was broken. But a ledger is not a campaign. A pile of grievances is not an army. Today you build the War Machine: the thing that decides what to fight, claims a single battle so no two soldiers swing at the same foe, and rehearses fifty skirmishes in a war-room sandbox before a single real blade is drawn.

The real-world skill underneath the fantasy is autonomous work orchestration: an OODA dispatch loop, distributed leasing with no database, RICE prioritization, and end-to-end simulation. Master this and you can build any fleet of agents that picks its own work and never trips over itself.

📖 The Legend Behind This Quest

Every great keep eventually outgrows the lone caretaker who walks the halls fixing things by hand. The realm needs a war machine — a mechanism that observes the state of the kingdom, orients on what matters most, decides on exactly one quest worth doing, and acts on it without waiting for a human to point. The danger is chaos: a dozen automatons all “fixing” the same crumbling wall, overwriting each other’s mortar. The genius of this chapter is that the machine needs no central overseer and no database — it uses the kingdom’s own chronicle (its git history) as the single source of truth for who claimed what. Git is the ledger. The lease is a ref. And before any of it touches the living castle, you run a fifty-case dress rehearsal in the war room.

🎯 Quest Objectives

Primary Objectives

Implement an OODA dispatch loop (Observe → Orient → Decide → Act) that selects exactly one unit of work per run
Build serverless work-leasing using git refs as a distributed compare-and-swap (CAS) lock so concurrent runners never collide
Score a backlog with RICE triage (Reach × Impact × Confidence ÷ Effort) and emit a deterministic priority order
Author a fifty-case end-to-end simulation that exercises the dispatcher offline and asserts no double-leasing

Mastery Indicators

Two parallel dispatch runs claim different work items, proven by a failing-then-passing collision test
The simulation runs in CI as a tiered pipeline (fast lint → unit → full E2E) and gates merges
You can explain why a git ref makes a correct distributed lock without Redis, a DB, or a queue

🗺️ Quest Prerequisites

Before you fire up the war machine, make sure your armory is stocked:

Prior chapter: Finish Chapter II — The Proving Grounds. You need its self-survey worklist — the ledger of issues — because that ledger is the backlog this dispatcher observes.
Tools on your bench: git (2.30+), python3 (3.10+ for the Task | None union syntax used below), and a text editor or IDE. The CI tier also installs ruff and pytest — locally, pip install ruff pytest.
Accounts & access: A GitHub repository you own (you must be able to push refs to it), and a Claude Code OAuth token stored as the CLAUDE_CODE_OAUTH_TOKEN repository secret to drive the agent steps that perform the leased work.
Concepts you should already hold: Git branches and pull requests, and a basic feel for GitHub Actions jobs and the needs: keyword.

🧙‍♂️ Chapter 1: The OODA Dispatcher — Deciding What to Fight

⚔️ Skills You’ll Forge

Modeling the OODA loop (Observe, Orient, Decide, Act) as discrete, testable stages
RICE scoring to turn a messy backlog into a single deterministic choice
Keeping “decide” pure so the same inputs always produce the same pick

A dispatcher is the brain of the fleet. Given a backlog of candidate tasks — each one a row in the ledger your scouts wrote in Chapter II — it must choose one to act on. Resist the urge to make it clever and stateful. The strongest dispatchers are boring: a pure function from (backlog, snapshot) → one task. Everything else (claiming, running, reporting) lives outside the decision.

The OODA loop gives us four named stages. Observe loads the world (the worklist file, open PRs, recent runs). Orient filters and enriches — drop anything already in flight, attach a freshness penalty. Decide ranks what remains and returns the top item. Act hands that item off to be leased and worked.

We rank with RICE: score = (Reach × Impact × Confidence) / Effort. Reach is how many pages or users a fix touches, Impact is how much it moves the needle, Confidence is how sure we are, and Effort is the cost. Dividing by effort means a cheap, broad, high-confidence win beats a heroic gamble every time.

Save these pure functions as scripts/decide_core.py — decide.py below imports them with from decide_core import Task, decide, so the file must sit next to decide.py in scripts/ (or be on your PYTHONPATH), otherwise you get ModuleNotFoundError: No module named 'decide_core'.

# scripts/decide_core.py — the pure OODA/RICE decision functions, imported by decide.py.
from dataclasses import dataclass

@dataclass(frozen=True)
class Task:
    id: str
    reach: float       # how many pages/users this touches
    impact: float      # 0.25 .. 3.0 (massive)
    confidence: float  # 0.0 .. 1.0
    effort: float      # person-hours, never zero

def rice_score(t: Task) -> float:
    # Cheap, broad, high-confidence wins float to the top.
    return (t.reach * t.impact * t.confidence) / max(t.effort, 0.5)

def decide(backlog: list[Task], in_flight: set[str]) -> Task | None:
    """The pure heart of OODA: same inputs -> same single pick."""
    candidates = [t for t in backlog if t.id not in in_flight]
    if not candidates:
        return None
    # Sort by score desc, then id for a stable, deterministic tiebreak.
    return sorted(candidates, key=lambda t: (-rice_score(t), t.id))[0]

Notice three deliberate choices. The decision is pure — no I/O, no clock, no randomness — so it is trivially unit-testable and reproducible. The tiebreak on t.id makes the pick stable: two runners reading the same backlog will choose the same task, which is exactly the property the leasing layer in Chapter 2 turns into a safe race. And in_flight is passed in, not fetched, so “observe” stays separate from “decide.”

The two helpers the dispatch script shells out to are thin. build_backlog.py is the Observe stage: it reads your Chapter-II worklist and prints a JSON array of task objects. decide.py is the Orient + Decide stage: it reads that JSON on stdin, builds Task objects, calls decide(), and prints the chosen id.

# scripts/build_backlog.py — Observe: emit the backlog as JSON.
# Output shape (one object per candidate task), consumed by decide.py:
#   [{"id": "fix-broken-link-42", "reach": 120, "impact": 2.0,
#     "confidence": 0.9, "effort": 1.5}, ...]
import json
import sys

def build_backlog() -> list[dict]:
    # Replace this stub with a read of your Chapter-II worklist.
    # Each row becomes one task dict with the five RICE-relevant fields.
    return [
        {"id": "fix-broken-link-42", "reach": 120, "impact": 2.0,
         "confidence": 0.9, "effort": 1.5},
        {"id": "add-meta-description-home", "reach": 300, "impact": 1.5,
         "confidence": 0.95, "effort": 0.5},
    ]

if __name__ == "__main__":
    json.dump(build_backlog(), sys.stdout)

# scripts/decide.py — Orient + Decide: read backlog JSON on stdin, print one id.
import json
import sys
from decide_core import Task, decide   # the pure functions defined above

def main() -> None:
    rows = json.load(sys.stdin)
    backlog = [Task(**row) for row in rows]
    # in_flight could be fetched from open lease refs; empty here for clarity.
    chosen = decide(backlog, in_flight=set())
    if chosen is not None:
        print(chosen.id)   # empty stdout => "nothing to dispatch"

if __name__ == "__main__":
    main()

The Act stage is thin — it just announces intent and delegates:

#!/usr/bin/env bash
# dispatch.sh — Observe -> Orient -> Decide, then hand off to Act (lease).
set -euo pipefail

BACKLOG_JSON="$(python3 scripts/build_backlog.py)"        # Observe
TASK_ID="$(echo "$BACKLOG_JSON" | python3 scripts/decide.py)"  # Orient + Decide

if [[ -z "$TASK_ID" ]]; then
  echo "::notice::Backlog empty after orient — nothing to dispatch."
  exit 0
fi

echo "Dispatcher selected task: $TASK_ID"
exec scripts/lease.sh "$TASK_ID"                          # Act

🔍 Knowledge Check

Why must decide() be a pure function for the leasing safety property to hold?
In RICE, what happens to the ranking if you forget to divide by Effort — and which kind of task wins instead?
Why does the dispatcher select exactly one task per run instead of a batch?

🧙‍♂️ Chapter 2: Git-Ref Leasing — The Database You Already Have

⚔️ Skills You’ll Forge

Using a git ref as an atomic compare-and-swap (CAS) lock
Building distributed mutual exclusion with no server, no DB, no queue
Writing a fifty-case simulation that proves no two runners claim the same work

Here is the trick that makes the whole fleet serverless. To claim a task safely, you need an atomic operation that exactly one racer can win. Most teams reach for Redis or a database row lock. But you already run a distributed, transactional, atomic store on every push: git. Creating a ref on the remote is atomic — if the ref already exists, a non-fast-forward push fails. That failure is your lock contention signal.

Here CAS means compare-and-swap — an atomic conditional update: “set this ref to X only if its current value is what I expect.” (This is the concurrency primitive; do not confuse it with a content-addressable store, which is a different idea about naming data by its hash.) Git exposes exactly this through --force-with-lease, whose value names the ref and the expected current value. We give an empty expected value, which asserts the ref must not already exist — a create-only CAS. Two runners that decide() the same task try to create the same ref, and the remote lets only one succeed.

Note: The raw/endraw tags you’ll see around the YAML below are Jekyll escapes for this site’s renderer — omit them when you copy the YAML into your own .github/workflows/.

#!/usr/bin/env bash
# lease.sh — claim a task by creating a git ref via create-only CAS. Atomic on the remote.
set -euo pipefail
TASK_ID="$1"

# Sanitize: the task id becomes part of a ref name, so reject anything risky.
[[ "$TASK_ID" =~ ^[A-Za-z0-9_-]+$ ]] || { echo "::error::bad TASK_ID: $TASK_ID"; exit 1; }

LEASE_REF="refs/leases/${TASK_ID}"

# Point the lease at a UNIQUE per-runner commit — NOT at HEAD.
# If two runners share the same HEAD SHA, pushing HEAD to the lease ref is
# byte-identical for both, so git reports "Everything up-to-date" (exit 0) for
# the second runner too and BOTH would believe they won. A per-attempt nonce
# commit makes the two pushes differ, so the create-only CAS below correctly
# rejects the loser with a stale-info error.
NONCE="${GITHUB_RUN_ID:-$$}-${RANDOM}-$(date +%s%N)"
LEASE_SHA="$(GIT_AUTHOR_NAME=war-machine GIT_AUTHOR_EMAIL=war-machine@ci \
  GIT_COMMITTER_NAME=war-machine GIT_COMMITTER_EMAIL=war-machine@ci \
  git commit-tree 'HEAD^{tree}' -p HEAD -m "lease ${TASK_ID} ${NONCE}")"
git update-ref "$LEASE_REF" "$LEASE_SHA"

# The push IS the compare-and-swap. An EMPTY expected value
# (the part after the trailing ':') asserts the ref must NOT already
# exist -> a create-only CAS. If another runner created it first,
# the expected value mismatches and the push is rejected.
if git push origin --force-with-lease=refs/leases/$TASK_ID: "$LEASE_REF" 2>/dev/null; then
  echo "::notice::Leased ${TASK_ID} — this runner owns the work."
  exec scripts/work.sh "$TASK_ID"     # we won the race; do the work
else
  echo "::notice::${TASK_ID} already leased by another runner — standing down."
  git update-ref -d "$LEASE_REF"      # clean up our local ref
  exit 0                              # losing is success, not failure
fi

Because the expected value is empty, the remote accepts the push only if the lease ref does not yet exist. So when two GitHub Actions runners fire at the same moment, both call decide(), both pick the same top task (thanks to the stable tiebreak), both try to create refs/leases/<id>, and exactly one push is accepted. The loser sees a stale-info / non-fast-forward rejection, treats it as “someone else has this,” and exits cleanly. No coordinator. No polling. No split brain.

work.sh is the winner’s payload: it runs the agent against the task and, win or lose at the end, releases the lease by deleting the ref so the task can be re-attempted later if needed.

#!/usr/bin/env bash
# work.sh — do the leased task, then release the lease no matter what.
set -euo pipefail
TASK_ID="$1"
LEASE_REF="refs/leases/${TASK_ID}"

release() { git push origin --delete "$LEASE_REF" 2>/dev/null || true; }
trap release EXIT   # release on success, failure, or interrupt

echo "Working task ${TASK_ID} ..."
# ... drive the Claude Code agent step here to perform the actual fix ...

Now prove it. A claim like “no double-leasing” is worthless until a test tries to break it. The fifty-case simulation spins up the dispatcher against a synthetic backlog across many concurrent “runners” and asserts every task is claimed at most once. The shared claimed dict stands in for the remote’s ref namespace, and threading.Lock() stands in for the remote’s atomic ref creation — keying on the plain task id, exactly as a refs/leases/<id> ref would:

import concurrent.futures, threading

def simulate_run(seed: int, backlog, claimed: dict, lock):
    """One simulated runner: decide, then attempt an atomic claim."""
    with lock:                                # stands in for the remote's atomicity
        in_flight = {t for t in claimed}      # the lease refs that already exist
        task = decide(backlog, in_flight=in_flight)
        if task is None:
            return None
        if task.id in claimed:                # ref already existed -> we lost
            return ("lost", task.id)
        claimed[task.id] = seed               # create-only CAS succeeds
        return ("won", task.id)

def test_no_double_lease():
    backlog = [Task(f"t{i}", reach=i+1, impact=1.0, confidence=0.9, effort=2.0)
               for i in range(50)]
    claimed, lock = {}, threading.Lock()
    with concurrent.futures.ThreadPoolExecutor(max_workers=50) as pool:
        results = list(pool.map(
            lambda s: simulate_run(s, backlog, claimed, lock), range(50)))
    won = [r[1] for r in results if r and r[0] == "won"]
    # The load-bearing assertion: every claim is unique. No collisions, ever.
    assert len(won) == len(set(won)), f"double-lease detected: {won}"

Because each runner re-reads claimed inside the lock and adds the chosen id to in_flight, the next runner to acquire the lock sees that id as already-leased and decide() skips it — so fifty runners spread across fifty distinct tasks rather than dogpiling one. Run this fifty times with fifty runners and the assertion holds every time. In CI you can run the real thing against a throwaway branch instead of the lock. Wire all of it into a tiered pipeline so cheap checks fail fast and the expensive simulation only runs when the cheap ones pass:

# .github/workflows/war-machine.yml — fast lint -> unit -> full E2E
name: War Machine
on: [pull_request]
jobs:
  lint:        # tier 1: seconds
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-python@v5
        with:
          python-version: '3.12'
      - run: pip install ruff pytest
      - run: ruff check scripts/
  unit:        # tier 2: the pure decide()/RICE tests
    needs: lint
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-python@v5
        with:
          python-version: '3.12'
      - run: pip install ruff pytest
      - run: pytest tests/unit -q
  simulation:  # tier 3: the 50-case end-to-end leasing sim — gates merge
    needs: unit
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-python@v5
        with:
          python-version: '3.12'
      - run: pip install ruff pytest
      - run: pytest tests/sim/test_dispatch_sim.py -q

🔍 Knowledge Check

Why does a create-only CAS (an empty expected value with --force-with-lease) give you a correct distributed lock for free?
What property of decide() guarantees two runners race on the same ref rather than quietly doubling up?
Why does the tiered pipeline put the fifty-case simulation last instead of first?

🔁 Reproduce It

This chapter is anchored to real merged work in the bamr87/lifehacker.dev fleet — the same War Machine, built for real:

bamr87/lifehacker.dev#7 (bamr87/lifehacker.dev@178756d97) — stood up the initial OODA dispatch loop and the worklist-driven backlog the dispatcher observes.
bamr87/lifehacker.dev#16 (bamr87/lifehacker.dev@412397b21) — added serverless git-ref CAS leasing so concurrent runners claim distinct work without a database.
bamr87/lifehacker.dev#21 (bamr87/lifehacker.dev@d93355df6) — introduced RICE triage and the fifty-case end-to-end simulation wired into the tiered pipeline.

Read the diffs in order and you’ll watch the war machine assemble exactly as this chapter teaches it.

🎮 Mastery Challenge

Objective: Prove your war machine cannot double-lease, under contention, in CI.

Two parallel dispatch runs against the same backlog claim two different task ids (capture both run logs as proof)
The fifty-case simulation passes 50/50 with the no-double-lease assertion enabled, and fails when you temporarily remove the atomic check (prove the test has teeth)
The tiered pipeline blocks a merge when the simulation tier fails, and allows it when all three tiers are green

🎁 Rewards & Progression

Badge earned: 🏗️ Castle Mechanic — the fleet and simulation stand up
Skills unlocked:
- 🛠️ Serverless work-leasing with git refs (CAS)
- 🧠 OODA dispatch + RICE triage
+150 XP

🗺️ Quest Network

graph LR
    A["Chapter II:<br/>The Proving Grounds"] --> B["Chapter III:<br/>The War Machine"]
    B --> C["Chapter IV:<br/>The Sigils of Trust"]
    style B fill:#7c3aed,stroke:#4c1d95,color:#fff

🔮 Next Adventures

The machine now picks its battles and claims them safely — but who signs off on the work it ships? Next you’ll forge the trust layer: gated permissions, smuggle-guards, and the sigils that let agents merge without a human on the keyboard.

Next chapter: The Sigils of Trust
Campaign hub: The Self-Operating Website

📚 Resource Codex

GitHub Actions documentation — workflows, jobs, and needs dependencies for tiered pipelines
git update-ref documentation — the low-level ref plumbing behind the lease
git push --force-with-lease — safe compare-and-swap semantics for refs
Claude Code documentation — driving the agent steps that perform the leased work

🕸️ Knowledge Graph

Structured wiki-links connect this quest to the IT-Journey knowledge graph. Open the Obsidian Graph View to explore connections.

Campaign hub: [[Epic Quest: The Self-Operating Website]] Previous: [[The Proving Grounds]] Next: [[The Sigils of Trust]] Obsidian docs: [[Obsidian Knowledge Graph and Wiki Links]]

🎁 Rewards

150 XP

Badges

🏗️ Castle Mechanic — the fleet and simulation stand up

Skills unlocked

🛠️ Serverless work-leasing with git refs (CAS)
🧠 OODA dispatch + RICE triage

Features unlocked

Continue The Self-Operating Website campaign

Unlocks

The Sigils of Trust: Auth, Secrets, and the Kill Switch

🕸️ Quest Network

Click a node to open the quest · ⌘/Ctrl-click for a new tab · drag to reposition · scroll to zoom.

Referenced by

Loading…

Layout	`quest`
Collection	`quests`
Path	`_quests/1000/self-operating-website-03-the-war-machine.md`
URL	`/quests/1000/self-operating-website-03-the-war-machine/`
Date	`2026-06-28`

Settings

Color Mode

Theme Skin

Background

Environment

Theme & Build

Page Location

Page Info

Source Code

The War Machine: Dispatch, Leasing, and a Fifty-Case Simulation

Table of Contents

The War Machine: Dispatch, Leasing, and a Fifty-Case Simulation

📖 The Legend Behind This Quest

🎯 Quest Objectives

Primary Objectives

Mastery Indicators

🗺️ Quest Prerequisites

🧙‍♂️ Chapter 1: The OODA Dispatcher — Deciding What to Fight

⚔️ Skills You’ll Forge

🔍 Knowledge Check

🧙‍♂️ Chapter 2: Git-Ref Leasing — The Database You Already Have

⚔️ Skills You’ll Forge

🔍 Knowledge Check

🔁 Reproduce It

🎮 Mastery Challenge

🎁 Rewards & Progression

🗺️ Quest Network

🔮 Next Adventures

📚 Resource Codex

🕸️ Knowledge Graph

🎁 Rewards

Badges

Skills unlocked

Features unlocked

Unlocks

🕸️ Quest Network

Referenced by