Context
BatchEngine.tick(verdict, ...) takes a verdict(spec_tree, members, reps) -> BatchVerdict seam; today it's fed by from_green/fakes. Production must run the repo's real CI against the spec tree under repetition and return per-test DET_PASS/DET_FAIL/UNRESOLVED.
Scope
- Dispatch a GitHub Actions run (or check-run) per spec ref; collect per-test outcomes across
reps.
- Map results into
BatchVerdict.per_test; run under the hermetic test runtime (see hermetic-test-runtime issue).
Acceptance
A real PR lands end-to-end on a sandbox repo via Actions; a flaky vs deterministic-red test is classified correctly from real runs (no masking).
Seam
tools/conductor/verdict.py, batch_engine.py (the verdict param).
Context
BatchEngine.tick(verdict, ...)takes averdict(spec_tree, members, reps) -> BatchVerdictseam; today it's fed byfrom_green/fakes. Production must run the repo's real CI against the spec tree under repetition and return per-testDET_PASS/DET_FAIL/UNRESOLVED.Scope
reps.BatchVerdict.per_test; run under the hermetic test runtime (see hermetic-test-runtime issue).Acceptance
A real PR lands end-to-end on a sandbox repo via Actions; a flaky vs deterministic-red test is classified correctly from real runs (no masking).
Seam
tools/conductor/verdict.py,batch_engine.py(theverdictparam).