Skip to content

Phase 23.1: Clean up evidence pipeline contract #219

Description

@jununfly

This was generated by AI during triage.

Parent

What to build

Tighten the benchmark/evidence tooling contract introduced in Phase 22 so future Rust indexing optimization candidates can reuse artifact comparison, ranking, and decision-draft output without hand-written interpretation.

The toolchain must remain local-only. It must not call GitHub, require network access, or update trackers automatically.

Acceptance criteria

  • Consolidate duplicated artifact parsing and comparison helpers introduced by the evidence pipeline.
  • Normalize target status, required/stress classification, empty-corpus status, sufficiency, graphStats parity, wall time, RSS, and unavailable-reason output.
  • Keep rollout-readiness disclaimers explicit in generated output.
  • Keep comparison, ranking, and decision-draft output reusable for later Post-PRD optimization tracker for Rust indexing performance #165 candidates.
  • Do not add GitHub/network behavior to the tooling.
  • Do not add new benchmark dimensions unless they directly support candidate selection.
  • Add or update fixture tests for artifact parsing and comparison output.
  • Add or update fixture tests for ranking and decision draft output.
  • Smoke-generate output using existing Phase 22 artifacts.
  • Do not run a full VS Code sparse scoreboard for this slice.

Blocked by

None - can start immediately.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestready-for-agentFully specified and ready for an AFK agent

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions