[Feature]: Use Claude Code's native AskUserQuestion tool instead of numbered-option prompts

### Problem Statement

[`templates/commands/clarify.md`](https://github.com/github/spec-kit/blob/main/templates/commands/clarify.md) and [`templates/commands/checklist.md`](https://github.com/github/spec-kit/blob/main/templates/commands/checklist.md) each run their own interactive questioning loop and instruct the agent to render every question as a Markdown table with A–E options, asking the user to reply with a letter:

  - **[`/clarify`](https://github.com/github/spec-kit/blob/main/templates/commands/clarify.md)** — up to 5 questions, `| Option | Description |` table, with a "Recommended" row prefixed by reasoning.
  - **[`/checklist`](https://github.com/github/spec-kit/blob/main/templates/commands/checklist.md)** — its own independent 3-question loop (up to 5) using `| Option | Candidate | Why It Matters |`.

The same templates ship to every supported agent. Inside [Claude Code](https://docs.anthropic.com/en/docs/claude-code/overview) specifically, this causes friction:

  - **UX inconsistency.** Claude Code exposes a native `AskUserQuestion` tool that renders structured pickers with labels, descriptions, keyboard navigation, and multi-select support. Every other interactive flow in that host uses it; spec-kit is the one that asks users to type `A`/`B`/`C`.
  - **Ambiguous parsing.** Free-text replies like `2`, "the second one", or "recommended" have to be interpreted by the model. Most of the time it works, but when the "Recommended" row reorders options or a user paraphrases, it occasionally misroutes.
  - **Lost structure.** Option labels, descriptions, the `Recommended — <reasoning>` callout, and the optional free-form short-answer escape hatch all have to be re-encoded in prose each question. A structured tool call would carry them natively.

Other agents without a native structured-question tool should keep the current Markdown-table behavior — no regression.

  > **A note on scope.** No command internally invokes `/clarify` — `/specify`, `/plan`, and `/tasks` emit `[NEEDS CLARIFICATION]` markers and delegate to `/speckit.clarify` via *handoffs*, which only suggest the next command to the user. So fixing `/clarify` in isolation would leave `/checklist`'s parallel questioning loop untouched, which is why both belong in the same fix.

### Proposed Solution

 Apply the transform at **skill-generation time** in [`ClaudeIntegration`](https://github.com/github/spec-kit/blob/main/src/specify_cli/integrations/claude/__init__.py), matching the pattern already used for
  `argument-hint`, `user-invocable`, and `disable-model-invocation` injection in [`src/specify_cli/integrations/claude/__init__.py`](https://github.com/github/spec-kit/blob/main/src/specify_cli/integrations/claude/__init__.py):

  1. **Mark the questioning sections as transformable** in [`templates/commands/clarify.md`](https://github.com/github/spec-kit/blob/main/templates/commands/clarify.md) and
  [`templates/commands/checklist.md`](https://github.com/github/spec-kit/blob/main/templates/commands/checklist.md). Keep the current numbered-list instructions as the default body; add explicit begin/end markers (e.g., an HTML-comment fence like ` ... `) around the question-rendering block in each template so a post-processor can target it surgically without touching the surrounding logic (question selection, 5-question cap, spec-file writes, validation pass, etc.).

  2. **Extend `ClaudeIntegration.setup()`** (or a helper it calls) so that when generating `speckit-clarify/SKILL.md` and `speckit-checklist/SKILL.md`, the fenced block is replaced with instructions that tell Claude Code to invoke `AskUserQuestion` with a structured payload:

     - `question`: the clarification/checklist question text
     - `options[]`: `{label, description}`, with the recommended option placed first and its description prefixed `Recommended — <reasoning>.`
     - `multiSelect: false`
     - A final synthetic `"Provide my own short answer (≤10 words)"` option preserves the current free-form escape hatch.

  3. **Preserve all downstream behavior.** For `/clarify`: the 5-question cap, category coverage map, incremental spec-file writes after each accepted answer, and the post-questioning validation pass. For `/checklist`: the 3-initial / up-to-5-total question cap and the Q1–Q5 labeling. Only the *rendering* of each question changes in either command.

  4. **Normalize the two table schemas.** `/clarify` uses `Option | Description` and `/checklist` uses `Option | Candidate | Why It Matters`. The post-processor should map both to `AskUserQuestion`'s `{label, description}` shape — `Candidate` becomes the option `label`, and `Why It Matters` goes into `description`. Document this mapping inline in the integration code so the two templates can diverge further in the future without silently breaking the transform.

  5. **No changes for other integrations.** `MarkdownIntegration`, `TomlIntegration`, and other `SkillsIntegration` subclasses continue shipping the templates unchanged — the fence is plain Markdown that renders correctly as-is. Snapshot tests should assert that every non-Claude integration's generated output is byte-identical to `main`.

  ### Out of scope for this issue

  - Changing question-selection logic, caps, or taxonomy in either command.
  - `/implement`'s `"proceed anyway? (yes/no)"` and `/analyze`'s remediation-approval prompt. These are binary confirms — the same fencing pattern would cover them trivially once it exists, but the UX win is small and they can land in a follow-up PR to keep this one focused.
  - `/specify`, `/plan`, `/tasks`, `/constitution`, `/taskstoissues`. The first three emit `[NEEDS CLARIFICATION]` markers rather than asking interactively; `/constitution` is mostly free-form; `/taskstoissues` doesn't prompt.
  - Any changes to the extension-hooks system (`.specify/extensions.yml`).

  ### Why this shape

  - Matches spec-kit's existing architecture (see [`AGENTS.md`](https://github.com/github/spec-kit/blob/main/AGENTS.md)): per-agent behavior lives in `src/specify_cli/integrations/<key>/`, templates stay
  agent-agnostic, and post-processing on skill generation is already an established pattern.
  - Zero risk to non-Claude integrations — the fence is invisible to them.
  - Preserves the "Recommended option + reasoning" UX in `/clarify` and the `Candidate / Why It Matters` semantics in `/checklist`; both move from table rows into structured `AskUserQuestion` option fields.
  - No new runtime abstraction, no new CLI flag, no schema migration

### Alternatives Considered

 1. **Do nothing — keep the Markdown table rendering.** The current flow works on every agent and is low-maintenance. Rejected because inside Claude Code, it's a visibly worse UX than every other interactive flow in the host (structured pickers via `AskUserQuestion`), and the ambiguous-parsing failure mode is already observable when users paraphrase answers or the "Recommended" row shifts option ordering.

  2. **Fix `/clarify` only, leave `/checklist` for later.** Smallest diff, fastest to land. Rejected because `/checklist` has a parallel, independent questioning loop with essentially the same UX problem, and the fencing + post-processor infrastructure is the same shape for both — splitting them means building the mechanism twice or leaving an obvious gap between the two PRs.

  3. **Runtime abstraction — add an `ask_user()` helper in the CLI layer that integrations plug into.** More flexible long-term. Rejected because spec-kit commands are executed *by the agent*, not by the Python CLI , there's no runtime point at which the CLI could intercept a question to re-render it. The question-rendering instructions live in the template text the agent reads, so the only place to transform them is at template/skill generation time, which is exactly where `ClaudeIntegration` already operates.

  4. **Fork the templates per-agent** (`clarify.claude.md`, `clarify.default.md`, etc.). Rejected because it duplicates a ~200-line template body to change one rendering block, and it fights spec-kit's "one template per command, per-agent behavior in integration subpackages" architecture described in [`AGENTS.md`](https://github.com/github/spec-kit/blob/main/AGENTS.md).

  5. **Make the template itself conditional** — something like "if your host supports a structured-question tool, use it; otherwise render a table." Rejected because it pushes host-capability detection onto every agent, most of which would interpret it inconsistently, and it makes the template body harder to read for the common case.

  6. **Add a user-facing CLI flag** (e.g. `specify init --claude-native-questions`). Rejected as overkill — this is a rendering choice with no semantic impact on the generated spec, so it shouldn't need user configuration. The right default for Claude Code is "use Claude Code's native tool"; anything else is a regression.

The proposed solution is the only approach that (a) fits spec-kit's existing integration-layer pattern, (b) keeps the template single-sourced, (c) has zero risk to other agents, and (d) doesn't introduce new user-visible configuration.

### Component

Agent integrations (command files, workflows)

### AI Agent (if applicable)

Claude Code

### Use Cases

 1. **User running `/speckit.clarify` on a freshly-specified feature inside Claude Code.** 
Today: sees an ASCII table with a "Recommended" row, reads A–E descriptions, types `A` (or "yes"), repeats up to 5 times.
     After: sees a native Claude Code picker with the recommended option pre-highlighted, selects with arrow keys / enter, no free-text disambiguation.

  2. **User running `/speckit.checklist` to generate a UI-consistency or test-coverage checklist.**
     Today: answers 3 (up to 5) clarifying questions rendered as a `Option | Candidate | Why It Matters` table, each requiring a letter reply.
     After: the same 3–5 questions rendered as native pickers, with `Why It Matters` shown as the option description on hover/selection.

  3. **User pastes a one-word free-form answer** (e.g., "OAuth2") instead of picking a listed option.
     Today: works, but relies on the model parsing "OAuth2" against the `(<=5 words)` constraint described in prose.
     After: the `AskUserQuestion` payload includes a dedicated "Provide my own short answer (≤10 words)" option that, when selected, prompts for a free-text response — the escape hatch is preserved but is now a
  first-class, unambiguous path.

  4. **User running the same command on a non-Claude agent** (Copilot, Gemini, Windsurf, Cursor, etc.).
     Today, see the Markdown table.
     After: **still sees the Markdown table, byte-identical.** No regression, no divergence in behavior across agents beyond the rendering layer.

  5. **Maintainer reviewing a PR that touches `/clarify` or `/checklist` logic.**
     Today: edits one template, same behavior everywhere.
     After: edits one template; the fenced rendering block is transformed at skill-generation time for Claude only, so question-selection logic, caps, taxonomy, and spec-file writes remain single-sourced and
  agent-agnostic.

### Acceptance Criteria

- [ ] [`templates/commands/clarify.md`](https://github.com/github/spec-kit/blob/main/templates/commands/clarify.md) contains a fenced rendering block (begin/end markers) around the question-presentation instructions in the sequential questioning loop. The block's default contents are the current Markdown-table instructions, unchanged in wording.
- [ ] [`templates/commands/checklist.md`](https://github.com/github/spec-kit/blob/main/templates/commands/checklist.md) contains an equivalent fenced rendering block around its step-2 clarification-question rendering. Default contents unchanged.
- [ ] `ClaudeIntegration.setup()` in [`src/specify_cli/integrations/claude/__init__.py`](https://github.com/github/spec-kit/blob/main/src/specify_cli/integrations/claude/__init__.py) post-processes the generated `speckit-clarify/SKILL.md` and `speckit-checklist/SKILL.md` files, replacing the fenced block with instructions that direct [Claude Code](https://docs.anthropic.com/en/docs/claude-code/overview) to invoke `AskUserQuestion` with a structured payload (`question`, `options[]`, `multiSelect: false`).
- [ ] The post-processor maps [`/clarify`](https://github.com/github/spec-kit/blob/main/templates/commands/clarify.md)'s `Option | Description` rows and [`/checklist`](https://github.com/github/spec-kit/blob/main/templates/commands/checklist.md)'s `Option | Candidate | Why It Matters` rows to `AskUserQuestion`'s `{label, description}` shape. The mapping is documented inline in [`src/specify_cli/integrations/claude/__init__.py`](https://github.com/github/spec-kit/blob/main/src/specify_cli/integrations/claude/__init__.py).
- [ ] The recommended option (from [`/clarify`](https://github.com/github/spec-kit/blob/main/templates/commands/clarify.md)'s "Recommended" row) is placed first in the `options[]` array and its description is prefixed `Recommended — <reasoning>`.
- [ ] A final synthetic "Provide my own short answer (≤5 words)" option is appended to preserve the free-form escape hatch.
- [ ] All downstream behavior is preserved: the 5-question cap ([`/clarify`](https://github.com/github/spec-kit/blob/main/templates/commands/clarify.md)), 3-initial/5-total cap ([`/checklist`](https://github.com/github/spec-kit/blob/main/templates/commands/checklist.md)), category coverage map, incremental spec-file writes after each accepted answer, the post-questioning validation pass, and the Q1–Q5 labeling in [`/checklist`](https://github.com/github/spec-kit/blob/main/templates/commands/checklist.md).
- [ ] Snapshot tests under [`tests/`](https://github.com/github/spec-kit/tree/main/tests) assert that every non-Claude integration's generated output for `clarify` and `checklist` is **byte-identical to `main`**. No regression anywhere except the Claude-specific skill files. Relevant integration subpackages: [`src/specify_cli/integrations/`](https://github.com/github/spec-kit/tree/main/src/specify_cli/integrations).
- [ ] Snapshot tests assert that the Claude-specific output contains `AskUserQuestion` invocation instructions and no longer contains the `| Option | Description |` table for [`/clarify`](https://github.com/github/spec-kit/blob/main/templates/commands/clarify.md) (and the corresponding `| Option | Candidate | Why It Matters |` table for [`/checklist`](https://github.com/github/spec-kit/blob/main/templates/commands/checklist.md)).
- [ ] Manual end-to-end verification through `/speckit.clarify` and `/speckit.checklist` inside [Claude Code](https://docs.anthropic.com/en/docs/claude-code/overview) confirms: questions render as native pickers, the recommended option is visibly pre-highlighted, free-form answers still work, and the resulting `spec.md` / checklist file is equivalent to what the current flow produces.
- [ ] No new user-facing CLI flags, no schema migration, no changes to `.specify/extensions.yml`, and no changes to any other command template under [`templates/commands/`](https://github.com/github/spec-kit/tree/main/templates/commands).

### Additional Context

- **Claude Code's `AskUserQuestion` tool** is the native structured-question mechanism used across Claude Code's own features and by [user-authored skills](https://docs.anthropic.com/en/docs/claude-code/skills). It accepts a `question` string, an `options[]` array of `{label, description}`, and a `multiSelect` boolean. See the [Claude Code documentation](https://docs.anthropic.com/en/docs/claude-code/overview) for host context.
- **spec-kit's integration architecture** is documented in [`AGENTS.md`](https://github.com/github/spec-kit/blob/main/AGENTS.md). Each agent has a subpackage under `src/specify_cli/integrations/<key>/` with setup-time hooks; `ClaudeIntegration` already post-processes generated skill files to inject `user-invocable`, `disable-model-invocation`, and `argument-hint` — the proposed change extends the same pattern to the
  question-rendering block.
- **Related issues worth cross-referencing:**
  - [#752 — Feature Request: Utilize Claude Code's Subagent Feature for Command Execution](https://github.com/github/spec-kit/issues/752) — adjacent direction (Claude-native capabilities), useful as precedent that maintainers are open to Claude-specific enhancements.
 - [#1176 — Multi-Agent Architecture Implementation for Spec Kit](https://github.com/github/spec-kit/issues/1176) — relevant context if maintainers are consolidating agent-capability detection; this proposal could slot into that work, but does not depend on it.
- **What I've already verified by reading the repo:**
  - [`templates/commands/clarify.md`](https://github.com/github/spec-kit/blob/main/templates/commands/clarify.md) step 4 is the exact block that would be fenced (Sequential questioning loop → "For multiple-choice questions" → table render).
  - [`templates/commands/checklist.md`](https://github.com/github/spec-kit/blob/main/templates/commands/checklist.md) step 2 ("Clarify intent (dynamic)") contains the parallel questioning loop and the `Option | Candidate | Why It Matters` table.
  - No other command template under [`templates/commands/`](https://github.com/github/spec-kit/tree/main/templates/commands) runs a structured multi-option picker loop; the remaining interactive prompts are binary yes/no confirms in `/implement` and `/analyze`, which are out of scope for this issue.
  - No command internally invokes `/clarify` — `/specify`, `/plan`, and `/tasks` use `[NEEDS CLARIFICATION]` markers and [handoffs](https://github.com/github/spec-kit/blob/main/templates/commands/specify.md) that only surface a suggested next command.
  - **Environment where the issue is visible:** Claude Code (CLI, desktop app, VS Code / JetBrains extensions) running spec-kit skills installed via `specify init` with the `claude` integration selected. Not
  reproducible on other agents — by design.
  - **No screenshots attached** — the rendering difference is textual (Markdown table vs. native picker) and I'd rather not embed images of an unmerged design. Happy to add side-by-side captures before/after if a maintainer wants them for the PR.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Use Claude Code's native AskUserQuestion tool instead of numbered-option prompts #2181

Problem Statement

Proposed Solution

Out of scope for this issue

Why this shape

Alternatives Considered

Component

AI Agent (if applicable)

Use Cases

Acceptance Criteria

Additional Context

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

[Feature]: Use Claude Code's native AskUserQuestion tool instead of numbered-option prompts #2181

Description

Problem Statement

Proposed Solution

Out of scope for this issue

Why this shape

Alternatives Considered

Component

AI Agent (if applicable)

Use Cases

Acceptance Criteria

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions