Skip to content

Feature request: real-time streaming text-to-speech as responses are generated #67554

@bastionshieldtechnologies

Description

Preflight Checklist

Problem Statement

I use voice/dictation to interact with Claude Code hands-free. Today the only option is OS-level read-aloud (e.g. macOS Spoken Content), which speaks a message only after it has fully completed. During long agentic turns this means long silences followed by a wall of speech — and it often ends up reading stale/previous messages. There is no way to hear Claude 'thinking out loud' in real time as it types and runs tools.

Proposed Solution

Stream text-to-speech incrementally as the assistant's text output is generated (sentence-by-sentence or chunk-by-chunk), so the user hears the response in real time, including brief status updates between tool calls. Ideally:

  • A toggle (e.g. /voice or a setting) to enable spoken output
  • Speak assistant text as it streams, not after turn completion
  • Skip or summarize tool output; speak only assistant prose
  • Available in CLI and desktop app

Alternatives

  • OS read-aloud (laggy, reads whole transcript, no streaming)
  • Claude mobile app voice mode (not available for Claude Code workflows)

Additional Context

This would make long-running agentic sessions usable hands-free — the user can follow progress by ear while doing other things.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions