Live streaming models now front and centre

Speak live. See the words arrive.

Low-latency speech-to-text for Mac and iOS, built around streaming providers like Deepgram Nova-3, AssemblyAI Universal-3 Pro, ElevenLabs Scribe v2, Soniox, OpenAI Realtime Whisper, Modulate Velma-2, and Apple on-device recognition.

⬇️ Download for Mac 📱 Join iOS TestFlight Star on GitHub
$0
App download
7
Live model options
BYOK
No price markup
Live Streaming

Choose the engine for the moment

JustSpeakToIt is tuned for streaming-first transcription: interim text while you speak, polished finals when the provider supports them, and instant insertion into the app you are already using.

Deepgram ~200ms

Nova-3 Streaming

Fast WebSocket transcription with interim results for the lowest-friction everyday dictation flow.

AssemblyAI ~250ms

Universal-3 Pro Streaming

High-accuracy multilingual streaming with formatted turn finalisation when you want cleaner long-form output.

OpenAI ~250ms

Realtime Whisper

Streaming transcription with built-in noise reduction using your existing OpenAI API key.

ElevenLabs ~200ms

Scribe v2 Streaming

Real-time Scribe transcription for teams already using ElevenLabs speech workflows.

Soniox ~220ms

Real-time Preview

Low-latency multilingual WebSocket STT for fast feedback across more languages.

Modulate ~220ms

Velma-2 Streaming

Multilingual live transcription with diarization and signal-detection oriented provider support.

Apple ~50ms

On-device Speech

Private local recognition for instant, free dictation when cloud accuracy is not required.

Preview

See it in action

JustSpeakToIt Dashboard

Dashboard

Monitor usage, check permissions, and start recording in one click.

Transcription History

History

Browse transcriptions with audio playback, cost tracking, and session details.

Features

Built around live words, not waiting

A native app that feels immediate. No Electron, no bloat — just fast streaming transcripts, safe key storage, and polished text insertion.

Streaming-First Dictation

Double-tap Fn, hold-to-record, or use a custom shortcut. Watch live text arrive as you speak, then insert the final transcript where your cursor is.

🔒

Private On-Device Fallback

Use Apple's built-in speech recognition for completely private, offline transcription. Your audio never leaves your device.

🎯

Smart Text Insertion

Transcripts are automatically typed into any app using accessibility APIs. Insert at cursor or replace selection—works with any text field.

🧠

AI Post-Processing

Optional Live Polish and post-processing using GPT-4o, Claude, Gemini, or Mistral--removes filler words, fixes punctuation, and polishes your prose.

💬

OpenClaw AI Chat

Have voice-powered conversations with AI. Speak your questions and hear responses read back—complete with conversation history.

🎙️

Hands-Free Mode

Full voice-to-voice conversation loop: record, send, listen to the response, and auto-resume recording. No tapping required.

🔊

Text-to-Speech

Hear responses with natural voices from OpenAI, ElevenLabs, Deepgram, Azure, or Apple's built-in system voices.

📚

Personal Lexicon

Teach the app your jargon, names, and acronyms. Your custom dictionary improves accuracy across all providers.

☁️

iCloud History Sync

Your transcription history syncs seamlessly between Mac and iOS via iCloud. Pick up right where you left off on any device.

How It Works

Three steps to live text

Activate

Double-tap Fn or click the menu bar icon. A subtle HUD appears with your active live model.

Speak

Just talk naturally. Streaming providers return interim text while you are still speaking.

Done

Stop recording and your polished text appears instantly in any app, right at your cursor.

Bring Your Own Streaming Keys

Use your existing API keys from supported live transcription providers. Pay only for what you use—typically $0.36 per hour or less. No subscriptions, no hidden fees, no data selling.

Platforms

Native apps, native performance

Built with SwiftUI for the best experience on every Apple device.

💻

macOS

  • Menu bar app with global hotkeys
  • Direct text insertion via Accessibility
  • Live transcription preview HUD
  • Deepgram, AssemblyAI, OpenAI, ElevenLabs, Soniox, Modulate, and Apple live models
  • Live Polish + AI post-processing
  • OpenClaw AI chat with hands-free mode
  • Install via Homebrew or direct download
  • Requires macOS 14+
📱

iOS (TestFlight Beta)

  • Live Activity with Dynamic Island
  • Apple Speech, Deepgram, ElevenLabs, and OpenAI live transcription
  • OpenClaw AI chat with voice input
  • Audio recording & playback
  • Home Screen widgets
  • iCloud history sync with Mac
  • Requires iOS 17+
FAQ

Questions? We've got answers.

Is my audio data private? +

Yes! When using on-device transcription with Apple Speech, your audio never leaves your device. When using cloud providers like Deepgram, AssemblyAI, Soniox, ElevenLabs, OpenAI, or Modulate, audio is sent directly to their APIs using your own API keys—we never see or store your audio.

How much does it actually cost? +

The app is free to download. On-device transcription is completely free. Cloud transcription with your own API keys typically costs around $0.006/minute (about $0.36/hour) with low-cost streaming providers such as Deepgram. You pay the provider directly—we don't add any markup.

What's the difference between on-device and cloud transcription? +

On-device (Apple Speech) is instant, free, and private—but may be less accurate for specialized vocabulary. Cloud providers like Deepgram, AssemblyAI, Soniox, ElevenLabs, OpenAI, and Modulate offer higher accuracy, lower-latency streaming, broader language support, and provider-specific features like diarisation or formatted turns.

Can I use this for meetings? +

On Mac, you can capture system audio from meeting apps like Zoom or Teams. On iOS, the app captures audio from your device's microphone—place your phone near the speaker for best results, or use the hands-free conversation mode to interact with AI directly.

How do I set up my API keys? +

Open the app's Settings and enter your API keys in the Providers section. Keys are stored securely in your Mac's Keychain. We never see or transmit your keys—they go directly to the provider's API.

What is OpenClaw? +

OpenClaw is the built-in AI chat feature. You can have voice-powered conversations with AI assistants— speak your questions, get spoken responses, and use hands-free mode for a completely voice-driven experience. Conversations are saved locally and can sync via iCloud.

Ready to speak freely?

Download JustSpeakToIt and start turning your voice into text in seconds.

⬇️ Download for Mac 📱 Join iOS TestFlight

Have a feature request, bug report, or want to contribute? Open an issue on GitHub — we'd love to hear from you!