Most tools do one thing well. LLT does the whole pipeline — translate, transcribe and dictate, 100% local on Apple Silicon (Whisper · Parakeet · Mistral), with optional on-device LLM sentence-polishing. Then send it live to OBS, VRChat or vMix, in your own design. The free iOS companion turns your iPhone into a wireless mic — and can even start LLT on your Mac across the network; in the field without Wi-Fi, switch to direct cloud engines (Pro). GDPR-compliant, no extra costs, all in one system.
Speak straight into any app — text is typed where your cursor is (⌘⌥D)
Floating overlay shows original text and translation in real time
Full session transcripts saved as TXT with timestamps
Export subtitles in SRT format for video post-production
Hear translations read aloud via macOS system voice
Live subtitles to OBS (browser source), VRChat (OSC) and vMix
Most speech tools do one thing well and stop there. LLT takes the same live translation and puts it where your production actually needs it — overlay, stream, VR or broadcast — in your own design.
Live captions straight into the VRChat chatbox via OSC — original, translation or both. Like Kikitan, built in.
A transparent subtitle page on localhost — drop it into OBS, Streamlabs, XSplit or vMix as a Browser Source. Live, transparent, styled like your overlay.
Send translations to a vMix Title input via the Web API. Style the lower third however you want in vMix; a free .xaml template is included.
All of it 100% local — Whisper · Parakeet · Mistral on Apple Silicon, with optional on-device LLM sentence-polishing. GDPR-compliant, no extra costs, one system.
Choose your audio source and LLT captures it. WebRTC, Teams, a mic on a conference table — it doesn't matter. If your Mac can hear it, LLT can translate it.
Any hardware mic, USB mic, or audio interface. Select your device from the dropdown. Put a mic on a table and translate a meeting room in real time.
Capture audio from one specific app — Zoom, Teams, Chrome, Discord, FaceTime, or any other app. Only that app's audio gets translated, nothing else.
Capture all system audio at once. Everything playing on your Mac gets transcribed and translated. Useful as a catch-all for multi-source scenarios.
The backend starts with the app and runs a Python server on localhost. Audio is captured, chunked, sent via WebSocket, transcribed by Whisper, translated by Mistral, and displayed as an overlay — all without leaving your Mac.
Mic, app, or system audio → 3-second chunks via WebSocket to local backend
Speech-to-text with auto language detection. 5 models from tiny (~75 MB) to large (~3 GB) — choose in Settings > Whisper. Runs locally on Apple Silicon (GPU + CPU).
Mistral 7B Instruct (4-bit MLX) translates transcribed text to target language
Floating overlay with original + translation, live subtitles to OBS (browser source), VRChat (OSC) and vMix, optional text-to-speech, and full transcript save as TXT or SRT subtitles.
LLT is a menu bar app — no dock icon, no window clutter. Left-click opens the control panel, right-click shows status and quit. The backend starts with the app (or on demand), but translation only begins when you press Start.
Backend auto-starts with the app (configurable) and auto-stops on quit. Loads your selected Whisper model + Mistral on startup. Status visible in the control panel and menu bar icon.
Backend running ≠ translating. You manually press Start when you need translation. This keeps resource usage minimal until you actually need it — no background processing when idle.
New in 1.4: turn LLT into a dictation tool. Speak and the recognized — optionally translated — text is typed straight into whatever app has focus: Mail, Word, Notes, your browser, a code editor. Trigger it globally with ⌘⌥D or from the menu-bar menu.
Each engine pairs a speech-to-text source with a translation provider. Some run fully local, some call the cloud directly from the app, and some stream through the local Python backend. The matrix shows which engines need the backend running.
| Engine | STT | Translation | Backend? |
|---|---|---|---|
| Whisper | Whisper (local) | Mistral (local) | yes |
| Google REST | Google STT (direct) | Google Translate (direct) | no |
| Google gRPC | Google gRPC Streaming (via backend) | Google Translate (direct) | yes |
| Deepgram | Deepgram Streaming (direct) | Google Translate (direct) | no |
| AssemblyAI | AssemblyAI Streaming (direct) | Google Translate (direct) | no |
| Azure | Azure STT + Translation (direct, one call) | Azure | no |
| Vosk | Vosk (local, via backend) | Mistral (local) | yes |
| Parakeet default | Parakeet (local, via backend) | Mistral (local) | yes |
Parakeet (NVIDIA TDT v3) is now the default — a very fast local STT engine with continuous streaming; Whisper stays the multilingual, language-pinnable alternative. The local Mistral LLM handles translation — and can optionally polish transcribed sentences (grammar and flow) before they go out. All on-device, GDPR-compliant, at no extra cost.
Google gRPC in detail: audio → Python backend → Google gRPC STT (true streaming, no VAD needed) → backend returns the recognized text (skipTranslation: true, so Mistral is skipped) → the app calls Google Translate REST directly from Swift (same API key) → original + translation are displayed.
Requirements: backend running (for gRPC) · google-cloud-speech installed in the Python venv · Service Account JSON · Google API key (for Translate). Restart the backend after enabling.
Whisper detects the source language automatically. Mistral translates to any of 101 target languages. Set source to "Auto" and just let it figure out what's being spoken.
Afrikaans, Albanian, Amharic, Arabic, Armenian, Assamese, Azerbaijani, Bashkir, Basque, Belarusian, Bengali, Bosnian, Breton, Bulgarian, Burmese, Cantonese, Catalan, Chinese, Croatian, Czech, Danish, Dutch, English, Estonian, Faroese, Finnish, French, Galician, Georgian, German, Greek, Gujarati, Haitian Creole, Hausa, Hawaiian, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Irish, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Korean, Lao, Latin, Latvian, Lingala, Lithuanian, Luxembourgish, Macedonian, Malagasy, Malay, Malayalam, Maltese, Māori, Marathi, Mongolian, Nepali, Norwegian, Nynorsk, Occitan, Pashto, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Sanskrit, Serbian, Shona, Sindhi, Sinhala, Slovak, Slovenian, Somali, Spanish, Sundanese, Swahili, Swedish, Tagalog, Tajik, Tamil, Tatar, Telugu, Thai, Tibetan, Turkish, Turkmen, Ukrainian, Urdu, Uzbek, Vietnamese, Welsh, Yiddish, Yoruba
LLT runs AI models on your Mac. This requires Apple Silicon and enough RAM for the models.
macOS 13 (Ventura) or newer. Apple Silicon (M1, M2, M3, M4 — any variant). 8 GB RAM minimum (base model), 16 GB recommended (medium/large). Whisper models need 1–10 GB RAM depending on size. Python 3.11+ for the backend.
DMG with the app + Install-Backend.command script. The script creates a Python venv, installs Whisper, MLX, Mistral, and all dependencies. Choose your Whisper model during install (tiny to large) — change anytime in Settings > Whisper. One-time setup.
First launch — Gatekeeper: when you run Install-Backend.command you'll likely get a Gatekeeper warning that macOS “could not verify” the developer. That's completely normal — it's an unsigned shell script (only the app itself is notarized), and since macOS Sequoia the old right-click → Open bypass is gone. Just click Done, then go to System Settings → Privacy & Security → Open Anyway and run it again. Nothing's wrong with the build — it's the same step you see with most script-based installers.
Version 1.6.3 — 14 days full use, then license required
Requires Apple Silicon Mac (M1/M2/M3/M4) with 16 GB+ RAM. Backend installs automatically. Models download on first run (~4 GB).
LLT — Local Live Translator for macOS is part of the Adelvo family of professional media tools.
Professional vMix rundown & automation — timeline, call sheets, Stream Deck export, 20+ languages.
Route app audio, mic, webcam & line-in to 4 stereo outputs with EQ, delay, pan.
Audio compressor for browser tabs — fixes volume differences on social media and video pages.
Whisper + Mistral running locally on Apple Silicon. Just works.