Getting Started
The Three Modes
Understand Transcribe, Optimize, and Chat modes in Vocoding.
Vocoding operates in three modes, accessible via the mode selector in the header bar or with dedicated hotkeys.
1. Transcribe Mode
What it does: Voice to clean text (no LLM processing)
Hotkey: Cmd + Shift + R (macOS) / Ctrl + Shift + R (Windows)
Use when:
- You want clean dictation
- LLM is not configured
- You need your exact words, cleaned up
2. Optimize Mode
What it does: Voice to text to LLM optimization to structured prompt
Hotkey: Cmd + Shift + O (macOS) / Ctrl + Shift + O (Windows)
Use when:
- You want structured, optimized output from your voice input
- Writing code documentation, emails, blog posts, or any professional content
- You need an AI agent to refine and format your spoken ideas
3. Chat Mode
What it does: Conversational AI interaction with your selected agent
Access: Use the main hotkey (Option + Space / Alt + Space) while in the Chat view, or type directly in the composer.
Use when:
- Planning features
- Brainstorming architecture
- Multi-turn conversations with context
Switching Modes
You can switch between modes at any time using:
- The mode selector in the header bar (Transcribe | Optimize | Chat)
- The dedicated hotkeys listed above
- The mode-specific global hotkeys work from any application
How Modes Interact with Profiles
Each mode processes your input differently:
- Transcribe: Bypasses the LLM entirely. Output is your cleaned-up spoken words.
- Optimize: Sends your transcription through the selected optimization profile and agent for structured output.
- Chat: Full conversational mode where the agent maintains context across multiple messages.