Vocoding
Vocoding Docs
Getting Started

What is Vocoding?

Learn what Vocoding does and get started in 5 minutes.

Vocoding is a voice-first AI workspace — a desktop app that transforms spoken instructions into structured outputs using specialist agents, profiles, and context-aware optimization.

It is NOT just a transcription tool. Vocoding understands your project context, selects the right optimization profile and agent, and delivers structured prompts ready to paste.

The Core Flow

[Your Voice] → [Local Transcription] → [LLM Optimization] → [Structured Prompt] → [Clipboard]
     ↓                ↓                      ↓                     ↓
  Speak naturally   Whisper (100% local)   Cloud optional    Auto-copied, ready to paste

Time from voice to clipboard: < 5 seconds

Quick Start (5 Minutes)

1. Launch Vocoding

The app starts minimized to the menu bar (macOS) or system tray (Windows). Click the icon to open the main window, or use the global hotkey.

2. First-Run Setup

On first launch, Vocoding shows an onboarding wizard that guides you through:

  • Downloading a Whisper model for local transcription
  • Choosing your preferred operating mode
  • Configuring an LLM provider (optional)

3. Press the Global Hotkey

PlatformDefault Hotkey
macOSOption + Space
WindowsAlt + Space

4. Speak Your Idea

Talk naturally. For example:

"Create a React component for a user profile card that shows avatar, name, email, and a follow button. Use TypeScript and Tailwind CSS."

5. Get Your Result

Vocoding will:

  1. Transcribe your voice locally (Whisper — 100% offline)
  2. Optimize with LLM (if configured)
  3. Copy the result to your clipboard automatically

6. Paste Anywhere

Press Cmd+V (macOS) or Ctrl+V (Windows) in your AI tool of choice.

Next Steps