Integrate InterScribe with OBS, vMix, and hardware encoders for advanced setups.
If you're using professional production tools like OBS, vMix, or dedicated hardware encoders, InterScribe allows you to ingest high-quality audio using RTMP, SRT, or WHIP. This method is best suited for:
InterScribe will receive the audio, generate live captions, translations, and (if enabled) AI voice interpretation for attendees.
⚠️ Important: Broadcast protocols introduce latency (typically 3–10 seconds). These methods are not recommended for in-person audiences or interpreters. For low-latency needs, use the Desktop Agent or Web Agent.
💡 You must select an AV Channel to receive the audio. This connects the session to the appropriate incoming feed.
In OBS, vMix, or your hardware encoder:
Ensure you're sending only the speaker’s mic(s) — avoid music or ambient audio.
Use the Streamer Dashboard or Monitor View to verify:
Adjust volume, glossary, or language options as needed
Tip | Why It Matters |
---|---|
Send clean speech only | Avoid music or noise — improves transcription and translation accuracy |
Use an AUX mix bus | Isolate the speaker mic(s) without including ambient or background audio |
Test delay | RTMP/SRT/WHIP have built-in delays. Test alignment between speech and output |
Hardwire your encoder | Use Ethernet for stable audio streaming |
Backup with Desktop Agent | Run a Desktop Agent on the same AV Channel for failover redundancy |
🧪 Pro Tip: If your production includes video, InterScribe can sync captions with embedded YouTube or Vimeo players by adding a configurable delay. The protocol delay comes from the encoder and cannot be removed.
Yes — InterScribe will ingest the audio, and ignore the video for processing. If your session includes an embedded video player (e.g. YouTube or Vimeo), attendees can watch it while receiving synchronized captions and voice interpretation.
No. While broadcast protocols are ideal for livestream workflows, they are not suitable for real-time interpretation or in-person events due to their inherent delay. Use the Desktop Agent for:
Limitation | Notes |
---|---|
Latency (3–10 seconds) | Caused by the streaming protocol itself, not InterScribe |
No real-time fallback | If your RTMP/SRT stream drops, the session stops unless a backup agent is used |
Complex setup | Requires encoder configuration, AV channel mapping, and ingress management |
No multi-language input | All language routing must happen before encoding — only one audio feed accepted |
We respect your privacy. We respect your privacy.
TLDR: We use cookies for language selection, theme, and analytics. Learn more. TLDR: We use cookies for language selection, theme, and analytics. Learn more