Overview
TogetherSTTService provides real-time speech recognition using Together AI’s WebSocket API with OpenAI-compatible speech-to-text endpoints. It supports streaming transcription with interim results and automatic reconnection.
Together AI STT API Reference
Pipecat’s API methods for Together AI STT
Example Implementation
Complete transcription example
Together AI Documentation
Official Together AI Realtime API documentation
Together AI Platform
Access models and manage API keys
Installation
To use Together AI STT services, install the required dependencies:Prerequisites
Together AI Account Setup
Before using Together AI STT services, you need:- Together AI Account: Sign up at Together AI
- API Key: Generate an API key from your account dashboard
- Model Selection: Choose from available transcription models
Required Environment Variables
TOGETHER_API_KEY: Your Together AI API key for authentication
Configuration
Together AI API key for authentication.
Audio sample rate in Hz. When
None, uses the pipeline’s configured sample
rate.WebSocket base URL for Together AI API.
Runtime-configurable settings. See Settings below.
P99 latency from speech end to final transcript in seconds. Override for your
deployment. See
https://github.com/pipecat-ai/stt-benchmark.
Settings
Runtime-configurable settings passed via thesettings constructor argument using TogetherSTTService.Settings(...). These can be updated mid-conversation with STTUpdateSettingsFrame. See Service Settings for details.
| Parameter | Type | Default | Description |
|---|---|---|---|
model | str | "openai/whisper-large-v3" | Model identifier. (Inherited.) |
language | Language | str | Language.EN | Language for transcription. (Inherited) |
Usage
Basic Setup
With Custom Settings
In a Voice Pipeline
Notes
- Together AI’s STT service uses an OpenAI-compatible WebSocket protocol for real-time transcription.
- The service automatically handles reconnection on connection errors.
- Transcription is committed when
VADUserStoppedSpeakingFrameis received.