Voxtral Mini TTS is Mistral's text-to-speech model featuring zero-shot voice cloning and multilingual support. It converts text input into natural-sounding audio output.
Modalities
Price
$16per 1M characters
Context
4K
Weekly Rank
#405on OpenRouter
Voxtral Mini TTS is Mistral's text-to-speech model featuring zero-shot voice cloning and multilingual support. It converts text input into natural-sounding audio output.
Modalities
Price
$16per 1M characters
Context
4K
Weekly Rank
#405on OpenRouter
OpenRouter provides a text-to-speech API that converts text into natural-sounding audio. Send text and a voice selection, and receive raw audio bytes in your chosen format.
The response is a raw audio stream (not JSON). The generation ID is returned in the X-Generation-Id response header for tracking.
For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.
See the Request docs for all possible fields, and Parameters for explanations of specific sampling parameters.