Skip to main content

Voices and languages

SignalWire's cloud platform integrates with leading third-party text-to-speech (TTS) providers. This guide describes supported engines, voices, and languages. Refer to each provider's documentation for up-to-date model details and service information.

Compare providers and models​

SignalWire's TTS providers offer a wide range of voice engines optimized for various applications. Select a provider, model, and voice according to the following considerations:

Cost: When cost-efficiency is the top priority, select a Standard-tier voice from Google Cloud or Amazon Polly. Review our pricing information to learn more.

Language support: Amazon Polly, ElevenLabs, Google Cloud, and OpenAI offer a wide range of supported languages. In addition, all ElevenLabs and OpenAI voices are fully multilingual.

Model quality and realism: All supported providers offer high-quality engines: Google Cloud's WaveNet and Neural2, Amazon Polly Neural, ElevenLabs' Multilingual v2, and Deepgram's Aura are all optimized for voice quality.

SSML support: Google Cloud and Amazon Polly support SSML (Speech Synthesis Markup Language) as a string wrapped in <speak> tags. Consult Google Cloud's SSML docs for details. Refer to the Amazon Polly docs for more information on using SSML and supported SSML tags.

Use voice identifier strings​

Compose voice identifier strings using the <engine>.<voice id> format.

First, select your engine using the gcloud, polly, elevenlabs, or deepgram identifier. Append a period (.), followed by the specific voice ID from the TTS provider.

Case insensitivity

Voice identifier strings are case insensitive. For example, gcloud.en-US-Neural2-A, gcloud.en-us-neural2-a, and GCLOUD.EN-US-NEURAL2-A are equivalent.

For detailed instructions for each provider, consult the voice ID references linked in the Usage column of the below table.

TTS providerEngine codeSample voice ID stringUsage
Amazon Pollypollypolly.Joanna-NeuralReference
Cartesiacartesiacartesia.a167e0f3-df7e-4d52-a9c3-f949145efdabReference
Deepgramdeepgramdeepgram.aura-asteria-enReference
ElevenLabselevenlabselevenlabs.thomasReference
Google Cloudgcloudgcloud.en-US-Casual-KReference
Microsoft Azureazureen-US-AvaNeuralReference
OpenAIopenaiopenai.alloyReference
Rimerimerime.luna:arcanaReference


Pricing​

Voices are priced according to model in three tiers. Consult our Voice API Pricing for up-to-date pricing information.

Standard

  • Google Cloud Standard
  • Amazon Polly Standard

Premium

  • Google Cloud Neural2, WaveNet, and Journey
  • Amazon Polly Neural
  • Deepgram Aura

ElevenLabs voices have their own tier.