PlayKit.ai
Text-to-Speech

Voices

Choosing a voice, the system voice catalog, and blending voices

Voices

A voice determines the speaker's timbre. You select one by its voice id; if you omit it, the model's default voice is used.

System voices

The TTS model ships with a large catalog of system voices spanning many languages and styles — Mandarin, Cantonese, English, Japanese, Korean, Spanish, Portuguese, French, and more, including character and personality voices.

A voice id looks like male-qn-qingse, English_Trustworthy_Man, or Chinese (Mandarin)_Warm_Girl. Voice ids are case-sensitive and must match exactly.

You can browse and preview the full list in the PlayKit Dashboard voice picker for the TTS model.

{ "voice": "English_Trustworthy_Man" }

If the voice id is not in the catalog, the request is rejected — so prefer copying ids from the dashboard.

Voice mixing

You can blend up to 4 voices into a single custom timbre by giving each a weight from 1–100. Higher weight means more of that voice in the result. Voice mixing is mutually exclusive with picking a single voice.

{
  "voice_mix": [
    { "voice": "female-chengshu", "weight": 70 },
    { "voice": "female-tianmei", "weight": 30 }
  ]
}

Voice tuning

Independently of which voice you use, you can adjust speed, volume, and pitch (see your SDK's voice_settings / equivalent), and set an emotion — covered in Tone & Markup.

For the exact call in your language, see the JavaScript or Unity TTS guide.