Get Voiceover Tracks
VoiceOvers
Get Voiceover Tracks
Retrieve a comprehensive list of all available AI voiceover voices with their language, accent, and service provider details
GET
Get Voiceover Tracks
Documentation Index
Fetch the complete documentation index at: https://docs.pictory.ai/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Retrieve a complete list of all AI voiceover voices available for text-to-speech conversion in your video projects. The endpoint provides detailed information about each voice including accent, gender, language, service provider (AWS Polly or Google WaveNet/Neural2), sample audio URLs, and SSML support categories.You need a valid API key to use this endpoint. Get your API key from the API Access page in your Pictory dashboard.
API Endpoint
Request Parameters
Headers
API key for authentication (starts with
pictai_)Response
Returns an array of voice objects with the following properties:The accent or regional variant of the voice (e.g., “American accent”, “British accent”, “Indian accent”)
Voice quality category, typically “standard”
The text-to-speech engine type (e.g., “neural”, “WaveNet”, “Neural2”, “standard”)
The voice gender: “male” or “female”
Unique numeric identifier for the voice
Language code in IETF format (e.g., “en-US”, “en-GB”, “fr-FR”, “es-ES”)
The display name of the voice (e.g., “Joanna”, “Matthew”, “Amy”)
URL to an MP3 sample of the voice for preview
The voice provider service: “aws” (Amazon Polly) or “google” (Google Cloud Text-to-Speech)
URL to documentation for supported SSML tags for this voice
SSML support level category (A, B, or C) indicating which SSML features are supported
The technical voice identifier used by the service provider
Response Examples
Code Examples
Usage Notes
Voice Availability: The endpoint returns all available voices across multiple languages, accents, and service providers. Filter the results based on your project requirements.
SSML Support: Different voices support different SSML (Speech Synthesis Markup Language) features. Check the
ssmlSupportCategory and ssmlHelp fields to understand what is available for each voice.Service Providers:
- AWS Polly voices (
service: "aws") use the “neural” or “standard” engine - Google Cloud voices (
service: "google") use “WaveNet” or “Neural2” engines
Common Use Cases
1. List All Available Voices
Retrieve and display all available voices:2. Filter Voices by Language
Get all voices for a specific language:3. Group Voices by Accent
Organize voices by accent type:4. Find Best Voice Match
Find voices matching specific criteria:5. Create Voice Selection UI Data
Prepare voice data for a user interface:6. Compare Voice Providers
Analyze and compare voices by service provider:Best Practices
Voice Selection
- Preview Before Use: Always use the
sampleURLs to let users preview voices before selection - Filter by Language: Filter voices by the target language to present relevant options to users
- Consider Accent: Match voice accent to your target audience (American, British, Indian, etc.)
- Engine Quality: Neural and WaveNet voices generally sound more natural than standard voices
- SSML Support: Check
ssmlSupportCategoryif you need advanced SSML features like custom pronunciation or emphasis
Performance Tips
- Cache Voice List: Cache the voice list for 24 hours as it rarely changes
- Client-Side Filtering: Fetch all voices once and filter on the client side
- Lazy Load Samples: Only load audio samples when users preview them
- Index by ID: Create an ID-to-voice mapping for quick lookups
Common Voice Categories
SSML Support Categories:- Category A: Basic SSML support (standard engines)
- Category B: Advanced SSML support (AWS neural voices)
- Category C: Full SSML support (Google WaveNet/Neural2 voices)
- standard: Basic quality, faster processing
- neural: High quality, natural-sounding (AWS)
- WaveNet: Premium quality (Google)
- Neural2: Latest generation neural voices (Google)
Language Coverage
The API provides voices for multiple languages including:- English variants: en-US, en-GB, en-AU, en-IN, en-NZ, en-ZA
- European: fr-FR, fr-CA, de-DE, de-AT, it-IT, es-ES, nl-NL, nl-BE, pt-PT, pt-BR
- And many more…
