What You’ll Learn
Video-Level Default
Set a default voice for all scenes
Scene-Level Override
Customize voice for specific scenes
Voice Customization
Control speed and amplification per voice
Multiple Voices
Use different AI voices in one video
Before You Begin
Make sure you have:- A Pictory API key (get one here)
- Node.js or Python installed on your machine
- Basic understanding of voice-over concepts
How Multi-Level Voice-Over Works
Multi-level voice-over gives you granular control over narration:- Video-Level Settings: Define default voice-over configuration for all scenes
- Scene-Level Overrides: Customize voice settings for specific scenes
- Automatic Fallback: Scenes without custom settings use the video-level default
- Flexible Control: Mix and match voices, speeds, and volumes throughout your video
Scene-level voice-over settings always override video-level settings for that specific scene. This allows precise control while maintaining consistency across your video.
Complete Example
Understanding the Configuration
Video-Level Voice-Over (Default Settings)
| Parameter | Type | Default | Description |
|---|---|---|---|
voiceOver.enabled | boolean | false | Enable voice-over for the entire video |
voiceOver.aiVoices | array | - | Array of AI voice configurations |
voiceOver.aiVoices[].speaker | string | - | AI voice name (e.g., “Brian”, “Emma”) |
voiceOver.aiVoices[].speed | number | 100 | Voice speed (50-200) |
voiceOver.aiVoices[].amplificationLevel | number | 0 | Volume level (-1 to 1) |
Scene-Level Voice-Over (Overrides)
| Parameter | Type | Description |
|---|---|---|
scenes[].voiceOver | object | Scene-specific voice-over settings (same structure as video-level) |
scenes[].voiceOver.enabled | boolean | Enable/disable voice-over for this scene |
scenes[].voiceOver.aiVoices | array | Custom voice configuration for this scene |
Voice Speed Reference
| Speed Value | Playback Rate | Best Used For |
|---|---|---|
| 50 | 0.5x (Very slow) | Complex technical content, learning materials |
| 75 | 0.75x (Slower) | Detailed explanations, emphasis |
| 85-90 | 0.85-0.9x (Slightly slower) | Introduction, important points |
| 100 | 1.0x (Normal) | Standard content, most scenes |
| 110-125 | 1.1-1.25x (Slightly faster) | Casual content, transitions |
| 150 | 1.5x (Fast) | Quick summaries, energetic content |
| 200 | 2.0x (Very fast) | Speed reading, urgent calls-to-action |
Amplification Level Reference
| Level | Effect | Best Used For |
|---|---|---|
| -1.0 | Quietest (background) | Subtle narration, ambient voice |
| -0.5 | Quieter than normal | De-emphasized content |
| 0 | Normal volume | Standard narration |
| 0.3 | Slightly louder | Mild emphasis |
| 0.5 | Moderately louder | Important points |
| 1.0 | Loudest | Strong emphasis, calls-to-action |
Common Use Cases
Emphasizing Key Sections
Tutorial Videos
Multi-Language Support
Best Practices
Choose Appropriate Speed Variations
Choose Appropriate Speed Variations
Don’t vary speed too dramatically between scenes - sudden changes can be jarring:
- Good: Vary by 10-20 points (e.g., 90-110)
- Avoid: Extreme jumps (e.g., 50 to 200)
- Tip: Test your video to ensure smooth transitions
Use Amplification Sparingly
Use Amplification Sparingly
Subtle volume changes are more professional than dramatic ones:
- Good: Use 0.1-0.3 for emphasis
- Moderate: Use 0.5 for strong emphasis
- Avoid: Levels above 0.7 unless intentional
- Tip: Let the content quality drive emphasis, not just volume
Maintain Consistency
Maintain Consistency
Keep voice settings consistent across similar scenes:
- Use the same voice for all sections
- Apply similar speed/volume to similar content types
- Create a “voice style guide” for your brand
Test Before Production
Test Before Production
Always review a test video before creating multiple videos:
- Generate a short sample with your settings
- Listen on different devices (phone, desktop, headphones)
- Adjust based on feedback
- Document successful settings for reuse
Troubleshooting
Scene uses wrong voice settings
Scene uses wrong voice settings
Problem: Scene isn’t using your custom voice-over settings.Solution:
- Ensure scene-level
voiceOverobject is properly formatted - Check that you’ve included the
speakername (it must be specified even if using the same voice) - Verify JSON syntax is correct (commas, brackets, quotes)
Voice sounds distorted
Voice sounds distorted
Problem: Audio quality is poor or distorted.Solution:
- Reduce
amplificationLevel(try 0.3 or lower) - Avoid combining high speed (>150) with high amplification (>0.5)
- Check that speed values are within 50-200 range
All scenes use video-level settings
All scenes use video-level settings
Problem: Scene-level overrides aren’t being applied.Solution:
- Verify scene-level
voiceOveris inside the scene object - Check that
enabled: trueis set at scene level - Ensure scene-level configuration is complete (speaker, speed, amplificationLevel)
Next Steps
Enhance your voice-over videos with these features:Background Music
Add music to complement voice-over narration
Custom Captions
Add translated or custom subtitles to your videos
Brand Settings
Apply consistent branding across all your videos
Basic Voice-Over
Learn the basics of AI voice-over narration
