Supported SSML tags

The Text-to-Speech feature in Pictory APIs supports the following SSML tags, with full or limited functionality depending on the underlying text-to-speech platform type.

Supported SSML Tags

<break>
<emphasis>
<phoneme>
<prosody>
<say-as>

SSML tag support is divided into four categories based on the type of text-to-speech platform used for each speaker, as outlined below.

Category A

<break> - Supported
<emphasis> - Supported
<phoneme> - Supported
<prosody> - Supported
<say-as> - Supported

Category B

<break> - Supported
<emphasis> - Not Supported
<phoneme> - Supported
<prosody> - Speakers in this category support the volume and rate attributes but do not support the pitch attribute.
<say-as> - The characters option is not supported by speakers in this category.

Category C

<break> - Supported
<emphasis> - Supported

The <emphasis> tags should only be used around a full sentence. Enclosing words within a sentence may cause unwanted pauses in speech.

This element supports an optional "level" attribute with the following valid values:

  1. strong
  2. moderate
  3. none
  4. reduced

e.g. <emphasis level="moderate">This is an important announcement

<phoneme> - Supported
<prosody> - Supported

The <prosody> tags should only be used around a full sentence. Enclosing words within a sentence may cause unwanted pauses in speech.

e.g. <prosody rate="slow" pitch="-2st">Can you hear me now?</prosody>

<say-as> - Supported

Category D

<break> - Supported
<emphasis> - Not Supported
<phoneme> - Supported
<prosody> - Not Supported
<say-as> - Not Supported