June 16, 2026
FeatureDocumentation
PowerPoint Animated Slides, Multilingual PPT Narration, LLM-Friendly Docs, and OpenAPI Spec
PowerPoint Animated Slides
New scene-level fieldanimatePPT enables AI-generated slide animations (zoom, pan, fade) for image-heavy PowerPoint decks. Only valid when used with pptUrl.- New guide: PowerPoint to Video with Animated Slides — covers
animatePPT, feature combinations, and best practices for when to apply animations. - Existing PowerPoint AI Voiceover and Speaker Notes guides now document
animatePPT. - The field is also documented in the Create Storyboard Preview API reference.
Multilingual PowerPoint Narration — Translate Decks at Render Time
The top-levellanguage field is now documented for the PowerPoint-to-video flow, which means you can generate the video in a different language from the one your PowerPoint deck is written in. Submit an English deck with language: "fr" and Pictory generates the narration directly in French — no need to translate your slides first.- 14 supported languages: Chinese (
zh), Dutch (nl), English (en), French (fr), German (de), Hindi (hi), Italian (it), Japanese (ja), Korean (ko), Marathi (mr), Portuguese (pt), Russian (ru), Spanish (es), and Tamil (ta). - One deck → many language outputs: ship the same source PPT to multiple international audiences without maintaining translated copies.
- Works with both narration sources: whether the AI is reading the slide text or the speaker notes (
useSpeakerNotes: true), the output is translated into the targetlanguage. - Combines with
animatePPT: add slide animations on top of multilingual narration in the same request. - Documented voice per language: each
languagehas a default male STD voice you can use straight away (e.g.,Martinfor English,Gabrielfor French,Wilburfor German). See the End-to-End Recipes guide for the full language → voice mapping.
LLM-Friendly Documentation
New resources that make the Pictory API easier to consume for Large Language Models, code generators, and agentic tools.- OpenAPI 3.1 specification — Available at
docs.pictory.ai/openapi.json. Covers the five most-used endpoints (Create Storyboard Preview, Render Storyboard Video, Render Project, Get Job by ID, Get Video Brands) with 34 component schemas. Use it for SDK code generation, Postman or Insomnia import, or as a structured tool schema for LLMs. - End-to-End Recipes guide — Ten complete, ready-to-run JSON payloads for the most common use cases, including text-to-video, multilingual narration, PowerPoint, avatar walk-throughs, template-based renders, brand kits, webhooks, previews, and job polling.
- Use Pictory with an LLM guide — Recommended system prompt, example natural-language to API-call mappings, and Pictory MCP server pointers.
- AI Tools setup guides — Project rules and configuration templates for Claude Code, Cursor, and Windsurf. Each guide teaches the tool the Pictory API conventions, including the raw-key authentication pattern and mutually-exclusive field constraints.
llms.txtandllms-full.txt— Mintlify auto-generates these atdocs.pictory.ai/llms.txtanddocs.pictory.ai/llms-full.txt. Hand either URL to any LLM to make it Pictory-aware in one shot.
AI Credits — Unified Currency for Generative AI Features
AI Credits are now a single unified currency for Pictory’s generative AI features — AI Image generation, AI Video generation, and AI Avatars. API customers receive AI Credits as part of their plan; unused credits roll over to the next billing cycle and never expire while the subscription stays active.Track consumption and remaining balance with the Get Current Quota and Get AI Credits Usage endpoints.For the full breakdown of per-model rate cards, add-on packs, and Teams pooling, see Understanding AI Credits in the Pictory Help Center.Account and Usage API
Two new endpoints for monitoring your subscription usage and AI credit consumption.Get Current Quota- Retrieve real-time usage for video minutes, transcription minutes, ElevenLabs voice-over minutes, and AI credits.
- Each metric shows
usedandlimitvalues for the current billing term. - Use this to display usage summaries, gate features based on remaining quota, or verify capacity before launching jobs.
- Retrieve a per-transaction ledger of AI credit activity with date, amount, source, and direction (credit or debit).
- Filter by year and month to query specific billing periods.
- Transaction sources include AI Avatar, Text-to-Image, AI Video, Add-on Purchase, and Promotional Credits.
- History available for the last 12 months.
ElevenLabs Voice Auto-Discovery
You can now use any voice from the ElevenLabs catalog without manual setup. Pass the ElevenLabs voice ID directly as thespeaker value in your voiceover configuration, and Pictory will automatically discover and add the voice to your library.- No manual voice registration — the Add Voiceover Track endpoint is no longer needed for ElevenLabs voices.
- Pass any ElevenLabs voice ID as the
speakerfield (e.g.,"pNInz6obpgDQGcFmaJgB"). - Automatic library addition — once discovered, the voice is available by name or track ID in future requests.
- Premium voice settings — fine-tune stability, similarity boost, style, and model selection.
POST /v1/voiceovers/tracks endpoint has been removed. ElevenLabs voices are now added automatically on first use.See the ElevenLabs Voices Guide for full documentation.AI Visuals: Visual Continuity, Reference Images, and Creative Direction
New features for AI-generated scene backgrounds that give you greater creative control over video and image generation.Visual Continuity- Enable
visualContinuityto create seamless transitions between consecutive AI-generated scenes. - Works with both video clips and images.
- The system uses the output of each scene as a reference for the next, producing a cohesive visual flow.
- Continuity applies within the same story and across consecutive stories when enabled.
- Use
firstFrameImageUrlto control the starting frame of an AI-generated video clip. - The AI model generates a video that begins from your provided image and transitions into the motion described by the prompt.
- Use
referenceImageUrlto guide the style and composition of AI-generated images with a reference image. - Influence the color palette, lighting, and visual tone while generating new content from your prompt.
- Use
referenceImageUrlsto provide 1–2 reference images that guide the style of AI-generated video clips. - When using
veo3.1orveo3.1_fastwith reference images, the video duration is automatically set to"8s".
- When a story is split into multiple scenes, the
promptfield acts as a creative direction for the entire video. - The system uses your creative direction to guide the auto-generated prompts for each individual scene.
- Recommended structure: [Action/Movement] + [Scene/Environment] + [Camera Technique] + [Visual Style].
- Job responses now include
aiCreditsUsedwhen AI visuals are generated, reporting total credits consumed across all scenes.
AI Studio: AI Image and Video Generation
Pictory now offers AI Studio, a new set of APIs for generating AI-powered images and videos directly from text prompts. AI Studio gives you access to multiple AI models, aspect ratios, visual styles, and advanced creative input modes.Image Generation- Generate images from text prompts with your choice of AI model and visual style.
- Use a reference image to create variations, replace subjects, or apply transformations.
- Choose from models including
seedream3.0,flux-schnell,nanobanana, andnanobanana-pro. - Apply visual styles such as
photorealistic,artistic,cartoon,minimalist,vintage, andfuturistic.
- Generate videos from text prompts with configurable duration and aspect ratio.
- Start a video from a specific image using the first frame input for precise visual control.
- Extend an existing video with new AI-generated content that continues from the original.
- Provide up to three reference images to guide subject appearance and scene composition.
- Choose from models including
veo3.1,veo3.1_fast, andpixverse5.5.
- Retrieve paginated lists of all your generated images and videos.
- Results are sorted by creation date with the most recent items first.
- Every completed job reports the number of AI credits consumed.
- Image generation is charged per image, and video generation is charged per second of output.
- See the rate card in each model’s documentation for exact pricing.
Dynamic Captions with Word-Level Timing
Dynamic captions now render subtitles word by word, synchronized precisely with the voiceover audio. This produces a more engaging viewing experience where each word appears on screen as it is spoken.- Set
maxSubtitleLinesto a value from1to4to control how many lines of subtitles are displayed at a time. - Word-level timing is applied by default when dynamic captions are active.
