What You’ll Learn
AI Image Generation
Generate unique images with AI instead of stock visuals
Image Models
Choose from Flux, Seedream, Nano Banana, and Nano Banana Pro
Media Styles
Apply styles like photorealistic, artistic, cartoon, and more
AI Credit Costs
Understand per-image credit costs for each model
Before You Begin
Make sure you have:- A Pictory API key (get one here)
- Node.js or Python installed on your machine
- Sufficient AI credits in your account
- Basic understanding of AI image generation concepts
How It Works
When you setbackground.type to "image" and provide an aiVisual configuration, Pictory generates a unique AI image for the scene background:
- Prompt Processing — Your text prompt (or auto-generated prompt from story text) is analyzed
- Style Application — The selected
mediaStyleis applied to shape the visual output - AI Generation — The chosen image model creates a unique image
- Scene Integration — The generated image is used as the scene background
AI image generation takes additional processing time compared to stock visuals. The time varies by model — faster models like Flux generate in seconds, while higher-quality models take longer.
Configuration Reference
Background Object
When using AI-generated images, set thebackground object on a scene as follows:
| Parameter | Type | Required | Description |
|---|---|---|---|
type | string | Yes | Must be "image" for AI-generated images |
aiVisual | object | Yes | AI image generation configuration (see below) |
aiVisual Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
prompt | string | No | Text description of the image to generate (max 250 characters). If omitted, a prompt is auto-generated from the scene’s story text. |
model | string | Yes | The AI image model to use. See Available Image Models. |
mediaStyle | string | No | Visual style to apply. See Available Media Styles. |
The
videoDuration parameter is not allowed when type is "image". It is only used for AI-generated video clips.Available Image Models
Each model has different strengths and AI credit costs. Choose based on your quality requirements and budget.| Model ID | Display Name | AI Credits (per image) | Best For | Supported Aspect Ratios |
|---|---|---|---|---|
flux-schnell | Flux | 0.6 | Reliable for basic layouts | 1:1, 16:9, 9:16 |
seedream3.0 | Seedream | 2 | Reliable for text and numbers | 1:1, 16:9, 9:16 |
nanobanana | Nano Banana | 4 | Excels at details | 1:1, 16:9, 9:16 |
nanobanana-pro | Nano Banana Pro | 14 | Superior cinematic quality | 1:1, 16:9, 9:16 |
Available Media Styles
ThemediaStyle parameter shapes the look and feel of the generated image. It is optional — if omitted, the model uses its default rendering style.
| Style | Visual Characteristics | Best Used For |
|---|---|---|
photorealistic | Realistic photographs, natural lighting | Corporate videos, professional presentations, realistic scenarios |
artistic | Artistic renderings, painterly effects | Creative content, brand storytelling, abstract concepts |
cartoon | Cartoon-style imagery, bold colors | Children’s content, educational videos, fun marketing |
minimalist | Simple, clean designs, reduced details | Modern branding, tech content, professional minimalism |
vintage | Retro aesthetic, aged appearance | Nostalgia marketing, historical content, unique branding |
futuristic | Modern, sci-fi look, high-tech feel | Technology content, innovation topics, forward-thinking brands |
Complete Example
Auto-Generated Prompts
You can omit theprompt field to let the AI automatically generate a prompt based on your scene’s story text:
- Creating multi-scene videos with
createSceneOnEndOfSentence: true - You want the AI to visually interpret your story text
- Quick content creation without manual prompt writing
Common Use Cases
Technology Content
Educational Content
Marketing Content
Budget-Friendly Content
Best Practices
Write Effective Prompts
Write Effective Prompts
- Be specific: Include details about composition, lighting, and mood
- Use descriptive language: “bright morning sunlight streaming through glass walls” vs “sunny room”
- Mention key elements: “modern office with glass walls and city skyline view”
- Keep under 250 characters: Concise prompts produce more focused results
- Avoid negatives: Describe what you want, not what you don’t want
Choose the Right Model for Your Budget
Choose the Right Model for Your Budget
| Scenario | Recommended Model | Cost |
|---|---|---|
| Testing & drafts | flux-schnell | 0.6 credits |
| General content | seedream3.0 | 2 credits |
| Detail-rich scenes | nanobanana | 4 credits |
| Premium final output | nanobanana-pro | 14 credits |
flux-schnell for iteration, then switch to a higher-quality model for production.Match Media Style to Your Brand
Match Media Style to Your Brand
- Corporate/Professional:
photorealisticorminimalist - Creative/Artistic:
artisticorvintage - Tech/Innovation:
futuristic - Educational/Fun:
cartoon - Consistent branding: Stick to one style across scenes
Use Text-Rendering Models for Text in Images
Use Text-Rendering Models for Text in Images
If your image needs to display readable text or numbers, use
seedream3.0 — it is specifically optimized for rendering text and numbers clearly within generated images.Troubleshooting
Generated image doesn't match prompt
Generated image doesn't match prompt
- Make your prompt more specific and detailed
- Add descriptive adjectives: “bright”, “modern”, “spacious”
- Specify composition: “aerial view”, “close-up”, “wide angle”
- Include lighting details: “sunset lighting”, “studio lighting”
- Try a different model — each interprets prompts differently
Image quality is low
Image quality is low
- Switch to a higher-quality model (
nanobananaornanobanana-pro) - Avoid overly complex prompts — keep them focused
- Try a different
mediaStylethat suits the content
Error: Invalid background configuration
Error: Invalid background configuration
Ensure your configuration follows these rules:
Error: Insufficient AI credits
Error: Insufficient AI credits
Each image generation costs AI credits based on the model used. Check your credit balance and consider using a more economical model like
flux-schnell (0.6 credits per image).Next Steps
AI-Generated Video Clips
Use AI-generated video clips as scene backgrounds
AI Voice-Over
Add professional narration to your videos
Brand Settings
Apply consistent branding automatically
Background Music
Add music to complement your visuals
