Video Workflow
Create complete videos with AI-generated scripts, slide presentations, text-to-speech narration, and automated video composition.
Video creation wizard in the portal
Overview
The Video Workflow is the most complex workflow, combining multiple AI technologies to produce finished video content:
Script Generation
GPT-4 creates video scripts with scene breakdowns, narration text, and visual descriptions.
Slide Creation
AI generates slide presentations with text, images, and visual elements for each scene.
Voice Synthesis
OpenAI TTS converts narration text to natural-sounding voice audio for each slide.
Video Composition
FFmpeg combines slides and audio into final MP4 video with transitions and timing.
Stage Breakdown
Discovery
Gather video topic, style, and requirements
Generate Script
AI creates detailed video script with scenes
Script Review Approval
Human reviews and approves script
Create Slides
System creates slide tasks from script
Generate Slides
AI generates visual content for each slide
Slide Review Approval
Human reviews slides with feedback option
Generate Audio
TTS creates voiceover for each slide
Compose Video
FFmpeg combines slides and audio
Complete
Final video available for download
Configuration Options
| Parameter | Type | Description |
|---|---|---|
video_title |
String | Title of the video |
topic |
Text | Main subject of the video |
duration_target |
Selection | "1-2 min", "3-5 min", "10+ min" |
video_style |
Selection | "explainer", "tutorial", "presentation", "promotional" |
voice_style |
Selection | "alloy", "echo", "fable", "onyx", "nova", "shimmer" |
slide_style |
Selection | "modern", "corporate", "creative", "minimal" |
Requirements
The video workflow requires FFmpeg installed on the server for video
composition. Ensure ffmpeg is available in the system PATH.