Generate videos from text prompts using the Kling v3.0 model with multi-shot, audio generation, and extended duration support.
first_frame and last_frame images to guide video generationmulti_shot and multi_promptsound: "on"elements with frontal and reference imagesBearer authentication header of the form Bearer <token>, where <token> is your auth token.
First frame image (URL or Base64). Sets the opening frame of the generated video.
Last frame image (URL or Base64). Sets the closing frame of the generated video.
Positive text prompt. Cannot exceed 2500 characters.
Required when multi_shot is false or when shot_type is intelligence.
2500Negative text prompt. Cannot exceed 2500 characters.
2500Whether to generate a multi-shot video.
true: Enable multi-shot mode. prompt is ignored; use shot_type and multi_prompt instead.false: Single-shot mode (default).Shot segmentation method. Required when multi_shot is true.
customize: Custom shots, requires multi_prompt.intelligence: AI-generated shots, requires prompt.customize, intelligence Shot prompt list for multi-shot videos.
Required when multi_shot is true and shot_type is customize.
1 - 6 elementsGenerate audio simultaneously when generating videos.
on: Enable audio generationoff: Disable audio generation (silent video)on, off Video generation mode.
std: Standard Mode (720P), cost-effective.
pro: Professional Mode (1080P), higher quality video output.
std, pro The aspect ratio of the generated video frame (width:height).
16:9, 9:16, 1:1 Video length in seconds (3-15).
Element definitions. Max 3 elements.
Provide frontal and reference images.
Use <<<element_1>>> in prompt to reference elements.
1 - 3 elementsAccepted - Task created successfully