Kling V3.0 Image to Video
Generate videos from images using the Kling v3.0 model with multi-shot, audio, element reference, and extended duration support.
Documentation Index
Fetch the complete documentation index at: https://mulerouter.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Generate videos from reference images using the Kling V3.0 model. In addition to the text-to-video features, image-to-video supports:- First/Last frame control — provide
first_frameandlast_framefor start-to-end frame interpolation - Element references — reference up to 3 subjects via
elementswith frontal and reference images
Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
First frame image (URL or Base64). Sets the opening frame of the generated video.
Important: When using Base64 encoding, do not add any prefixes such as data:image/png;base64,. Provide only the Base64-encoded string itself.
- Supported image formats: .jpg, .jpeg, .png
- Image file size cannot exceed 10MB
- Width and height dimensions must not be less than 300px
- Aspect ratio should be between 1:2.5 ~ 2.5:1
- At least one of
first_frameandlast_framemust be provided.
Last frame image (URL or Base64). Sets the closing frame of the generated video.
- At least one of
first_frameandlast_framemust be provided. last_frame,dynamic_masks/static_mask, andcamera_controlare mutually exclusive.
Positive text prompt. Cannot exceed 2500 characters.
Required when multi_shot is false or when shot_type is intelligence.
2500Negative text prompt. Cannot exceed 2500 characters.
2500Whether to generate a multi-shot video.
true: Enable multi-shot mode.promptis ignored; useshot_typeandmulti_promptinstead.false: Single-shot mode (default).
Shot segmentation method. Required when multi_shot is true.
customize: Custom shots, requiresmulti_prompt.intelligence: AI-generated shots, requiresprompt.
customize, intelligence Shot prompt list for multi-shot videos.
- Max 6 shots, min 1 shot.
- Each shot prompt max 512 characters.
- Each shot duration must not exceed total duration and must be >= 1.
- Sum of all shot durations must equal total task duration.
Required when multi_shot is true and shot_type is customize.
1 - 6 elementsElement definitions. Max 3 elements.
Provide frontal and reference images.
Use <<<element_1>>> in prompt to reference elements.
1 - 3 elementsGenerate audio simultaneously when generating videos.
on: Enable audio generationoff: Disable audio generation (silent video)
on, off Video generation mode.
std: Standard Mode (720P), cost-effective.
pro: Professional Mode (1080P), higher quality video output.
std, pro Video length in seconds (3-15).
Response
Accepted - Task created successfully

