Edit videos using Alibaba wan2.1-vace-plus model with multi-modal inputs.
| Property | Value |
|---|---|
| Audio | No (silent video only) |
| First Frame | ✅ Supported |
| First & Last Frame | ✅ Supported |
| Resolution | 720P only |
| Duration | 5 seconds (fixed) |
| Frame Rate | 30fps |
| Format | MP4 (H.264) |
| Property | Requirement |
|---|---|
| Formats | JPEG, JPG, PNG, BMP, WEBP |
| File Size | Max 10MB per image |
| Input | Public URL or Base64 encoded data |
| Multiple Images | Supported for content and style reference |
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
wan2.1-vace-plus video editing entry point. Select a function and supply the parameters supported by that function.
Model name (only wan2.1-vace-plus is supported)
wan2.1-vace-plus Multi-image reference feature selector.
image_reference "image_reference"Text description for the synthesized video (max 800 characters).
800Public URLs for 1-3 reference images (JPG/JPEG/PNG/BMP/TIFF/WEBP, 360-2000px, ≤10MB, ASCII URL). Required.
1 - 3 elementsNegative prompt (max 500 characters).
500Matches ref_images_url to label each image as entity ("obj") or background ("bg"). Required when multiple images are supplied; defaults to ["obj"] for a single image.
obj, bg Output resolution ("width*height"). Supported: 1280*720 (default), 720*1280, 960*960, 832*1088, 1088*832.
1280*720, 720*1280, 960*960, 832*1088, 1088*832 Output duration (seconds). Fixed at 5.
5 Enable intelligent prompt rewriting (default true).
Random seed [0, 2147483647].
0 <= x <= 2147483647Accepted - Task created successfully