MuleRouter Documentation - AI API Gateway & Routing Platform

This API supports Alibaba Tongyi Wanxiang (Wan2) video editing models. Please refer to Alibaba Cloud’s official documentation for more details.

Overview

Edit videos using the wan2.1-vace-plus model with multi-modal inputs. This model supports multiple video editing functions.

Supported Functions

Function	Description
`image_reference`	Generate videos from 1-3 reference images
`video_repainting`	Repaint/restyle an existing video
`video_edit`	Local (masked) editing of video regions
`video_extension`	Extend video with new content
`video_outpainting`	Expand video canvas in any direction

Example Requests

Multi-image Reference

{
  "function": "image_reference",
  "prompt": "A cat playing in the garden",
  "ref_images_url": [
    "https://example.com/cat.jpg",
    "https://example.com/garden.jpg"
  ],
  "obj_or_bg": ["obj", "bg"],
  "size": "1280*720",
  "duration": 5
}

Video Repainting

{
  "function": "video_repainting",
  "prompt": "Transform the scene into a cyberpunk style",
  "video_url": "https://example.com/source.mp4",
  "control_condition": "depth",
  "strength": 0.8
}

Local Editing

{
  "function": "video_edit",
  "prompt": "Replace the background with a beach sunset",
  "video_url": "https://example.com/source.mp4",
  "mask_image_url": "https://example.com/mask.png",
  "mask_type": "tracking"
}

Video Extension

{
  "function": "video_extension",
  "prompt": "Continue the scene with the character walking forward",
  "first_clip_url": "https://example.com/clip.mp4",
  "duration": 5
}

Video Outpainting

{
  "function": "video_outpainting",
  "prompt": "Expand to show more of the surrounding environment",
  "video_url": "https://example.com/source.mp4",
  "top_scale": 1.5,
  "left_scale": 1.3,
  "right_scale": 1.3
}

Function Details

image_reference

Required: ref_images_url (1-3 images)
Required when multiple images: obj_or_bg array
Output: 720P resolution, 5s duration

video_repainting

Required: video_url, control_condition
Control conditions: posebodyface, posebody, depth, scribble
Strength: 0.0-1.0 (default 1.0)

video_edit

Required: video_url
Mask options: mask_image_url or mask_video_url
Mask types: tracking (follows motion), fixed (static)

video_extension

Required: prompt
Optional: first_frame_url, last_frame_url, first_clip_url, last_clip_url
With guidance video: requires control_condition

video_outpainting

Required: video_url
Scale options: top_scale, bottom_scale, left_scale, right_scale (1.0-2.0)

Video Requirements

Property	Requirement
Format	MP4
Max size	50MB
Min FPS	16
Max duration	5s

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json

Multi-image Reference
Video Repainting
Local Editing
Video Extension
Video Outpainting

wan2.1-vace-plus video editing entry point. Select a function and supply the parameters supported by that function.

model

enum<string>

required

Model name (only wan2.1-vace-plus is supported)

Available options:

wan2.1-vace-plus

function

enum<string>

required

Multi-image reference feature selector.

Available options:

image_reference

Allowed value: "image_reference"

prompt

string

required

Text description for the synthesized video (max 800 characters).

Maximum string length: 800

ref_images_url

string<uri>[]

required

Public URLs for 1-3 reference images (JPG/JPEG/PNG/BMP/TIFF/WEBP, 360-2000px, ≤10MB, ASCII URL). Required.

Required array length: 1 - 3 elements

negative_prompt

string

Negative prompt (max 500 characters).

Maximum string length: 500

obj_or_bg

enum<string>[]

Matches ref_images_url to label each image as entity ("obj") or background ("bg"). Required when multiple images are supplied; defaults to ["obj"] for a single image.

Available options:

obj,

bg

size

enum<string>

default:1280*720

Output resolution ("width*height"). Supported: 1280*720 (default), 720*1280, 960*960, 832*1088, 1088*832.

Available options:

1280*720,

720*1280,

960*960,

832*1088,

1088*832

duration

enum<integer> | null

Output duration (seconds). Fixed at 5.

Available options:

5

prompt_extend

boolean

default:true

Enable intelligent prompt rewriting (default true).

seed

integer

Random seed [0, 2147483647].

Required range: 0 <= x <= 2147483647

Response

202 - application/json

Accepted - Task created successfully

task_info

object

Show child attributes

Using the APIs

API reference

LLM

Image Generation

Video Generation

Wan 2.1 Video Ace Plus Generation

Overview

Supported Functions

Example Requests

Multi-image Reference

Video Repainting

Local Editing

Video Extension

Video Outpainting

Function Details

image_reference

video_repainting

video_edit

video_extension

video_outpainting

Video Requirements

Authorizations

Body

Response

Using the APIs

API reference

LLM

Image Generation

Video Generation

​Overview

​Supported Functions

​Example Requests

​Multi-image Reference

​Video Repainting

​Local Editing

​Video Extension

​Video Outpainting

​Function Details

​image_reference

​video_repainting

​video_edit

​video_extension

​video_outpainting

​Video Requirements

Authorizations

Body

Response

Overview

Supported Functions

Example Requests

Multi-image Reference

Video Repainting

Local Editing

Video Extension

Video Outpainting

Function Details

image_reference

video_repainting

video_edit

video_extension

video_outpainting

Video Requirements