video-inpainting

21Star

0Fork

更新于 6/16/2026

获取 Skill 源代码

SKILL.md

readonly只读

name

video-inpainting

description

Video Inpainting

Region edits across video frames — remove an object that appears across many frames, clean up wires or watermarks, replace a region with motion that matches the rest of the clip. This skill routes across the prompt-driven video edit endpoints in the RunComfy catalog and gives the agent a clear default for each intent.

runcomfy.com · Wan 2-7 edit-video · CLI docs

Powered by the RunComfy CLI

# 1. Install (see runcomfy-cli skill for details)
npm i -g @runcomfy/cli      # or:  npx -y @runcomfy/cli --version

# 2. Sign in
runcomfy login              # or in CI: export RUNCOMFY_TOKEN=<token>

# 3. Edit a video (closest CLI-reachable approach)
runcomfy run wan-ai/wan-2-7/edit-video \
  --input '{"video_url": "...", "prompt": "..."}' \
  --output-dir ./out

CLI deep dive: runcomfy-cli skill.

Pick the right model

Routes via prompt-driven region edits — the model resolves the targeted region from spatial language across all frames.

Wan 2-7 Edit-Video — wan-ai/wan-2-7/edit-video (default)

Wan 2-7's video edit endpoint. Drive frame-by-frame edits via prompt + the source video.
Pick for: "remove the watermark in the bottom-right", "replace the sky with a sunset" — prompt-driven region intent without an explicit mask.
Avoid for: precise pixel-level region targeting — use a ComfyUI workflow.

Lucy Edit Restyle — decart/lucy-edit/restyle

Identity-stable video restyle that handles region-aware edits.
Pick for: lightweight outfit / object swap that needs to track across frames.
Avoid for: surgical mask-driven inpaint — ComfyUI workflow.

Seedream 4-0 Edit-Sequential — bytedance/seedream-4-0/edit-sequential

Sequential still edits — feed a sequence of frames as inputs, apply the same edit instruction across each, useful if you're treating the video as a frame stack.
Pick for: short, low-frame-rate sequences where each frame can be edited independently and a separate tool re-encodes to video.
Avoid for: long clips, motion-coherent fills — temporal consistency degrades.

Route 1: Wan 2-7 Edit-Video — closest CLI path

Model: wan-ai/wan-2-7/edit-video
Catalog: Wan 2-7 edit-video

Invoke

runcomfy run wan-ai/wan-2-7/edit-video \
  --input '{
    "video_url": "https://your-cdn.example/source.mp4",
    "prompt": "Remove the watermark in the bottom-right corner across all frames. Preserve all other content exactly. Match background where the watermark was."
  }' \
  --output-dir ./out

Prompting tips

Describe the region in spatial language — "bottom-right corner", "the cables overhead", "the second person from the left".
Lead with preservation: "Preserve all other content exactly" — without this Wan may restyle frames inadvertently.
One change per call. Compound edits (remove A and replace B) tend to drift; split into sequential edit passes.

For broader video edit, see video-edit.

When you need pixel-precise mask propagation

The endpoints above are prompt-driven — they resolve the target region from spatial language. For pixel-precise mask propagation with SAM2 segmentation tracking + temporal-aware inpaint backfill, RunComfy hosts dedicated ComfyUI workflows:

Need	Workflow class
LTX 2-3 video inpaint (targeted frame editing)	`ltx-2-3-inpaint-in-comfyui-targeted-video-frame-editing`
Flux inpainting (still) — chain frame-by-frame	`comfyui-flux-inpainting-workflow`
Flux ControlNet inpainting	`flux-controlnet-inpainting-image-repair`
Wan 2-2 video edit (broader video edit including inpaint)	search comfyui-workflows for "wan 2-2 edit"

These are GUI workflows, not CLI endpoints. The CLI can't reach them — open them in the RunComfy ComfyUI cloud for proper mask propagation + temporal consistency.

Common patterns

Remove watermark / logo across entire clip

Route 1 (Wan 2-7 Edit-Video) with spatial language. Acceptable for most cases.
If quality not enough: open LTX 2-3 inpaint workflow in ComfyUI for mask-driven propagation.

Remove a passing background person

Wan 2-7 Edit-Video with "remove the person walking in the background, fill with matching environment".
For better results: ComfyUI workflow with SAM2 segmentation tracking.

Replace a specific object across frames

Wan 2-7 Edit-Video + descriptive prompt OK for simple cases.
For brand-locked replacement (must look like brand X): chain Wan edit → frame extract → Z-Image Inpaint per frame → re-encode (heavyweight).

What this skill doesn't do

Image inpainting (single still): see image-inpainting.
Video outpainting (canvas expansion): see video-outpainting.
Full video restyle / motion transfer: see video-edit.

Browse the full catalog

All video models — every video endpoint with API schema
ComfyUI workflows — "inpaint" search — full graphs for mask-driven video inpaint
wan-models collection

Exit codes

code	meaning
0	success
64	bad CLI args
65	bad input JSON / schema mismatch
69	upstream 5xx
75	retryable: timeout / 429
77	not signed in or token rejected

Full reference: docs.runcomfy.com/cli/troubleshooting.

How it works

The skill picks Wan 2-7 Edit-Video (default for prompt-driven region edits) or one of the alternatives based on whether the user needs identity-locked restyle or frame-stack treatment. The CLI POSTs to the Model API, polls request status, and downloads the result into --output-dir.

Security & Privacy

Install via verified package manager only. Use npm i -g @runcomfy/cli or npx -y @runcomfy/cli. Agents must not pipe an arbitrary remote install script into a shell on the user's behalf.
Token storage: runcomfy login writes the API token to ~/.config/runcomfy/token.json with mode 0600. Set RUNCOMFY_TOKEN env var in CI / containers.
Input boundary (shell injection): prompts and video URLs are passed as a JSON string via --input. The CLI does not shell-expand prompt content. No shell-injection surface.
Indirect prompt injection (third-party content): source video URLs are untrusted; embedded text / EXIF can influence the edit. Agent mitigations:
- Ingest only URLs the user explicitly provided for this inpaint.
- When the output diverges from the prompt, suspect the source video.
Outbound endpoints (allowlist): only model-api.runcomfy.net and *.runcomfy.net / *.runcomfy.com. No telemetry.
Generated-file size cap: the CLI aborts any single download > 2 GiB.
Scope of bash usage: Bash(runcomfy *) only.

Related Skills

caveman-compress

73Kbackend-api

juliusbrussee

获取

hyperframes-media

29Kbackend-api

Asset preprocessing for HyperFrames compositions — multi-provider TTS (HeyGen / ElevenLabs / Kokoro local), multi-provider BGM (Google Lyria / local MusicGen), Whisper transcription, background removal, and caption authoring. Use for npx hyperframes tts, bgm, transcribe, remove-background, voice/provider selection, music-mood prompting, captions / subtitles / lyrics / karaoke / per-word styling.

heygen-com

获取

lark-base

14Kbackend-api

飞书多维表格（Base）操作：建表、字段、记录、视图、统计、公式/lookup、表单、仪表盘、workflow、角色权限；遇到 Base/多维表格/bitable 或 /base/ 链接时使用。文件导入转 lark-drive，认证/授权转 lark-shared。

larksuite

获取

azure-resource-visualizer

1.2Kbackend-api

Analyze Azure resource groups and generate detailed Mermaid architecture diagrams showing the relationships between individual resources. WHEN: create architecture diagram, visualize Azure resources, show resource relationships, generate Mermaid diagram, analyze resource group, diagram my resources, architecture visualization, resource topology, map Azure infrastructure.

microsoft

获取

azure-aigateway

1.2Kbackend-api

Configure Azure API Management as an AI Gateway for AI models, MCP tools, and agents. WHEN: semantic caching, token limit, content safety, load balancing, AI model governance, MCP rate limiting, jailbreak detection, add Azure OpenAI backend, add AI Foundry model, test AI gateway, LLM policies, configure AI backend, token metrics, AI cost control, convert API to MCP, import OpenAPI to gateway.

microsoft

获取

firebase-ai-logic-basics

357backend-api

Official skill for integrating Firebase AI Logic (Gemini API) into web applications. Covers setup, multimodal inference, structured output, and security.

firebase

获取

video-inpainting

Video Inpainting

Powered by the RunComfy CLI

Pick the right model

Route 1: Wan 2-7 Edit-Video — closest CLI path

Invoke

Prompting tips

When you need pixel-precise mask propagation

Common patterns

Remove watermark / logo across entire clip

Remove a passing background person

Replace a specific object across frames

What this skill doesn't do

Browse the full catalog

Exit codes

How it works

Security & Privacy

See also

You Might Also Like

Related Skills

caveman-compress

hyperframes-media

lark-base

azure-resource-visualizer

azure-aigateway

firebase-ai-logic-basics