CueFrame

An API-first, agent-native video composition substrate. Your agent brings the intent; CueFrame executes it into video.

CueFrame is a video composition API built for AI agents. The customer is programmatic — an agent connects over MCP, REST, or the CLI, expresses intent (clips, reframing, captions, brand), and CueFrame renders production-quality video in any aspect ratio.

It is the reliable, expressive hands — not the brain. You direct; it does the legwork.

The loop

Every job is the same shape:

Ingest

Pull footage from a public URL or generate it from a prompt. CueFrame transcribes and detects faces so the edit lines up with the content.

Compose

Author the edit as data (clips, reframe, captions). The Director can draft an ensemble and a judge scores it. Dry-run validation is free.

Render

Render to 9:16, 16:9, 1:1, or 4:5 — baked from a single composition, never re-edited per platform.

What makes it different

Content-aware framing — face and active-speaker tracking reframe the shot across the whole cut, not a static crop.
One compose, every aspect ratio — output to every format from one composition.
Agent-native — a curated MCP surface, async-first with webhooks, every op declarative and validateable before you spend on a render.

CueFrame

The loop

Ingest

Compose

Render

What makes it different

Next steps

Quickstart

API reference

On this page