CueFrame
An API-first, agent-native video composition substrate. Your agent brings the intent; CueFrame executes it into video.
CueFrame is a video composition API built for AI agents. The customer is programmatic — an agent connects over MCP, REST, or the CLI, expresses intent (clips, reframing, captions, brand), and CueFrame renders production-quality video in any aspect ratio.
It is the reliable, expressive hands — not the brain. You direct; it does the legwork.
The loop
Every job is the same shape:
Ingest
Pull footage from a public URL or generate it from a prompt. CueFrame transcribes and detects faces so the edit lines up with the content.
Compose
Author the edit as data (clips, reframe, captions). The Director can draft an ensemble and a judge scores it. Dry-run validation is free.
Render
Render to 9:16, 16:9, 1:1, or 4:5 — baked from a single composition, never re-edited per platform.
What makes it different
- Content-aware framing — face and active-speaker tracking reframe the shot across the whole cut, not a static crop.
- One compose, every aspect ratio — output to every format from one composition.
- Agent-native — a curated MCP surface, async-first with webhooks, every op declarative and validateable before you spend on a render.