AI Suite · Video AI

AI Video Caption

Describe what happens across a clip — per-keyframe captions aggregated into a narrative, bullets, SEO blurb, social caption, or hashtags. Add an optional topic to ground the result.

Model size: ~500 MB·Typical speed: ~15.0 s·Tier: standard
ModeSpeedQualityBest for
Keyframe captioning~keyframe countPer-scene descriptionsSummaries, accessibility

AI Video Caption needs the standard AI tier (0.5 GB model). You’re currently opted into none.

Quick start

See all 5 presets

Drop videos here — add as many as you like

or click to browse · processed one at a time · output text

Tier opt-in required before this capability runs.

Want this capability inside the main editor with layers, history, and the full AI panel?

How it works

  1. 1

    Add your video

    Drop or select the video you want to process — it stays on your device.

  2. 2

    Run the model in-browser

    AI Video Caption loads its model (~500 MB) once, caches it, then runs locally in a worker. No upload.

  3. 3

    Download the text

    Preview the result and download the text. Re-run with different settings anytime.

Common use cases

Summarising a video in textAccessibility descriptionsCataloguing footageGenerating chapter blurbs

Why it’s different

100% Private

Every model runs in your browser. Your files never leave your device — nothing is uploaded to a server.

True Alpha Channel

Exports preserve a real straight-alpha transparency channel (PNG / WebP / AVIF), not a baked-on background.

Free Forever

No account, no watermark, no credits. Open the tool and use it.

Works Offline

After the model downloads once it is cached, so the tool keeps working with no connection.

FAQ

How does it work?

It samples keyframes and captions them with the vision model, then assembles a description.

Is it free to use?

Yes — AI Video Caption is completely free. No account, no watermark, no credits, and no usage limits.

Do my files or prompts ever leave my device?

No. Everything runs locally in your browser via WebAssembly/WebGPU — there is no server that receives your files, prompts, or results.

Which browser and hardware do I need?

A modern browser. Chrome and Edge get WebGPU acceleration for the fastest results; Firefox and Safari run via WebAssembly. The model (~500 MB) downloads once, then is cached for offline use.

Can I use the results commercially?

Yes. You own everything you create — NSS makes no claim to the images, videos, or text you process or export.

Does it work on mobile?

Lightweight tools run on phones; heavier models prefer a desktop with a GPU. The tool picks the best path for your device and falls back gracefully where needed.

Where can I see a step-by-step guide?

Yes — there is a full walkthrough at /how-it-works/ai-describe-video.

Ready to try AI Video Caption?

Free, private, no signup — runs right in your browser.