A cinematic AI platform for solo filmmakers and small studios — generation, color, sound and delivery, in one calm canvas.
A cinematic AI platform that helps creators move from raw idea to finished film. Built around fast generation, intentional editing and a filmic visual language — for solo creators and small studios that ship every week.
Before any pixel got coloured, we built the bones. Low-fi wireframes lock the information architecture and prove the workflow stands up under load — five tools deep, four AI panels, one timeline, no clutter.
Designed through research, wireframing, and final UI execution. Focused on a fast, seamless creator experience.
Creators juggle multiple tools and slow workflows. Lumen solves this by combining generation, editing, and finishing into one fast, AI-powered platform.
Design a workspace where prompt-based generation, frame-accurate editing, and finishing share one mental model — so creators stay in the work, not in the tool-switching.
Solo creators stitch generation, NLE, color, audio, captions, and publishing across half a dozen apps. Every handoff loses momentum and chips away at intent.
A unified studio: generate clips inline, AI-cut to the beat, grade with curated LUTs, and ship — addressable from one canvas. Native, not glued together.
"I lose 40 minutes a week just moving files between apps." · "AI generations look generic — I can't shape them." · "Captions look like an afterthought." · "Color grading scares me out of trying."
Filmic by default · The prompt is the entry point everywhere · Density is intimacy, not clutter · Generative is a verb, not a feature tab · Earned ornament — every accent has to mean something.
Geist for product surfaces — a clean, modern sans for readability and density. Instrument Serif as the editorial accent, used sparingly for emphasis and emotion. Geist Mono for technical metadata.
A warm cinematic system. Deep black grounds the canvas; violet, magenta and amber form a generative gradient signature; a single editorial gold marks intent.
Used for AI-touched surfaces — generation, suggestions, magic.
Active state, brand pulse. The tone that says "this is alive".
Cinematic warmth — golden hour, render success, ready-to-ship.
The room lights down. Every cinematic frame begins here.
Onboarding → generation → post-production → delivery. Every surface designed for the cinematic workflow.
Four steps from sign-in to first generation. Pick the workflow that matches today's job, import the footage you have, and the studio configures itself accordingly. No empty canvas paralysis.
The full studio: media library left, preview centred, AI generation panel right, multi-track timeline below. Every region collapses; every action has a keyboard shortcut.
Three screens, one continuous gesture. Type the idea, watch four directions land, choose the one that matches intent — drop it on the timeline. The work never leaves the canvas.
Two tools that traditionally take days, available inline. Curated LUTs honour cinema-grade looks; an audio mixer that auto-ducks dialogue, isolates stems, and sweetens room tone with one toggle.
Twelve looks named for the films they reference — "Golden Hour", "Neo Tokyo", "Cold Pastoral". A live waveform scope keeps you honest. Per-clip strength, never per-edit.
Dialogue, score, SFX — auto-routed by AI on import. Auto-ducking under speech, room-tone matching across cuts, broadcast-safe loudness — all defaults you can override but rarely need to.
Drop B-roll over a music track and Auto-edit lays cuts on the beat. Captions transcribe, segment, and styles ride a single timeline alongside the picture — corrections are inline, not modal.
Pick a track, set the energy curve, hit run. Lumen aligns cuts to the rhythm, lifts emphasis on the chorus, and exposes every decision so you can override what the AI got wrong.
Word-level timing, speaker diarisation, four caption presets that pass broadcast review. Edit the transcript and the captions follow — never the other way around.
Footage searchable by content, not filename — "wide shot, sunset, no people". Delivery presets for every channel, with adaptive transcoding and per-platform safe-area previews before render.
Auto-tagged on import. Faces, locations, camera moves, even mood. Type a phrase, get a shortlist with timecodes — no more scrubbing through bins at 2am.
Render queue with per-platform presets — TikTok 9:16, YouTube 16:9, Instagram square, broadcast 23.976. Safe areas overlay on the final preview so subtitles never end up cropped.
Reviewers leave notes that pin to a timecode and a region — not "second 47", but here, on this frame, with context. Multi-cursor presence and version history make async feel synchronous.
See who's looking at what. Pin notes to a frame or a region, not just a timecode. Resolve threads inline — the timeline keeps moving while critique happens.
Auto-snapshots on every render and at smart breakpoints — first cut, after color, after audio. Compare any two versions side-by-side or roll back without confirmation theatre.
Settings and billing don't get to be afterthoughts. They use the same type ramp, the same ratios, the same cinematic restraint as the main canvas — because trust is built where users least expect it.
Model, ratio, render preset, captioning, color science — set them once at the studio level and every new project inherits, every junior collaborator gets the same baseline.
Generation credits visible in the topbar at all times. No surprise overages — when you hit 90% the studio dims the spark icon and asks before continuing.
Six weeks from blank canvas to a 28-screen interactive prototype. Solo build, AI-assisted across discovery, IA, and visual design.
A concept exploring the future of cinematic video creation.
Designed & built at Stepikin Lab. ✦