Loom-class Realtime Video SaaS + Desktop App
A Loom-style async video platform with a native desktop recorder, auto-summarisation, searchable transcripts and resumable chunked uploads.
The founding team had a browser-only MVP that dropped recordings on shaky connections and could not handle multi-monitor capture.
Transcription was a cron job. Summaries took 30 minutes. Revenue was gated by a painful onboarding.
They needed a paid-tier product inside eight weeks to stay on plan for a Series A raise.
Shipped an Electron desktop recorder with camera + system audio + multi-monitor, using native capture APIs and a resumable S3 multipart uploader over WebSocket.
Introduced an FFmpeg stitching pipeline on a small GPU pool; chunks are transcribed as they land.
Streaming summaries over Server-Sent Events using a two-stage LLM chain: speaker-labelled outline → exec-ready brief.
Rebuilt the dashboard in Next.js 15 with shared-element transitions, presence and realtime collab comments via Socket.io.
38ms median ingestion latency, down from 6.2 seconds.
Zero dropped uploads across 4.9M recorded minutes in the first quarter.
72% MoM growth on paid seats after the desktop launch.
Closed Series A three weeks ahead of plan.
- System design across web + desktop + backend
- LLM pipeline: transcription, outline, exec summary, eval harness
- Observability: OpenTelemetry traces, Grafana dashboards, cost per minute
- Hiring of two product engineers and the onboarding of AI QA