Back to browse
GitHub Repository

goes through a video and captures notes and diagrams from the conversation

17 starsPython

PlanOpticon – Extract structured knowledge from video recordings

by ragelink·Feb 15, 2026·2 points·0 comments

AI Analysis

●●SolidSolve My ProblemShip It

Smart frame filtering plus knowledge graph extraction—useful but Fireflies.ai and Otter.ai exist.

Strengths
  • Change-detection frame extraction avoids redundant analysis of static content.
  • Offline Whisper + multi-provider AI routing (OpenAI, Anthropic, Gemini, Ollama) is flexible.
  • Batch processing with cross-video knowledge graph merging addresses real workflow pain.
Weaknesses
  • Video analysis via LLM APIs scales expensively; no comparison to existing meeting intelligence tools.
  • Face filtering and diagram recreation are nice but not differentiated from tools like Fireflies.
Category
Target Audience

Knowledge workers, meeting organizers, training teams

Similar To

Fireflies.ai · Otter.ai · Sembly AI

Post Description

We built PlanOpticon to solve a problem we kept hitting: hours of recorded meetings, training sessions, and presentations that nobody rewatches. It extracts structured knowledge from video — transcripts, diagrams, action items, key points, and a knowledge graph — into browsable outputs (Markdown, HTML, PDF).

How it works:

- Extracts frames using change detection (not just every Nth frame), with periodic capture for slow-evolving content like screen shares - Filters out webcam/people-only frames automatically via face detection - Transcribes audio (OpenAI Whisper API or local Whisper — no API needed) - Sends frames to vision models to identify and recreate diagrams as Mermaid code - Builds a knowledge graph (entities + relationships) from the transcript - Extracts key points, action items, and cross-references between visual and spoken content - Generates a structured report with everything linked together

Supports OpenAI, Anthropic, and Gemini as providers — auto-discovers available models and routes each task to the best one. Checkpoint/resume so long analyses survive failures.

pip install planopticon planopticon analyze -i meeting.mp4 -o ./output

Also supports batch processing of entire folders and pulling videos from Google Drive or Dropbox.

Example: We ran it on a 90-minute training session: 122 frames extracted (from thousands of candidates), 6 diagrams recreated, full transcript with speaker diarization, 540-node knowledge graph, and a comprehensive report — all in about 25 minutes.

Python 3.10+, MIT licensed. Docs at https://planopticon.dev.

Similar Projects

AI/ML●●Solid

DocMason – Agent Knowledge Base for local complex office files

Provenance-first RAG beats anonymous text chunks, but Cursor and Continue already own this space.

Big BrainNiche Gem
Jet_Xu
1102mo ago