Back to browse
Migas – Meeting copilot with live speaker labels (no bot, no cloud STT)

Migas – Meeting copilot with live speaker labels (no bot, no cloud STT)

by blakers95·Mar 31, 2026·1 point·2 comments

AI Analysis

●●●BangerWizardrySolve My ProblemZero to One

On-device speaker ID without training beats Granola and Otter's cloud-dependent approach.

Strengths
  • TitaNet neural embeddings identify speakers without prior training
  • Real-time AI chat during meetings, not just post-meeting summaries
  • No bot joins calls, audio never leaves device unless explicitly queried
Weaknesses
  • Apple Silicon only excludes Windows and Intel Mac users
  • English-only support limits international team adoption
Category
Target Audience

Remote workers, meeting-heavy professionals

Similar To

Granola · Otter.ai · Fireflies.ai

Post Description

I used to use Granola for my meetings, but the one thing it couldn't do was tell me who said what. So I built Migas, which does voice fingerprinting on device to provide real time speaker labels.

Because it knows who's speaking in real time, the AI can do things that transcript-only tools can't: "What did Sarah commit to?", or "Based on what you know about the CTO from our last three meetings, what question should I ask right now?" It builds speaker profiles across meetings, so the context compounds over time.

It captures system audio via macOS APIs, runs speech-to-text locally using Moonshine on Apple Silicon, and does speaker identification with TitaNet neural embeddings, all on-device. The only thing that touches the cloud is AI chat, and only when you explicitly ask it a question, and then it only sends transcript text, never audio.

Built with Rust/Tauri for the native app, React for the UI, and a Python sidecar for the ML pipeline.

Works with any meeting platform (Zoom, Meet, Teams, whatever) since it just listens to system audio. No calendar access, no integrations, no account required.

Free tier has unlimited transcription. I'm a solo dev and would love feedback on what could be improved.

Similar Projects

SaaS●●Solid

Transcriptum – fast video transcription with speaker labels and summary

It pairs WhisperX-grade transcription (speaker diarization and word-level timestamps) with optional multi-LLM analysis — summaries, Q&A, sentiment, topics and even fact-checking — plus YouTube import and standard export formats. Being vendor-agnostic and offering fact-checking is a smart differentiator, but the space is crowded (Descript/Otter/etc.); clearer accuracy numbers, pricing, or unique workflow hooks would make this stand out.

Solve My ProblemSlick
lpeancovschi
104mo ago