Back to browse
GitHub Repository

CPU-only voice agent approximating Thinking Machines' Interaction Models demo

17 starsPython

Realtime voice agent that sees, hears, and interrupts – on a CPU laptop

by mrkn1·Jun 11, 2026·1 point·1 comment

AI Analysis

●●●BangerWizardryBold Bet

Replicates Thinking Machines' multimodal demo on a CPU laptop with commodity models.

Strengths
  • Four complex behaviors work end-to-end on one CPU without GPU acceleration
  • Single asyncio loop orchestrates webcam, mic, speaker, and LLM calls efficiently
  • Vision triggers like friend detection and slouch alerts run locally with YOLO11
Weaknesses
  • LLM calls still route through DeepInfra API, not fully offline
  • Demo-focused implementation needs more production hardening
Category
Target Audience

Developers experimenting with multimodal AI agents on consumer hardware

Similar To

Thinking Machines Interaction Models · LiveKit Agents · Vapi

Similar Projects

AI/ML●●●Banger

Knotch – a hub-and-spoke voice agent

Inverts Vapi Squads: many humans coordinated by one AI, not one caller handed between bots.

Big BrainZero to One
akshatvasisht
4213d ago