Back to browse
GitHub Repository

State-of-the-art TTS model under 25MB 😻

14,139 starsPython

Three new Kitten TTS models – smallest less than 25MB

by rohan_joshi·Mar 19, 2026·561 points·181 comments

AI Analysis

●●●BangerWizardryNiche GemZero to One

SOTA expressivity at 14M parameters beats cloud models for on-device TTS.

Strengths
  • 25MB int8 quantized model runs on Raspberry Pi without GPU acceleration.
  • Eight built-in voices with adjustable speech speed via simple API.
  • 11.3k GitHub stars and active community signal real adoption momentum.
Weaknesses
  • English-only for now; multi-lingual support still on the roadmap.
  • Developer preview status means APIs may change between releases.
Category
Target Audience

Embedded developers and mobile app builders needing on-device speech

Similar To

Piper TTS · Coqui TTS · Silero TTS

Post Description

Kitten TTS is an open-source series of tiny and expressive text-to-speech models for on-device applications. (We had a thread last year here: https://news.ycombinator.com/item?id=44807868.) Today we're releasing three new models with 80M, 40M and 14M parameters.

The largest model has the highest quality. The 14M variant reaches new SOTA in expressivity among similar sized models, despite being <25MB in size. This release is a major upgrade from the previous one and supports English text-to-speech applications in eight voices: four male and four female. Most models are quantized to int8 + fp16, and they use ONNX for runtime. The model is designed to run anywhere eg. raspberry pi, low-end smartphones, wearables, browsers etc. No GPU required! This release aims to bridge the gap between on-device and cloud models for tts applications. Multi-lingual model release is coming soon.

On-device AI is bottlenecked by one thing: a lack of tiny models that actually perform. The goal is to open-source more models to run production-ready voice agents and apps entirely on-device. Would love your feedback!

Similar Projects

AI/ML●●Solid

TTS.ai

Twenty-seven open-source TTS models in one UI with no signup required for the free tier.

SlickCrowd Pleaser
nadermx
302mo ago