Back to browse
GitHub Repository

Neutral, reproducible benchmark for local LLMs on Apple Silicon (Mac · iPhone · iPad) — MLX, llama.cpp, CoreML, Apple Foundation Models

29 starsSwift

iPhone ANE holds LLM tok/s while MLX and LiteRT thermal-throttle

by mlboy·Jun 4, 2026·1 point·0 comments

AI Analysis

●●●BangerDark HorseBig BrainSolve My Problem

LiteRT beats MLX on Gemma memory while CoreML sips power on the Neural Engine.

Strengths
  • Automated `devicectl` headless mode removes manual testing friction on iOS devices.
  • Compares Google LiteRT against Apple MLX and CoreML on mobile hardware.
  • Reveals Neural Engine memory efficiency versus GPU throughput tradeoffs clearly.
Weaknesses
  • "iPhone 17 Pro" label raises eyebrows since the device doesn't publicly exist.
  • Limited model coverage favors Gemma and Qwen, needs broader architecture testing.
Category
Target Audience

iOS AI developers, Edge ML engineers

Similar To

MLC Bench · Llama.cpp Benchmarks · Perfetto

Similar Projects

AI/ML●●Solid

Running Gemma 4 on an iPhone 13 Pro

Clean Swift wrapper for Gemma 4 with vision and audio on iPhone.

Niche GemShip It
dengjiuhong
102mo ago
AI/ML●●●Banger

A tiny C program where an LLM rewires its DAG while running

LLM mutates the workflow DAG mid-run via a constrained four-verb grammar.

Big BrainWizardryZero to One
mrkn1
1541mo ago