Back to browse
GitHub Repository

Kubernetes scanner that discovers LLMs running on vLLM and extracts their deployment and runtime facts.

11 starsPython

Piqc – GPU waste scanner for LLM inference clusters

by paralleliq·Jun 2, 2026·3 points·0 comments

AI Analysis

●●SolidSolve My ProblemSlick

Read-only GPU waste scanner finds 20-40% cluster spend waste without agents or sidecars.

Strengths
  • Detects three waste types standard monitoring tools miss: idle allocation, tier misplacement, dark capacity
  • No permanent installation required - runs as a Job, prints results, exits cleanly
  • Dollar-estimate output makes waste immediately actionable for budget discussions
Weaknesses
  • Only surfaces problems without automated remediation or optimization suggestions
  • Competes with established tools like Kubecost and Run:ai with deeper integrations
Target Audience

ML engineers, DevOps teams running GPU clusters

Similar To

Kubecost · Run:ai · VPA

Similar Projects

AI/ML●●●Banger

Browser-Native GPU Sharing

Browser-based GPU cluster for LLM inference with HTTP API and SSE broker coordination.

WizardryZero to OneBold Bet
bilekas
1213d ago