Save tokens from Claude subscriptions now allowed
Rotates Claude OAuth tokens with auth precedence handling before API key fallback.

Users bring their own API keys so indie devs skip inference bills entirely.
Indie developers building AI side projects without subscription revenue
LiteLLM · Helicone · Portkey
This has pushed me toward cheaper, less capable models and made me hesitate to even explore certain ideas. I don’t want every side project to become another $20/month subscription, but I also can’t compete with VC-backed companies willing to subsidize inference costs.
Then I had this idea: what if users simply paid for their own inference?
This already happens in some apps through locally configured API keys, but could the model be extended? If users bring their own AI account, developers can build AI-powered products without taking on variable inference costs. How many AI applications shouldn’t be subscription businesses at all?
The challenge is that developers don’t want to handle user API keys, users don’t want to hand them out to every app they try, and nobody wants the friction of collecting payment methods just to pass through inference costs.
That’s the backstory to my latest side project, infer0.com.
It's a bit like SSO for AI inference. Users connect their AI providers once, and apps use auth tokens to request inference through infer0. Developers don’t manage API keys or pay model costs themselves, while users can bring the same AI accounts across multiple applications.
This may be a terrible idea, both because nobody will trust it and because I’m sure there are risks around handling user API credentials that I haven’t fully appreciated. But I felt the need to build it. So here’s a rough first pass.
Rotates Claude OAuth tokens with auth precedence handling before API key fallback.
OAuth proxy workaround when OpenAI already sells API access directly.
OAuth relay for AI subscriptions, but relies on unofficial device-code flows.
In-process key rotation with state machine simplicity instead of LiteLLM/Redis overhead.
LAN sharing with per-user keys beats single-machine Claude proxies.
Transparently routes Claude Code traffic to $5 OpenCode Go models with automatic fallback chains.