Cost.dev (YC W21) – making agents cost-aware and 79% cheaper to call

by akh·Jun 4, 2026·48 points·29 comments

Visit Project View on HN

AI Analysis

●●SolidBig BrainSolve My Problem

79% token reduction for agent CLI calls with predicate pushdown and efficient output.

Strengths

•Benchmark harness with 16 questions across 1,171 resources surfaced real cost gaps
•Predicate pushdown into CLI eliminates jq pipelines for agent callers
•Measurable 67% API cost reduction versus bare-Claude baseline

Weaknesses

•Established tool (5 years old) making incremental optimization, not novel product
•Agent-specific optimization may not matter for human-only CLI workflows

Post Description

We launched Infracost on HN five years ago (https://news.ycombinator.com/item?id=26064588) where our CLI generated cost estimates for infra-as-code, e.g. "this Terraform PR adds $400/mo". The idea was to shift cloud costs (FinOps) left so engineers get visibility of costs before deployment and make better decisions.

Earlier this year we started seeing agent traffic in our logs and it looked like coding agents were calling our CLI. But that CLI wasn't designed with coding agents in mind. We went down a philosophical rabbit hole to see if a CLI is even needed anymore given that Claude, Copilot et al. already follow best practices. Ultimately we decided to create a new CLI from the ground up with coding agents in mind for two reasons:

1. We optimized the CLI for agent callers and cut Claude's output token usage by up to 79% and API cost by up to 67% versus a bare-Claude baseline. We wrote a blog documenting our lessons on optimizing user token usage when designing a CLI, e.g. using predicate flags so the agent doesn't compose jq | python | wc pipelines, output format that strips JSON's redundant field names. The blog is here: https://www.infracost.io/resources/blog/we-cut-claude-s-toke...

2. With cloud costs, precision matters. Telling a coding agent "make this Terraform cost-optimized" can be expensive and lossy. You burn tokens loading code and policy context into every conversation. Your agent could make up a price and you wouldn't know because it's difficult to verify that across the ~10M price points that AWS, Azure and Google have. The CLI runs static analysis on the code, uses the latest prices from cloud vendors, and passes that context to the coding agent.

So that's what we're launching today - Cost.dev:

- It runs locally. Your code never leaves your machine, you get a fast feedback loop, and you're not burning API calls per character when you want to fetch prices. - The CLI does the deterministic work. Fetching price points, scanning the code, validating fixes. The coding agent does the natural-language part. You don't have to trust the LLM to remember the rules, and can verify it called the right CLI command. - It provides a consistent rule layer across every tool you use. Get cost estimates in your IDE and your coding agent with a single install. We support Claude Code, GitHub Copilot, Cursor, Windsurf, OpenAI Codex, Gemini CLI, as well as IDEs like VS Code and JetBrains

Before we keep building more in that direction, I want to sanity-check with HN: is "agents writing IaC in prod" actually a thing yet, or am I betting on a future that's still a year out? I know software developers are using coding agents heavily, but are platform/infra folks doing that for prod too? Also, if you have any feedback on Cost.dev, I'd love to hear it.

Similar Projects

Developer Tools●●●Banger

Tilth v0.4.1 – 29% cheaper Sonnet, 22% on Opus (benchmark: 114 runs)

Instruction tuning on tool descriptions cut Sonnet costs 29% without code changes.

Big BrainSolve My Problem

jahala

204mo ago

Developer Tools●●Solid

Sift – save AI tokens in Codex/Claude by summarizing command output

Nested agent summarization cuts token costs ~45% for command-heavy workflows.

Solve My ProblemNiche Gem

piotrwilczek

231mo ago

SaaS●Mid

GlobCall – Agentic AI voice agents that make real international calls

Web-based VoIP with AI agent support competing against Skype and Twilio.

Ship ItSolve My Problem

makepostai

103mo ago

Developer Tools●●●Banger

I built an AI coding agent 50% cheaper than Claude Code (same prompts)

Minifies code before sending to LLM, slashing Claude Code costs by 50% in benchmarks.

Big BrainSolve My Problem

kirby88

422mo ago

Security●●●Banger

Kontext.dev – Runtime Credentials for Agents

Scoped runtime credentials for AI agents replace insecure .env API keys.

Solve My ProblemBig BrainShip It

michiosw

613mo ago

AI/ML●●Solid

LLM-use – cost-effective LLM orchestrator for agents

Smart local‑first routing that only escalates to expensive cloud planners when necessary is the standout idea — combined with per‑run cost accounting and full Ollama offline support it solves a real operational itch. The repo is a pragmatic, CLI/TUI-focused toolkit (scraping + cache, MCP server mode) that feels useful for teams wanting a no‑friction orchestrator, but it’s playing in a crowded space of agent frameworks so the novelty is incremental rather than revolutionary.

Niche GemBig Brain

justvugg

214mo ago