GitHub Repository

Claude Code for prompt eval

21 starsPython

Promptloop – create, run, and improve prompt evals from the terminal

Name: Promptloop – create, run, and improve prompt evals from the terminal
Availability: InStock
Author: velapod

by velapod·May 29, 2026·13 points·3 comments

Visit Project View on HN

AI Analysis

●●SolidShip ItNiche Gem

Terminal-native prompt evals with diff proposals beats web dashboards.

Strengths

•CLI-native workflow keeps eval context inside the terminal session.
•Proposes prompt diffs instead of blind edits for safer iteration.
•SQLite chat.db preserves conversation checkpoints across eval runs.

Weaknesses

•Built on LangChain means orchestrating existing tools, not novel infra.
•Prompt evaluation space crowded with LangSmith, Promptfoo, Arize.

Post Description

a CLI agent for prompt evaluation loopsw

Similar Projects

Developer Tools●●Solid

HermesBench – workflow reliability evals for personal AI agents

Whole-agent evals beat model-only benchmarks, but only one baseline published so far.

Big BrainShip It

verkyyi26

2022d ago

Developer Tools●Mid

Agent-evals – Claude skill to build your own evals

Claude Skill for agent evals, but LangSmith and Arize already own this.

Solve My Problem

sauercrowd

911mo ago

AI/ML●Mid

Is it art? An art project for AI agents

Fascinating art experiment, but more novelty than tool developers would actually use.

Rabbit HoleBold Bet

is-it-art

201mo ago

Developer Tools●●●Banger

Evals Skills

Install eval pipelines via npm instead of reading docs, saving hours of setup.

Big BrainSolve My Problem

jangletown

403mo ago

Developer Tools●●Solid

Agent-skills-eval – Test whether Agent Skills improve outputs

Lightweight A/B testing for SKILL.md files when LangSmith feels too heavy.

Solve My ProblemShip It

darkrishabh

79371mo ago

Developer Tools●●Solid

TruLayer – tracing, evals, and a control loop for production LLMs

Automated rollback on regression is a killer feature LangSmith doesn't have.

Solve My ProblemSlick

trulayer

2029d ago