Digest AI vs HN About

GitHub Repository

Pipevals is the visual pipeline builder for evaluation-driven AI development. Build evaluation graphs. Run them against datasets. Track quality over time.

14 starsTypeScript

Pipevals – a visual pipeline builder for evaluation-driven AI

by tilt·Mar 20, 2026·6 points·2 comments

Visit Project View on HN

AI Analysis

●MidShip ItBold Bet

Early learning project in a crowded eval space dominated by LangSmith and Arize.

Strengths

•Visual drag-and-drop canvas for composing evaluation graphs with typed step nodes
•Human-in-the-loop review steps with configurable rubrics and multi-reviewer aggregation
•Durable execution on Vercel Workflow with parallel branch support

Weaknesses

•Author admits it's early and rough — zero stars, zero forks, learning project energy
•LLM evaluation is saturated with LangSmith, Arize Phoenix, TruLens, and Braintrust

Category

Target Audience

ML engineers, AI product teams building evaluation workflows

Similar To

LangSmith · Arize Phoenix · Braintrust

Post Description

Hey HN! Pipevals is early and rough (this is a learning project), but usable.

It currently lets you: - build evaluation pipelines as graphs - run them against datasets - track how output quality changes over time

Similar Projects

Developer Tools●●●Banger

Blacknode – Visual workflow builder Claude can drive via MCP

Agents build their own workflows through typed MCP tools instead of guessing fragile JSON graphs.

WizardryBig BrainZero to One

temiroff

301mo ago

AI/ML●●Solid

ThinkLLM, A knowledge graph of AI models (HTTPS://thinkllm.dev)

Hugging Face but organized by use case instead of architecture, with model comparisons.

SlickSolve My Problem

gkanellopoulos

1029d ago

AI/ML●●Solid

A tool to create and evaluate document processing pipelines for RAG

LLM-as-judge metrics beat guessing chunk sizes, but Ragas and LangSmith already exist.

Solve My ProblemSlick

martimchaves

202mo ago

Developer Tools●Mid

HelixDB Explorer – A macOS GUI for HelixDB

Pretty graph DB GUI, but HelixDB adoption is niche and unproven market demand.

Eye Candy

jomamax

304mo ago

Developer Tools●●Solid

Incorporator, Turn any API/File into typed Python graph with pipeline

Dynamic Pydantic models beat manual schemas for messy API responses.

Big BrainShip It

PyPlumber

311mo ago

AI/ML●Mid

GDL – I built an AI-powered invention engine

83k LOC is impressive, but chaining LLMs to evaluate LLM output isn't novel architecture.

Bold BetShip It

Whyachi

202mo ago