Hey HN!
Pipevals is early and rough (this is a learning project), but usable. It currently lets you: - build evaluation pipelines as graphs - run them against datasets - track how output quality changes over time |
Hey HN!
Pipevals is early and rough (this is a learning project), but usable. It currently lets you: - build evaluation pipelines as graphs - run them against datasets - track how output quality changes over time |