Stop Evaluating LLMs on Vibes(github.com) |
Stop Evaluating LLMs on Vibes(github.com) |
Docs and other resources at: https://www.trulens.org/
Real evaluations (maybe evaluated by LLMs themselves) instead of just vibe checks is the next big step we need to take as an industry.