Trust at scale: Auto-evaluation for high-stakes LLM accuracy(blog.elicit.com)6 points by stuhlmueller 1 year ago | 0 commentsNo comments yet