Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations(arxiv.org)2 points by mnk47 1 year ago | 0 commentsNo comments yet