MLE-Bench: Evaluating Machine Learning Agents on Machine Learning Engineering(openai.com)3 points by hlynurd 1 year ago | 0 commentsNo comments yet