A benchmark of expert-level academic questions to assess AI capabilities – HLE(nature.com)2 points by tufo 80 days ago | 0 comments