Zork-bench: An LLM reasoning eval based on text adventure games(lowimpactfruit.com)2 points by nicholasjbs 43 days ago | 0 commentsNo comments yet