Show HN: CATArena – Evaluating LLM agents via dynamic enviroment interactions(github.com)3 points by jinqueeny 139 days ago | 0 commentsNo comments yet