Odysseys: Benchmarking Web Agents on Realistic Long Horizon Tasks(odysseys-website.pages.dev)1 points by cmitsakis 30 days ago | 0 commentsNo comments yet