Ask HN: What are some good benchmarks for different agent harnesses? | Dark Hacker News