Show HN: Open Operator Evals – real-world benchmarks for LLM web agents | Dark Hacker News