Systematically Auditing AI Agent Benchmarks with BenchJack | Dark Hacker News