Agent-evals: Overlap, boundary, and metacognitive scoring for coding agents | Dark Hacker News