Coding evals are broken. CI is green while AI code quality goes unmeasured | Dark Hacker News