GRPO Judge Experiments: Findings and Empirical Observations | Dark Hacker News