PostTrainBench: How well can AI agents post-train language models? | Dark Hacker News