DeepSWE: Measuring coding agents on original, long-horizon engineering tasks | Dark Hacker News