Evaluating the GPT-5 Series on Custom Benchmarks | Dark Hacker News