Benchmarking GPT-4 Turbo – A Cautionary Tale | Dark Hacker News