Benchmarking multilingual long-context language models | Dark Hacker News