MCPMark: A LLM Benchmark based on real-world use cases (in Notion, Playwright..) | Dark Hacker News