Ask HN: How are you evaluating your LLMs in production? | Dark Hacker News