Ask HN: How are engineers evaluating non-deterministic ML/LLM based deployments? | Dark Hacker News