Ask HN: How are engineers evaluating non-deterministic ML/LLM based deployments? So, we process data as well as documents from various sources, then,
How do engineers evaluate such systems?
Any good approach for writing evaluations for these? |
No comments yet