Testing and Evaluating Large Language Models in AI Applications | Dark Hacker News