Got tired of manually parsing all my chatGPT logs. So I built a real-time hallucination detector for my logs in production. Now instead of manually trying to figure out which, of my hundreds of logs, were bad responses (invented new facts, refused to answer, etc.) I can just get chatGPT to flag them for me. How does it work? Bettershot aims to detect 3 things:
If relevant then,
We do this by using chatgpt (currently gpt-3.5-turbo-16k) to evaluate each prompt-response pair 5 times, sampling the most frequent result (e.g. if it evaluated it to 'True' 4 times out of 5, then it's probably a good response).Check out the repo to know more https://github.com/ClerkieAI/bettershot |
No comments yet