Bootstrapping AI Evals from Context (Why 'Just Asking Claude' Fails)(scorable.ai)1 points by Arimbr 77 days ago | 0 commentsNo comments yet