Show HN: Auto-generate hard evaluation data for LLMs(talc.ai)14 points by matt_lee 1 year ago | 1 comment