Anthropic NLAs translate LLM activations to human-readable text for safety(presciente.com)1 points by sebastianperezr 9 days ago | 0 commentsNo comments yet