OpenAI: Investigating the consequences of accidentally grading CoT during RL(alignment.openai.com)2 points by pretext 9 days ago | 0 commentsNo comments yet