Reinforcement learning towards broadly and persistently beneficial models(alignment.openai.com)2 points by gmays 4 days ago | 0 commentsNo comments yet