Reinforcement learning towards broadly and persistently beneficial models(alignment.openai.com)2 points by spicypete 3 days ago | 0 commentsNo comments yet