| user: | xianshou |
| created: | May 1, 2011 |
| karma: | 2.8k |
| about: | I live in New York, work in finance, drink excessive amounts of coffee, and play chess. |
| 1. | |
| 2. | |
| 3. | |
| 4. | |
| 5. | Unsupervised Elicitation of Language Models(arxiv.org) |
| 6. | DeepSeek V3 0324 is now the best nonthinking model (Reddit)(old.reddit.com) |
| 7. | |
| 8. | Practical RL (Yandex Data School)(github.com) |