| user: | kumama |
| created: | February 19, 2017 |
| karma: | 5 |
| 1. | Designing dev onboarding for an agent-first world(castform.com) |
| 2. | I post-trained a model to reliably roll a die(castform.com) |
| 3. | Open-Weight Models Don't Need to Win(twitter.com) |
| 4. | |
| 5. | Pokegents: Making multi-agent coding feel like a team(castform.com) |
| 6. | |
| 7. | Do RL on a model with your vector db(cgft.io) |
| 8. | What is reinforcement learning finetuning(youtube.com) |
| 9. | |
| 10. | |
| 11. | |
| 12. | 339 days ago | discuss |
| 13. | |
| 14. | |
| 15. | |
| 16. | |
| 17. | 1 year ago | discuss |