user: | veryluckyxyz |
created: | September 30, 2014 |
karma: | 546 |
1. | Hidden drivers of HRM's performance on ARC-AGI(arcprize.org) |
2. | |
3. | Deep Think with Confidence(jiaweizzhao.github.io) |
4. | |
5. | Easily Understand Rdma Technology(naddod.com) |
6. | |
7. | |
8. | Building and better understanding vision-language models (2024)(huggingface.co) |
9. | HF smolagents computer-agent demo(huggingface.co) |
10. | |
11. | |
12. | Retrieval with Learned Similarities(arxiv.org) |
13. | The Curse of Depth in Large Language Models(arxiv.org) |
14. | Looking Back at Speculative Decoding(research.google) |
15. | Long-Context GRPO(unsloth.ai) |
16. | |
17. | |
18. | Process Reinforcement Through Implicit Rewards(curvy-check-498.notion.site) |
19. | |
20. | Phi-4 Technical Report(arxiv.org) |
21. | Alignment Faking in LLMs [pdf](assets.anthropic.com) |
22. | |
23. | |
24. | |
25. | Random Matrix Theory in Machine Learning Tutorial(random-matrix-learning.github.io) |
26. | |
27. | Double Descent Demystified(arxiv.org) |
28. | Synthetic Continued Pretraining(arxiv.org) |
29. |