| user: | matt_d |
| created: | April 21, 2014 |
| karma: | 18.7k |
| 1. | Cloud RAM(mikekohn.net) |
| 2. | Triton Linear Layout: Examples(lei.chat) |
| 3. | When XLA Isn't Enough: From Pallas to VLIW with Splash Attention on TPU(patricktoulme.substack.com) |
| 4. | Warp Specialization in Triton: Design and Roadmap(pytorch.org) |
| 5. | |
| 6. | |
| 7. | |
| 8. | 6 days ago | discuss |
| 9. | |
| 10. | |
| 11. | |
| 12. | Are DBMS Researchers Making Correct Assumptions about Transaction Workloads?(muratbuffalo.blogspot.com) |
| 13. | vLLM: An Efficient Inference Engine for Large Language Models(www2.eecs.berkeley.edu) |
| 14. | |
| 15. | SMTMSMT: Gluing Together CVC5 and Z3 Nelson Oppen Style(philipzucker.com) |
| 16. | |
| 17. | |
| 18. | Oral History of Jeffrey Ullman [video](youtube.com) |
| 19. | CPU Autoscaling with a Kernel of Truth(dl.acm.org) |
| 20. | |
| 21. | |
| 22. | |
| 23. | |
| 24. | |
| 25. | |
| 26. | |
| 27. | |
| 28. | |
| 29. | |
| 30. | Testing and Benchmarking of AI Compilers(broune.com) |