Prune and Distill Llama-3.1 8B to an Nvidia Llama-3.1-Minitron 4B | Dark Hacker News