kashifr | Dark Hacker News

user:	kashifr
created:	March 11, 2011
karma:	750
about:	https://github.com/kashif https://twitter.com/krasul

1.	The ultimate guide to RL environments: building and scaling them in the LLM era(huggingface.co) 7 points by kashifr 13 days ago \| 0 comments
2.	Distilling 100B+ Models 40x Faster with TRL(huggingface.co) 13 points by kashifr 35 days ago \| 0 comments
3.	Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries(huggingface.co) 2 points by kashifr 66 days ago \| 0 comments
4.	Transformers V5 is out!(github.com) 10 points by kashifr 112 days ago \| 0 comments
5.	The Smol Training Playbook: The Secrets to Building World-Class LLMs(huggingface.co) 265 points by kashifr 200 days ago \| 19 comments
6.	Unlocking On-Policy Distillation for Any Model Family(huggingface.co) 6 points by kashifr 200 days ago \| 1 comment
7.	Transformers 4.55 New OpenAI GPT OSS(github.com) 2 points by kashifr 286 days ago \| 1 comment
8.	Smollm3: Smol, multilingual, long-context reasoner LLM(huggingface.co) 388 points by kashifr 314 days ago \| 79 comments
9.	Epic vs. Apple(twitter.com) 7 points by kashifr 1 year ago \| 0 comments
10.	AIMO (AI Math Olympiad) progress prize winning solution(huggingface.co) 9 points by kashifr 1 year ago \| 0 comments
11.	MaPO: A reference-free alignment technique for diffusion models(mapo-t2i.github.io) 2 points by kashifr 1 year ago \| 1 comment
12.	OpenHermesPreferences: Dataset of ~1M AI preferences from teknium/OpenHermes-2.5(huggingface.co) 7 points by kashifr 2 years ago \| 1 comment
13.	HuggingFace Training Cluster as a Service(huggingface.co) 101 points by kashifr 2 years ago \| 45 comments
14.	HuggingFace 235M series D at a $4.5B valuation(twitter.com) 3 points by kashifr 2 years ago \| 0 comments
15.	Fine-tune Llama 2 with DPO(huggingface.co) 3 points by kashifr 2 years ago \| 0 comments
16.	QLoRA 4-bit finetuning of LLMs(github.com) 7 points by kashifr 2 years ago \| 1 comment
17.	StackLlama: A hands-on guide to train LlaMa with RLHF(huggingface.co) 165 points by kashifr 3 years ago \| 38 comments
18.	HuggingFace Diffusers 0.2 with Stable Diffusion pipeline(github.com) 2 points by kashifr 3 years ago \| 1 comment
19.	Diffusers: Modular Diffusion model library from HuggingFace(github.com) 47 points by kashifr 3 years ago \| 5 comments
20.	Generic Neural Elastic Search(gnes.ai) 4 points by kashifr 6 years ago \| 0 comments