gpjt | Dark Hacker News

user:	gpjt
created:	January 12, 2009
karma:	1.8k
about:	https://www.gilesthomas.com/

1.	Building a Jax training loop for an LLM training run(gilesthomas.com) 1 points by gpjt 2 days ago \| 0 comments
2.	Thoughts on Role Confusion(gilesthomas.com) 3 points by gpjt 8 days ago \| 0 comments
3.	Flax debugging: making a hash of things(gilesthomas.com) 2 points by gpjt 15 days ago \| 0 comments
4.	10Gb/s Ethernet: switching to a Broadcom SFP+ module(gilesthomas.com) 195 points by gpjt 16 days ago \| 170 comments
5.	Jax: Commitment Issues(gilesthomas.com) 4 points by gpjt 17 days ago \| 0 comments
6.	Jax Back Ends and Devices(gilesthomas.com) 2 points by gpjt 27 days ago \| 0 comments
7.	Using Safetensors with Flax(gilesthomas.com) 2 points by gpjt 27 days ago \| 0 comments
8.	First Looking into Jax(gilesthomas.com) 3 points by gpjt 33 days ago \| 0 comments
9.	33 days ago \| discuss
10.	10Gb/s Ethernet: using mini-heatsinks with a 10GBASE-T SFP+ module(gilesthomas.com) 3 points by gpjt 45 days ago \| 0 comments
11.	10Gb/s Ethernet: what I did to get it working in my home(gilesthomas.com) 232 points by gpjt 64 days ago \| 177 comments
12.	10Gb Ethernet: what I had to (re)learn(gilesthomas.com) 1 points by gpjt 65 days ago \| 1 comment
13.	LLM from scratch, part 33 – what I learned from the appendices(gilesthomas.com) 5 points by gpjt 71 days ago \| 0 comments
14.	LLM from scratch (32l) – Interventions: updated instruction fine-tuning results(gilesthomas.com) 1 points by gpjt 72 days ago \| 0 comments
15.	How an LLM becomes more coherent as we train it(gilesthomas.com) 3 points by gpjt 75 days ago \| 0 comments