user: | gpjt |
created: | January 12, 2009 |
karma: | 1.3k |
about: | https://www.gilesthomas.com/ |
1. | Writing an LLM from scratch, part 22 – training our LLM(gilesthomas.com) |
2. | |
3. | Writing an LLM from scratch, part 21 – perplexed by perplexity(gilesthomas.com) |
4. | |
5. | How Do LLMs Work?(gilesthomas.com) |
6. | The maths you need to start understanding LLMs(gilesthomas.com) |
7. | What AI chatbots are doing under the hood(gilesthomas.com) |
8. | |
9. | The fixed length bottleneck and the feed forward network(gilesthomas.com) |
10. | Writing an LLM from scratch, part 17 – the feed-forward network(gilesthomas.com) |
11. | Writing an LLM from scratch, part 16 – layer normalisation(gilesthomas.com) |
12. | Leaving PythonAnywhere(gilesthomas.com) |
13. | |
14. | |
15. | Writing an LLM from scratch, part 13 – attention heads are dumb(gilesthomas.com) |
16. | Writing an LLM from scratch, part 12 – multi-head attention(gilesthomas.com) |
17. | Writing an LLM from scratch, part 11 – batches(gilesthomas.com) |
18. | The Business of the AI Labs(blog.omega-prime.co.uk) |