Building an vision language model from scratch | Dark Hacker News