Show HN: Jlama – A fast Java inference engine for GPT and Llama models(github.com)7 points by tjake 2 years ago | 1 comment