Pool spare GPU capacity to run LLMs at larger scale(github.com)11 points by i386 100 days ago | 3 comments