Pool spare GPU capacity to run LLMs at larger scale(github.com)11 points by i386 55 days ago | 3 comments