PFlash: 10x prefill speedup over llama.cpp at 128K on a RTX 3090(github.com)3 points by GreenGames 17 days ago | 1 comment