Ask HN: How to store KV cache efficiently on disk? If I have a lot of precomputed KV cache values (basically just tensors), how do I store them efficiently on disk? Ideally, I would want to lookup their values by prefix quickly as well. |
No comments yet