Fooling around with encrypted reasoning blobs(blog.cryptographyengineering.com) |
Fooling around with encrypted reasoning blobs(blog.cryptographyengineering.com) |
I always wondered how these large AI companies managed access for millions of simultaneous users without having to allocate a dedicated LLM instance for each user. Pushing the complete state down to the user after every call makes perfect sense. The LLM itself stays memoryless and ready to respond to an arbitrary prompt. Very nice.