Prompt caching but for RL – 7.5x speedup on long-prompt/short-response workloads(castform.com)4 points by kumama 53 days ago | 0 commentsNo comments yet