RightSize runs your prompt against candidate models (Kimi, GLM, Qwen, Gemma etc.) in parallel via OpenRouter. Then it uses a stronger model as a Judge to score accuracy against a baseline model. Happy to answer questions. |
No comments yet
RightSize runs your prompt against candidate models (Kimi, GLM, Qwen, Gemma etc.) in parallel via OpenRouter. Then it uses a stronger model as a Judge to score accuracy against a baseline model. Happy to answer questions. |