We Tested 6 AI Models on 3 Common Security Exploits(blog.kilocode.ai) |
We Tested 6 AI Models on 3 Common Security Exploits(blog.kilocode.ai) |
That's a bit silly, especially since all openai models will share some elements. The points lost meaning there. They could for example use glm for all judging instead. Or go all the way and do a full matrix of everything judging everything else.