Ask HN: Is there a metric for AI code quality? I've tried many different models and without doubt the code coming out of them differs a lot when it comes to "quality". Some of that is subjective for sure, but there are objective sides to "good" code. I wish this was a metric for the AI benchmarks so I could choose a model based on this, because honestly it's one of the things I care most about. Problem: How can you measure such things, whats the metrcis? ...maybe there just isn't a way to do it, since that metric isn't in the charts.. |