Cohere's First Model for Developers(cohere.com) |
Cohere's First Model for Developers(cohere.com) |
More competition is better.
Regular Qwen 3.6 benchmarks slightly better and has much wider software support though, so this is probably of interest only to organizations which disallow models trained in China.
30B vs 35B isn't nothing either.
If it ends up just being some tweaks to someone else's weights, then meh.