Model Quantization may be the explanation. As Anthropic targets a new model launch, quantization helps reduce infra cost of AI models at the expense of quality and accuracy.
This NVIDIA blog explains the concept:
https://developer.nvidia.com/blog/model-quantization-concept...