o3-mini faster than GPT-4o-mini

1 points by cbowal 1 year ago | 0 comments

In my testing o3-mini is consistently faster than gpt-4o-mini in total response time by 10-20% even as it produces more tokens.

Demo: https://imgur.com/a/o3-faster-than-gpt-4o-mini-uPQo7wK

Do we know why it's faster? And if any of those performance improvements will come to the 4o(-mini) models?

No comments yet