Don’t use ChatGPT to solve problems(thebuild.com) |
Don’t use ChatGPT to solve problems(thebuild.com) |
I am curious if these tests could be written down and also tested with some versions of GPT (2, 3, 3.5, 4) not because I think they will solve it but to see some trend, and to have a new benchmark if we see 4.5 or 5. Or maybe even test with the 'Code Interpreter' plugin.