Open-source reflection lama 70B beats Claude 3.5 and GPT-4 on benchmarks(reflectionllama.com) |
Open-source reflection lama 70B beats Claude 3.5 and GPT-4 on benchmarks(reflectionllama.com) |
Reflection 70B, the top open-source model
The link provided leads to a playground for the reflection llama 70B.
However he also points out it has to be included in the initial training, you can't improve a non-backtrack-trained model by finetuning it later.
So seems it's probably the way to go for training new models, but limited applicability to those already trained.
Check the questions people posted: