A Plea to the Labs: Let the Models Diagnose(tangent.bearblog.dev) |
A Plea to the Labs: Let the Models Diagnose(tangent.bearblog.dev) |
Posting it here in case anyone is interested in a non-coding perspective on model refusal behaviour.
And because I really want someone at anthropic to read this so I can test the goddamn thing.
So you get to press the big red button anyways, you just get downgraded to stupider advice, increasing the risk of catastrophy and lawsuits.
But the whole point is that the models are so good now anyways that any stupidity will most likely be from the user misunderstanding the output...which already happens all the time with humans.