Given the $10k price tag for tokens and high rate of bugs (several per minute) they mention, it'd be very interesting to see this experiment run with cheaper models too.
I wonder if we get to a world where a full repo sweep like this is a default Github action after commit.