M6 – 10T Parameters at 1% GPT-3’s Energy Cost(towardsdatascience.com)1 points by bloodcarter 4 years agoNo comments yet