Adversarial Soft Advantage Fitting: Imitation Learning Without RL | Dark Hacker News