Fail2Drive: Benchmarking Closed-Loop Driving Generalization | Dark Hacker News