Debugging misaligned completions with sparse-autoencoder latent attribution | Dark Hacker News