Physics of Language Models: The Magic of Canon Layers | Dark Hacker News