T-Mac: Low-bit LLM inference on CPU/NPU with lookup table | Dark Hacker News