How to accelerate GPT-scale models for AI inference | Dark Hacker News