Estimating required GPU memory for serving LLMs | Dark Hacker News