Run Llama locally on CPU with minimal API's in-between you and the model | Dark Hacker News