Pure Go hardware accelerated local inference on VLMs using llama.cpp(github.com)1 points by deadprogram 191 days ago | 0 commentsNo comments yet