Dark Hacker News
new
|
best
|
ask
|
show
|
jobs
Practical Llama 3 inference in Java | Dark Hacker News
Practical Llama 3 inference in Java
(github.com)
4 points
by
mukel
2 years ago
| 1 comment
mukel
2 years ago
|
next
[−]
Llama3.java: featuring .GGUF file format support, Q8_0 and Q4_0 quantizations, fast matrix/vector multiplication routines using Java's Vector API; served by a simple CLI with a --chat mode to interact with the Llama 3 models.