Show HN: Nanbeige 4.1-3B running in the browser via WebGPU(huggingface.co) |
Show HN: Nanbeige 4.1-3B running in the browser via WebGPU(huggingface.co) |
I wrapped it in a simple browser demo using Transformers.js + WebGPU. It downloads the q4 ONNX weights (~1.7GB) and runs fully client-side. no server required. Falls back to WASM if WebGPU isn't available.