Hello! Earlier this week Mistral released: https://huggingface.co/mistralai/Voxtral-Mini-4B-Realtime-26... Last time I ported a TTS model to Rust using candle, this time I ported an ASR model to Rust with burn. I was able to lean on the wgpu backend to get the model running in the browser after sharding it. Here is the HF Space: https://huggingface.co/spaces/TrevorJS/voxtral-mini-realtime and here are the model weights (q4 + tokenizer): https://huggingface.co/TrevorJS/voxtral-mini-realtime-gguf and the code: https://github.com/TrevorS/voxtral-mini-realtime-rs Didn't have a chance to use agent teams with this project, maybe next one! :) |
No comments yet