200ms Voice LLM(github.com) |
200ms Voice LLM(github.com) |
I expect once GPT 4o becomes available it will be awesome (even more if they “unlock” it to be asked generic questions and hold a conversation).
Edit: Yes, for sure there will be a delay of a few words or even one or two short sentences. Just like human translators. Not a problem I think.
Edit 2: Very curious why my first comment was downvoted?
HF Transformers is great for prototyping and research, but should not an interactive tool like this be based on something more speed-focused, like llama.cpp?
Any plans for languages beyond English?
Feels like there will be plenty of cases you can't just get around.