Ask HN: Most accurate ML speech-to-text API? I'm building a project that relies on at least pretty-good transcription with timestamps for each word and ideally speaker diarization. Right now I'm using Google Cloud's Speech-to-Text, but the accuracy is underwhelming when transcribing a Zoom call (50%ish). Am I likely to fare much better with Azure/AWS? What about Symbl.ai? |