A few questions for @Cherian:
1. I see the ASR usage, but where does computer vision come into play?
2. Are you training and/or fine tuning asr models to deal with the speech characteristics of children and new speakers?
3. Is the asr all cloud side, or do you have it running locally in some fashion?
2. The model is primarily focused around kids, with focus on repeated words and partial words
3. ASR is local to device. We are using a focused model per book
We plan to train and fine tune through continuous beta testing with families that opt in through our research groups but not through existing learners/subscribers.
Some other team members may comment to provide more info
Anyone in Bay Area welcome to come and check out demo.
Critique us. We’d love to hear from you.