Ask HN: How can I clone myself using ML? What beginner-friendly tools and models can I use to create a backup of my voice (text 2 speech and the like), appearance (video in real time) etc? |
Ask HN: How can I clone myself using ML? What beginner-friendly tools and models can I use to create a backup of my voice (text 2 speech and the like), appearance (video in real time) etc? |
> video in real time
These two are going to be mutually exclusive for a few years, I'm afraid. Sure, the hardware investment will put you back a few thousand dollars, but running the latest temporally-stable video models is not realistic for most experienced devs, much less a beginner. If you wanted to be corny you could train a few-shot voice model to sound like you and plug it into GPT-3, but that would be neither realtime nor particularly gratifying.
It's not exactly a surprise that this is a bad idea. Now I'm going to get nightmares with my AI doppelganger repeating GPT-3 gobbledygook in monotone...
RE: resources/hardware. I'd rather go for cloud providers like Microsoft's Azure Cognitive services or some pre-trained model that I can run locally or on colab by Google.
I would like some guidance on which models to choose, tutorials and/or perhaps colab links with examples on how to use them.
I like the idea to leave a trained model with my voice to my loved ones at least for when I will be gone. I would love to have one of my dad too!