TokenVerse: Multi-Concept Personalization in Token Modulation Space by Google(token-verse.github.io) |
TokenVerse: Multi-Concept Personalization in Token Modulation Space by Google(token-verse.github.io) |
Pretty interesting.
Seems like you could apply similar ideas to text too.
I don’t understand why they keep making these announcements and then just sitting on the results.
This is an immediately commercially useful product even as an API. You could make a mobile app for kids to “create their own cartoon story”.
Someone else will have to reproduce this for it to see the light of day.
So why not automate the attachment of images to concentrate on the more important tasks? I hope the code will be available soon!
Overall the example images still look like overly corporate slop.
It would be a huge invention, but they did not achieve that.
https://token-verse.github.io/results/multi_concepts/06.png
Both of these show a man's face in a source image being used in a newly generated image. I agree that it isn't complicated, but you seem to be drawing different conclusions to everyone else here.
If your point is that it can't perform face transfer, you seem to be wrong - that's what's happening here. If your point is that the blurred photos used for other parts of the input mean that this suggests the model may get confused by other faces, then that's a fair point, but it seems clear they have demonstrated face transfer, and requiring blurring irrelevant faces seems a minor point compared to transferring the face that's intended. I'm not sure how that would really impact use-cases.