Nano Banana 2 Lite(deepmind.google) |
Nano Banana 2 Lite(deepmind.google) |
Unless people prove me wrong, and they really fall for that...
Its like we used to be flooded with fisheye lens pictures of homes, that made the rooms way bigger then reality. I noticed that this trend (on the immo that i follow for years) has heavily reduced. Because nothing beats a sale, as people seeing something looking spacious on pictures and then in person seen its way more small/cramped/compact.
I love that new trend of 3D home viewing... It saves you so much time, and saves time for the immo people, filters out a lot of people with less interest.
Likewise, if I want to see AI renders of what the apartment may one day maybe look like - I can ask for it. Or make the render myself with my tool of choice. But I'll need to know what it actually looks like to do that.
Sadly Google in particular don't obey rule #1 even for machine translation, so it's going to be an uphill battle to get companies to understand.
And plus thats time the real estate agent could have spent prompting claude to cure cancer so its a double win
the actual bedroom could only fit queen size bed ;(
2 months ago while looking for apartments, the majority of the pics shown were generated by AI. The pictures generated by AI often looked much more brighter, cleaner and larger and when I visited them in person, they were the opposite. I wasted so much time visiting due to this.
I understand the intention but the pictures are so wrong most of the time and hide so much imperfection that it should be illegal for false advertisement.
It’s just used to be more expensive to hire someone to do it for you.
The altered images always e free stirs the same bright walls and grey magazine style furniture.
AI is just making it cheaper, but this was bound to happen.
(Images altered this way do have a small watermark stating so)
(Unrelated, my favorite one was getting to the apartment and learning the “bedroom” was a flex wall in the kitchen)
Users can rate how accurate the description was, the real life flaws and even upload their own photos.
Side note: last time I looked for a house I really wasted 95% of my time because every house had one unique major flaw that would have made me not even bother going to see it.
It works as advertised here, and it does behave like a distilled Nano Banana 2 with respect to certain elements such as good text rendering, which Nano Banana 1 does much worse with. It is definitely not at the level of the base Nano Banana 2 of course particularly with highly-nuanced prompts. My main criticism is that you cannot programmatically force aspect ratios with NB2L but you can with NB2.
That said, the price of $0.034/image is higher than expected since price is generally correlated with generation time, and it takes half the time to generate than a Nano Banana 1 image which costs $0.039/image. Google's assertion that you can directly replace NB1 pipelines with NB2L is fair.
Yesterday, Google announced that the Gemini app will allow free image generations (https://blog.google/innovation-and-ai/products/gemini-app/pe...) but did not specify which model would be used: I suspect it's the main motivation for Nano Banana 2 Lite.
I built an app for my kids that generates illustrated stories for them with them as the characters. I wanted to prioritize likeness while still stylizing the illustrations. I tested a bunch of models but none seem to come close to maintaining likeness when stylized. I find the others generate generic looking characters.
I'm excited to incorporate this into the onboarding of my app since I want the users to experience the aha moment as soon as possible and waiting half a minute+ isn't ideal. I'll still be using the main NB2 for the actual illustrations as this lite version still has slight issues with nuance and consistency as others have pointed out.
I want to do a writeup on ChatGPT Image 2 but at this point I don't think people care about nuanced image generation anymore...even though ChatGPT Image 2 crushes all my existing tests.
This is purely about generating images with people in them, I don’t think she’s doing any logic puzzles with gotchas and specific alignments of differently colored blocks and whatnot
Many of these ELO comparative tests (ArtificialAnalysis is guilty as hell on this as well) also have other problems such as a considerable number of "amateur judges" tending to prioritize aesthetics over actual instruction-following given the prompt.
Also (less a critique of Arena.AI necessarily), but the MAI models are so incredibly locked down (e.g. censored) as to be functionally useless. I have a sneaking suspicion its fallout from Tay.
Also a lot of the negative comments are from people who hate the very idea of AI art and want it to fail.
People making images, where the image is the focal point, want to spend more per image.
Where images are parts of reports or throwaways or demos, cheap is the better approach.
However, based off my personal experiences with general images models, Google in my opinion is the best for my workflows. Granted, I haven't tried far-east providers yet.
What does everyone else think?
Probably not for free but tbf, Google did scale "AI Mode" globally to its billion+ users, with its Gemini 3 series. Pretty much broke my habit of searching the web with pplx & Chat.
I mean 3 cents compared to 6 cents doesnt seem like much in my mind-unless youre running a consumer saas
Nano Banana is head and shoulders above the rest, but still too steep for personal use, and half off doesn't really mean much for enterprise if the results are worse. Hopefully this drives the rest to catch up at least.
Pretty soon they'll take the stickers off mowers warning people to not put their hand under it while it's running.
The world isn't in a good place...
I don't particularly mind fake furniture, but if it's very much not to scale I think it's pushing "probably fraud". And when permanent fixtures are fabricated, "blatant fraud, penalize immediately, revoke license on repeats". Using an automated tool does not absolve you of consequences, particularly one nigh-universally well-known to fabricate things.