Nano Banana 2: Google's latest AI image generation model

(blog.google)

255 points | by davidbarker 2 hours ago

38 comments

zug_zug 1 hour ago
I'm sure this has been written about but here's what happens long term - images are commoditized and lose their emotional appeal.
Probably about half of us here remember photos before the cell phone era. They were rare, and special, and you'd have a few photos per YEAR to look back on. The feel of photos back then, was at least 100x stronger than now. They were a special item, could be given as a gift. But once they became freely available that same amount of emotion is now split across many thousands of photos. (not saying this is good or bad, just increased supply reducing value of each item)
With image/art generation the same thing will happen and I can already feel it happening. Things that used to be beautiful or fantastic looking now just feel flat and AI-ish. If claymation scenes can be generated in 1s, and I see a million claymation diagrams a year, then claymation will lose its charm. If I see a million fake Tom Cruise videos, then it oversaturates my desire for desire for all Tom Cruise movies.
What a time to be alive.
[-]
- thewebguyd 27 minutes ago
  I believe this is the reason for a return to interest in analog media with both my generation (millenials) and gen-z. I do wedding photography on the side, and the past ~2 years have seen a huge increase in requests for film photography, either exclusively film or as an add-on to digital. Offering film has been one of the best things I've done for my side hustle.
  Likewise with the sort of resurgence of vinyl, and the obsession over "old" point and shoot digicams.
  [-]
  - giancarlostoro 14 minutes ago
    The best weddings I've been to had a photo booth where you can have photos printed out (any number) and texted to you. I think that's the best way to do it. I agree, people like physical photos still. I've bought my wife several different ways to print photos, including a smaller portable printer, and one of those Instant photo cameras.
  - xnx 17 minutes ago
    > huge increase in requests for film photography
    Also for VHS camcorder footage
- skerit 41 minutes ago
  > They were rare, and special, and you'd have a few photos per YEAR to look back on. The feel of photos back then, was at least 100x stronger than now. [...] But once they became freely available that same amount of emotion is now split across many thousands of photos
  I don't think I fully agree. Sure people make so many photo's that they don't have the time or the will to start looking through them all.
  You can't just whip out your phone and start scrolling through thousands of photo's with friends. It would get so boring so fast.
  But if you put some effort into making a nice little selection of the best photo's, that emotion is 100% still there.
  [-]
  - Bewelge 35 minutes ago
    So now the value is created through curation. Before it was inherent at creation. If you never curate it might seem like it lost value in comparison.
    [-]
    - Daishiman 2 minutes ago
      Curation was implicit when the cost of image creation was high and authors had to consider the photos they were taking beforehand. Now curation comes afterward.
- bryanrasmussen 3 minutes ago
  https://medium.com/luminasticity/art-as-a-tool-for-storing-m...
  "One of the primary properties of anything with Mana is a feeling of uniqueness. That one has never encountered something like this before, and therefore it is important. The uniqueness of the thing is a property that pulls you in to focus more closely, to attempt to understand more closely why the thing is unique."
- torginus 9 minutes ago
  Considering half of the memes are still rage comics drawn with MSPaint i'm kind of skeptical of this statement.
- com2kid 37 minutes ago
  > They were rare, and special, and you'd have a few photos per YEAR to look back on. The feel of photos back then, was at least 100x stronger than now. They were a special item, could be given as a gift. But once they became freely available that same amount of emotion is now split across many thousands of photos. (not saying this is good or bad, just increased supply reducing value of each item)
  I take a hundred photos on a trip, my phone uses AI (not even the new fancy AI, but old 5-10 year old stuff to detect smiling faces and people in frame) to pull out less than a dozen that are worth keeping. Once a month or so I get fed a reminder of some past trip.
  This isn't any different than before. The number of photos taken is greater, but the overall number of worthwhile photos from a given trip is about the same.
- bananaflag 57 minutes ago
  > I'm sure this has been written about
  Scott Alexander has written about it:
  https://www.astralcodexten.com/p/the-colors-of-her-coat
- electrosphere 1 hour ago
  It reminds me of the Star Wars content thats come out recently - before there was the Original Trilogy which we all watched many times and the lines became iconic. Since then it's all become a mismash and blur of mediocrity due to over-exposure.
  (except The Mandalorian, and I can't believe I'm using the word "content" :/)
  edit: Totally forgot about Andor & Rogue One sorry, great film and two seasons of top-notch storytelling.
  [-]
  - mghackerlady 57 minutes ago
    Rogue One was very good, to the point that I consider it on equal standing to the original trilogy and prequels
  - adammarples 29 minutes ago
    It's a blur of mediocrity due to its mediocrity, not its overexposure
  - ex-aws-dude 23 minutes ago
    That is something that annoys me with fandoms
    You could ask "how many more movies should we make?" and the answer would be "there is no limit, I always want more"
    "I like this thing therefore more of it is obviously better"
    I think it takes maturity to say "I like this thing and I don't want more of it."
  - hackyhacky 1 hour ago
    > except The Mandalorian,
    To each their own, but I think Andor is, by far, the best post-ROTJ output.
    [-]
    - mcny 21 minutes ago
      > To each their own,
      And that is the gist of the problem, isn't it? As we approach our forties and beyond, chances are we have lived more than half our lives. So do I really want to spend hours watching something I might hate and might leave a bad taste in my mouth? (See game of thrones season 8 or worse, Westworld the HBO series which I don't even want to know what happened in season 3 or 4). I am sure there are people who will enjoy those but for the average person it is highly unlikely.
  - TaupeRanger 1 hour ago
    Andor is fantastic. The good content still stands out. Mediocre content will have to compete with AI slop at an increasing rate.
- benterix 14 minutes ago
  > The feel of photos back then, was at least 100x stronger than now.
  I dare say, the feel of photos from back then is much stronger than of the photos taken today. See e.g.:
  https://plfoto.com/zdjecie/413363/bez-tytulu?from=autor/beak...
  https://plfoto.com/zdjecie/619173/bez-tytulu?from=autor/beak...
- Aerroon 37 minutes ago
  I don't fully agree. Perhaps you're right when it comes to images as a whole, but I think individual images themselves still capture that emotional value for me.
  Even if there were a million fake Tom Cruise movies I would still like Edge of Tomorrow (even if it had been AI made).
- rootusrootus 36 minutes ago
  > a few photos per YEAR to look back on
  I totally get this, but on the other hand, we have definitely benefited from being able to take more photos. I have some older friends (pushing 80 or so) who sucked at taking photos, so 9 of 10 photos they have from their prime adult years raising their family are blurry to the point of not recognizing the people if you don't already know who they are.
  They have great photos from the last 15-20 years, but of course they do, phone cameras are vastly superior to the point-and-shoot cameras from the 70s, and when you reflexively shoot a dozen photos every time you pose for a picture your odds are way better that one will come out clear, everyone looking at the camera, smiling, etc.
- mrandish 18 minutes ago
  > They were rare, and special, and you'd have a few photos per YEAR to look back on.
  My generation generally only had photos from birthdays, holidays, vacations, weddings, graduations and reunions. We looked at the three albums which contained every family photo often and I know them all by heart.
  My kid was born in 2009 and our family digital album has nearly 1,000 photos per year of her life. And she's seen virtually none of them and seems to have little interest in ever seeing them since she creates so many of her own photos every day which are ephemeral.
- mrbonner 22 minutes ago
  You know, all of a sudden, I am starting to lose interest in meticulously drawn Mermaid diagrams in README, perfect grammar and spelling in doc reviews, or neat generated general photographs. They are all correctly presented, of course. But the ideas are mostly wrong, too.
  I guess my stick figure hand drawn diagrams, a doc with few mistakes in grammar or spelling would be seen as more worthy to read as long as my ideas are sound. Right? :-)
- patwolf 1 hour ago
  The first time I got a photo scanner, I was blown away that I could see myself on a screen. I eventually got a digital camera, and the novelty started to wear off. Now I can make myself the lead in a blockbuster movie, but that feels boring.
- spchampion2 46 minutes ago
  It sounds like you've been reading Susan Sontag. For others, I recommend:
  - https://en.wikipedia.org/wiki/On_Photography
  - https://en.wikipedia.org/wiki/Regarding_the_Pain_of_Others
- thoughtlede 38 minutes ago
  Strictly speaking, I don't think it is the generation or creation that diminishes their value. it is the consumption.
  You said it too:
  > If I see a million fake Tom Cruise videos, then it oversaturates my desire for desire for all Tom Cruise movies.
  The trick of course is to keep yourself from seeing that content.
  The other nuance is that as long as real performance remains unique, which so far it is, we can appreciate more what flesh and blood brings to the table. For example, I can appreciate the reality of the people in a picture or a video that is captured by a regular camera; it's AI version lacks that spunk (for now).
  Note that iPhone in its default settings is already altering the reality, so AI generation is far right on that slippery axis.
  Perhaps, AI and VR would be the reason why our real hangouts would be more appreciated even if they become rare events in the future.
- _trampeltier 25 minutes ago
  A kind of the same happend to music. With a LP or a tape, you had to listen to all songs. Later with a CD you just skipped the not so good songs. And with MP3, you don't even bothered to save not so good songs. And now with TikTok etc. a song just have to be 20sec but has to bang hard for this short time.
- pancakeguy 12 minutes ago
  This is the same argument illustrators made upon the invention of photography.
- fortzi 28 minutes ago
  This.
  Unimaginable abundance may sound good (it does to me), but scarcity has value too. We might just find put that its value is too important. I just hope that if we do, it’s not too late.
  [-]
  - Mars008 16 minutes ago
    There is something that's not easy to scale: humans. Live concerts, performance, etc. They are local
- vunderba 56 minutes ago
  > If I see a million fake Tom Cruise videos, then it oversaturates my desire for desire for all Tom Cruise movies.
  I often call this over-saturation the media equivalent of semantic satiation. Anything commoditized or mass-manufactured isn't going to have emotional appeal.
  https://en.wikipedia.org/wiki/Semantic_satiation
  [-]
  - mannanj 43 minutes ago
    I've often had an "addictive" personality and now I see it as an over satiation, in a semantic way, sort of thing. When I found something I liked I would over saturate my self in it, and lose interest and move on faster than others I knew.
    Feels like what you described describes that inner personality trait better than I have heard before.
    [-]
    - vunderba 29 minutes ago
      As somebody who juggles both figuratively and literally a lot of hobbies, I can definitely relate! One of my friends is a bit like you, they tend to experience the sudden flash of interest in various hobbies, dive extremely deeply and then experience a "bit flip". (a quote describing John Romero's hot/cold personality when he worked at id Software)
      With respect to people with a consumptive addictive personality though - I really feel for them, it's a rough time to be alive.
- ChaitanyaSai 59 minutes ago
  Agree. But there are some use-cases where images can still be of huge help. Making textbooks come alive for instance. We are trying to do that and make a whole bunch of Indian textbooks into comics and free for students. (zerobyheart.com if anyone's interested and would like to make suggestions; the panel-to-panel continuity is still off and something we are working on )
- soperj 53 minutes ago
  > you'd have a few photos per YEAR to look back on
  My parents took way more photos with film than I do with my cellphone camera.
  [-]
  - obscurette 47 minutes ago
    While it wasn't really rare, it was far from common. It was almost full time hobby back then. (I grew up in sixties/seventies.)
- lukol 1 hour ago
  Don't disagree but being the social animals we are, images and videos will never not be important. Things will always feel better when I can connect it with a friendly face.
  [-]
  - EForEndeavour 52 minutes ago
    The source, personal significance, and intent of images and videos will matter a lot, though. I'll cherish photos of my family members forever, regardless of technical excellence.
    Or a photo of my freshman dorm room during exam season. Subpar image quality, lousy lighting, etc. but so many memories, positive and negative, are elicited by that fleeting glimpse from an era of excitement, boredom, stress, uncertainty, and optimism, not knowing where I was going in life, when I'd ever look back at that snapshot, but deciding on a whim to grab it during a break from cramming topics now long forgotten.
    But I roll my eyes at the idea of injecting my likeness into a short clip depicting random over-the-top action sequences, no matter how photorealistic, because I've never wanted to do that.
- 999900000999 1 hour ago
  Your photos of your dog mean nothing to me.
  I have a photo of a friend I’ve since drifted from, it’s her in her army fatigues after basic. She was had just went through a horrible divorce and that was a shining achievement for her.
  The story behind the photo is what makes it matter.
  Not the format.
  However I will agree AI is a poor substitute. You’ll have people creating AI photos of a fake marriage and fake pets in a big fake house, while they sleep in a bunk bed in a halfway house.
- Razengan 12 minutes ago
  Every time in human civilization there's a new technology, existing humans rail against it and want the Good Old Days back, existing children grow up to get used to it, the generation-to-be-born knows it as the normal baseline, then maybe future generations rediscover the past and take the best things about how things used to be without being held back by how bad they were. (see retro games made after retro games died)
  [-]
  - Bombthecat 7 minutes ago
    Yeah, pixel games are huge now.
    But I think it's more because of growing up with it have now pc, money. Not because people rediscover pixel games.
- techterrier 26 minutes ago
  Make Theatre Great Again
- Bratmon 22 minutes ago
  You're presenting this as an argument against AI, but really it's an argument against all human endeavor.
  https://xkcd.com/915/
  [-]
  - Papazsazsa 20 minutes ago
    You're presenting this as an argument against snobbery, but really it's an argument against all humanity.
- seydor 43 minutes ago
  contrary to that i use it to restore old pictures and it has increased their emotional appeal
- Mars008 21 minutes ago
  There is more to that, globalization. Now we have 8 billions humans. They are connected to the same infospace (internet) and share much more and more diverse content. Which means a lot more of emotional/interesting/helpful things. While each of them becomes less emotional.
  Well, world changes dramatically. Connected old folks are like neanderthals in big city now. However not connected are still living locally in their minds. Youngsters are just accepting the world as it is. Nobody is amused by computers and cameras anymore. (at least in developed areas)
  And with all that the worst is yet to come...
- TiredOfLife 22 minutes ago
  Probably some of us here remember paintings before the photography era. They were rare, and special, and you'd have a few painting per YEAR to look back on. The feel of paintings back then, was at least 100x stronger than now. They were a special item, could be given as a gift. But once they became freely available that same amount of emotion is now split across many thousands of photos. (not saying this is good or bad, just increased supply reducing value of each item)
- esafak 27 minutes ago
  There is still room for art. Any photographer sees lots of pictures, but can tell the good from the bad, and find pleasure. They don't dismiss photography altogether.
- gradus_ad 1 hour ago
  Exactly right. A flood of content devalues content, which devalues whatever makes that content. AI fanboys are gushing over every overhyped release and reveal but are really just showing themselves to be incapable of second order thinking and very susceptible to marketing hype.
- dfxm12 31 minutes ago
  I think you're being tricked by nostalgia. It's about the fact that of course older photos you remember have a stronger emotional tie to you (they've had more time to form that bond), and it just so happens that older photos are not digital.
  In my experience, a digital photo of myself and my partner used as the lock screen of my phone has the same emotional weight as the one sitting on my desk (which is a print out of a digital photo). Additionally, printing out a photo of you and your partner and gifting it to them has the same weight as going through childhood photo. A scrapbook of a recent vacation filled with printed digital photos evokes memories just as vividly as one from the 80s. On the flip side of this, a photo in a box in the basement has the same weight as a photo sitting in the cloud.
  I'll offer you some more food for thought: are Aardman Animations films charming because they use claymation? Or is it the creative force of people like Nick Park and Peter Lord?
- blindriver 31 minutes ago
  > images are commoditized and lose their emotional appeal.
  No, ALL CONTENT is asymptotically approaching 0. This includes photos, videos, stories, app features, even code. Code is now worthless. If you want better security from generated code, wait 2 months and it will be better. If you want a photo, you just prompt and it will generate it on the fly.
  AI will be generating movies and videos on the fly, either legally or illegally infringing on IP. Do you want a movie where Deadpool fights The Hulk? Easy. And just like how ad technology knows your preferences, each movie will be individually tailored to YOUR liking just so that your engagement will increase. Do you like happy endings? Deadpool and Hulk will join forces and defeat Thanos. Do you prefer dark endings? Deadpool and Hulk fight until they float off into the Sun and get atomized but keep regenerating for eternity.
  If you want to see a photo of you and your family from 15 years ago, it will generate slightly better versions of yourself and your wife and maximize how cute your kids look. This is the world we are facing now, where authenticity is meaningless. And while YOU may not prefer it, think about the kids who aren't born yet and will grow up in a world where this exists.
  [-]
  - imiric 6 minutes ago
    > AI will be generating movies and videos on the fly, either legally or illegally infringing on IP.
    > If you want to see a photo of you and your family from 15 years ago, it will generate slightly better versions of yourself and your wife and maximize how cute your kids look.
    Sure, but why would any of this media have any emotional significance?
    The reason we enjoy cinema is because a talented group of people had an interesting story to tell and brought it to life in a memorable way. Me, or a random person with no filmmaking talent, prompting a tool to generate a particular scene wouldn't be interesting at all. Talented individuals will also rely on this technology, of course, but a demand for human creativity will still exist, possibly even stronger than today, once everyone is exhausted from the flood of shitty Deadpool vs Hulk videos.
    The reason we enjoy media of friends and family is because it depicts a moment in the life of our loved ones. A fake image or video of them is of absolutely zero value to everyone.
    All of this seems like a neat technology in search of a problem to solve, while actually introducing countless societal problems we haven't even begun to acknowledge, let alone address.
- GaggiX 1 hour ago
  You can still buy a Polaroid, there is one factory left in the world able to produce the film required but they still make them.
  [-]
  - ctmnt 30 minutes ago
    “Still” isn’t the right word. Once Polaroid stopped making the film, closed their factories, and sold or junked their machines, their supplies did the same, and so some of the components stopped being manufactured and available for purchase. What’s sold now as Polaroid film was a reinvention of the same idea. And it’s notably not as good. The dwindling stock of unused true Polaroid film is getting absurdly expensive as a result.
    The one factory you refer to was the last one, and was purchased by the Impossible Project (now Polaroid BV). So they were able to save one set of machines. But the actual process of making the film was lost. So it’s an old set of machines making a new but similar product.
    [-]
    - GaggiX 4 minutes ago
      Yeah I know but it's still incredible that we have something like that in 2026 being produced.
- nathan_compton 1 hour ago
  People here like to say "Commoditize your Compliment" but to a company the size of google or amazon literally EVERYTHING is your compliment. Too bad no philosopher or political scientist or economist every thought about this stuff before or we might have some kind of plan to make the future less miserable and alienating.
  [-]
  - NoGravitas 8 minutes ago
    > Too bad no philosopher or political scientist or economist every thought about this stuff before
    I see what you did there and know exactly the political economist you are talking about, but if you Speak His Name, the shrieking hordes descend.
- sarreph 1 hour ago
  > They were rare, and special, and you'd have a few photos per YEAR to look back on.
  Um yeah I don't know. I fully resonate with the _emotional_ appeal here, but realistically I remember going round to people's houses to be shown analog photo albums that nobody was that bothered about seeing, because they didn't really care -- they weren't their photos.
  The special photos (a few a year) still exists in digital form.
keiferski 17 minutes ago
Some random predictions about what AI image generation tools will do/are doing to art:
1. The narrative/life of the artist becomes a lot more important. The most successful artists are ones that craft a story around their life and art, and don't just create stuff and stop. This will become even more important.
2. Originality matters more than ever. By design, these tools can only copy and mix things that already exist. But they aren't alive, they don't live in the world and have experiences, and they can't create something truly new.
3. Those that bother to learn the actual art skills, and not merely prompting, will increasingly be miles ahead of everyone else. People are lazy, and bothering to put in the time to actually learn stuff will stand out more and more. (Ditto for writing essays and other writing people are doing with AI.)
4. Taste continues to be the single most important thing. The vast, vast majority of AI art out there is...not very good. It's not going to get better, because the lack of taste isn't a technical problem.
5. Art with physical materials will become increasingly popular. That is, stuff that can't be digitized very well: sculpture, installation art, etc. Above all, AI art is uncool, which means it has no real future as a leading art form. This uncoolness will push people away from the screen and towards things that are more material.
nickandbro 1 hour ago
These image gen models are getting so advanced and life like that increasingly the general public are being duped into believing AI images are actually real (ex Facebook food images or fake OF models). Don't get me wrong I will enjoy the benefits of using this model for expressing myself better than ever before, but can't help feeling there's something also very insidious about these models too.
[-]
- WarmWash 1 hour ago
  It's more likely than not that every single person who uses the internet has viewed an AI image and taken it as real by now.
  The obvious ones stand out, but there are so many that are indiscernible without spending lots of time digging through it. Even then there are ones that you can at best guess it's maybe AI gen.
  [-]
  - WD-42 25 minutes ago
    People will continue to retreat into walled, trusted networks where they can have more confidence in the content they see. I can’t even be sure I’m responding to a real person right now.
  - tokai 1 hour ago
    Maybe not an actual argument for anything, but even before these image models everyone that used the internet had seen a doctored image they believed to be real. There was a reason that 'i can tell by the pixels' was a meme.
  - versk 1 hour ago
    At the point now where basically any photo that isn't shared by someone I trust or a reputable news organisation is essentially unverifiable as being real or not
    The positive aspect of this advance is that I've basically stopped using social media because of the creeping sense that everything is slop
  - yieldcrv 1 hour ago
    people only notice when they are prompted to look for AI or scrutinize AI
    a lot of these accounts mix old clips with new AI clips
    or tag onto something emotional like a fake Epstein file image with your favorite politician, and pointing out its AI has people thinking you’re deflecting because you support the politician
    Meanwhile the engagement farmer is completely exempt from scrutiny
    Its fascinating how fast and unexpected the direction goes
- whynotmaybe 1 hour ago
  >fake OF models
  Soon many real OF models will be out of job when everyone will be able to produce content to their personal taste from a few prompts.
  [-]
  - sodacanner 1 hour ago
    People already have access to every form of niche pornography they could dare to imagine (for absolutely free!), I really doubt that 'personal taste' is the part that makes OF models their money. They'll be fine.
    [-]
    - sosodev 1 hour ago
      I think you're under-estimating how much personal taste applies in that industry. Yes, there's a lot of free content but it's often low quality and/or difficult to find for a particular niche. The OF pages, and other paid sites, are curated collections of high quality stuff that can satisfy particular cravings repeatedly with minimal effort.
      A big part of it also the feeling of "connection" with the creator via messages and what not, but that too can be replicated (arguably better) by AI. In fact, a lot of those messages are already being generated haha.
      [-]
      - sodacanner 26 minutes ago
        I was mostly hinting towards the 'connection' part of it, yes - I think that's really where the money is made more than anything else. That's the part that'll start killing the industry once some company tunes it in.
  - sekai 44 minutes ago
    > Soon many real OF models will be out of job when everyone will be able to produce content to their personal taste from a few prompts.
    net positive to society
  - pousada 1 hour ago
    You can’t really because these powerful models are censored. You can create lewd pictures with open models but they aren’t nearly as good or easy to use.
    [-]
    - dragonwriter 51 minutes ago
      Because models can be used to alter existing images, you can use open and commercial models together in content creation workflows (and also the available findings of open models, and the ability to further tube them very specific used, are quite powerful on their own), so the censorship on the commercial models has a lot less effect on what motivated people can produce than you might think.
      I still think, even with that, that like most predictions of AI taking over any content industries, the short-term predictions are overblown.
    - coffeebeqn 1 hour ago
      I’ve seen some very high quality NSFW AI video in the last few months. Those models are not far behind and the search and training space for porn is smaller than being able to generate anything
    - sosodev 1 hour ago
      Doesn't Grok allow users to create lewd content or did they roll that back?
      Also, I suspect that we'll soon see the same pattern of open weights models following several months behind frontier in every modality not just text.
      It's just too easy for other labs to produce synthetic training data from the frontier models and then mimic their behavior. They'll never be as good, but they will certainly be good enough.
    - infecto 47 minutes ago
      Just a matter of time and open models will get there. Not once have we seen a moat across the model spectrums.
  - baal80spam 1 hour ago
    And this can't come soon enough.
    [-]
    - noumenon1111 1 hour ago
      Coming soon... YOU!
  - dfxm12 42 minutes ago
    I don't think so. Talking to people in this space, I've found out about broad camps. There are probably more:
    -They simply aren't into real women/men (so you couldn't even pay a model to do what they're looking for).
    -They want to play out fantasies that would be hard to coordinate even if you could pay models (I guess this is more on the video side of things, but a string of photos can put be together into a comic)
    -They want to generate imagery that would be illegal
    Based on this, I would guess fetish artists are more at risk than OF models. However, AI isn't free. Depending on what you're looking for commissions might be cheaper still for quite a while...
  - coldtea 1 hour ago
    And they might have to gasp! get an honest job!
    [-]
    - switchbak 1 hour ago
      I don't know much about that side of things, but I presume that's hard work! Maybe not always so honest though.
    - xfeeefeee 1 hour ago
      That's a pretty wide brush you are painting with there
- kevincox 1 hour ago
  I actually think this was a good thing. Manipulating images incredibly convincingly was already possible but the cost was high (many hours of highly skilled work). So many people assumed that most images they were seeing were "authentic" without much consideration. By making these fake images ubiquitous we are forcing people to quickly learn that they can't believe what they see on the internet and tracking down sources and deciding who you trust is critically important. People have always said that you can't believe what you see on the internet, but unfortunately many people have managed without major issue ignoring this advice. This wave will force them to take that advice to heart by default.
  [-]
  - slfnflctd 36 minutes ago
    I remember telling my parents at a young age that I couldn't be sure Ronald Reagan was real, because I'd only ever seen him on TV and never in real life, and I knew things on TV could be fake.
    That was the beginning of my journey into understanding what proper verification/vetting of a source is. It's been going on for a long time and there are always new things to learn. This should be taught to every child, starting early on.
  - manuelabeledo 1 hour ago
    > By making these fake images ubiquitous we are forcing people to quickly learn that they can't believe what they see on the internet and tracking down sources and deciding who you trust is critically important.
    Has this thought process ever worked in real life? I know plenty of seniors who still believe everything that comes out of Facebook, be AI or not, and before that it was the TV, radio, newspapers, etc.
    Most people choose to believe, which is why they have a hard time confronting facts.
    [-]
    - rootusrootus 30 minutes ago
      > I know plenty of seniors
      And not just seniors. I see people of all ages who are perfectly happy to accept artificially generated images and video so long as it plays to their existing biases. My impression is that the majority of humanity is not very skeptical by default, and unwilling to learn.
  - lm28469 1 hour ago
    I feel like there is one or two generations of people who are tech savy and not 100% gullible when it comes to online things. Older and younger generations are both completely lost imho, in a blind test you wouldn't discern a monkey from a human scrolling tiktok &co
    [-]
    - manuelabeledo 46 minutes ago
      How so? This "tech savvy and not 100% gullible" generation, gave birth to a political landscape dominated by online ragebait.
      [-]
      - lm28469 15 minutes ago
        Boomers used to tell us to never trust anything online and now they send their life savings to "Brad Pitt"
        New generations gets unlimited brain rot delivered through infinite scroll, don't know what a folder is, think everything is "an app" and keep falling for the "technology will free us from work and cure cancer"
        There was a sweet spot during which you could grow alongside the internet at a pace that was still manageable and when companies and scammers weren't trying so hard to robbyou from your time money and attention
- pancakeguy 10 minutes ago
  Surely this is a problem that we will never be able to solve.
- Havoc 1 hour ago
  Don’t think the demand for real OF is going anywhere
  [-]
  - derwiki 1 hour ago
    How do you know they’re real right now?
    [-]
    - JasonADrury 1 hour ago
      A lot of escorts have OF profiles.
- vunderba 1 hour ago
  Jaded, but if I knew there was a possibility of a bunch of incriminating footage of me (images, video, etc.) out there in the pre-AI days, I would do my absolute best to flood the internet with as many related deepfakes (including of myself) as possible.
- neogodless 47 minutes ago
  > Facebook food images or fake OF models
  What in the world is a fake OF model?
  Does "OF" stand for "of food"?
  [-]
  - bena 44 minutes ago
    It stands for "OnlyFans" a website originally for creators to engage directly with their audiences but quickly became a website where women sold explicit pictures of themselves to subscribers.
    [-]
    - sebzim4500 30 minutes ago
      TIL it wasn't created to be a porn site
- techpression 1 hour ago
  Oh we’ve seen nothing yet of the chaos that generative ai will unleash on the world, looking at Meta platforms it’s already a multi million dollar industry of selling something or someone that doesn’t exist. And that’s just the benign stuff.
- dfxm12 58 minutes ago
  This has been true for a while with digital art, photoshop, etc. Over time, people's BS detectors get tuned. I mean, scrolling by quickly in a feed, yeah, you might miss if an image is "real" or not, but if you see a series of photos side by side of the same subject (like an OF model), you'll figure it out.
  Also, using AI will not allow you to better express yourself. To use an analogy, it will not put your self-expression into any better focus, but just apply one of the stock IG filters to it.
  [-]
  - itintheory 25 minutes ago
    > a series of photos side by side of the same subject
    Cameras are now "enhancing" photos with AI automatically. The contents of a 'real' photo are increasingly generated. The line is blurring and it's only going to get worse.
- fortyseven 1 hour ago
  It's shitty, but I think it's almost as bad that people are calling everything AI. And I can't even blame them, despite how infuriating it is. It's just as insidious that even mundane things literally ARE AI now. I've seen at least twice now (that I'm aware of) where some cute, harmless, otherwise non-outrageous animal video was hiding a Sora watermark. So the crazy shit is AI. The mundane shit is AI. You wonder why everyone is calling everything AI now. :P
  [-]
  - switchbak 1 hour ago
    It seems like a low level paranoia - now I find myself double checking that the youtube video I'm watching isn't some AI slop. All the creators use Getty b-rolls and increasingly AI generated stuff so much that it's not a far stretch to have the voice and script all be auto generated too.
    I suppose if the AI was able to tell me a true and compelling story, I might not even mind so much. I just don't want to be spoon fed drivel for 15 minutes to find it was all complete made up BS.
- throwaway613746 1 hour ago
  [dead]
fasteddie31003 1 hour ago
I'm building my personal home right now. The AI image models have been a game-changer in designing the look of the house. My architect did an OK job, but the details that Nano Banana added really bring the house up a notch. I just do hundreds of renders from the basic 3D models and I find looks that I like and iterate from there. We are implementing the renders from Nano Banana over our Interior Designers designs. We would not have hired the Interior Designers again after using Nano Banana to do our interiors.
I think part of the issue with architects and designers today is that they use CAD too much. It's easy to design boxes and basic roof lines in CAD. It's harder to put in curves and more craftsman features. Nano Banana's renders have more organic design features IMO.
Our house is looking great and we're very happy how it's going so far with a lot of the thanks to Nano Banana.
[-]
- kristjansson 1 hour ago
  Part of the job of interior design is delivering the promised images in … yknow, physical reality? How are you going from nano banana images to actual plans, materials, finishes, products, paint codes, … ?
  [-]
  - fasteddie31003 1 hour ago
    I just gave the renders to the cabinet makers and they had no problems recreating.
    [-]
    - kristjansson 26 minutes ago
      Interesting. I model interior architecture as "here's $xxxK, make it nice" and they do a bunch of work to figure out what you mean by nice, and a bunch more work to codify your definition of nice into, like, SKUs of sconces and so on. Seems like NB helped you figure out your definition of nice, and your subcontractor had a good designer on staff to execute on that.
  - jatari 1 hour ago
    Presumably you give the render to a designer and they recreate it using real materials.
  - PunchTornado 1 hour ago
    not the op, but this is what i did too and bypassed the designer. I iterated with nano banana and gave the result to the company that builds the kitchen. the middleman is gone now.
- soared 1 hour ago
  Same! I redid my backyard entirely and needed ideas. Gemini took a pile of dirt and gave me countless ideas, improved my plans, recommended materials, etc. a designer gave me two out of the box ideas that Gemini didn’t come up with, but it did everything else perfectly. (Designer said, put a patio out in the yard and put your table there, and take your ugly shed and make it the center of attention, since you’ll never succeeed trying to hide it)
  [-]
  - veb 25 minutes ago
    Same thing here. I took a picture of some gravel/grass and asked it to show me what it'd look like with tiles. I showed it another part of the property, and asked it to show me what it would look like with a raised lawn. Super impressive to be able to see a cloudy idea in the physical realm like that.
- bartman 1 hour ago
  Can you write a bit more about your workflow? I've been thinking about doing the same, but since I'm very non-interior-design minded have struggled to ask the right things.
  Like... What are your inputs to the model? Empty renders of the space, or more fully decorated views/ photos? Do you have a light harness around this to help you discover the style you like and then stay consistent with it?
  Do you find that giving a lot of context around the space you're designing helps (it hasn't in my attempts)?
  [-]
  - fasteddie31003 1 hour ago
    I started with sketchup to make basic floor plans and house shapes. I had a rough idea of the style of the home. I picked "Transitional English Estate" since the build site is out on a farm that sorta looks like the Cotswolds. I used AI in this process to get rough renders and feedback on the floorplan. I then took that basic floorplan and house dimensions to a Draftsman who did a lot of tweaking to get it up to code and fix issues. I got his plans and took it to a Sketchup Pro on Fivver . They made a detailed sketchup model. I then took that model and took screenshots from different perspectives and tweaked the prompt to get renders I liked. These changes were reencorprated into the blueprints. I did the same thing with the interior. Took screenshots from sketchup and put them into AI and tweaked the prompt. https://imgur.com/a/lSIYTYr
  - soared 1 hour ago
    Mine was far more lightweight, but u just uploaded pics of my yard and prompted manually a bunch of times. Sometimes id find reference images to give as context, draw on the image to call out specific areas, etc.
    It wouldn’t show me the exact things I wanted, but got close enough that I could test ideas and iterate quickly.
- shostack 1 hour ago
  What tooling are you using to use this and manage it?
CWuestefeld 1 hour ago
What they've chosen as examples to illustrate the strength of the new model surprises me.
The "cubism" example seems like it would be a closer fit to something like stained glass or something. I don't think the thing really understands what cubism was all about. Cubist painters were trying to free themselves from the confines of a single integral plane of perspective by allowing themselves to show various parts of the image from different viewpoints, different times, different styles, etc.
The division of the image into geometric shapes is just a by-product of that quest, whereas the examples here have made it the sum total of the whole piece.
This feels to me like an example of how LLMs still don't "understand" what the art means, and are just aping its facade.
[-]
- kevinsync 1 hour ago
  I had a similar thought before realizing that I'm pretty sure what they were demonstrating wasn't art style, but adherence to correct physical dimensions and construction of the buildings referenced, that was then expressed in an art style (or reasonable facsimile thereof). The before prompts would just conjure a random building out of thin air, the after prompts searched the web for reference material and then used that in image generation.
  And actually, the link I saw a bit ago was this [0] which is more in-depth and has a lot more examples + prompts.
  [0] - https://deepmind.google/models/gemini-image/flash/
jacquesm 1 hour ago
What a great thing this didn't exist in the past. We likely wouldn't have had any of the amazing artworks that we have now. Imagine an AI generated Mona Lisa, Nightwatch or Sistine Chapel ceiling because prompting would have been so much cheaper than paying Leonardo, Rembrandt or Michelangelo...
Now extrapolate to all other artforms. Sculpture seems safe, for now, but only barely so.
[-]
- wordpad 1 hour ago
  I feel like the complete opposite is true.
  Artists aren't doing it for the money. With advanced tools like these they wouldve iterated much faster and created much grander designs.
  Art is about pushing limits of what's possible and AI just raises those limits.
  [-]
  - nluken 1 hour ago
    I hear this often and it's such a strange view of art, like the only thing that matters is scale and speed. It's a perspective so colored by mechanization that it fails to account for other philosophies in art. Think of what, say the Arts and Crafts movement was all about!
  - jacquesm 37 minutes ago
    > Artists aren't doing it for the money.
    That is unlike any artist that I know and I know quite a lot of them. They love their work and the process but they also need to eat. And that included those mentioned above.
  - theappsecguy 35 minutes ago
    Art is about creating something from scratch. This isn't creating anything but cobbling together elements of scraped/stolen content to generate an imitation of prior work.
  - __alexs 1 hour ago
    There is a tremendous amount of "art" that is produced for purely commercial reasons. It employs many thousands of people. These roles are definitely threatened by image generators.
    Agree that if you are Artist this is not going to be a big concern to you.
    [-]
    - gm678 46 minutes ago
      Also, many (I would even venture to say most) of the great artists most people know of earned their bread with intermittent commercial contracts, even rote advertising commissions in the 19th/20th century.
  - NoGravitas 2 minutes ago
    Taste is not scaleable.
  - lm28469 1 hour ago
    Have you talked to "artists"? In my experience the vast majority say the opposite of what you worded here.
  - rdedev 45 minutes ago
    An aspect of art is this pursuit of pushing boundaries within the confines of what is considered good. Would an artist with an infinite image generator be interested in pushing said boundaries? Maybe but they will definitely miss out on getting stuck on an idea and coming up something completely new
  - coldtea 1 hour ago
    >Art is about pushing limits of what's possible
    That's engineering, if that.
    Art isn't, and has never been about that.
    [-]
    - williamcotton 1 hour ago
      Sure it has. See the modernism as a whole.
  - jayd16 21 minutes ago
    The Sistine Chapel was a commission.
    [-]
    - jacquesm 9 minutes ago
      A very large fraction of everything we collect as great art marking our history was made on commission. The GGP is showing their complete ignorance of the history of art.
  - autoexec 41 minutes ago
    Yet somehow with AI art we end up with https://i.redd.it/3v2uwwxxkhkg1.png more often than https://upload.wikimedia.org/wikipedia/commons/7/73/Michelan...
    The only thing AI art makes possible that wasn't possible before is the scale of slop
  - Timpanzee 1 hour ago
    AI isn't a tool for creating art in the same way as a paintbrush or clay. AI is describing a painting you want, then having someone else creating the artwork for you. You aren't doing art in the same way hiring a sculptor isn't doing sculpting.
    AI is well on the way to eliminating human made art since the skills to actually make art will be lost to the skill of being able to describe art. You know, since the only thing that matter is reducing costs.
- nzach 22 minutes ago
  That's true, but you forgot a key piece in this puzzle. The AI can only produce things that already exist. It can combine new things, this is why you can it for a picture of Jesus planting a flag on the Moon. But it only works because Jesus is a concrete concept that already exists in our world. If you ask for a picture of jacquesm planting a flag on the Moon the result will be nonsensical.
- tom1337 1 hour ago
  I'd say these models only exist because we had amazing artworks in the past.
  [-]
  - jacquesm 34 minutes ago
    Absolutely.
- dfxm12 3 minutes ago
  I disagree. On the one hand, yeah, On This Day... 1776 is terrible, and it is sad to compare it to Requiem for a Dream or Pi, but even in this age where AI is available, we see tons of critically successful art being made without the use of AI.
- zackmorris 48 minutes ago
  I think of it more as that AI will destroy the profit motive in all things, not just art. What we used to think of as talent/skill/experience will no longer be scarce, because anyone will be able to make anything with a prompt. The perceived value will be in wholes built of valueless parts (gestalts).
  AI is incompatible with capitalism, but the world isn't ready for that. So we'll have a prolonged period of intense aggregation where more and more value is attributed to systems of control that already have more than they could ever spend, long after the free parts could have provided for basic human needs.
  In other words, the masters existed because they had benefactors and a market for their art and inventions. Today there are better artists and inventors toiling in obscurity, but they won't be remembered because they merely make rent. Which gets harder every day, so there's a kind of deification of the working class hero NPC mindset and simultaneously no bandwidth for ingenuity (what we once thought of as divine inspiration).
  Terence McKenna predicted this paradox that the future's going to get weirder and weirder back in 1998:
  https://www.youtube.com/shorts/KZ2ZtTsHqO0
  [-]
  - jacquesm 35 minutes ago
    On the contrary, the talent will be more scarce because there is no longer a motivation to acquire it in the first place.
- techjamie 1 hour ago
  Ironically we live in a time that, overall, is probably better for artists than the world any of those guys grew up in. People have always valued art but not the artists, and many artists through history, including the famous ones, died broke with their works only posthumously attaining value.
  These days, through commissions, art is a much more viable profession than it ever was.
  [-]
  - jacquesm 31 minutes ago
    It was until ~2021 and it going rapidly downhill. I know some people that are really good at art and they got work on commission from publications, venues and so on. They have seen a significant drop in their bookings and the ones that they do get negotiate hardball because (1) everybody else is desperate too and (2) if they can't get to a deal then AI is now an alternative for the not-so-discerning public which was a fairly large chunk of the usecases.
    So you were making book covers? Ah, so sorry. Nobody really cared that it was you.
    And you can probably extend that to what's between the covers...
  - coffeebeqn 57 minutes ago
    Is it though? It was for the last 20 years but I’d imagine sales of commissions are down immensely and going down every day
- charcircuit 1 hour ago
  We would have tons of great artworks if it existed in the past. The works would be both more numerous and at a higher quality.
  [-]
  - jacquesm 30 minutes ago
    Absolutely not a chance. You see, in the past there was nothing to train it on. And that's sort of the point: the only reason that this AI image generation works at all is because it is lifting on the hard work of the people that had the skills, put the time and the effort in.
- ahtihn 1 hour ago
  Would anyone even care about Mona Lisa if the exact same painting was done by a random nobody? It's just a portrait.
  [-]
  - coldtea 1 hour ago
    Most people no. Then again most people are idiots barely aware of the world they live in, much less culture.
    People who actually care about art, if given a chance to see it, yes.
    Of course, it being done by Davinci is not some random fact about the painting - as if a painting is a mere artifact.
  - __alexs 1 hour ago
    Da Vinci is maybe only the 5th most interesting thing about the Mona Lisa.
- hypeatei 43 minutes ago
  I'll just be extremely candid: a lot of people don't give a shit about these art pieces or art in general. It's okay if you do, there is nothing wrong with that, but it's a myopic view that the world would be worse off if we didn't have a portrait of Mona Lisa.
  [-]
  - jacquesm 28 minutes ago
    Yes, who gives a shit about culture, after all humanity doesn't really need it...
    [-]
    - hypeatei 19 minutes ago
      That's not the point, but okay. I'm simply pointing out the fact that there'd still be art, just not those pieces created by those specific people and the world would be just fine. Humanity would've fared okay if Nano Banana was created 500 years ago.
      [-]
      - jacquesm 8 minutes ago
        What would you have trained it on?
MaxikCZ 7 minutes ago
I have Google AI Ultra. Where can I test this? They say its in aistudio, which says its a paid model and I need to setup billing (as if paying for Ultra isnt enough). They say its available in antigravity, but I cant seem to find it there?
yakattak 1 hour ago
I think this tech is cool, from an engineering perspective. I’m trying to figure out if there’s any justification for using it in a business world outside of: “We don’t want to pay an artist.”
You can argue things like code generation are an extension of the engineer wielding it. Image generation just seems like a net negative overall if it’s used at scale.
Edit: By scale, I mean large corporations putting content in front of millions. I understand the appeal for smaller businesses where they probably weren’t going to pay an artist anyway.
[-]
- alex43578 1 hour ago
  When a company uses a photocopier, they don’t want to pay a scribe.
  When a company sends an email or docu-sign, they don’t want to pay a courier.
  Technology supplements or replaces jobs, often reducing costs. This is no different.
  [-]
  - nindalf 1 hour ago
    Art isn't just a job or a way to make money, like being a courier is.
    [-]
    - progbits 1 hour ago
      For corporate art it is. Nobody draws memphis out of passion.
      [-]
      - dizlexic 1 hour ago
        The real victims here are going to be the graphic designers who worked for firework importers.
- garbawarb 1 hour ago
  Advertising? "We don't want to pay an artist" goes a long way for a small business with a limited budget.
  [-]
  - whynotmaybe 1 hour ago
    We're using voice generation from clipchamp for our promotional videos.
    It's an ethical conundrum because we're not paying anyone, but we don't have the money to pay anyone, and it's good enough for our budget.
    But we're getting used to the process of changing a part of the text in a few seconds without any artist involved and for 0$.
    I guess that soon we'll be able to create voice sample from know personalities for a few $ with prices based on the popularity of the artist and some sanity check based on the artist preferences.
    [-]
    - yakattak 1 hour ago
      I think this is where I see the benefit for small business. I don’t want to speak for you, but I imagine it’s either “no voice over, we can’t afford it.” or “inexpensive AI voice over to make it more accessible and appealing.”
      My thought is the large corps that could afford it, still won’t because it’s a cost they don’t need to incur. For them it’s not even a moral conundrum.
  - rm_-rf_slash 1 hour ago
    It can also backfire. AI slop ads and marketing material imply cut corners and poor quality products. If a bakery isn’t going to bother touching up its AI slop banner, I don’t expect their cookies to be great either.
    [-]
    - gwd 1 hour ago
      FWIW I've never seen a correlation between a small company's website and the quality of their product. Slick website? Maybe they care for their craft, maybe they're all marketing and no content. Website stuck in 1998? Maybe they're sloppy and don't care; maybe they care about their core product, not a slick marketing brochure. I don't see any reason AI would be different in that regard.
      [-]
      - yakattak 1 hour ago
        That’s true. I think it’s more of a problem of getting someone in the door. Anecdotally going to art festivals I’m much more likely to enter the booth of someone who has handcrafted marketing over the person who has generated marketing.
        [-]
        hedora 38 minutes ago
        Basic marketing theory says that spending extra to make your ads signals (term of art) to your potential customers that (1) you are successful, since you can afford it and (2) you are confident your product is superior, since you’re effectively paying people to try it, and expect doing this will generate revenue in return.
        Much like the star bellied sneetches, when the quality of some ad format becomes untethered from the cost of production and placement, then marketers will flock to some alternative.
        YouTube influencers fill[ed] that niche for a while because content milling SEO spam and fake reviews is a lot more expensive if you present the results in video form with good production values. (Not sure how long that will be true, since AI is getting better at short-term video).
    - switchbak 1 hour ago
      Every local business I deal with is completely lacking on the online side. They might have square POS terminals and all that stuff, but their website either doesn't exist, sucks (not updated in years) or they throw me to Facebook (also sucks).
      This is like the last mile for online presence. The average barber out here doesn't use Squarespace, barely knows how to use Facebook and doesn't touch GenAi. But they can still cut your hair pretty well - tech savvyness doesn't have a huge connection to business competence out here.
    - awepofiwaop 1 hour ago
      The amount of lost revenue due to the implication of cut corners needs to be higher than the cost of hiring an artist by enough of a margin that the managers who make the decision start to care, and enough that they're willing to put the effort into hiring an artist.
    - sekai 41 minutes ago
      > It can also backfire. AI slop ads and marketing material imply cut corners and poor quality products. If a bakery isn’t going to bother touching up its AI slop banner, I don’t expect their cookies to be great either.
      Average person won't notice, and would not care either way.
    - hypeatei 1 hour ago
      This assumes that models won't improve and you'll always be able to tell that it's "AI slop" ... that seems like a bad bet. Five years ago you'd be laughed out of the room for suggesting that a computer could produce images from a natural language prompt and that it'd be accessible to everyone -- not just corporations with deep pockets.
      [-]
      - yakattak 1 hour ago
        Yeah if/when it becomes indistinguishable I think most people won’t care. That being said I do think someone finding out something is AI generated will be met with poor response. Does that ultimately matter? Probably not in a business world.
- konschubert 1 hour ago
  I disagree with your premise that everybody should endure friction and cost such that artists can earn a living producing cookie-cutter content.
- jedberg 50 minutes ago
  I've been using it to replace things that I used to do for personal projects in photoshop/gimp. Remove a background, add a person, put a letter in here that looks like the same crayon as the other letters.
  Things that would take me an hour or so the old way takes three minutes with NB.
  But I can see this applying to small businesses. Something that some random person would have to spend on hour photoshopping can be done in a few minutes with NB.
- bonoboTP 1 hour ago
  Drafting, iteration, mockups. Quite useful during ideation.
  [-]
  - yakattak 1 hour ago
    All things traditionally done by artists or artist adjacent roles. I can understand at an individual level, say for a solo gamedev who wasn’t going to pay an artist anyway. That’s not at scale though.
    Larian Studios most recently was under fire for this [1]. Like I can see a director going “what would X look like?” and then speeding over to the concept artists for a proper rendition if they liked it. I don’t think this is at scale though. Any large business is just going to get rid of the concept artists.
    [1]: https://www.pcgamer.com/games/rpg/baldurs-gate-3-developer-l...
    [-]
    - bonoboTP 1 hour ago
      There are many places in general office work where you need some kind of graphics. Slides, reports, info graphics, dataviz. Or academic papers. Some are just illustrations, like a fancy clipart or stock photos, some are drafts for a proper tikz or svg or something that you then redo in draw.io etc. There is much more use for graphics than the use cases where people would ever even consider hiring an actual artist. I've seen good results for iterating on eg model architecture figures quickly between PhD students and supervisors, faster than dragging boxes around and fiddling with tikz. Obviously you don't simply paste the result into the paper. You redo it but it's a good discussion basis. That's for info graphics stuff. But the same can apply to creative stuff, like an event poster, an invitation card to your wedding, storyboards, mood boards, DIY interior design, outfit planning etc etc
      [-]
      - yakattak 1 hour ago
        Yeah that’s a good point. I don’t think that’s what I meant by “at scale” but I can see that being useful day to day.
- sempron64 1 hour ago
  Diagrams! So much documentation lacks diagrams because they are hard to make
  [-]
  - yakattak 1 hour ago
    True! Though I’d argue diagrams as code like PlantUML or Mermaid are better than an image!
    [-]
    - vunderba 1 hour ago
      Agree just from a text search perspective alone that Mermaid even ASCII diagrams are usually preferable.
- jezzamon 1 hour ago
  One major thing is photoreal use cases, which artists can't really do. A lot of that is deep fakes / scams but there are some real use cases
  [-]
  - yakattak 1 hour ago
    Isn’t that what photographers are for?
- zamalek 22 minutes ago
  Sora is already a flop. People are sick of slop and are getting good at identifying it. Grok is the only player that has any semblance of success in the visual gen market, only because they do the one thing that will always make money.
- RickS 1 hour ago
  Same answers you'd use beyond "we don't want to pay an engineer". 100x shorter iteration speed, and the associated workflow (stream of microrevisions and spaghetti throwing), top quartile outputs in many langs/styles/contexts without having to source, hire, and maintain a fleet of separate specialists who can quit when they feel like it.
  I'm torn on the scale thing. It definitely seems net negative. But I think we collectively underestimate just how deeply sick the existing thing already is. We're repulsed by image gen at scale because it breaks our expectation that images are at least somewhat based on reality, that they reflect the natural world or what we can really expect from a product, from a company, from the future. But that was already a bad expectation: when's the last time you saw a mcdonalds meal that looked like the advert? Or a sub-30$ amazon product that wasn't a complete piece of shit? Advertisements were already actively malicious fantasies to exploit the way our brains react to pictures. They're just fantasies that required whole teams of humans doing weird bullshit with lighting and photoshop, and I'm not sure that's much better. It was already slop. All the grieving we do about the loss of truth, or the extent to which corps will gleefully spray us with mind-breaking waterfalls of outright lies, I think those ships sailed a long time ago. The disgust, deceit, the rage we feel about genAI slop is the way we should have felt about all commercials since at least the 80s IMO.
  [-]
  - yakattak 1 hour ago
    > Advertisements were already actively malicious fantasies to exploit the way our brains react to pictures. They're just fantasies that required whole teams of humans doing weird bullshit with lighting and photoshop, and I'm not sure that's much better.
    This is a good point. My gut reaction is “well at least someone was paid to do it and can continue to keep society/the economy going ”.
    I can see the other side where that’s a soulless job. Not sure what’s worse. Soulless job where your skills apply or even less jobs in a competitive industry.
- rafael09ed 1 hour ago
  It is faster as well
- the_mar 1 hour ago
  a friend of mine was a creative director and a big tech co until recently, she was replaced by AI
- testing22321 1 hour ago
  > I’m trying to figure out if there’s any justification for using it in a business world outside of: “We don’t want to pay a human.”
  You could easily say the same about anytime computers or robots or automation have taken a job away. We’ve been going down this road for decades.
  [-]
  - yakattak 1 hour ago
    Those industries (computers, robots) created other jobs though. This doesn’t seem to.
    [-]
    - jedberg 49 minutes ago
      It will. There will be people whose skillset is advanced prompting.
- tantalor 1 hour ago
  Won't somebody think of the window replacers?
neom 28 minutes ago
I did some tests, my education is in digital imaging technology/film from 20 years ago so I find this stuff fun to follow.
Two what I could consider "interesting prompts" for image gen testing. Did pretty well.
https://s.h4x.club/KouXmB1O
"A macro close-up photograph of an old watchmaker's hands carefully replacing a tiny gear inside a vintage pocket watch. The watch mechanism is partially submerged in a shallow dish of clear water, causing visible refraction and light caustics across the brass gears. A single drop of water is falling from a pair of steel tweezers, captured mid splash on the water's surface. Reflect the watchmaker's face, slightly distorted, in the curved glass of the watch face. Sharp focus throughout, natural window lighting from the left, shot on 100mm macro lens." - Only major problem i could find at a glance is the clasps don't make sense probably, and the drop of water inside the watch on the cog doesn't make sense/cog mangled into tweezers.
https://s.h4x.club/7KuGJmNw
"A candid photograph taken from behind an elderly woman sitting alone on a park bench in late autumn. She is gently resting one hand on the empty seat beside her, where a man's weathered flat cap and a folded newspaper sit untouched. Fallen golden leaves cover the path ahead. The low afternoon sun casts her long shadow alongside a second, fainter shadow that almost seems to be there, the suggestion of someone sitting next to her, visible only in the light on the ground. Muted, warm color palette, shallow depth of field on the background trees, photojournalistic style." - I don't know why but it internal errored twice on this one but then got there.
vunderba 1 hour ago
I've only had a brief opportunity to try out NB Pro 2 (`gemini-3.1-flash-image-preview`), so I haven't had a chance to update GenAI Showdown.
Here's some of my captions that tend to trip up even state-of-the-art models.
https://mordenstar.com/other/nb-pro-2-tests
So far it does feel more iterative than an entirely new leap in terms of capabilities, but I haven't run it through the more multimodal aspects such as editing existing images.
That being said, it actually managed the King Louie jump rope test which surprised me.
Scene_Cast2 41 minutes ago
It still seems to have the same pitfalls as all the other image generation models. I ran it through my test prompt (wary of posting it here, lest it gets trained on) - it still cannot generate something along the lines of "object A, but with feature X from Y", where that combo has never been seen in the training data. I wonder how the "astronaut riding unicorn on the moon" was solved...
EDIT: after significant prompting, it actually solved it. I think it's the first one to do so in my testing.
jakub_g 1 hour ago
Since talking images, are there any AI models that can output real transparent gifs/pngs?
And not a (botched) fake white/gray grid background that is commonly used to visualize transparency?
[-]
- dyates 2 minutes ago
  ChatGPT's image generator has been able to do this since last year. That NBP still can't is baffling. They should at least train it to respond to requests for transparency with a solid colour pink background.
- minimaxir 1 hour ago
  You can output to a plain background and use any number of tools to mask it.
  [-]
  - jakub_g 58 minutes ago
    I know. It sounds like a perfect task for AI to do it though (wasn't the whole premise of AI do to mundane things for us), yet they fail to do it, and I need to use an external tool.
    [-]
    - minimaxir 55 minutes ago
      Alpha is a 4th image channel that 99%+ of images in the training data do not use, so it makes more pragmatic sense to just not allow it.
pietz 1 hour ago
I'm officially done with the Nano Banana name. It was fun, but can we go back just calling it Gemini Image?
[-]
- bonoboTP 1 hour ago
  Name recognition has big value. People remember what an advancement the first banana was. Nowadays it's no longer so unique, ChatGPT's and Grok's image editors are also strong.
- PunchTornado 26 minutes ago
  I really like it. Nano banana is like the best product name in AI.
jslakro 31 minutes ago
It'll be great to find a web directory dedicated exclusively to good/useful prompts with nano banana
runamuck 1 hour ago
I saw an item for sale on Ali Express's video and I thought "Wow, they hired some really attractive actors to pitch their little gadget." 30 seconds in, I realized they used GenAI. Not because it looked AI, but because the production values looked too high and professional for the item. I would get in on this if you sell anything online.
[-]
- coffeebeqn 55 minutes ago
  They can even combine the models, create the presenters with nano banana and then use that as the reference for a video model and paste in your product
aliljet 1 hour ago
I really really want to see how these images are starting to form into videos. The stills are clearly getting better and better, but what about when you need the stills to organically conform to a keyed script?
[-]
- Mizza 1 hour ago
  Check out Seedance 2: https://seed.bytedance.com/en/seedance2_0
  Nano Banana was technically impressive the first time, but after Seedance it's not really. It's all just an internet pollution machine anyway.
  [-]
  - rany_ 1 hour ago
    The page looks promising but how can I try it out?
    [-]
    - rabf 24 minutes ago
      They have an API.
- progbits 1 hour ago
  I'm seeing more and more AI video memes and they are getting really good. Still just bunch of short clips, long shots are not working well enough, but typical Hollywood movies have few second cuts anyway so this is almost good enough to make a marvel fanfic.
- vessenes 1 hour ago
  the workflow right now would be to take this images, make a sequence of them for key "shots" and send them to an I2V model. LTX-2 is the model the r/stablediffusion folks are playing with right now, but there are a fair few.
LeoPanthera 1 hour ago
It's notable that this model is less advanced that the previous "Pro" model, and also that the Gemini interface is defaulting all requests to "Fast" even if you've previously changed it to Pro.
I guess even Google is running out of GPUs.
vessenes 1 hour ago
Interesting they get to rev this with the release of a new flash model. I'm speculating part of the distil pipeline includes the image gen stuff; that seems like internal tooling that will pay dividends over time, if true. New frontier model -> automatic new image model. Even if it's just incremental updates, it's good for both the product cadence and compounding improvements.
[-]
- WarmWash 1 hour ago
  The confusion here is dense, 3.1 Flash Image is not 3.1 Flash.
  The banana models (image) are a different than the mainline models, but the confusingly leverage the same naming scheme.
- NitpickLawyer 1 hour ago
  > the distil pipeline
  I don't have inside info, but everything we've seen about gemini3.0 makes me think they aren't doing distillation for their models. They are likely training different arch/sizes in parallel. Gemini 3.0-flash was better than 3.0-pro on a bunch of tasks. That shouldn't happen with distillation. So my guess is that they are working in parallel, on different arches, and try out stuff on -flash first (since they're smaller and faster to train) and then apply the learnings to -pro training runs. (same thing kinda happened with 2.5-flash that got better upgrades than 2.5-pro at various points last year). Ofc I might be wrong, but that's my guess right now.
ozgung 27 minutes ago
Any info or speculation about technical details?
JOJESU 1 hour ago
I’ve been exploring this exact problem space from the angle of extreme constraints (single-digit MB memory, no cloud assumptions). I documented what broke first and why here, in case it’s useful: https://github.com/nullclaw/nullclaw
[-]
- nathan_compton 1 hour ago
  So this is an ultra-minimalist software platform to farm work out to enormous energy chugging AI models?
hmokiguess 1 hour ago
The Chinese are so much ahead in this space, their models are way better at this stuff. For example, https://hunyuan.tencent.com/image/en?tabIndex=0 and https://seed.bytedance.com/en/seedream5_0_lite
[-]
- raincole 3 minutes ago
  When it comes to image prompts, Seedream is far behind Nano Banana. "Far behind" is a ridiculous understatement here, btw.
  Afaik the only real competitor is Riverflow V2.
- sigmar 1 hour ago
  which models?
  we have user-preference rankings that put NB2 on top: https://arena.ai/leaderboard/text-to-image
  [-]
  - hmokiguess 1 hour ago
    interesting, maybe this is just anedoctal experience then and I'm biased but I have been preferring theirs over Nano Banana
- KK7NIL 1 hour ago
  Kind of a pointless comment without a link to such a model.
  [-]
  - hmokiguess 1 hour ago
    sorry, I've edited my original comment with one such example, but they're easily discoverable I assumed it was popular knowledge
    - https://hunyuan.tencent.com/image/en?tabIndex=0
    - https://seed.bytedance.com/en/seedream5_0_lite
    someone shared benchmarks that differ my experience tho, so I may be biased
hedora 45 minutes ago
Open weight? How many parameters?
sync 1 hour ago
Did gemini-2.5-flash-image get an upgrade as well? I just got the following, which is fascinating, and not something I've seen before:
> I'm sorry, but I cannot fulfill your request as it contains conflicting instructions. You asked me to include the self-carved markings on the character's right wrist and to show him clutching his electromancy focus, but you also explicitly stated, "Do NOT include any props, weapons, or objects in the character's hands - hands should be empty." This contradiction prevents me from generating the image as requested.
My prompts are automated (e.g. I'm not writing them) and definitely have contained conflicting instructions in the past.
A quick google search on that error doesn't reveal anything either
minimaxir 1 hour ago
Google updated it early in AI Studio so I've been experimenting:
- Base pricing for a 1024x1024 image is almost 1.6x what normal Nano Banana is ($0.067 vs. $0.039), however you can now get a 512x512 image for cheaper, or a 4k image for cheaper than four 1k images: https://ai.google.dev/gemini-api/docs/pricing#gemini-3.1-fla...
- Thinking is now configurable between `Minimal` and `High` (was not the case with Nano Banana Pro)
- Safety of the model appears to be increased so typical copyright infringing/NSFW content is difficult to generate (it refused to let me generate cartoon characters having taken psychedelics)
- Generation speed is really slow (2-3min per image) but that may be due to load.
- Prompt adherence to my trickier prompts for Nano Banana Pro (https://minimaxir.com/2025/12/nano-banana-pro/) is much worse, unsurprisingly. For example I asked it to make a 5x2 grid with 10 given inputs and it keeps making 4x3 grids with duplicate inputs.
However, I am skeptical with their marquee feature: image search. Anyone who has used Nano Banana Pro for awhile knows that it will strongly overfit on any input images by copy/pasting the subject without changes which is bad for creativity, and I suspect this implementation appears the same.
Additionally I have a test prompt which exploits the January 2025 knowledge cutoff:
```
    Generate a photo of the KPop Demon Hunters performing a concert at Golden Gate Park in their concert outfits.
```
That still fails even with Grounding with Google Search and Image Search enabled, and more charitable variants of the prompt.
tl;dr the example images (https://deepmind.google/models/gemini-image/flash/) seem similar to Nano Banana Pro which is indeed a big quality improvement but even relative to base Nano Banana it's unclear if it justifies a "2" subtitle especially given the increased cost.
[-]
- shostack 58 minutes ago
  The pricing changes are interesting. I wonder if at some point they will deprecate the less expensive model to increase their margins.
  Original Nano Banana (gemini-2.5-flash-image): $0.039 per image (up to 1024×1024px)
  Nano Banana 2 (gemini-3.1-flash-image-preview): $0.045 per 512px image $0.067 per 1K (1024×1024) image $0.101 per 2K image $0.151 per 4K image
  Nano Banana Pro (gemini-3-pro-image-preview): $0.134 per 1K/2K image $0.240 per 4K image
  So at the most common 1K resolution, NB2 is ~72% more expensive than the original NB ($0.067 vs $0.039), but still half the price of NB Pro ($0.134).
- sheept 1 hour ago
  For your knowledge cutoff test, did it failing mean that it generated a generic "Kpop demon hunter" or it rejected the prompt?
  [-]
  - minimaxir 1 hour ago
    Generic "Kpop demon hunter". Nano Banana 2 atleast has fun with it, though.
evrenesat 1 hour ago
I only needed help of this banana boy twice, it managed to disappoint me each time. The most recent one, I was trying different beard and mustache styles on myself, on a photo I imported from my own Google photo gallery, and it consistently rejected me, claiming I'm a public figure. Nobody ever told me that I look like any famous person, so that's googles own bananination. ChatGPT nicely handled the job.
dgtlanml2 1 hour ago
Wow the article narration with Umbriel is silent after the 6 second mark.
riteshyadav02 1 hour ago
Would be interesting to see latency vs quality tradeoffs here. Are they targeting consumer-facing generation speed or prioritizing fidelity for professional workflows?
meowface 1 hour ago
How does it compare to Nano Banana Pro?
sorenjan 1 hour ago
Is this a distillation of Nano Banana Pro?
[-]
- meetpateltech 1 hour ago
  Gemini 3.1 Flash Image is based on Gemini 3 Flash.
  source: https://deepmind.google/models/model-cards/gemini-3-1-flash-...
dyauspitr 7 minutes ago
I really wish they opened a version of this up for adult content. They would make immense amounts of money and it could be fenced off behind some sort of paywall where they could verify the age of the person.
wnevets 1 hour ago
does it still break images with transparent pixels?
throwaway4928ab 1 hour ago
Can we now edit the images it spits out? All prior tests in trying to edit AI images has failed miserably and laughably
danesparza 1 hour ago
Is it just me, or is Nano banana not working in Gemini currently?
hubraumhugo 30 minutes ago
It's working pretty well for generating an xkcd comic for your HN profile: https://hn-wrapped.kadoa.com/
Previous nano banana frequently made speech attribution errors, the new one seems a lot more consistent.
nightski 1 hour ago
[flagged]
[-]
- estearum 1 hour ago
  Quite telling that you think a technology that merely prevents you from passing off an AI-generated image as not-AI-generated makes the model "worthless."
  Good!
  That's the point! Whatever amazing use case you had in mind is bad and I'm glad SynthID (apparently) makes it impossible.
  [-]
  - nightski 54 minutes ago
    Actually no it just makes me use a different model. My uses are not nefarious at all, although it's fine for you to assume so. There are real, legitimate reasons why SynthId is actively harmful that do not involve deceiving or manipulating people at all. SynthId is just a stain on legitimate AI users. People who want to deceive or manipulate are not using Google models anyways. They are going to use a model without safety rails (which is not what I am advocating for per se, just that SynthId is an awful solution).
    It actually reeks of Google, since it's a technical solution to a people problem. Google doesn't seem to understand people.
    [-]
    - estearum 17 minutes ago
      > There are real, legitimate reasons why SynthId is actively harmful that do not involve deceiving or manipulating people at all
      I am legitimately curious: can you name some?
      > Actually no it just makes me use a different model
      Yes, this is a very good thing when "a different model" means "a worse model."
      > People who want to deceive or manipulate are not using Google models anyways. They are going to use a model without safety rails
      That's totally invalid logic. There are plenty of deception and manipulation use cases that don't run afoul of model safety rails at all. Trivially: Creating fake dating profiles to scam people. Fake product images. Fake insurance claims. Fake blackmail (e.g. of a person and another man/woman at a bar).
- DalasNoin 1 hour ago
  Why does SynthID make it worthless? it helps other platforms detect this as ai?
  [-]
  - zardo 1 hour ago
    If the value is in deception.
- csjh 1 hour ago
  What’s the downside of SynthID?
RivieraKid 49 minutes ago
It's extremely slow, takes several minutes to generate an image.
ge96 1 hour ago
My naive question, can image generation make something novel eg. "show me a DNA structure that cures cancer" can it do that, or it has to have seen something before to generate it.
Just think we conceptually know what a brushless motor design looks like and it's just pixels. I guess even if it did produce the image we wouldn't know what it means.
[-]
- minimaxir 1 hour ago
  All image models can generate images that were not in its training dataset, but it can't generate reductive extreme cases like your example.
  [-]
  - ge96 1 hour ago
    What about it is extreme? It's a concept, like "generate an xray image" eventually hopefully the cure to cancer could be represented as a simple molecule or whatever, I'm not saying I know.
    [-]
    - minimaxir 48 minutes ago
      There is currently no knowledge nor progress for what a cure for cancer, and nothing a LLM can draw upon.
      You could generate "pregnant Elon Musk with four arms and three eyes doing yoga poses" because the image models have enough visual concepts of each of those individual things, but that specific image is (likely) not in any training dataset.
      [-]
      - ge96 42 minutes ago
        What I'm saying is if this thing can generate random things (noise) couldn't it make that or new tech like negative mass. Anyway I get it too if we don't know then something we made wouldn't know.