Why we chose not to release Stable Diffusion 1.5 as quickly

Why we chose not to release Stable Diffusion 1.5 as quickly(danieljeffries.substack.com)

298 points by dwynings 3 years ago | 343 comments

I'm not a data hoarder, but from the moment Stable Diffusion was released I had a gut feeling that I should download everything available while it's there.

Somewhat similar gut feeling to when popcorn time was released, although it might not be exactly the same.

While I really wish I'm wrong, my gut tells me that broadly trained machine learning models available to the general public won't last and that intellectual property hawks are going to one day cancel and remove these models and code from all convenient access channels.

That somehow international legislation will converge on the strictest possible interpretation of intellectual property, and those models will become illegal by the mere fact they were trained on copyrighted material.

So reminder to everyone: Download! Get it and use it before they try to close the Stable doors after the horses Diffused. Do not be fooled by the illusion that just because it's open source it will be there forever! Popcorn time lost a similar battle.

Get it now when there are trustworthy sources. Once these kinds of things go underground, it gets much harder to get a trustworthy version.

williamcotton 3 years ago | |

From my research the general consensus is that the processing of copyrighted material will be considered fair use. Here is a lengthy legal discussion:

https://texaslawreview.org/fair-learning/

Here is a short quote from an IP lawyer:

“In terms of the ingestion of publicly accessible code, Ochoa said, there may be software license violations but that's probably protected by fair use. While there hasn't been a lot of litigation about that, a number of scholars have taken that position and he said he's inclined to agree.”

https://www.theregister.com/2022/10/19/github_copilot_copyri...

renonce 3 years ago | | |

It’s very probably fair use under current copyright laws. The things is that the game is changing very rapidly. Right now it’s suffering from criticism in terms of how it affects the society and allows people to generate unwanted images, and merely copyright laws may not be sufficient to protect them. And it has already caught the regulator’s attention so even the law could be rewritten around these models.

resoluteteeth 3 years ago | | |

> From my research the general consensus is that the processing of copyrighted material will be considered fair use. Here is a lengthy legal discussion:

IANAL but I would take any opinions on this right now with a huge grain of salt and treat them more as advocacy than actual predictions of any legal outcomes.

Whether there is a good case for it being considered fair use doesn't matter at all until its actually litigated and historically the result with fair use in relation to new technologies has always been a crapshoot.

The result could easily be affected by the actual cases that get litigated, and one well chosen lawsuit where machine learning software is shown to produce output that's too close to the material it was trained on could result in a completely different outcome.

notacoward 3 years ago | | |

Both of your sources make the point that the output of such models is separate from the ingestion mentioned in your (carefully selected) quote, and that the legal definition of fair use might well change to preclude such "AI washed" (my term) copying. That's almost the opposite of how you portray the state of legal thought on the matter.

Vetch 3 years ago | |

What's the point of downloading it when it'd just stagnate? This isn't like regular software where people can easily put in hard work and sweat to improve it.

LLMs have the unfortunate limitation of being both powerful and lending themselves to centralized control choke-points due to how resource intensive they are to train. Under this paradigm, I fear commercial entities will be able to easily navigate the legal landmines and continually improve while open efforts perpetually lag far behind.

There are many vested interests who want this control for various reasons they justify as: protection from x-risk, keeping it out of the hands of abusers and bullies, economic advantage. Their reasons for want of control are either well intended but wrong-headed or profit-motivated and disingenuous.

Rather than challenging the likes of GPT-3 and Copilot enabling freedom, I fear folks will be forced to send all their videos, pictures, text and code to the servers of Microsoft, Amazon and Google or lose access to advantages as LLMs continue to improve at a rapid clip.

frognumber 3 years ago | | |

> What's the point of downloading it when it'd just stagnate?

Because it's already good enough to have made it's way into many of my workflows.

I do feel that many companies will, ironically, use "ethical" as a pretext to not be open.

visarga 3 years ago | | |

> LLMs have the unfortunate limitation of being both powerful and lending themselves to centralized control choke-points

It was hard to accomplish, but you can finetune SD on your computer. They are working on instruction-tuning LLMs as well. In general ML models are not closed boxes inaccessible to us - they can be finetuned, reprompted, you can even average two versions to get a mix of two models. In the last 2 years lots of papers were written on finetuning and prompting, all of them geared towards low resource AI adaptation to new tasks.

deepserket 3 years ago | | |

> I fear commercial entities will be able to easily navigate the legal landmines and continually improve while open efforts perpetually lag far behind

Is it possible to crowdsource AI training with something that looks similar to folding@home?

petercooper 3 years ago | | |

What's the point of downloading it when it'd just stagnate?

The quality of the output you can get with the models right now have perpetual utility IMO. If you use it to create patterns, backgrounds, or even just for inspiration creations right now, it might be a shame if it didn't progress (depending on your position) but it's fine as-is if you put in the work to compose and refine the raw output.

danuker 3 years ago | | |

> when it'd just stagnate?

While it'd be difficult to improve upon the model, it might be easy enough to finetune it if needed, and it's certainly worth it to USE it as is.

There is a limited number of models costing 6 digits in dollars in train time and are freely available. There is certainly value in preserving them, in a world of artificial scarcity.

RobotToaster 3 years ago | | |

>LLMs have the unfortunate limitation of being both powerful and lending themselves to centralized control choke-points due to how resource intensive they are to train.

I wonder if that will continue.

My understanding is that's partially because it currently relies on GPUs, which until relatively recently there was a limited demand for, and the market is basically controlled by a single company.

Will we see cheaper special purpose AI accelerators? Like happened with crypto mining ASICs.

gauravvij137 3 years ago | | |

The only way to get rid of centralized choke points is to actually go decentralized. At Q Blocks, we're working on making this solution a reality for a lot of the ML devs constrained by the computing costs on cloud.

WheelsAtLarge 3 years ago | |

Companies have a similar problem now with AI than what the music labels had with Napster and MP3s in the 90's. Music labels tried very hard to legislate the problem away but it failed. I remember Metallica's Lars Ulrich working hard to fight it. They finally embraced the change. If it can't be done in the U.S., it will be done in some other country. That country will have the competitive advantage.

We'll go thru the same with AI but ultimately it won't be stopped. As long as there's no world wide coordination limiting its impact, AI will continue its course.

squokko 3 years ago | | |

They did legislate the problem away. Sure, Spotify and YouTube play a part in the reduced music piracy today. But it also helps that all of the music piracy sites have been killed, and the only ones left are shady enough that you fear malware if you go there.

azinman2 3 years ago | | |

How did he embrace the change? I just remember years of lawsuits and whatnot.

manholio 3 years ago | | |

They won't legislate AI away, just training them on copyrighted works without attribution and license. As it should be.

Countries that don't do that will be just as successful in the world marketplace as are countries that don't respect copyright.

Satam 3 years ago | |

Wow, thank you. That's a very interesting take.

You mention Popcorn time. I wonder if torrents in general could be a great example of how something like this plays out? Torrenting took the world by storm and had an amazing "product-market fit" for the early internet days. Of course, downloading copyrighted material was always illegal but that didn't stop many.

Over time, legal but paid alternatives rose up: Spotify, iTunes, Netflix. These players found their place in the market by balancing the interest of copyright holders and the needs of users looking for cheap and easy access to entertainment.

Just as Netflix acquired large content libraries, same here. With enough money, large training datasets could be acquired in a legally solid manner.

It's interesting to think where this analogy might fail as well, and how the paths of these technologies could differ. For one, torrenting was mostly for entertainment, and thus impacted B2C first. On the other hand, language models are more so for media _creation_ and the B2B sphere.

machina_ex_deus 3 years ago | | |

They can and do fight dirty. They don't only use legal tactics, they use legal options to get the information off from trustworthy sources.

Like torrents, you first have to resort to random websites who get randomly taken down as they acquire reputation. If a person takes the face and responsibility for something, he gets litigated into oblivion.

So you get to the point where trustworthy and untrustworthy sources are indistinguishable

. Now what they do is create untrustworthy sources. Like time for popcorn. Sow discord.

Fork several times, create intentionally malwared versions of both the program and the website. Keep kicking off the trustworthy sources of search engines, while magically skipping takedown requests for the less trustworthy websites.

Find ways to break old versions if possible, just to force them to keep moving. (they can make gradio randomly change APIs just to break the old trustworthy versions)

All of this can happen.

pmoriarty 3 years ago | | |

"Over time, legal but paid alternatives rose up: Spotify, iTunes, Netflix. These players found their place in the market by balancing the interest of copyright holders and the needs of users looking for cheap and easy access to entertainment."

You didn't mention one of the largest (perhaps even the largest) distributor of copyright content (which happens to also be free, for now): YouTube.

You can watch/listen to endless amounts of copyrighted content (and other types of content) on there completely for free, and to say it's tremendously popular would be an understatement.

Google has made it work through ads. Perhaps something like that will happen with image-generating AI.

pabs3 3 years ago | |

Models that are trained on data under open source licenses (such as Creative Commons) would likely be much safer from copyright claims. I like to use the Debian Deep Learning Team's Machine Learning Policy to evaluate the openness of ML work.

https://salsa.debian.org/deeplearning-team/ml-policy

wongarsu 3 years ago | | |

Unless they carry with them a library of attributions to every source image, that safety comes mostly from anticipating that authors of CC-licensed works won't be too upset about people using them.

prepend 3 years ago | |

I forked deepfake a few years ago because it seemed interesting. I didn’t have a spidy sense just thought it would be something interesting to look into. But I forked in GitHub rather than doing a proper clone so now it’s gone.

It reminds me to follow the datahoarder maxim that if you don’t admin then servers, you don’t have the data. So now I clone stuff to a local drive.

spaceman_2020 3 years ago | |

This lines up with my observation of the sudden and complete absence of celebrity deep fakes (the adult rated or otherwise) from the internet.

There is a legal machinery that works behind the scenes which we aren't always aware of.

Huh1337 3 years ago | | |

I think you're just not looking. It might've disappeared from Twitter and Facebook, but it's still on Reddit, 4Chan and many other sites.

gedy 3 years ago | |

I think the savvy media companies realize that we're at the cusp of ai generated media - movies and music included. If we have free/open models trained on the past 100 years of media, they may become obsolete and they will fight this to the death.

Irony is the "NSFW" moral concerns, when the media companies put out such negative and filthy content as it is.

jug 3 years ago | | |

It’s interesting how we already are at a point where home users can in theory make a Star Wars fan film with Luke’s actual face and synthesized voice.

CuriouslyC 3 years ago | | |

The way Disney is churning out rehashed content for its IP they're obsoleting themselves. When your human content is more predictable and stale than something generated by an AI you should hang your head in shame.

fbdab103 3 years ago | |

Any particular repos/artifacts you suggest downloading?

lostmsu 3 years ago | | |

YALM 100B, and that giant codegen model from Salesforce https://huggingface.co/Salesforce

blackoil 3 years ago | |

I am more hopeful. Unlike popcon/napster these models aren't directly impacting existing bottom line of any company/organization. Most of the models are trained on opensource / public datasets, so you won't find any company to sponsor the fight against these models. The cost of these models is an issue right now, but Mr. Moore has always handeled that well.

capitalsigma 3 years ago | | |

I don't think that Moore's law is what's driving down ML compute costs in particular, where there seems to be a lot of innovation going on in terms of hardware architecture and compilers (much of which is proprietary). Even just thinking about memory bandwidth, which historically has scaled much slower than compute: the $/second required to push 10+ TB of training data into some piece of hardware that can do useful work on it isn't going to fall by 100x in a decade.

samarthr1 3 years ago | | |

Ah., but SD does compete with OpenAI's Dalle2 no? I am however not sure if that will cause too much trouble though.

Jevon23 3 years ago | | |

Unfortunately, you’re right. These models are beneficial to large corporations, and they do the most harm to the small individual artists who created the content that made them possible in the first place, so it’s unlikely there will be any serious legal challenges.

2Gkashmiri 3 years ago | |

>Popcorn time lost a similar battle.

i was actively following torrentfreak at the time and there was genuine excitement with something incredible but that only lasted a week :-(

why do you say they lost the battle? the original team threw in the towel within the week but there are people who have taken the fight

https://github.com/popcorn-official/popcorn-desktop/releases... here, the latest release was on 04 Sep 2022 so it is very much in active development with a lot of people contributing https://github.com/popcorn-official/popcorn-desktop/graphs/c...

so while the original team might not be working on it, like a true free software, the code lives.

liuliu 3 years ago | |

Model can be retrained (with some money). But data is harder. I cannot backup LAION 5B unfortunately. If you can, please do! (About 200T)

pmoriarty 3 years ago | | |

A distributed backup might be possible.

Get 200 interested people backing up 1 TB each and you have your 200 TB backup.

With redundancy and error correction data added to the mix, you should be able to lose a certain percentage of participants and still have access to the full, error-free backup.

l33tman 3 years ago | | |

Why would you, though? It's just a list of 5B URLs. Some might go down, some new might go up. But it's not like any government body can suddenly take down all photos on the whole internet...

manholio 3 years ago | |

> That somehow international legislation will converge on the strictest possible interpretation of intellectual property, and those models will become illegal by the mere fact they were trained on copyrighted material.

That's the only possible interpretation, really. AI models algorithmically remix input intellectual property en masse, without any significant amount of human creativity, the only thing copyright law protects. As such, the models themselves are wholly derived works, essentially a compressed and compact representation of the artistic features of the original works.

Legally, a AI model is equivalent to a huge tar.gz of copyrighted thumbnails: very limited fair use applies, only in some countries, and only in certain use contexts that generally don't harm the original author or out-compete them in the market place - the polar opposite of what AI models are.

adamsmith143 3 years ago | |

>That somehow international legislation will converge on the strictest possible interpretation of intellectual property, and those models will become illegal by the mere fact they were trained on copyrighted material.

Just feels absurd to me because how is this different from any Human artist who you could equally say was "trained" on copyrighted material.

>Get it now when there are trustworthy sources. Once these kinds of things go underground, it gets much harder to get a trustworthy version.

People have already reverse engineered most text2image models and given enough hardware can train their own. There is no need for this hysterical take. As long as the internet exists you will be able to train these models.

green_on_black 3 years ago | |

Here's a (not-recommended but amusing) nuclear option:

Tit-for-tat. Regulators and artists don't want this? Okay, include in all open source software licenses that regulators and artists are now barred from using them without payment.

Kerrick 3 years ago | | |

That would be neither an open source license (according to the OSI) nor a free license (according to the FSF).

https://www.gnu.org/licenses/license-list.html

> [...] is a nonfree license because it extends the four freedoms only to some kinds of organizations, not to all. Such a restriction in a software license, in the name of any cause whatsoever, imposes too much power over users. Please don't use this license, and we urge you to avoid any software that has been released under it.

https://opensource.org/osd-annotated

> 1. Free Redistribution

> The license shall not restrict any party from selling or giving away the software as a component of an aggregate software distribution containing programs from several different sources. The license shall not require a royalty or other fee for such sale.

> Rationale: By constraining the license to require free redistribution, we eliminate the temptation for licensors to throw away many long-term gains to make short-term gains. If we didn't do this, there would be lots of pressure for cooperators to defect.

beojan 3 years ago | | |

A lot of open source software authors don't want this either because it can circumvent copyleft and attribution requirements.

Also, discriminating like you suggest would make those licenses closed source by definition.

manholio 3 years ago | | |

What a wonderful excuse for a government to pay another 1 billion dollars on crapware to their cronies, who will outsource it to some incompetent software sweatshop on the other side of the world.

We can barely get governments to use open source even today, without restrictions. Hell, we can barely make them manage source code for commercial products they commission and pay for. I've walked into govt shops that were 100% binary dependent to the original software author, which never delivered source code and charged them trough the nose for the basic servicing.

Like it or not, the government and regulators represent us, we need individual accountability but harming the govt. directly harms ourselves firstly. The bureaucrats and the corrupt hardly care.

didibus 3 years ago | |

The UK and the EU have already made to law that text and data mining is excluded from copyright for non-commercial uses, and the UK has even done so for commercial use cases.

Personally, I think commercial use cases should get license agreements from the authors for their training data, but I think non-commercial exemptions to advance the field of AI makes sense.

Irregardless of what I think though, the UK has set an international precedent, and the EU is apparently discussing about possibly extending it to commercial use cases as well. So there's that.

corndoge 3 years ago | |

I agree that it’s a good idea to download everything now and I agree that the legal powers that be will probably soon force it underground - but I’m less certain the driving reason will be copyright / IP. I think it will be reasons similar to what TA hints at. People are (somewhat understandably) upset with certain classes of output the model is capable of generating and a moral panic is likely to ensue that, historically, has won most cases it’s presented itself in.

datacruncher01 3 years ago | |

I figure these tools fall in a similar category to web scraping which is legal. What you can’t do is copy the file. If you can demonstrate that you are modifying the source data then it’s a new work. Style is not protected by copyright as much as famous artists may want.

Where copyright may be applicable is when the models reproduce original art without modification that a reasonable person wouldn’t know the difference.

speleding 3 years ago | |

> those models will become illegal by the mere fact they were trained on copyrighted material

The blog post says they are worried about the ability to use the model to "use it for illegal purposes and hurting people". I think that they are referring to the ability to create all kinds of compromising pictures (porn) with celebrities, kids, etc. Am I misreading that? They don't mention copyright anywhere.

paulcole 3 years ago | | |

> Am I misreading that? They don't mention copyright anywhere.

The conspiracy theorist would say that if you were doing something you shouldn’t, you wouldn’t mention it. Instead, you’d give a more palatable excuse to buy yourself some time while you figure out how to get away (legally) with the thing you shouldn’t be doing.

stelonix 3 years ago | |

Yesterday I was backing up and old failing HD. I looked at the models I downloaded since 2014 and since I was out of time, I decided to just delete them. But I deleted them with the same thought you just shared: those old models probably don't even exist anymore, they're probably gone. I'm just hoping that time you described isn't happening anytime soon.

EGreg 3 years ago | |

Where can we get Stable Diffusion downloaded?

LawTalkingGuy 3 years ago | |

I think it'll be EU-style privacy regulations that make it illegal to train on the majority of data. Perhaps the requirement to be able to remove a user's impact from an already computed model if they file a right-to-be-forgotten.

Something that would make any non-trivial model a legal nightmare.

ionwake 3 years ago | |

> close the Stable doors after the horses Diffused

Encapsulates it all well I like this statement, total pottery

dividedbyzero 3 years ago | |

I've followed this sort of thing rather loosely so far, any recommendations what other pre-trained models would be worth looking at?

metadat 3 years ago | |

Is there a torrent available? This is an effective way to ensure the models and information remain available indefinitely.

zakki 3 years ago | |

can you point the good source to download? thanks.

Satam 3 years ago |

Based on a Reddit post [1], the author of this is Stability AI's chief information officer.

My very rough take on the situation: the company gained their notoriety by building on OpenAI's pioneering research but with an important twist of releasing their models as unneutered open source. Now, their openness is starting to falter due to strong pressure from outside forces.

If they're unable to continue playing the hardball game they themselves invented, I think their glory days will end as fast as they started. The competitive advantage was always their boldness. If they lose that, quickly others will take their place.

In general, I don't think tech that's as open, powerful and easily reproducible as these language models can be stopped. Sure, maybe regulations will delay it a bit, but give it a few years and any decent hacker or tinkerer will be dabbling with 5x better tech with 5x less effort.

[1] https://archive.ph/Z5sU3

pr337h4m 3 years ago |

"We’ve heard from regulators and the general public that we need to focus more strongly on security to ensure that we’re taking all the steps possible to make sure people don't use Stable Diffusion for illegal purposes or hurting people."

"What we do need to do is listen to society as a whole, listen to regulators, listen to the community."

"So when Stability AI says we have to slow down just a little it's because if we don't deal with very reasonable feedback from society and our own communities then there is a chance open source AI simply won't exist and nobody will be able to release powerful models."

Looks like someone is leaning on them :(

vasco 3 years ago | |

Two days ago: “Nobody has any voting rights except our employees — no billionaires, big funds, governments or anyone else with control of the company or the communities we support. We’re completely independent,” Mostaque told TechCrunch in a previous interview. “We plan to use our compute to accelerate open source, foundational AI.”

hansworst 3 years ago | | |

Big funds and billionaires can influence those employees with hard cash, and governments can influence those employees with threats of incarceration.

blueblimp 3 years ago | |

Plausibly OpenAI/etc. trying to get a competitor shut down.

icelancer 3 years ago | | |

Or their investors now that they raised a ton of money.

thorum 3 years ago |

The author (Stability.AI’s CIO) did an impromptu AMA on Reddit:

https://reddit.com/r/StableDiffusion/comments/y9ga5s/stabili...

His comments regarding RunwayML’s release of 1.5 were especially interesting:

> “No they did not. They supplied a single researcher, no data, not compute and none of the other reseachers. So it’s a nice thing to claim now but it’s basically BS. They also spoke to me on the phone, said they agreed about the bigger picture and then cut off communications and turned around and did the exact opposite which is negotiating in bad faith.”

> “I’m saying they are bad faith actors who agreed to one thing, didn’t get the consent of other researchers who worked hard on the project and then turned around and did something else.”

icelancer 3 years ago |

His answers on reddit are downvoted and the redditors are correctly pointing out that most of these "protections" smack of the fact that his investors want to stop giving things away and to close up source / resources for better monetization strategies.

ycombinete 3 years ago | |

Reddit upvotes also doxxed the wrong person in the Boston Bombing.

minimaxir 3 years ago |

> At Stability, we see ourselves more as a classical democracy, where every vote and voice counts, rather than just a company.

After taking $100M in venture capital and two distinct drama events due to disorganization, this is unlikely to last.

jerpint 3 years ago | |

When that much money is at stake, it's hard to keep incentives aligned

aortega 3 years ago |

Powerful people are pulling strings to control AI everywhere. OpenAI is exactly the opposite of open. Now someone is pushing on Stability AI to close it up, I believe those models are more powerful or dangerous than they seem, and it got some people scared in some way.

I read than when some guys from 4chan started running the leaked NovelAI model, they generated porn non-stop for 20 hs or more, no sleep, no eating.

zaptrem 3 years ago | |

This is not at all unusual behavior for 4Chan users.

blackoil 3 years ago | |

>Powerful people

Even without conspiracy theories, these models cost upto 10s of millions to generate, no suprise investors wouldn't like if you are giving it all for free, there should be some revenue model.

emikulic 3 years ago | | |

I thought Emad said it cost $600k in GPU time.

SanderNL 3 years ago | |

To be fair, that's the novelty factor and when this hits "the public" it is not unthinkable that there'll be some "productivity issues".

IMO it is like finding a computer in a world without them. It is mind-blowing and it will take over your mind if you let it. For some folks that results in lots of porn, for others it'll be fear. My guess is that it'll wear off eventually.

fsociety999 3 years ago |

While they frame the post as if this is a positive and something they want to do, reading between the lines, it sounds to me like something has them rattled.

They mentioned regulators here, and I would be curious to hear the story behind that.

Don’t want to go too tin foil hat, but it makes you wonder if a certain other AI company that claims to be “open” may be afraid of a company that actually is open and is applying political pressure.

dougmwne 3 years ago | |

Oh that's a certainty. They said regulators twice. It was no accident, they are telegraphing just how hard they got smacked behind the scenes.

Extremely likely that the FAANG lobbyists went into overdrive. The big guys know this will be an extremely important industry for the coming decades and don't want a new competitor swooping in with nothing to lose when established companies are forced to be cautious.

Roark66 3 years ago |

As always in such cases this is 100% bull**. Either something is not working out for them and they have to delay in which case they could've just said so, or this is some sort of pretense to show how "responsibility minded" they are.

The reality is that bad actors have the resources to train their own stable diffusion on a dataset of whatever they want to deep fake and such delays do not slow them down one bit.

What it does slow down is normal people using those models.

From the smallest thing like mobilenetv3 through whisper, stable diffusion, CodeGen, and bloom those are huge productivity equalisers between the huge corpos and the little guy.

Also the same thing can be said about frameworks like huggingface's. Just recently I was looking for a way to classify image type (photo or not photo[clip art, cartoon, drawing]) in an android app. Of course first hits on Google stear towards Microsoft Azure's paid API service. I was unhappy with having to use an over-the-Internet-API (with potentially sensitive end user's private pictures) so in one day of work I managed to download a pretrained MobileNetV3. A couple of 10k+ image datasets and I wrote <50 lines of python to tweak the last layer and fine tune the network. On rtx 2070 training took 10 minutes. Resulting accuracy on real data? 90%+. The model loads and infers in few hundreds of ms on modern phones(instantiating and loading takes longer than the inference BTW).This is priceless and 100% secure for end users. For thilose interested in the details I use ncnn and vulkan for gpu(mobile!) inference.

Every commercial model maker's wet dream is to expose the model through an API, lock it behind a firewall and have people pay for access. This is not just hugely inefficient. It is insecure by design.

Take copilot by example. I'm perfectly happy for all my hobby-grade code to be streamed to Microsoft, but no chance in hell I'll use it on any of my commercial projects. However faux pilot run locally is on my list of things to try.

The first AI revolution was creation of those super powerful models, the second is the ability to run them on the edge devices.

fxtentacle 3 years ago |

I think the most important part is this comment:

https://danieljeffries.substack.com/p/why-the-future-of-open...

The people that he discredits as "leak the model in order to draw some quick press to themselves" are the researchers that are named in the Stable Diffusion paper. Yes, Stability.AI gave them lots of money. But no, they are not leaking the model, they are publishing their own work. It's university researchers, after all. And Stability.AI does NOT own the model.

13of40 3 years ago |

Two thoughts I've had about Stable Diffusion:

1. The web UIs I have used are taking advantage of the same mental pathways as an electronic slot machine. Just like you can max out your bet on a slot machine and mash a button until you run out of credits, you can do the same on the hosted stable diffusion apps until you get a shareable hit.

2. Just like the dream you had last night, nobody wants to hear about it at breakfast, no matter how epic it was, because it's not backed by any meaning.

That said, I love stable diffusion and am an addict to it almost every day.

notacanofsoda 3 years ago |

1) Who is Daniel Jeffries? There's no explanation of how he's related to Stability.

2) StabilityAI gave RunwayML compute time for them to train Stable Diffusion (they're also the creators of the original model). It's weird to categorize them as " other groups leak the model". They're the ones that created the model! (Source: https://huggingface.co/runwayml/stable-diffusion-v1-5/discus...)

minimaxir 3 years ago | |

He is the very new CIO at Stability, apparently: https://twitter.com/Dan_Jeffries1/status/1575068030367059968

ZephyrBlu 3 years ago | | |

Why does an AI company have a Chief Investment/Information (?) Officer?

cactusplant7374 3 years ago | |

I looked at Jeffries twitter. That did not clear anything up.

lairv 3 years ago |

The discourse has already changed quite a bit since the first release, which was only 2 months ago, and is getting alarmingly close from OpenAI's "we must delay release of XXX for safety reasons". It was probably to be expected, OpenAI are not just morons who decided to freeze opensource progresses, there are likely legal reasons behind it. But adding to that last weeks dramas, I am not very bullish on StabilityAI, hope I'll be proven wrong

Beaver117 3 years ago |

So you want it to be open source, but not too open, because then bad people will use it. Good luck with that. If you want to filter everything behind a SaaS like OpenAI go ahead, but then you can't call it open source. And maybe that would have been the right choice. But Pandora's box is open now.

jillesvangurp 3 years ago | |

Exactly, that cat is out of the bag. Right now the hard part is not using the models but creating the models. It requires a significant commitment in resources and there are only a handful of companies with those resources. And you need some skilled people to babysit the software and the algorithms.

However, that will inevitably spread to include more and more companies and will also start happening outside the US. All the research around this is being published and there's a lot of open source code that facilitates this. So, it's just a matter of people optimizing and improving that and hardware getting cheaper.

I expect that once that market is big enough, you'll see cloud providers step up with provisioning infrastructure for this stuff. It will still be expensive to use but it won't have a lot of limitations.

AI driven porn is basically the obvious use-case where there are some big companies with lots of money operating in that space and plenty of incentive to make this happen. Morally that might actually be preferable to exploiting people as is their current way of operating. The likes of OpenAI won't be able to do much to stop that.

dang 3 years ago |

We replaced the title, which has a whiff of corporate press release about it, with what appears to be a representative phrase from the article body. If there's a more representative phrase, we can change it again.

p3opl3s 3 years ago |

Y/ou can't comment unless you pay to subscribe.. lol - isn't that a company blog post?

Anyways.. this shit grinds me.. yet another "open source" AI proejct pretending to be fo rthe people.. finally get a massive valuation and now it's all "we must be security concious"..

Hypocrtyes and here is an interview with the founder of Stable Diffusion stating the exact opposite approach by "having faith in people"!

https://youtu.be/YQ2QtKcK2dA?t=704

moralestapia 3 years ago | |

He just says what he need to at any given moment to maximize his profit. People nowadays are like the chicken from that (alleged) Stalin anecdote.

jackblemming 3 years ago |

Guys I'm going to release an invention called the car, but my security team needs to make sure it's safe and won't be abused by drunk drivers. Next I plan to release an invention called the gun, but please hold your horses, because it could be abused. I need to double check and make sure it's safe to release this piece of equipment.

machinekob 3 years ago |

All this is PR talk after few dramas with immoral activities.

They got 100mil USD in founding and I feel like pressure squeeze them hard as they are trying to monetise models, but how you monetise open source models when someone can just fine-tune your weights and make better/faster/cleaner model and software without losing 10mil+ on training original.

You are always few mil behind rivals and after past few weeks which was PR nightmare they lost most of the "community driven" advantage.

I fell like they are extremely desperate for attention (drama was artificially created cause it clicks conspiracy) or they are just so chaotic and lack proper leaders that everything is burin.

passion__desire 3 years ago | |

Emad mentioned about governing Stability AI as a DAO of DAOs. If they can't even run a traditional company properly, forget about the DAOs chaos.

d3nj4l 3 years ago | |

Can you provide any context on the "past weeks" drama? Haven't been following this space closely so I don't know what happened. Honestly surprised about this turn because I used to follow Emad on twitter and he was always strongly for open models.

lbotos 3 years ago | | |

There are local WebUIs. Someone named AUTOMATIC1111 has a popular one. They added in a "leaked" model. Stability.ai, banned AUTO from their discord and accused him of "stealing/promoting piracy". Community enflamed. They then apologized and let him back in.

Stability.ai took over the /r/stablediffusion subreddit. Community enflamed. They then turned the subreddit back over to the community

Stability.ai delayed 1.5 model. And now sent this justification. Community enflamed.

imhoguy 3 years ago |

Well, models will be taken down anyway (at least attempted), save whatever you can put hands on. It is happening, govt is just catching up with this rapid situation:

https://www.federalregister.gov/documents/2022/10/13/2022-21... (AI mentioned 4 times)

https://eshoo.house.gov/sites/eshoo.house.gov/files/9.20.22L... (at the very end "export controls" are mentioned multiple times)

EMIRELADERO 3 years ago | |

Keep in mind that the last link points to a letter that's (allegedly) only from one member of the House of Representatives.

What people need to understand is that the bar for worryingness shouldn't be "government looking into it".

Governments look into things all the time, and in such a diverse environment as the U.S legislative branch, we cannot just pack every opinion of every member into a single "government" momolith. That is why, in fact, we even have legislative systems with different representatives from different parties at all. This isn't an undesirable effect, this is how it's supposed to work, and in a good way.

charcircuit 3 years ago |

>NSFW policies

Ugh. It feels like so many of these models are trying to censor NSFW material.

blueblimp 3 years ago | |

Imagine if Photoshop were deemed somehow unsafe without a mechanism to prevent the user from creating NSFW images. The panic around image generation models is absurd.

Gigachad 3 years ago | |

The NSFW images coming out of SD are hilarious. Nude people with 3 arms. Another torso coming out of the neck, etc.

kmeisthax 3 years ago | |

Nobody making or studying AI wants to hear "your model is being used to generate copious amounts of CSAM". That's basically a death sentence for the technology - even moreso than "your model is just an unattributed search index of stolen art and code". The easiest way to avoid this is to just ensure the model refuses to generate anything NSFW.

charcircuit 3 years ago | | |

It's not a death sentence and journalists will just find a reason to hate on your project for clicks no matter what you do. Also no children, or adults, are sexually abused when an NSFW image is generated.

f0e4c2f7 3 years ago |

Once upon a time there was a company called OpenAI that was going to do for AI what open source did for software.

I think OpenAI changing their revenue model and corporate structure to better reflect how much money they were about to make really left a mark on the internet around trust in the AI space.

The default is going to be to assume that AI companies like stability have sold out, to that end it would not surprise me if even this minor incident leads to a splitting and a new open model that becomes popular.

I understand the point the author is trying to make. I understand what OpenAI is getting at with safety. I understand what the regulators are getting at.

But it is too late. The genie is already out of the bottle and granting wishes. What are you going to ban at this point? Math education?

It's time to accept that it's not that hard to come up with a few a100s and train models for harm if thats your goal. You can write code that harms people too. The answer is not to ban code. The answer is not to heavily regulate AI (not all countries will regulate it, it will be like banning gunpowder or electricity)

As for this particular release - what is being implied they were going to wait for? Figuring out the model? Regulation? The internet to start acting calm and reasonably? We don't even know what these models fully do yet. It's hard to imagine what you could know in 6 months vs now that would allow you to release with a big thumbs up.

More and more I'm realizing how politically controversial AI will become. Already today we're starting to see that on various axies. I think weirdly in a few years it may be a top issue.

jerpint 3 years ago |

Isn't the whole point of open source, so long as licenses and attributions are respected, that anyone is free to do whatever they please with these models and their redistribution?

eddiewithzato 3 years ago | |

That’s the FSF definition yea, but lots of developers nowadays don’t agree with the FSF. For example some people don’t want companies to use their open source libraries/code for a profit.

wyldfire 3 years ago | | |

> some people don’t want companies to use their open source libraries/code for a profit.

IMO the AGPL goes a long way to solving that problem. But if the AGPL is not for you, I suppose you could use some non-commercial license terms. It seems like "closed source" is a much better fit for folks who want a great deal of control over licensees. In practice, "closed source" code can be published for licensees to see but instead of granting terms to all comers you could force people to ask you for a license, review their use case and only then decide to grant a copyright license -- with or without source.

seba_dos1 3 years ago | | |

That's obviously not an open source code then.

notacanofsoda 3 years ago | |

Sounds like they wanted to remove NSFW training data and re-train their models before release.

bombcar 3 years ago | | |

They’re desperately trying to avoid the “stable diffusion makes child porn” articles from flowing

jaimex2 3 years ago |

Looks like the fun police arrived.

Seriously, fire any coward lawyers erroring on the side of caution and get some that are versed in the NRA playbook.

ok123456 3 years ago |

I'm really tired of this infantilizing garbage. People are always going to use new technologies in ways the people didn't anticipate.

So, they're going to delay their release so that if you type a naughty word it won't make a naughty image. You know what happens within hours? Someone releases a modified version of the weights that over corrects it back and makes it even more naughty.

roel_v 3 years ago |

Is there any work being done on trust-less (or maybe trust web) distributed model training? The main problem today is that training the model is being gatekept (is that a word?) by actors with 100's of k's of $. If there would be a way to run a client, like SETI@home, that will train models, then a few thousand unsophisticated users with 30x0's and some weeks/months of time will do to model training what bittorrent did to mp3 distribution. But for this to work, you need some way to feed images to users, ensure that images aren't re-used, somehow guard against malicious actors injecting faulty data etc.

muaytimbo 3 years ago |

This guy already sounds, in his own words, "neutered". "We have to listen to regulators, we have to listen to the community, etc" there are no regulations, and even if there were, imagine if Uber, Lyft, AirBnB, Tesla, or other startups had taken this position. Listening to regulators / the community / anyone without a stake in the company is literally the quickest way to get killed by regulators captured by incumbent competitors.

moneycantbuy 3 years ago |

download while you can. i really hope this isn’t the beginning of the end for stable diffusion or true open ai. it’s too good to not piss off powerful people. we must keep real open source ai alive, otherwise it’ll only be billionaires like zuck and elon force-feeding us poisonous saccharine.

fleddr 3 years ago |

Yeah, you can stop pretending that the neutering is the right thing to do, clearly it's something you somehow are forced to do, due to some serious threat your received.

Retr0id 3 years ago |

Depending on how the legislation plays out, I can foresee a "pirate bay for ML models" popping up.

yellow_lead 3 years ago |

> We are forming an open source committee to decide on major issues like cleaning data, NSFW policies and formal guidelines for model release.

I don't see how NSFW photos can easily be stopped from being generated, with the model being open source. Maybe the model could be heavily pre-filtered to remove any photos that could possibly be used for NSFW images.

lbotos 3 years ago | |

My understanding is it was trained on this dataset: https://rom1504.github.io/clip-retrieval/?back=https%3A%2F%2...

Which has a LOT of NSFW images in it. I suspect if you removed them from the training set it would go a long way to curb NSFW output but as you say people could easily train their own NSFW latent diffusion model.

kmeisthax 3 years ago | | |

My experience with Stable Diffusion is that it has a habit of tripping the NSFW filter - as in, it actually generated NSFW images - on prompts that were entirely innocuous. Would not be surprised if Stability has a huge "how do we get it to stop spitting out porn so easily" problem.

ggm 3 years ago |

This appears to be BOTH an IPR statement and a social policy statement.

I tend to thinking they are co-joined, but clarity helps.

I think the social harms side, they need to be careful to under-promise and over-deliver. The likelihood of preventing social harms is frankly close to zero, what they can do is make it more complicated.

Think like this: use stable diffusion to make one "actor" dance a lambada in the left field and save it. in a new state, make a different "actor" dance a lambada in the "right" field. Now using alpha masks combine the two actors. Can this represent sexy dancing? you bet your sweet bippy.

Promising not to release "two person sexy dancing" in this situation would be over-promising. Sure, it was done outside of the AI by masks. Will the law makers care?

(for actor and lambada and sexy dance, substitute whatever contextually means "harm" in a two-actor situation, semantically)

visarga 3 years ago | |

AIs, Photoshop, pen and paper are just tools that are handled by people, you can't preemptively prevent people from doing illegal things.

julienreszka 3 years ago |

I really want to be kind but all I see in this article is some corpo speech.

diebeforei485 3 years ago |

> We’ve heard from regulators

Who are these regulators?

nnx 3 years ago | |

It's quite concerning indeed. Is there any country with a regulator for opensource software now? AI?

obert 3 years ago |

all these companies being responsible and protecting us from “bad AI” are just delaying the inevitable.

With hardware prices going down and new GPUs and better algorithms coming to light, it’s only a matter of few years until anybody will be able to train custom versions as powerful as today’s AI, without protections, probably biased, etc.

Sure, they will be 5-10 years behind big corps, but it won’t matter once poorman AI will be good enough to matter.

marmada 3 years ago |

My hope (for codex, stable diffusion, etc.) is that the models become so popular that it will be impossible to legislate them for issues like copyright. I think there might be a limited window before legal repercussions start happening -- so hopefully the models are in extremely wide spread use by then

yieldcrv 3 years ago |

I think Daniel Jeffries believes everything they just wrote.

Their new handlers can do anything to the contrary and are incentivized to curb release as well. The market is saying their new handlers are going to do that.

So we enjoy you proving us wrong!

c7b 3 years ago |

Realistically, those guys are facing a choice between an option for a very comfortable early retirement and getting roadblocked / litigated into oblivion. Can you really blame them?

pabs3 3 years ago |

I wouldn't call Stable Diffusion "Open Source AI", the training data isn't publicly released under open source licenses. I like the Debian Deep Learning Team's Machine Learning policy for evaluating these things:

https://salsa.debian.org/deeplearning-team/ml-policy

seydor 3 years ago |

Have to find a way for content makers to make money/jobs through the system. Google search solved that by providing ad revenue to content makers, or else they 'd have removed all their content by now.

TheArcane 3 years ago |

>Help us make AI truly open, rather than open in name only.

That has to be a dig at OpenAI

habibur 3 years ago |

This should have been expected.

Open source or not they are funded. And that funding needs to generate profit one way or the other.

This first release gave them the popular attention which they needed. It was successful.

can16358p 3 years ago |

> Help us make AI truly open, rather than open in name only.

oth001 3 years ago |

Still don't see them trying to use a dataset that they own the licenses to...

rafaelero 3 years ago |

And so it starts...

bgi_909 3 years ago |

seeks

chatterhead 3 years ago |

So if someone buys the rights to an artists work and that artist is dead can they start using Stable Diffusion to create new works of art they can claim as "by the artist"?

whywhywhywhy 3 years ago | |

Hollywood is about to start buying actor image rights and performance data to continue producing movies staring them after death so very likely legislation will be made that does make these things part of the artist canon.

chatterhead 3 years ago | | |

If they bring Chris Farley back I'm going to be very very angry.

wnkrshm 3 years ago | |

You can claim it but the emperor is naked, except if the artist actually made generative models that you can run, then the model can produce new art - but I feel it still would have been made by the original creator, not by whoever buys the model.

isitmadeofglass 3 years ago |

Its weird they don’t mention their horrendously failures or attempts to take over all the independent social media groups. I expect that slowed them down quite a bit as well.

Use-based restrictions as referenced in paragraph 5 MUST be included as an enforceable provision by You in any type of legal agreement (e.g. a license) governing the use and/or distribution of the Model or Derivatives of the Model, and You shall give notice to subsequent users You Distribute to, that the Model or Derivatives of the Model are subject to paragraph 5. This provision does not apply to the use of Complementary Material. You must give any Third Party recipients of the Model or Derivatives of the Model a copy of this License; You must cause any modified files to carry prominent notices stating that You changed the files; You must retain all copyright, patent, trademark, and attribution notices excluding those notices that do not pertain to any part of the Model, Derivatives of the Model. You may add Your own copyright statement to Your modifications and may provide additional or different license terms and conditions - respecting paragraph 4.a. - for use, reproduction, or Distribution of Your modifications, or for any such Derivatives of the Model as a whole, provided Your use, reproduction, and Distribution of the Model otherwise complies with the conditions stated in this License. Trademarks and related. Nothing in this License permits You to make use of Licensors’ trademarks, trade names, logos or to otherwise suggest endorsement or misrepresent the relationship between the parties; and any rights not expressly granted herein are reserved by the Licensors.

Company StabilityAI has requested a takedown of this published model characterizing it as a leak of their IP While we are awaiting for a formal legal request, and even though Hugging Face is not knowledgeable of the IP agreements (if any) between this repo owner (RunwayML) and StabilityAI, we are flagging this repository as having potential/disputed IP rights.

Hi all, Cris here - the CEO and Co-founder of Runway. Since our founding in 2018, we’ve been on a mission to empower anyone to create the impossible. So, we’re excited to share this newest version of Stable Diffusion so that we can continue delivering on our mission. This version of Stable Diffusion is a continuation of the original High-Resolution Image Synthesis with Latent Diffusion Models work that we created and published (now more commonly referred to as Stable Diffusion). Stable Diffusion is an AI model developed by Patrick Esser from Runway and Robin Rombach from LMU Munich. The research and code behind Stable Diffusion was open-sourced last year. The model was released under the CreativeML Open RAIL M License. We confirm there has been no breach of IP as flagged and we thank Stability AI for the compute donation to retrain the original model.