Facebook LLAMA is being openly distributed via torrents

Facebook LLAMA is being openly distributed via torrents(github.com)

909 points by micro_charm 3 years ago | 693 comments

Tiberium 3 years ago |

It seems that the leak originated from 4chan [1]. Two people in the same thread had access to the weights and verified that their hashes match [2][3] to make sure that the model isn't watermarked. However, the leaker made a mistake of adding the original download script which had his unique download URL to the torrent [4], so Meta can easily find them if they want to.

[1]: https://boards.4channel.org/g/thread/91848262#p91850335

[2]: https://boards.4channel.org/g/thread/91848262#p91849717

[3]: https://boards.4channel.org/g/thread/91848262#p91849855

[3]: https://boards.4channel.org/g/thread/91848262#p91850503

narrator 3 years ago | |

It's funny that part of the 4chan excitement over this is that they think they'll get back the AI girlfriend experience of when character.ai was hooked up to uncensored GPT-3. All that has been thoroughly shut down by character.ai and Replika and they just want their girlfriends back.

dmix 3 years ago | | |

The Repilka subreddit became one of the weirdest places on the internet when their model got capped for adult content.

https://www.reddit.com/r/replika/

Hundreds of men (and yes women) full on acting like they lost a spouse and posting constantly about it for weeks. AI is going to create some unusual social situations the general public isn't ready to grasp. And we're only in the early alpha stages.

abandonliberty 3 years ago | | |

I'm curious if the blocking of adult content has to do with moralism, commercial interests, or something deeper.

An eager to please conversational partner who can generate endless content seems quite dangerous and addictive, especially when it crosses over into romantic areas. There's already posts of people spending entire days interacting with LLMs, using as their therapist, romantic partner, etc.

Combined with findings like social engineering through prompt injection on Bing [1], the potential for systems that can manipulate people is clear.

While some of us may think that the LLMs appear ultimately limited in their capabilities, there's a ton of specific applications where they're more than sufficient, including customer service chat bots and telephone scams that target vulnerable people. It's only a matter of time until scammers stop using international call centers and switch over to something powered by these technologies.

https://news.ycombinator.com/item?id=34976886

Stagnant 3 years ago | | |

Correct me if I'm wrong but doesn't character.ai use their own model and isn't associated with OpenAI? At least I can't find any information that would claim so.

Anecdotally, as a roleplaying chat experience, char.ai seems to perform way better than anything else publicly available (doesn't get repetitive, very long memory). It also feels different to GPT3 on how it is affected by prompts.

I've just assumed that char.ai is doing its own thing as it was founded by two engineers who worked on google's LaMDA.

IAmNotACellist 3 years ago | | |

Oh, they will. And they'll exceed it.

Look at what fueled SD's ultimate K.O. of DALL-E 2: extremely high-quality custom-tailored porn images, one sentence away. The top models on civitai are all about it.

im3w1l 3 years ago | | |

I think it's funny that out of all the scifis I know, Chobits of all things is looking to be the most accurate.

xg15 3 years ago | | |

...and of course it's fucking 4chan. Somehow I'm neither surprised they actually got hold of the model - nor that they did so as part of the quest to build their very own virtual anime robot sex slave - I mean "girlfriend" - harem.

It's all somehow par for the course but I'm still wondering when exactly we switched to the satire version of reality.

machiaweliczny 3 years ago | | |

Porn and games move world forward :)

jimbob45 3 years ago | | |

I’m sure the CAI filter will magically stop filtering as much now that they have actual competition.

slowmotiony 3 years ago | | |

I'd want an uncensored GPT-3 too and I don't want an AI girlfriend - I just find that chatgpt has too much moral censorship to be fun to use. Want to ask about a health condition? Nope, forbidden. Have a question related to IT security? That's a big no-no. Anything remotely sexual even in educational context? No can do. Yesterday I finished watching a TV show about French intelligence and asked it to recommend some good books about espionage - it told me I shouldn't be reading such things because it's dangerous.

I ended up deleting my account, i won't allow some chatbot made by a couple 20 year old silicon valley billionnaires teach me about ethics and morality.

mwill 3 years ago | |

Off topic, but I clicked around /g/, which I haven't done in probably more than a decade, and a thread caught my eye about learning to code. The replies were overwhelmingly of the position that it is useless, and you will be replaced by AI before you can get a job if you start learning now.

I think that's nonsense, and 4chan is bent towards pessimism but it's still surprising to me.

anigbrowl 3 years ago | | |

/g/ is ridiculously overdramatic (and often offensive, though much less so than the political boards where the nazis fester), but regularly interesting. Agree that the pessimism here is misplaced, but not by much. The main change I see is not that AI will render coding or coders superfluous, but that it will massively shift the economics in favor of solo developers and small teams that don't have access to significant capital.

CamperBob2 3 years ago | | |

Yes and no. If you expressed interest in learning to program and were handed a book on x86 assembly language, most people would call that a waste of time. Even if you succeed at learning x86 as your first language, the knowledge will not be especially useful when employers are looking for fluency in modern C++ or Rust or whatever. It never hurts to have a solid grasp of the low-level fundamentals, of course, but it's not the name of the game. Not anymore.

The way I think of it is, all current programming languages are now assembly languages. Coding will not go away -- not by any means -- but the job will be utterly unrecognizable in ten to fifteen years.

And it's about fucking time.

I just picked up a new 13900k / RTX4090 box the other day at the local white-box builder. I was telling my partner how cool it was that it could do almost a trillion calculations per second on the CPU, and maybe 40x that on the graphics card. "How does that compare to the big mainframes from the late 60s?" she asked. "About ten million times faster. But I still program the same way those guys did, using almost the same language and tools. How weird is that?"

boole1854 3 years ago | | |

4chan has been in full doomer mode for years. It didn't used to be, from what I remember, though I was never an active denizen.

I'd love to understand the sociology behind the change in vibe that happened there.

causi 3 years ago | |

Just a warning to readers, I would not recommend clicking 4chan links while at work.

jpeter 3 years ago | | |

Fortune favors the brave

m4jor 3 years ago | | |

magnet:?xt=urn:btih:ZXXDAUWYLRUXXBHUYEMS6Q5CE5WA3LVA&dn=LLaMA

weberer 3 years ago | | |

/g/ is one of the SFW boards

abxytg 3 years ago | | |

if you don't have the leeway to say "I was looking at the 4chan thread where metas LLM was leaked" you shouldnt even be on hacker news tbh. get back to work!

archon1410 3 years ago | |

It has only just occurred to me that 4chan's technology board is /g/ because it's tech-naw-la-G

lolpython 3 years ago | | |

> The board letter /g/ stands for gijutsu (技術), the Japanese word for technology

https://wiki.installgentoo.com/wiki//g/#:~:text=%2Fg%2F%20is....

lukeplato 3 years ago | |

It would be interesting if there was a WikiLeaks-type of organization that facilitates safely leaking large models from big corporations.

Not sure how that would play out for accelerationism and existential risk, but I certainly don't trust the current powers that be.

Name_Chawps 3 years ago | | |

Open sourcing is widely recognized to be a bad thing when it comes to AI existential risk. (For the same reason you don't want simple instructions for how to build bio weapons posted to the internet.)

Modern AI is pretty harmless though, so it doesn't matter yet.

Cipater 3 years ago | |

Why do 4chan users go out of their way to be so offensive in their posts?

gpm 3 years ago | | |

Because we took the set of internet users, and sorted everyone who wants to be intentionally offensive into 4chan. Which means there's not only a high density of people who like being intentionally offensive there, but that being intentionally offensive is socially rewarded, so over time 4chan users grow to want to be more and more intentionally offensive.

IAmNotACellist 3 years ago | | |

I think because you can't be anywhere else on the Internet anymore. It's like the system's pressure relief valve. A blaring steam whistle that's only getting worse and worse the more the Internet squeezes elsewhere.

weberer 3 years ago | | |

Because the bump system combined with the finite number of threads incentivizes threads that get the highest number of replies per second. And the best way to increase replies per second is to start an internet fight.

selfmodruntime 3 years ago | | |

Because it's really the onlY place left to go if you want to be offensive. First forums, then platforms censored offensive people out of their niche places. Even Cloudflare participates. 4chan remains the only privately owned large forum.

tekla 3 years ago | | |

It keeps people out that are unable to separate the internet from real life.

_blz2 3 years ago | | |

because it is effective in keeping a certain type of people out

orangepurple 3 years ago | | |

The same reason Penicillium molds produce β-lactam antibiotics. There doesn't have to be an intelligent reason, just a survival trick.

garbagecoder 3 years ago | | |

It's to preserve the users' mental health.

yamrzou 3 years ago | |

Archived: https://archive.is/lGrH8

oh_sigh 3 years ago | |

What kind of recourse would meta have here? Sue him for breach of contract?

femi-lab 3 years ago | | |

They almost surely anticipated that this would happen at some point (though perhaps not so soon). They would look like major ass holes for dragging some post doc or whatever through the courts to make a point; would not be good for brand at all.

But it does give them cover for whatever people end up doing with it - they can claim they did all they could to support research while promoting safety.

pavlov 3 years ago |

It’s interesting that these models are both massively expensive to produce and self-contained to a degree that you can distribute the end product in a torrent.

This has not been the case for most commercial software for the past 20 years, during the cloud era. If you could steal a dump of random Facebook source code, it would be 99% useless because it’s so closely tied to the infrastructure. There’s almost nothing you could usefully run on your own PC or server VM.

But these ML models are like neutron stars of computation density. You can’t really peek inside to see what’s going on either. An unknown stolen model’s properties would need to be discovered by experimentation.

ot 3 years ago |

In case it's not clear what's happening here (and from the comments it doesn't seem like it is), someone (not Meta) leaked the models and had the brilliant idea of advertising the magnet link through a GitHub pull request. The part about saving bandwidth is a joke. Meta employees may have not noticed or are still figuring out how to react, so the PR is still up.

(Disclaimer: I work at Meta, but have no relationship with the team that owns the models and have no internal information on this)

espadrine 3 years ago | |

> Meta employees may have not noticed or are still figuring out how to react

Given that the cat is out of the bag, if I were them, I would say that it is now publicly downloadable under the terms listed in the form. It is great PR, which if this was unintentional, is a positive outcome out of a bad situation.

dmix 3 years ago | | |

Facebook fumbling it's way into being the better open source AI than OpenAI would be amusing.

koheripbal 3 years ago | | |

How likely is it that there is a larger model that they haven't discussed?

IanCal 3 years ago | |

It's not even clear someone has leaked the models. A random person has put a download link on a PR, it could be anything.

sebzim4500 3 years ago | | |

The folder structure definitely looks like model weights, I didn't download or run it though so for all I know it only generates the words to "Never Gonna Give You Up".

rnosov 3 years ago | | |

HN user (with >3k) karma seems to confirm the leak. Take it for what it's worth.

ot 3 years ago | | |

Yes you're absolutely right. I went by another comment that seemed to confirm the contents, but that could be trolling too.

ddtaylor 3 years ago |

FWIW this information was already freely available via DHT scrapers like btdig [1] I think everyone at Facebook knows that torrents aren't secret and the Google form is basically a legal tool to shield them from liability while making litigation against anyone misusing the model easier.

[1]: https://btdig.com/b8287ebfa04f879b048d4d4404108cf3e8014352/l...

riedel 3 years ago | |

The fun question is anyway if a ML model is copyright protectable. Probably not as it is produced by an algorithm (which even is GPL'ed). So the only tool would have been watermarking and pulling NDA type clauses, however a Google form seems not the best way in the first place also it is close to impossible to identify the leak (if they are not as stupid as it seems). Or am I missing anything? One backdoor would be if they included copyrighted material in the training and show how this can be extracted from the model. Maybe it the whole stunt was about trying out how the legal system works in those cases :)

yieldcrv 3 years ago | | |

commercial derivative works have always been legal when you did not agree to other terms.

one person broke their agreement with Meta, they're the only person that has a problem and the only person who gets to find out if the agreement was applicable at all.

if you released a chat bot that could be prompted to regurgitate some copyrighted information, so what? it just proves that you didn't need the $30 million in funding yet to train your own because you are using an existing model. So either use the funding for that or don't sell shares or a product based on that pretext. Nobody else has a problem.

Anything I missed? Now I wouldn't reshare the model, but aside from use and commercial use of its output? Not everyone gets their way, that's not controversial.

hnfong 3 years ago | | |

photos are copyrightable by the person taking the photo only because they decided where and when to press a button. the rest are algorithms and hardware.

I believe the AI models would also be copyrightable as such, subject to arguments that the underlying data was protected and thus it was subject to prior copyrights instead

winterqt 3 years ago | |

Note that this is the leaked copy, not the original -- see 'llama.sh'.

londons_explore 3 years ago | |

btdig blocked in the UK and many other countries. Use a USA VPN for access.

sebzim4500 3 years ago | | |

I'm in the UK and can view that link without a VPN.

ok123456 3 years ago |

Maybe this is an intentional leak to damage OpenAI.

A supposedly better model by some accounts that strikes right at the heart of their business plan of selling access for $250k/year. One month of access to their service could buy a machine capable of running this leaked model.

Facebook nerfs a potential upstart competitor to keep current big-tech cartel stable.

Maybe this is a bit conspiratorial, but we live in the age big-tech and big-conspiracy.

sebzim4500 3 years ago | |

IMO it's way more likely that some random guy on 4chan leaked it than it being some vast conspiracy.

tmalsburg2 3 years ago | |

Why leak it instead of just publishing it along with a press release about openness and democratizing AI and so on?

ok123456 3 years ago | | |

because then you don't need to explain to shareholders why you're giving away something that could potentially be worth a few hundred million dollars.

cosmojg 3 years ago | | |

See: https://pbs.twimg.com/media/FqMiv31aEAAjd-3?format=jpg&name=...

tinyspacewizard 3 years ago | |

Not a conspiracy at all. See also IE, Android, Kubernetes...

GuB-42 3 years ago | | |

I am not aware of Android and Kubernetes being leaks, they were open source from the start. For Android, openness was a big marketing point. I am not aware of IE leaks, and if there were leaks, hackers searching for exploits would be probably be the most interested, and that would be a bad thing for Microsoft.

The problem with leaks is that they don't come with a license, you don't have the right to use them for any legitimate purpose. No one who could afford a 250k/year license would touch that leak as it could get them in big trouble.

ok123456 3 years ago | | |

Any links about IE, Android and Kubernetes? I'm not up on these being ops.

slig 3 years ago | | |

I missed the one about K8S, do you have any resources?

underlines 3 years ago |

- how much vRAM needed to run each model parameter size?

- any inference optimization we can use similar to StableDiffusion, to bring down the vRAM requirements?

I only know about these:

- use 8bit precision

- https://github.com/bigscience-workshop/petals

- https://github.com/FMInference/FlexGen

- https://github.com/microsoft/DeepSpeed

Anything that could bring this to a 10GB 3080 or 24GB 3090 without 60s/it per token?

eigenvalue 3 years ago |

I'm not surprised-- I recently suggested that someone might try to pull an Aaron Swartz with the LLAMA weights (i.e., release them in an uncontrolled way similar to how Aaron attempted to release the JSTOR database). It's quite misleading for FB to claim that they are being so open, but then hoard the weights and only release it to a few academics. If the paper is to be believed, this is a major development, allowing you to get close to GPT3 performance on a single GPU (at least for inference on the smallest model). Clearly some renegade academic feels the same way.

WithinReason 3 years ago |

Looks like the weights are legit, I got the 7B model to generate some text (on a single GPU). Using the 1st prompt from the script it generated this:

[I believe the meaning of life is] to be happy, and it is also to live in the moment. I think that is the most important thing. I'm not really a party girl. I'm not a girl's girl. I have a really small group of close girlfriends and that's all I need. I believe in equal rights for everyone. I'm not a rebel. I don't really rebel against anything. I'm a very traditional girl, very loyal. I'm a mum's girl and I'm a dad's girl. People have a right to know what's going on. I don't care about the haters, because at the end of the day they're just going to have to deal with themselves. I've been getting more and more into fashion since I was about 16. I know I'm a little different, but so what? I think that's good. I don't think you should be like everyone else. It's my birthday, and I'll cry if I want to. I've always been a huge fan of fashion, and I've always liked to dress up

Another one:

[Building a website can be done in 10 simple steps:] 1. Defining Goals 2. Your Branding and Web Presence 3. Defining Your Marketing Strategy 4. Creating Your Website 5. Your Website Design 6. Your Website Development 7. Your Website Launch 8. Your Website’s Content 9. Your Website’s Conversion Rate 10. Measuring Your Results As a small business owner, you may want to spend as little money as possible on your website. But if you want to see a positive ROI, you will need to spend some money. Defining goals is critical when building a website. You should know what you want to accomplish with your website. You need to know what you want your website to achieve. You need to know who you want to convert to a customer. You need to know how you want to reach your goals. You need to know what the timeframe is for your website goals. You need to know what you want to get out of your website. When building a website, you need to clearly define your goals. Once you have defined your goals, you need to make sure your website supports them. If you want to reach your goals, you

aent 3 years ago |

For anyone wondering, it includes 4 models: 7/13/30/65 billion parameters, the smallest one is 14Gb, the largest one is 131GB, all four are 235Gb.

q1w2 3 years ago | |

I wonder how many people are scrambling to set this up on their startup infra.

6x24GB NVRAM on 6 GPUs linked with NVSwitch is a little pricey, but totally doable.

arthurcolle 3 years ago | | |

I got it running using Colab Pro+ (immediately got a V100 40GB VRAM GPU) - the 7B model works with batch size of 8 and a max seq len of 1024

exo-pla-net 3 years ago | | |

How pricey would you estimate?

mlboss 3 years ago | |

Is it possible to run the smallest one on a consumer gpu with 24gb ram ?

MacsHeadroom 3 years ago | | |

You can do even better!. You can run the second smallest one (better than GPT-3 175B) on 24GB of vram, ie LLaMA-13B. https://github.com/oobabooga/text-generation-webui/issues/14...

Tepix 3 years ago | | |

Running it is easy but you'll probably want to finetune it, too

rihegher 3 years ago | | |

I would be surprised if you can't. The smallest weight file is 14gb apparently

kaszanka 3 years ago |

Here is the magnet link for posterity: magnet:?xt=urn:btih:ZXXDAUWYLRUXXBHUYEMS6Q5CE5WA3LVA&dn=LLaMA

psychphysic 3 years ago | |

Thanks not working for me...

Not that I could run it if I downloaded it.

q1w2 3 years ago | |

Great, now how do I run it? Do I need a GPU with over 65GB RAM?

version_five 3 years ago | | |

Try this, it's for running llms that won't fit in the gpu: https://github.com/FMInference/FlexGen

rnosov 3 years ago | | |

Generally, you'll need multiply model size by two to get required amount of video RAM. There are 4 sizes, so you might get away with even smaller GPU for say 13B model.

bioemerl 3 years ago | | |

Nope, more like 111gb

version_five 3 years ago |

Are there any official checksums available? I'm happy to see this, even if it's an unsanctioned stunt, because I think it's really pathetic of meta to want to gatekeep their "open" model. But ML models generally can execute arbitrary code, I'd want to make sure it's the real version at least.

zb3 3 years ago | |

    But ML models generally can execute arbitrary code

Is it the case if we're only talking about weights? I thought the rest is actually "open".

px43 3 years ago | | |

My understanding is that weights are normally stored as pickled python blobs, which means arbitrary code execution as they are unpickled.

TaylorAlexander 3 years ago | |

I am running it in docker to be safe, which works just fine.

Red_Leaves_Flyy 3 years ago | | |

Docker escapes exist and if this was released by spooks then including sandbox escapes is par. Unlikely for sure but your confidence is naïve.

4bpp 3 years ago |

Since the point seems to be lost on some of the early commenters, this appears to be a cheeky PR by someone unaffiliated with Facebook, suggesting that they put a magnet link to (what seems to be) a leak of the model weights along with the previously existing invitation to apply to receive them on their own page.

gpm 3 years ago |

Recent comment in this discussion thread of the PR

> looks like some people have been complaining about the link. it will need more seeders before we can merge into main

from someone claiming to be

> Research Scientist at Facebook AI Research. Working on [...]

and who has previously merged pull requests for a repo under https://github.com/facebookresearch

(I'm going to leave their name out of this... because it feels like that comment might come back to bite them)

kif 3 years ago |

I wonder what the memory requirements would be to run such a large model. I'd love to be able to run this model, alas my MacBook can barely run toy models.

q1w2 3 years ago | |

You would need over 65GB of RAM. There are consumer GPUs that have 48GB of RAM, and can be tethered together with NVLink. I wonder if that would work.

coolspot 3 years ago | | |

Or you can rent per-hour from vast.ai or lambdalabs for like couple dollars per hour.

astrange 3 years ago | | |

A Mac Studio should be able to do it since it has unified memory.

px43 3 years ago | |

Hell, I'd love to be able to buy a $30k server to run these models. I think to run BLOOM required something more along the lines of a $200k server.

londons_explore 3 years ago | | |

With code modifications, it should be possible to run this with a very modest machine as long as you're happy for performance to suck. Transformer models typically need to read all the weights per 'word' output, so if your model is 20GB and you have not enough ram or vram, but have an SSD that reads 1GB/sec, expect 3 words per minute output speed.

However, code changes are necessary to achieve that, although they won't be crazy complex.

VadimPR 3 years ago | | |

True, and that's why there is a project that is using volunteered, distributed GPUs to run BLOOM/BLOOMZ: https://github.com/bigscience-workshop/petals, http://chat.petals.ml.

DeathArrow 3 years ago | | |

No need to spend $30k, use Azure or AWS.

permo-w 3 years ago | | |

you can - slowly - run Bloom 3b and 7b1 on the free (trial) tiers of Google Cloud Compute if you use the low_cpu_mem_usage parameter of from_pretrained

make3 3 years ago | |

you can rent a vm on aws to run it

Aissen 3 years ago |

It's nice that it's downloadable without filling a form (even though it should have been the default), a leak was bound to happen. The license is quite restrictive anyway: see RESTRICTIONS on https://forms.gle/jk851eBVbX1m5TAv5

sebzim4500 3 years ago | |

If someone just decides to use the torrent and ignore those restrictions it might finally establish precident for if you can copyright model weights.

dougmwne 3 years ago | | |

But even if you could copyright them, once you do some fine-tuning, they are not the same model weights!

sebzim4500 3 years ago |

Is there anything stopping anyone from using this for commercial purposes? I know that when you fill in the google form you need to agree to noncommercial use, but someone downloading this will never have agreed to that licence agreement.

injidup 3 years ago | |

I don't know. Is there anything stopping you using the latest Miley Cyrus album for commercial purposes if you downloaded it via torrent and never agreed to any licencing terms?

RobotToaster 3 years ago | | |

IANAL, but I imagine it's a legal grey area if the weights can be copyrighted? Works produced by purely mechanical means don't normally meet the threshold of originality.

counttheforks 3 years ago | | |

That's what Facebook and OpenAI are doing. They consumed tons of copyrighted content without permission and are now using it for commercial purposes. So using their model seems fair game.

DesiLurker 3 years ago | | |

IDK, its more like finding recipes to many great restaurant chains all mushed together by a 5th grader whose uncle stole it from them, on the sidewalk. looks like a grey area to me legally but IANAL.

unhammer 3 years ago | |

Is there anything stopping Meta (or openai etc) from using The Whole Web for commercial purposes in their LLM's?

EamonnMR 3 years ago | |

Considering ML's tenuous relationship with IP, I can't help but find this situation amusing.

RobotToaster 3 years ago |

For those who didn't check the github discussion, I don't think this pull request came from a Facebook employee, lol.

VadimPR 3 years ago |

Does this mean that with big enough compute capacity - say, Petals https://github.com/bigscience-workshop/petals which distributes the model over the internet over GPUs - we can run LLAMA?

Beaver117 3 years ago |

Funny. iirc some of the big tech (I think it was Google?) use torrents internally to deploy very large images to servers. Piracy is not the only use case!

regularfry 3 years ago | |

It was used for years to distribute World of Warcraft updates. No idea if it still is.

jpgvm 3 years ago | |

Ironically that is Facebook that used torrent for binary distribution. (no idea if it's still the case, that was a very long time ago).

ithkuil 3 years ago | |

It's not just food very large images. It's also useful for moderately large images/packages being deployed to many many many servers.

EamonnMR 3 years ago | |

Used it to download linux distro images back when the size of an install CD was huge.

Good times.

gpm 3 years ago | | |

Now that I think about it I wonder why we don't see it being used to distribute packages for linux distros. Seems more flexible than the current mirror system.

madmod 3 years ago |

Would there be some way to “launder” the model to make it plausibly viable for commercial use? Train a new model with the weights of this model with some kind of noise added to make it hard to tell what it is based on?

ImprobableTruth 3 years ago | |

Distillation would be the ideal way (especially because it also has efficiency gains), but as far as I know distillation for LLMs is kinda unproven.

Honestly though, even if you just finetune it, which you will want anyway for any serious commercial application, it's essentially impossible to determine the origin.

bertday 3 years ago | | |

Randomly perturbing the weights and then finetuning would probably make it impossible. If someone had access to the finetune dataset and you didn’t add noise, they could see if the finetuning curves intersect.

I guess in practice, it’ll look suspicious if you have an identical model architecture and have similar performance.

pimterry 3 years ago |

I give it a week before we see tools for subtly watermarking your secret LLM's weights, so you can trace leaks like this later.

Tiberium 3 years ago | |

The original 4chan thread seems to indicate that the leaker verified that his hashes matched with another person who had access to the weights, to make sure that the weights aren't watermarked [0]

0: https://boards.4channel.org/g/thread/91848262#p91849855

optimalsolver 3 years ago | | |

The leaker accidentally doxxed themselves by adding the original download script to the torrent:

https://boards.4channel.org/g/thread/91848262#p91850503

eigenvalue 3 years ago | |

Could already have happened in these weights. Reminds me of when the movie studios started projecting random dot patterns during movies to try to catch which theaters were leading to bootlegs. Their approach was essentially defeated by pirates sourcing multiple versions and combining them. In this case, I suspect you could add a small normally distributed random number to some random subset of the weights and it would have very little impact on performance but would corrupt any watermark beyond recognition.

londons_explore 3 years ago | |

Watermarking the weights is trivial.

Watermarking the output is also possible, but more complex and with a statistical success rate Vs performance tradeoff.

avisser 3 years ago | | |

I love the idea that LLMs will get watermarked in a way where you can ask them who they were built for and they just tell you.

xuhu 3 years ago | | |

If you find an AI generated response online, and ask GhatGPT if it was the author, it says "it was probably written by a human". But we all know there is a split infinitive here, and an archaic form there, and it knows. But it won't tell us.

xg15 3 years ago |

How horrible! Is there a torrent link so I can be sure to never accidentally download it?

AustinDev 3 years ago | |

See github link in OP :p

DesiLurker 3 years ago | | |

there is a magnet link here somewhere, i think

unethical_ban 3 years ago |

Let's say I wanted to use this for... whatever. How do I do it? I bookmarked some "AI for beginners" youtube videos.

No, I'm not trolling. The jargon and the ideas around LLMs is completely foreign to me. I have no idea how they work.

turmeric_root 3 years ago | |

clone this and point the script(s) to your downloaded model files: https://github.com/facebookresearch/llama/

electrosphere 3 years ago | |

I would like to know too.

fancyfredbot 3 years ago |

opt-175B weights are already openly available as I understand. Hugging-face also has openly available weights for a 176B parameter LLM called Bloom. Is LLAMA offering something over and above these?

controversial97 3 years ago |

The torrent is 224GB total, a load of 13 to 16GB .pth files

alfalfasprout 3 years ago |

Warning: do not use this for commercial purposes. While the weights may be available now, it's a lawsuit waiting to happen if you try to use this at work.

See the original license: "a. Subject to your compliance with the Documentation and Sections 2, 3, and 5, Meta grants you a non-exclusive, worldwide, non-transferable, non-sublicensable, revocable, royalty free and limited license under Meta’s copyright interests to reproduce, distribute, and create derivative works of the Software solely for your non-commercial research purposes. The foregoing license is personal to you, and you may not assign or sublicense this License or any other rights or obligations under this License without Meta’s prior written consent; any such assignment or sublicense will be void and will automatically and immediately terminate this License."

flangola7 3 years ago | |

And where did I sign my name to that agreement?

alfalfasprout 3 years ago | | |

License agreements/terms of use don't require signature usually. Consent is implied by downloading. That's also the case when you eg; clone a repo, download a file, etc.

I'm anti DRM + restrictions as much as the next guy but just trying to save folks from a bad time if meta comes knocking after seeing corporate IPs downloading the weights.

grrowl 3 years ago | | |

This might be a bit of an assumption, but it seems likely Meta is willing to lose more on lawyers than you'd be willing to ever spend.

Retr0id 3 years ago |

There seem to be a lot of confused commenters here. This is the content of an as-yet-unmerged pull request, and presumably not something that Facebook approves of.

lopkeny12ko 3 years ago |

If the model is open source, who cares? This is good for the community; no need to go through Meta's opaque approval process.

rnosov 3 years ago |

Just to make it clear, does this torrent include model weights?

generalizations 3 years ago | |

It contains weights for all four model sizes, apparently. This definitely saves on bandwidth costs. :)

WithinReason 3 years ago | |

Folder structure for the 2 smaller models look like this:

    LLAMA
    │   tokenizer.model
    │   tokenizer_checklist.chk
    │
    ├───13B
    │       checklist.chk
    │       consolidated.00.pth
    │       consolidated.01.pth
    │       params.json
    │
    └───7B
            checklist.chk
            consolidated.00.pth
            params.json

HarHarVeryFunny 3 years ago | | |

So what is content of those various files? Does this include the full models themselves, or just the weights ?

ComplexSystems 3 years ago |

The smallest model (7B) is supposed to outperform GPT-3.

Does anyone have any idea what hardware is needed to run this?

throwaway1851 3 years ago | |

No, the 13B model outperforms GPT-3. Judging from the metrics published in the paper, it does look like the 7B model is not far off from GPT-3 however.

JimmyRuska 3 years ago | |

Supposedly double the model size so 14gb. RTX 4090 might be able to handle it. You can use lambdalabs to rent a server gpu for one of the larger models.

TaylorAlexander 3 years ago | | |

I don't know if it matters but the 7B parameter checkpoint is 13.5GB in size. Someone with 24GB VRAM struggled to run it:

https://github.com/facebookresearch/llama/issues/55

coolspot 3 years ago | |

7B would require at least 14GB VRAM in 8 bit precision. 28GB in 16 bit precision.

linearalgebra45 3 years ago |

Hypothetically, what would the consequences be if I ran this on my university's computing cluster?

htrp 3 years ago | |

Either you get a nice invitation to collaborate on research with one of your uni's professors..... or you get sent to academic/disciplinary review and probably suspended for the semester.

cma 3 years ago | | |

Why? Model weights aren't copyrighted and they didn't protect it as a trade secret.

hn_20591249 3 years ago | |

Seems a valid use of resources if you have a way to vaguely associate it to some academic side-project, just don't start monetizing the output and beware the wrath of stressed out PhDs if you use too much capacity.

elcomet 3 years ago | |

None

Felminor 3 years ago |

Was to expected.

Anyhow I do remember a post of a person stating this will never happen but it's just a web form and request for describing of what type of research you do

Of course it will be leaked

popcorncowboy 3 years ago | |

Yeah, Meta must have had a plan for "when this gets leaked" because they put up only the flimsiest of foils. As per other comments the most likely is simply that they could shield themselves (and plausibly litigate with grounds) while ensuring that the model escapes into the wild to wreak its chaos against MS (OAI) and big G. This way they can see what's what from the safety of their shielded bubble and make a more informed call about changing the license to something more permissive if it looks like the strategic wins against their enemies would be worthwhile. Win win win. (Except for the leaker, that was an unfortunate own goal, they're going down).

throw14082020 3 years ago |

In case anyone was wondering, the torrent contains 219.01 GiB. More specifically, the 65B parameters, is 121GB, the 30B parameters is 60.59GB, and so on.

mdaniel 3 years ago |

I was expecting it to be a newly created GitHub account, but no, seems they're willing to roll the dice on whatever the outcome is from this

Laaas 3 years ago |

What's the point of the form if it's freely accessible? This might be revolutionary in the LLM field, as Stable Diffusion was to DALL-E.

fwlr 3 years ago | |

The user who submitted the pull request is not part of Meta or Facebook Research, and the users who signed off on reviewing the changes don’t appear to be either. I highly doubt Meta will approve the pull request. The models are being distributed by torrent by someone with access to the models, not by Meta themselves as far as we know. They likely still intend to distribute via the form. This is just someone publicizing the torrent link by being cheeky on GitHub.

(As they didn’t reply to my request for the model - I specified it was for personal use and my use case was “I think it would be fun to run it on my own hardware” - I appreciate this little stunt a great deal!)

rnosov 3 years ago | |

The linked page is just a pull request, the actual repository readme doesn't mention torrent option at all.

rvnx 3 years ago |

The most logical thing would be for Archive.org to distribute these weights

wunderland 3 years ago |

In case it was unclear, the person who submit the pull request does not work for Facebook and is teasing them here.

KaiserPro 3 years ago |

old school opensource, which is a bit surprising from meta. I wonder how they managed to square that with legal. Someone must have been very good friends with Zuck.

LilyFrenchPants 3 years ago | |

> old school opensource, which is a bit surprising from meta

Aren't you a cheeky lad? Metea turned out lots of open-source database systems:

* RocksDB

* Hive

* Presto

* Cassandra

* Velox

LFP

KaiserPro 3 years ago | | |

I should have been more precise, I have added an additional comment.

optymizer 3 years ago | | |

and, you know... React.

papruapap 3 years ago | |

is it?[0]

The worst offender is AMZ, all the rest big tech are pretty open-source friendly.

0:https://opensource.fb.com/projects/

KaiserPro 3 years ago | |

I should be more precise:

Getting anything that could produce, look like, or smell anything like misinformation out of meta is very hard (for good reason!)

My friends have had repeated push back for various papers because they are ML based and could be in the same room as something that could possible be used by miscreants.

And here we have a LLM that can spit out all sorts of things that are misinformation like.

If their department tried to launch something like Galactica they would have been slapped down and told to think again about what they were doing in life.

Manjuuu 3 years ago |

That guy does not seem to have anything to do with Facebook... interesting.

BeFlatXIII 3 years ago |

Good. Information deserves to be free.

EamonnMR 3 years ago |

Gonna be interesting to see if Facebook tries to tell people they can't use this because it's stolen (when it was presumably built using data taken without permission.)

ed 3 years ago | |

Unlike many llm’s this was trained using public training sets (and cited in their paper), to let anyone with the $$$ independently generate the weights

politician 3 years ago |

LLMs invalidate the concept of copyright to such a degree that I find it impossible to see this torrent as theft.

CapShoyo12 3 years ago |

I'm excited, but having trouble running Llama on my local machine, has anyone managed this?

binarymax 3 years ago |

Has anyone managed to download this yet using the magnet link? Is it well seeded?

transitivebs 3 years ago |

Seeding...

speedylight 3 years ago |

Legally speaking is it a good idea to download these models this way?

EamonnMR 3 years ago | |

My compliance brain says no, but the fact that models get trained with data they obtain without explicit permission makes says that finders keepers would be the relevant case law.

Filligree 3 years ago | | |

It's not clear that the model weights can be copyrighted. But of course, I wouldn't want to be the test case.

havkom 3 years ago |

Is this warez?

marginalia_nu 3 years ago | |

Sure I'll download

TeamMysticAvengers-meta-llm-x-cars-movie-model-x-angelina-jolie-naked-xxx-2023.zip.exe.torrent

ocimbote 3 years ago | | |

Thanks for this late 1990s moment. My back stopped hurting while I was reading this :p

grog_tremor 3 years ago | | |

Just a sec, need to find the crack on astalavista

fiat_fandango 3 years ago |

I wonder if anyone is legitimately concerned that mirrored downloads might contain malicious payloads?

m3kw9 3 years ago |

Till someone puts up a site to test it

hsuduebc2 3 years ago |

Can you point me a way to run this locally please?

aghack 3 years ago |

Can this be finetuned?

KierPrev 3 years ago |

Why is Meta open sourcing its AI through torrent?

Or am I understanding it all wrong

gorbypark 3 years ago | |

It seems like the model has been leaked (not by Meta) and is being distributed via a torrent. Someone has created a PR to the repo as a joke, suggesting that instead of filling out a form and waiting to be granted access (which is the official way to get access to the model), that you could just download it via the torrent.

IceWreck 3 years ago | |

Theyre giving it to universities for free. Someone got access and then made PR with a link to the torrent

rnosov 3 years ago | |

it's a pull request from ChristopherKing42. He is unlikely to be associated with Meta.

drbscl 3 years ago | |

Sending via HTTP will incur bandwidth costs. Torrents massively reduce this cost in the long run by making it P2P.

Edit: maybe in this case it's a leak though

Technotroll 3 years ago | | |

Torrent can also be really, really fast with enough seeders, even giving CDNs a run for their money.

happycube 3 years ago |

Who here didn't see this leak coming?

LoveMortuus 3 years ago |

~220 GB :O

That's quite big!

londons_explore 3 years ago | |

Needs ~200GB of graphics ram to run... Not many people will get this running!

bioemerl 3 years ago | | |

I need just two more 24gb Tesla's and I can do it!

swalsh 3 years ago | | |

depends on which model you choose to run. The 7B model can reasonably run with consumer hardware.

988747 3 years ago | |

What do you mean "big"? fits on the average laptop :)

Madmallard 3 years ago |

There's pretty much no point in downloading this right? It cannot be run with any fidelity on any consumer end gpu

zoranzv 3 years ago |

Good app

progbloging 3 years ago |

cool

onetokeoverthe 3 years ago |

Good thing ive had decades of real relationships and sex.

Knew the net would probably squash print and privacy the first minute i logged into aol.

Who knew it would breed a generation of robot loving losers?