Twitch is hacked, and its source code leaked

Twitch is hacked, and its source code leaked(kotaku.com)

556 points by goldenzun 4 years ago | 298 comments

nemothekid 4 years ago |

This is a pretty thorough and high profile hack on a major tech company - this isn't something I'd expect from an Amazon owned property. The hack (allegedly, I haven't downloaded it) includes

* Entire git histories

* Internal/Private AWS SDKs

* Encrypted Password dumps and payout reports

It's so comprehensive I'm very curious into how an attacker got that level of access. I can't think of another, large, corporate web 2.0 startup who's gotten owned in a similar fashion. Could the same attack work on Amazon? YouTube?

It's also strange that someone who has this level of access to what is presumably a multi-billion dollar company decided to just leak the data? Maybe they did try to ransom it, but I'd imagine someone with this kind of access inside Twitch must have had some creative way of making money.

madrox 4 years ago | |

There were no encrypted password dumps. No production secrets were leaked (according to the article). What's here is no more than what your average Twitch engineer has access to.

Yes, that included payout data. Anyone with "staff" access to the site (which any employee can have) has access to any streamer's dashboard, which includes payout data.

I don't think this was an attack. Based on the data so far I think it was a disgruntled engineer. Obviously if more gets leaked later I may revise that opinion.

ergerger 4 years ago | | |

I also worked for Twitch and can confirm what you're saying is true. These repo's any staff member had access to - including non-engineering staff.

Revenue for the longest time was as simple as navigating to a streamers dashboard as staff, but they did finally gate that away from staff who don't need to see that info, however I am sure there are other ways to obtain revenue reporting info.

I am assuming all data - including personal - has been compromised but so far, the data leaked is data that most staff would have access to in some way or another. Some may find that shocking, but this was not a "high level hack"

twistedpair 4 years ago | | |

So much for information compartmentalization. Does the typical engineer need access to payment details for their daily work?

ljm 4 years ago | | |

Why would an intern at Twitch have access to data in production?

Saying that no 'secrets' were leaked is effectively burying the lede.

popotamonga 4 years ago | | |

I worked for a multi billion company and even 6 month contractors had access to basically everything with little effort.

unethical_ban 4 years ago | | |

No one in IT should have access to business data. That's simply best practice. Worst case would be a database engineer who has access to backups or some prod data for troubleshooting, and even that should be under tight control with good access accounting.

weaksauce 4 years ago | | |

Could have been a hack of a twitch engineer's laptop or something like that.

syshum 4 years ago | | |

Sounds like someone in Twitch Security needs to take a course on Least Privileged Access then

63 4 years ago | |

> It's also strange that someone who has this level of access to what is presumably a multi-billion dollar company decided to just leak the data? Maybe they did try to ransom it, but I'd imagine someone with this kind of access inside Twitch must have had some creative way of making money.

Notably, the initial leak didn't actually include the password data which the leaker claims to have, just source code and payment data which has been verified by several affected streamers. It's possible that this first leak was just to establish trust so they can random or auction password hashes later.

ganoushoreilly 4 years ago | | |

Given the torrent is labeled "twitch-leaks-part-one" I'm curious too as to what they have. The torrent breaks out into a lot of compressed volumes, so it's clear this wasn't just a backup file, but a curated collection of files. I'm very curious if we will see any other amazon related leaks come from it.

Either way, I can only imagine the chaos inside as they try to figure out what has transpired here.

nemothekid 4 years ago | | |

>It's possible that this first leak was just to establish trust so they can random or auction password hashes later.

Password hashes are relatively useless though? Once the leak is announced I imagine most of the big targets will rotate their credentials. Then the next thing you need to do is spend possibly thousands in CPU time bruteforcing bcrypt hashes. Then I'm not sure what you can even do with those.

I'm not criminally creative but I imagine you could make more by abusing trust with payment processors or fraudulent invoices.

zinekeller 4 years ago | | |

Maybe that Twitch is competent in the password department so they decided against it? But thinking about it, although it's unclear if two-factor secrets are included in the leak, but maybe the two-factor secrets may be usable to someone who has already the password of a victim. Unless it's the dongle-type one (WebAuthn/FIDO), the secret is common to both the server and the user, so two-factor bypass is almost certain in this case.

mdoms 4 years ago | | |

Doesn't seem likely to me. If the attacker has password hashes then they would want to keep this attack quiet so that the buyer of the hashes would have time to compute the passwords. If Twitch gets wind of this happening then a simple password reset would foil any efforts.

skilled 4 years ago | |

I'm hoping we will get to see a transparent report (from hacker or Twitch) on how this happened.

I think anyone would be excited to hack Twitch as the site alone - or any big platform for that matter - but this is quite literally someone just downloading the entire Twitch ecosystem and publishing it online.

ergerger 4 years ago | | |

Twitch has not been known to be transparent about anything.

leros 4 years ago | |

It something I would expect security hardware to have automatically stopped. Even an employee shouldn't be able to download 125GB of stuff without flipping a safety switch somewhere.

munk-a 4 years ago | | |

Gosh - I've worked at shops where we handled multi-terabyte images and we'd regularly stream large chunks of that while debugging tools. I've also worked at places where data was king and 125GB of stuff might be a reasonable dispatch of data to help someone debug.

The volume of data is irrelevant - source code is usually teensy tiny and of far more value to companies than, say, three months of livestream chat logs.

I'm not certain what security hardware you're thinking of - but I'm pretty sure I hate it already since it doesn't effectively guard anything while making everyone's lives difficult. For effective corporate security you need 1) data use policies and 2) access control lists - both of those are generally more effectively implemented at an entirely software level.

AshamedCaptain 4 years ago | | |

Trying to protect against leaking developers/employees is like trying to protect against lone gunman terrorists: useless. And, if you try anyway, it is likely to cause more annoyance to everyone involved than actual protection (think TSA).

CobrastanJorji 4 years ago | | |

If the bulk of it is a git repo, it's probably expected that every engineer will download it regularly.

com2kid 4 years ago | | |

> Even an employee shouldn't be able to download 125GB of stuff without flipping a safety switch somewhere.

I am trying to recall, but I am pretty sure when I worked in Microsoft Office that a build would pull down many tens of gigabytes of data.

125GB in one day from the build system wouldn't be uncommon!

tptacek 4 years ago | | |

There was a fad for tools that accomplished this in enterprise networks, with much clearer rules for who needs to access what (it was called "data loss prevention", or DLP) and those tools for the most part don't work. This is a harder problem than it looks like.

outworlder 4 years ago | | |

> It something I would expect security hardware to have automatically stopped. Even an employee shouldn't be able to download 125GB of stuff without flipping a safety switch somewhere.

Remember that Twitch handles streams. Good luck implementing this without having all sorts of false alarms everywhere.

Plus, you don't have to exfiltrate 125GB in one go.

cheeze 4 years ago | | |

I feel like once you have it pulled downm, it would be as simple as an upload to s3 (which wouldn't trigger any flags), then making the bucket public whenever you want. Hell, S3 used to (still does?) support being part of a torrent swarm...

ljm 4 years ago | | |

Why would that help? They just have to accumulate work over a period of time and then 'lose' their laptop.

toomuchtodo 4 years ago | | |

That's 6.25GB/day over a 20 day working month. More time, less data per work day, harder to detect.

ABeeSea 4 years ago | | |

ML engineers / data scientists are regularly moving terabytes of data around at Amazon.

yawaworht1978 4 years ago | | |

Indeed , how could this happen, really curious.

So let's say someone with access to all GitHub repos gave the password to someone else, maybe then it was downloaded from another machine?

Or someone stole the credentials and downloaded from another machine?

Or someone got access to such a machine?

It's it not possible to prevent these cases?

How long does such a download take?

stefan_ 4 years ago | | |

Cue monorepo discussion

ArlenBales 4 years ago | |

There are so many indiscreet USB pentesting devices easily purchasable by anyone today, I'm actually surprised this sort of thing doesn't happen more often.

SketchySeaBeast 4 years ago | | |

Shouldn't that be discreet devices? Or do they make a really high pitched whine with a big flashing light when they start transferring data?

aahortwwy 4 years ago | |

ITT: people shocked that something like this could happen at a company the size and profile of Twitch.

Running security at scale in a hypergrowth B2C company is very difficult. It's also completely different from running security at a startup, in a B2B company, or a slower-growth situation. _Every_ security executive and manager I've met has given up in frustration after 12-24 months and gone to take a cushy FAANG job instead.

I'm not surprised at all. My experience in security at a larger SV unicorn was that changes only happened in the immediate aftermath of a security crisis. Otherwise, there was incredible inertia and you just wouldn't be able to get the institutional support you needed to make progress.

xwolfi 4 years ago | | |

It's funny because for me each letter of FAANG is an hypergrowth B2C company...

koolba 4 years ago | |

How much of this is a holdover of lax security practices from before they were acquired? I can’t imagine AWS being managed in a way where local network access gives you keys to the kingdom. Then again, EC2 instance profiles do let you do quite a bit.

lamontcg 4 years ago | | |

Conflating AWS security with twitch security is probably the wrong way to think about it.

Within Amazon those are almost going to be two entirely separate companies, with very different security focuses.

The idea that Amazon is monolithic and uniform wasn't true when I left there in 2006, and I'm certain it is less so now.

And that isn't just that its related to the merger, but that fundamentally its different business orgs with different focus.

this_user 4 years ago | | |

I always had the impression that Twitch were operating in a largely independent fashion. For instance, it had been an open secret for years that one of their executives had been sexually harassing female streamers. Only a year ago he was finally fired. If Amazon had a firmer grip on Twitch, I'm sure they would have stepped in much earlier.

ganoushoreilly 4 years ago | | |

If you go back to the Adobe software breach circa 2013, a large part of their issues were the bolt on connections between acquisitions. It's honestly the most common thing I see in the startup world.

slightwinder 4 years ago | |

> It's also strange that someone who has this level of access to what is presumably a multi-billion dollar company decided to just leak the data?

From what I heard about Twitch-interns over the years, it seems the company is more a third-rate-s**hole that grew too big too fast and accumulated a huge amount of technical debt and fatal security flaws. Making billions doesn't mean anything if you don't invest them back into the important corners of the company. It's considered a miracle that the platform is still working that well in that state. And what comes from the leaks so far supports this view.

Though, said that, it seems they did start to improve one or two years ago, just too late to prevent this critical hit. But considering this was also a strike that avoided the deadly parts (yet), maybe there is a different aim here and the company can grow from this? It will be interesting to see how Amazon will react to this.

superfrank 4 years ago | | |

> From what I heard about Twitch-interns over the years, it seems the company is more a third-rate-s*hole that grew too big too fast and accumulated a huge amount of technical debt and fatal security flaws.

I mean this as a genuine question, but is there any company that didn't end up like this after an exponential growth phase? I'm not saying it's okay, but this feels par for the course. I've now been at two start ups during that hockey stick growth time and both went through this as well.

I'd be curious if anyone here has worked at a large, fast growing tech company where they didn't accumulate a ton of technical debt during growth. If so, what did the company do to prevent that?

yupper32 4 years ago | |

Does anyone know if Twitch employees have two factor auth? Having access to an employee's account would be the easiest way to pull this off.

It'd be strange if they don't have two factor auth, of course, but it's just as strange to have this large of a hack.

I think if it is a simple case of an employee account takeover, then the attack would "work" to some extent at any company. Larger companies typically have strict data access requirements, though. Good luck finding the few employees who have raw access to Google password hashes, for example. And even more luck knowing how to get that data if you do.

some_furry 4 years ago | | |

> Does anyone know if Twitch employees have two factor auth?

Yes, IIRC everyone at Amazon has a hardware security key (which is more secure than the standard mobile app TOTP most of us use everywhere online).

AustinDev 4 years ago | | |

Every Twitch Developer has 2FA even 3rd party developers are required to have 2FA I also think, but don't know, that this applies to Twitch Broadcaster Partners as well in order to have their tax information in the system.

Luckily iirc from a conversation with a senior Twitch engineer the Tax information backend has been migrated to Amazon. So hopefully that did not leak... Because that would be full legal name and addresses of a ton of streamers that likely have stalkers.

gorgoiler 4 years ago | |

Facebook [2011] was pretty bad…

https://www.theguardian.com/technology/2012/feb/17/facebook-...

…except Mangham didn’t ever get to release his spoils to The Internet?

dilyevsky 4 years ago | |

> I can't think of another, large, corporate web 2.0 startup who's gotten owned in a similar fashion

Linkedin, Microsoft, Yahoo, Google

FormerBandmate 4 years ago | |

I mean, it did work on Amazon (a division with poorer security probably, but still). 4chan is a truly special place

kordlessagain 4 years ago | |

From an ethical standpoint, any code that amplifies and profits from radical speech should be fair game for release. If employees or hackers feel the need to release info in that regard, so be it. This is the risk defined in such models and should be mitigated accordingly.

heurisko 4 years ago | | |

Who decides what speech is radical enough to compromise the privacy of users?

And if speech is "radical" meaning to the point of illegality, shouldn't the legal system decide, rather than the court of public opinion?

Hokusai 4 years ago | |

> this isn't something I'd expect from an Amazon owned property

Because you expect Amazon to put security priority over new features and profit? We have very different understandings of what Amazon stands for.

nemothekid 4 years ago | | |

>Because you expect Amazon to put security priority over new features and profit?

I don't know what you think Amazon stands for, but Amazon runs the largest cloud hosting service in the world - AWS, which not only runs a large number of other large companies but governments as well. I know, first hand, that their datacenter security protocols are state of the art.

Amazon has a much larger surface attack area so if they were playing fast and loose with security, chances are we would know already.

adrusi 4 years ago | | |

EC2, Amazon's cash cow, competes with nearly identical offerings from Microsoft and Google, and is not a place where additional features are often all that valuable to customers. Any sort of breach like this on EC2 would seriously hurt Amazon's bottom line and they know it.

dolores_ab 4 years ago |

Someone actually started streaming going through the code ... on twitch.

https://www.twitch.tv/deepfrieddev

dolores_ab 4 years ago | |

They were banned for 14 days -- https://www.reddit.com/user/coder_ent/comments/q2q24x/banned...

kuroguro 4 years ago | | |

On one hand I understand why you'd ban that kind of content, on the other it's essentially public information now... what's the point.

Philip-J-Fry 4 years ago | | |

They = you. It's fine to be honest, you're not exactly making it unobvious.

CoolGuySteve 4 years ago | |

"Sorry. Unless you’ve got a time machine, that content is unavailable."

Too bad, it would be nice to see someone go through and document how Twitch works. I've never worked at "web scale" so I'd probably learn a lot.

yupper32 4 years ago | | |

> I've never worked at "web scale" so I'd probably learn a lot.

As someone who has worked at both large and small companies, you'd probably be disappointed.

mastermojo 4 years ago |

There's something about this sentence that I find hilarious:

The download was posted to 4chan today, described by its unidentified source as “part one” of “an extremely poggers leak,”

wchar_t 4 years ago | |

I find it extremely ironic that they whine about Twitch being a "disgusting cesspool"... on 4chan.

> Calling Twitch a “disgusting toxic cesspool,”

snvzz 4 years ago | | |

Ironic? Why?

jallen_dot_dev 4 years ago | |

This hack was not very xqcL of them.

_qbjt 4 years ago |

More discussion here: https://news.ycombinator.com/item?id=28770590

rasz 4 years ago |

> including its source code

This will help with ad preroll blockers.

I would love to see someone look deep into Twitch recommendation system - last time I tested the thing they call "Feedback" is a rolling buffer and wont let you exclude more than ~100 things, adding more simply removed oldest entries and started spamming you with things you already excluded in the past. This looked like performance optimization (less things to track per user).

mariusor 4 years ago | |

This won't help with preroll ads because the video segments themselves are replaced in the stream data. They're not ads, but it's not the stream either.

You get a "twitch commercial break in progress" video for the time the ads are playing.

You can check this by loading a stream with MPV.

rasz 4 years ago | | |

aaand new ad bypass dropped 4 hours ago :)

>You can check this by loading a stream with MPV

I watch all of my twitch using mplayer. "magic incantations" when generating access token is what produces ad free .m3u8. For example early methods involved setting origin and/or referrer headers to internal Amazon systems.

DavidPeiffer 4 years ago |

I'd be interested if someone could get their own instance of Twitch up and running from this leak. Someone mentioned internal API's, which would have to be reworked to avoid detection, but it'd be interesting to host it on AWS just to see how long it takes to get shut down.

How would current AWS policies hold up? Obviously the code would be illegally acquired, but do they have detection mechanisms in place?

manquer 4 years ago | |

Even with source code it is hard to run a service if not impossible. You would need well written documentation that explains various options and error codes you could potentially get.

Many times there is some magic command only one guy knows and he will share with you on slack.

Rubbing a service of any complexity takes years of institutional knowledge.

BugWatch 4 years ago | | |

Please don't rub the services, it causes unnecessary friction, and wear & tear.

ijcd 4 years ago | |

100s of services and databases to work out and sort through. Good luck building a global real-time video CDN too. You could build your own faster. Microservice architectures mirror the org that built them. You wouldn’t do it the same way for yourself.

personjerry 4 years ago |

The top streamers' earnings were also leaked: https://www.twitchearnings.com/

ChrisArchitect 4 years ago |

lots of discussion and speculation from a few hours ago here:

https://news.ycombinator.com/item?id=28770590

marto1 4 years ago |

We're just walking into a future where these kind of leaks happen every other day, aren't we ?

cblconfederate 4 years ago | |

does it matter? social networks arent some obscure technology, but making them successful is

shapefrog 4 years ago | |

We are already there it seems

luis8 4 years ago |

I wonder how often these "hacks" are just an engineer leaking the info.

fhood 4 years ago |

Hang on, is this just a repo dump or not? Because it looks like a repo dump, in which case I would be very surprised if any passwords or other personal information is included, at least at a reasonable scale.

noncoml 4 years ago |

Anybody took a peek?

What language, and framework if they use one, do they use?

blain 4 years ago | |

Here are a few screenshots of go and php: https://sizeof.cat/post/twitch-leaks/.

WARNING: do not click the link, copy it and paste it in new tab.

whimsicalism 4 years ago | | |

holler 4 years ago | |

I know the original frontend used ember.js but then they switched to react... that's about all I know :D

(twitch used to sponsor and attend local ember.js meetups)

dolores_ab 4 years ago | |

A mix of Go, Ruby, Python, Elixer from what I saw.

doctorshady 4 years ago |

Archive of the original 4chan post from this morning: https://archive.is/8rQNK

imwillofficial 4 years ago |

Is this the first time actual Amazon infrastructure has been hacked? Anyone has Amazon been hacked pervious to this? (Not talking about insecure AWS accounts)

iuri1 4 years ago |

Since the main leaked files are from github, I'm assuming they got it from one of the many reported github auth flaws which don't get fixed and allows access to private repositories. Or more unlikely, via someone getting sloppy with their laptop.

Now I wonder if the commit history has database dumps or sensitive information, which is a common practice, or if any twitch servers have been accessed through a breach or privileged information found in some of their source code.

AustinDev 4 years ago | |

I'm pretty sure a company of Twitch's size uses on-premise GitHub.

ijcd 4 years ago | | |

Yup, and AWS Code*

jmazzi 4 years ago | |

Which Github auth issues are you referring to?

frays 4 years ago |

As an avid Twitch streamer, what do I need to do to protect myself?

INTPenis 4 years ago | |

Change your password obviously, maybe even reset your 2FA if those codes are in the leak.

And if you want to be perfectly safe, don't visit twitch. Because if that source code has any vulnerabilities they might be exploited against twitch visitors as we speak.

ALittleLight 4 years ago | |

Also change any account with a password that's the same as your twitch account. Once they know your twitch password they will try it on your related accounts.

shapefrog 4 years ago | |

Report your earnings on your tax.

andrewstuart 4 years ago |

What language is the main website written in?

ijcd 4 years ago | |

Typescript/React and Go. Ruby and Ember once upon a time.

mfollert 4 years ago | |

A lot of Go and Ruby

rawoke083600 4 years ago |

at we least know their backups were 'complete' ! This hack seems to includes everything and the kitchen sink !

1vuio0pswjnm7 4 years ago |

[deleted]

jackson1442 4 years ago | |

I thought it was pretty obvious that that was a joke.

anthk 4 years ago |

From banned usernames, "Jesus".

Yep. From Mexico to the Pagonia and Iberia, let's screw a few millions of users.

runawaybottle 4 years ago |

Does it take a genius to figure out how to build twitch? It’s a modern crud app with video streaming.

kabdib 4 years ago | |

I figure you could "build a Steam" in a couple of years, with the right engineers hitting the main features. There's very little magic at the technology level, and you can make life simpler and forget about minor things like the hardware survey or the pretty graphs. I'm not saying this is trivial, but it's definitely doable.

This is a far different statement than "You can build something and compete with Steam in a couple of years". Most of the really hard problems are not technical. Success ain't gonna happen without a bunch of pain, sweat, and strategic stumbles on the part of the competition.

runawaybottle 4 years ago | | |

Sir (Madame?), I ask you one simple question:

Was Twitch built in 10 years, or over just a few?

Steam was built since I was in FUCKING high school. Im old now, well over 30.

Apples, and blueberries.

Bluebarry, Drewbarry, tomato, ToMaHtoH.

Fuck their stupid ass streaming code, it’s a giant crud app, only their devops team can take credit for scaling, everyone else is not worth a shit, sorry, thats life, I gotta Leetcode too, and ur code isn’t worth me reading it, leaked or not).

o10449366 4 years ago | |

This is such a Hacker News comment.

It's just a crud app - why do they need more than 10 employees?

runawaybottle 4 years ago | | |

inefficiency.

namrog84 4 years ago | |

A lot of the secret sauce of such things are not that secret but just take a lot of work.

Building and maintaining infrastructure simply takes a lot of people, time, relationships and whatnot.

They get good at it over time which I guess could consider some secret sauce but there isn't like some secret code that makes the whole thing way better that now you'll see tons of competitors.

ashtonkem 4 years ago | |

Everything is easy to build until a small nation state’s worth of people want to use it at once.

ThePadawan 4 years ago | | |

I work in a small nation state.

That doesn't stop CV-hungry engineers from finding ways to overcomplicate it.

(I do agree with you on this topic in general)

lwansbrough 4 years ago | |

Just stream the video, it’s easy!

kinghajj 4 years ago | | |

Netflix & Youtube in shambles.

NelsonMinar 4 years ago | |

I mean doing Youtube is even easier; it's just a wrapper around HTML5 video.

lm28469 4 years ago | |

Everything is just a crud app with a few extra steps.... yet you're not Zuckerberg or Dorsey

nullifidian 4 years ago | | |

One shouldn't aspire to be a Zuckerberg/Dorsey.

runawaybottle 4 years ago | | |

I’m so misread, Twitch is a lot of luck, so is all of these companies. Show me the the source code for luck. I don’t give a fuck if you leaked a video streaming crud app code lol.

Zababa 4 years ago | |

You don't need a genius. You need a few good people, and a lot of hands. I think the best way to look at things like Twitch is to compare them to cathedrals, bridges, things like that. You might be able to have the idea and sketch the plans by yourself, but it's physically impossible to build it yourself.

throw_m239339 4 years ago | |

Like all things web, the problem is scaling the platform and moderation/security. It wouldn't be hard to build a toy Twitch clone no. But it takes tons of people and money to scale it / secure it. And even with all the security, they still got hacked...

mdoms 4 years ago | |

https://news.ycombinator.com/item?id=9224

decebalus1 4 years ago | |

This reminds me of the Albertsons guy on Blind who inadvertently created a meme when he said that Facebook could be rewritten with a small cluster of Oracle dbs. The meme is that Albertsons people are so elite, they work and think in a higher level of existence, way above the scalability bs us commoners are accustomed to.

bradjohnson 4 years ago | |

Right, just like a plane is a car with wings.