Ask HN: Plenty of large sites down; Reddit.com, GNU.org, Discord, coincidence?

103 points by JonathanBouman 7 years ago | 84 comments

IloveHN84 7 years ago |

Is AWS/Azure/GCE down?

Remember: the cloud is someone's else computer. When it's broken, you cannot do anything

nik736 7 years ago | |

So what? I would rather have Google/Amazon employees on the issue than some random DevOps dude.

i_r7al 7 years ago | | |

The random dude is your employee and his/her mani job to get it to work and it’s first proirity. You will never be AWS/GCP/Azure first priority.

zzzcpan 7 years ago | | |

> So what? I would rather have Google/Amazon employees on the issue than some random DevOps dude.

This is fine if three nines of availability is all you need. Doesn't matter much if you prefer a big brand employee fixing things or a small brand employee. It doesn't change the outcome.

However there are a lot of things that simply cannot live with crappy three nines availability. And the only way to do better is to stop relying on any single cloud, which inevitably requires infrastructure engineers aka random devops dudes.

throwawaymath 7 years ago | | |

In fairness one "random DevOps dude" might be equally capable and less expensive for your infrastructure. Generally speaking any software company can succeed without a cloud provider's infrastructure, it's just a matter of cost and developing that competency in-house. There are many site reliability engineers who specialize in high availability and downtime resolution on baremetal hardware. StackExchange notably has this competency internally.

xd 7 years ago | | |

Well I guess the whole essence of HN is now lost... no room for some "random" startup dude to do anything that can be trusted.

apple4ever 7 years ago | | |

I wouldn’t. Hire the right person and you have immediate response instead of waiting or somebody else. A large reason we are not going cloud for our new infrastructure.

simonjgreen 7 years ago | | |

Why would you hire just one person?

zacharycohn 7 years ago | | |

person

jrs95 7 years ago | | |

This just doesn’t make sense. Google/Amazon employees basically are some random DevOps “dudes”. Whereas your own people would be...whoever you decided to hire to work on your infrastructure.

dullgiulio 7 years ago | |

Would be big news to discover that GNU.org runs on some cloud hosting ...

solarkraft 7 years ago | | |

I'd rather expect it to be a problem with some other provider. Proxy, or something.

ajeet_dhaliwal 7 years ago | | |

I’d need to see some spy shots of Stallman in an Uber, talking on an iPhone and tapping out a denial on a Surface to really believe this news was true.

ekr 7 years ago | |

GNU.org relying on a cloud provider? Isn't that going against the GNU philosophy a bit?

pigscantfly 7 years ago | |

Speculating here, but I was woken a half-hour back by an SMS from our prod monitoring system. The people at Azure had required maintenance for some instances scheduled for this morning, which I had had performed during the scheduled window over the last two weeks, but they seem to have brought down two thirds of the instances anyways. Possibly unrelated; just my two cents.

ReverseCold 7 years ago | |

Last time this happened for me it was Cloudflare's fault. Google and some other really large sites worked, but not much else.

throwawaymath 7 years ago | |

That's pretty vacuous. Everyone's computer is someone's computer. The more important point is how capable you are at managing it yourself.

What you're trying to get at is this: would you rather trust your infrastructure to a large organization whose core competency it is to do so, or would you rather manage it yourself? For many companies it makes more sense to have someone else manage it because of division of labor.

If you believe you're better suited to managing your own hardware for cost or capability reasons, you should. But of the arguments in favor of that decision, pointing out that "you cannot do anything" when GCP/AWS/Azure has downtime is a pretty poor one. It's an exceptional circumstance if you're 1) able to achieve better uptime than a cloud provider, 2) at nearly the same cost (in personnel, hardware and software), and 3) while being relatively unaffected by the downtime of major cloud providers anyway.

The companies for which the calculus shifts in favor of managing their own hardware probably don't need to be told "the cloud is just someone else's computer." In contrast, most companies using a cloud provider do not have a readily available alternative because they do not have in-house talent capable of maintaining baremetal hardware (local or colocated).

I consider myself personally capable of maintaining a baremetal distributed system with high availability, because I presently do that. But for the most part I wouldn't encourage companies using a cloud provider to invest in their own infrastructure. It's usually expensive in personnel, time or both.

asdojasdosadsa 7 years ago |

I'm not the best at interpreting this map[0] but seems that something is going on?

[0] http://www.digitalattackmap.com/#anim=1&color=0&country=ALL&...

user5994461 7 years ago | |

Looks like an ongoing DNS reflection attack from all over the world to the US.

sonofblah 7 years ago | |

What's the significance of Poland in the output? A tracking thing?

I noticed problems with Reddit earlier, too.

wildrhythms 7 years ago | | |

Where are you seeing 'Poland in the output'?

>Edit: Nevermind, I see what you mean (on the map). I'd be interested to know too... maybe PL is a big player in their attack monitoring?

mrdrozdov 7 years ago |

Looks like wunderground.com is down too. If you're wondering, high chance of thunderstorms this evening in New York, NY.

slavojastoria 7 years ago | |

I was wondering, thanks. Rain has been wild recently

CPUstring 7 years ago |

Whatever is happening, it got me out of bed instead of browsing endlessly

JonathanBouman 7 years ago |

https://status.discordapp.com states that Discord identified and resolved the problem.

noobermin 7 years ago |

I suppose I came late because only gnu.org is down of those mentioned.

gjvc 7 years ago |

defcon week

fibers 7 years ago | |

But why gnu? They seem like a static site that can only be taken down by simple ddos?

DonHopkins 7 years ago | | |

The Emacs web server had to garbage collect.

https://www.emacswiki.org/emacs/HttpServer

stephengillie 7 years ago | | |

Everyone has to start somewhere - maybe some script kiddies are at their first Defcon and saw an easy target?

aviau 7 years ago | | |

Why does the fact that the site is static make it easier to take down by a simple ddos?

I have a static website at https://alexandreviau.net/. It sits behind AWS CloudFront. Good luck taking it down.

lawnchair_larry 7 years ago | |

lol, no

digi_owl 7 years ago | |

That, and various incidents on twitter etc over the years makes me really question the professionalism of _sec...

DmenshunlAnlsis 7 years ago |

Reddit is up as of this writing, although GNU.org is down.

JonathanBouman 7 years ago | |

Pretty unstable here, it loads but all the user specific pages return 'error code: 503'

digi_owl 7 years ago | | |

And their status page shows all green...

https://reddit.statuspage.io/

zabana 7 years ago | |

Up in Europe (Paris) at 5:08 local time.

a012 7 years ago | |

I use Reddit is fun app and browsing fine

thiscatis 7 years ago |

Turkey?

batuhanicoz 7 years ago | |

I don't understand the relation/reference.

I'm Turkish and have been watching the news but I don't see any reason why someone correlates large websites being down with Turkey. With no explanation too.

Can you elaborate please? This is an honest question and I would like to know if my government is hacking foreign sites in retaliation for sanctions.

thiscatis 7 years ago | | |

Well Trump obliterated any chance of a decent lira value for the next years with his tweets and sanctions.

oneplane 7 years ago | | |

Retaliation would make sense, but I haven't dug deep in Turkeys APT crews lately. Most of the stuff I hear/read about is talking about Iranian, Russian, Chinese and roaming APT groups doing attacks. It also would make attacks from Turkish AS's more logical as the government would not likely do something about 'their own' for free.

based2 7 years ago |

https://kb.isc.org/article/AA-01639/74/CVE-2018-5740%3A-A-fl...

Twirrim 7 years ago | |

What evidence do you have to support this claim?

I could sit here and just pull out random CVEs too, with as much validity.