SSH and User-Mode IP WireGuard

369 points by BCM43 5 years ago | 102 comments

closeparen 5 years ago |

This is such an interesting marketing strategy, I had never thought of selling B2B production infrastructure under the aesthetic of, “Can you believe this shit actually works?”

tptacek 5 years ago | |

Yes! This person gets it.

xbar 5 years ago | | |

I'm a fan of the writing style. It reminds of smart people I know. I haven't bought any fly.io yet, so I don't know that I'm you're target market. Still--well said, repeatedly.

omribahumi 5 years ago | | |

Have you considered using ssh command's ProxyCommand option? It allows you to replace the TCP transport with communication over stdin/stdout.

It could help you replace the TUN with something more cross platform, and possibly with less overhead. You can pass in the hostname using %h, so you can even have virtual DNS.

ryanmarsh 5 years ago | |

This is going to be a great tweet for me and I will totally not give you credit.

majke 5 years ago |

This sounds very similar to my slirpnetstack, which is using gvisor netstack to do, which I call translating L3 (packets) into L7 (userspace syscalls like connect()):

https://github.com/majek/slirpnetstack/

(btw, gvisor netstack, while not without problems, is likely to be faster than libslirp, see benchmarks https://github.com/rootless-containers/rootlesskit/pull/101#... )

tptacek 5 years ago | |

It is extremely similar, and thanks for posting this, I had no idea.

tptacek 5 years ago |

I added some example code to the post, because, again, I kind of can't get over how easy this turns out to be. And if you follow the link into Jason's `wireguard-go` code, until you hit gVisor itself, it's not much more complicated under the hood.

Having complete control of TCP/IP in userland like this, with so little code, is so valuable I feel like there needs to be some special name for the technique.

The whole thing is kind of a vindication for Go's standard library network interface, which I have always hated.

ignoramous 5 years ago | |

> Having complete control of TCP/IP in userland like this, with so little code, is so valuable I feel like there needs to be some special name for the technique.

Yes! Userspace TCP/IP is how we implement firewall for Androids (which don't expose iptables on non-root devices but let you setup TUN interfaces via VPN APIs). Right now, we rely on LwIP (wrapped in golang) and it has worked wonderfully well; especially since it is light-weight without any locking-overheads (single-threaded) and that bodes well for battery-powered devices.

> The whole thing is kind of a vindication for Go's standard library network interface, which I have always hated.

The Fuchsia team at Google is re-implementing netstack3 in Rust (and hence you're probably right to call it "gVisor netstack") due to what I presume are performance and efficiency reasons (which is of interest to us because we develop for smartphones). Of course, flyctl doesn't need that, but since you wrote about pulling in heavy dependencies, I am interested in your take on it.

anderspitman 5 years ago | | |

Don't want to go OT but I'm super curious what your experience developing a network application for non-root Android devices has been?

As a non-Android developer, I've been working on a project the last few months that involves running an HTTP server on the device and tunneling out so it can receive requests from the outside world, and the platform feels nerfed at every level from filesystem access to keeping your server from being battery-killed.

prattmic 5 years ago | |

This is awesome! In the post you mention "For a couple hundred lines of code (not counting the entire user-mode Linux you’ll be pulling in from gVisor, HEY! Dependencies! What are you gonna do!) ..."

I'll note that while all of gVisor's user-mode Linux is in the same Go module, we've actually gone to decent lengths to keep the network stack logically separate from the rest of the user-mode Linux code.

So while go.sum might look a bit frightening, Brad's depaware shows that the extra code you pull in to binaries by using netstack is actually quite minimal: https://github.com/tailscale/tailscale/commit/5aa5db89d6a9a6....

ignoramous 5 years ago | | |

Wait... What is depaware? How do I use it to make sense of go.sum in my projects?

jeffbee 5 years ago | |

I hope people can mentally generalize this enthusiasm for user-mode wireguard in order to understand the value proposition of QUIC.

tptacek 5 years ago | | |

QUIC was my plan B for this feature. :)

anderspitman 5 years ago | |

> The whole thing is kind of a vindication for Go's standard library network interface, which I have always hated.

Curious about this. I've generally found Go's net libs to be pretty pleasant. Can you compare/contrast it with others you like better?

tptacek 5 years ago | | |

I'm just a 1990s BSD sockets, write-my-own-select-loop kind of programmer; the idea of an abstract `Dial` interface always seemed like just a performative Plan-9-ism (I assume?).

Anyways. Wrong about that one! Movin' on!

mwcampbell 5 years ago | |

> Having complete control of TCP/IP in userland like this, with so little code, is so valuable I feel like there needs to be some special name for the technique.

Many years ago, when we could take always-on desktop PCs more or less for granted, I developed a product that let the user connect back to their home PC from another PC, to stream music from home or grab a file (this was also pre-Dropbox). NAT was already ubiquitous by this point, and Windows XP SP2 (first version with Windows Firewall) came out that year, so I knew it couldn't just make a direct TCP connection to the user's home PC. So I did a stupid relay implementation, where both the client and the home server (that's what we actually called the tray applet on the home PC) made outgoing TCP connections to our central server, which would relay packets back and forth. If I'd had access to a TCP-in-userspace thing like the gvisor network stack, I could have run TCP end-to-end, the way it's meant to be used. It almost makes me want to reimplement that old system using Go and WireGuard, even though the functionality is basically irrelevant in today's world.

bluesign 5 years ago | |

Why not something like:

ssh dogmatic-potato-342@jump.fly.io

And tunnel connection over wireguard on jump server

tptacek 5 years ago | | |

Because then there would be some service exposed to the Internet (not over WireGuard; if you have WireGuard, you don't need a jump box) whose job it would be to hop 6PN networks. The only thing we have in our infra now that controls access to 6PN is eBPF code; we keep the system simple so we can reason about it.

azalemeth 5 years ago |

I know it's not really an HN thing to say, but this is just cool. Reverse ssh tunnels on Wireguard through my VPN are cool enough; the amount of magic here (albeit I think perhaps not totally strictly required magic…) is definitely interesting++.

mrkurt 5 years ago | |

It should be an HN thing to say. Unrestrained positivity is much more fun than kneejerk cynicism. :D

willis936 5 years ago | |

This morning I set up openSSH + wsl + minicom to configure my router over LAN. Why? Because I could. Computers are fun.

Also, it saves moving cables around when I break stuff.

vlmutolo 5 years ago |

> Normally, this big balloon thingy would be an elaborate scheme to get you to check out our product, but here it's just pointing out some new source code we haven't talked about elsewhere.

I really enjoy this style of writing from a company.

Regarding the article, it seems like Fly has pulled off some insane networking nonsense, but I don’t know enough about networking yet to understand it. Saving this page for later and gonna get back to the TCP/IP Guide.

ignoramous 5 years ago | |

> Regarding the article, it seems like Fly has pulled off some insane networking nonsense

Fly is essentially building a Tailscale-esque infrastructure to service one part of their cloud offering. It is indeed insane the amount of heavy-lifting they do to make it all work. They seem like a cross between packetfabric, gitops, docker, and hashicorp but with way less engineers on the team.

brianm 5 years ago | | |

The technical heavy lift is rarely the success determinant, so having a company implement half-baked (enough for internal use, but without the edges polished off that are needed to support it with external customers) versions of N related (but not yet mature) technologies is pretty normal (if they are full of good engineers) and getting advantage from it.

Most of the time these implementations are too tightly tied to the rest of the company's infra to be useful standalone. When one of those companies succeeds a common pattern is for engineers to cash out, leave, and build a new startup around one technology from the success story.

I would not be surprised if this is one of the forces that drives the consumer -> infra -> consumer -> infra cycle. A consumer wave leads to inventing lots of interesting but bespoke infra while it is growing like crazy. When it plateaus, folks spin out the interesting infra bits until the next consumer wave (generally larger) starts rising.

mrkurt 5 years ago | | |

To be fair, what Tailscale is doing is much harder than our private networking. They have to deal with NAT, mobile OSes, etc.

We mostly just try to pick the right primitives. And frequently get that wrong. Like that time we wrote our own JS runtime ...

anderspitman 5 years ago |

This is fantastic. I maintain a list[0] of tunneling software. One of the few downsides of WireGuard is the inability to run it in unprivileged situations. The complexity and performance overhead here might still be too much to edge out solutions like SSH tunnels, but I love that the space is being explored.

I'm hopeful we'll also see some robust QUIC-based tunneling tools over the next couple years.

[0]: https://github.com/anderspitman/awesome-tunneling

benjaminl 5 years ago | |

With the coming ubiquity of QUIC, its seems natural to have a QUIC based analog to OpenVPN using packet based QUIC instead of OpenVPN’s UDP/TLS.

It also seems rather obvious to extend WireGuard to run over QUIC in addition to UDP. But the movement on that front has been very limited.

russdill 5 years ago | |

tunsocks[0] might be of interest to you. It's very similar to the software mentioned by OP except in C. It uses the lwIP usermode tcp/ip stack. It doesn't itself have any VPN or tunneling support, but instead relies on raw packets being passed into and out of a pipe. It can then provide access to that network via various proxies, port forwards, and even raw packets via NAT (very useful for VMs).

[0]: https://github.com/russdill/tunsocks

ptomato 5 years ago |

Not having been previously familiar with fly's network setup, I gotta say I find it delightful; derived-prefix IPv6 + WG to give you basically static routing + ability to auth on IP is very elegant. I've actually been working on a toy stupid-simple clustering thing that does something similar, and I'm absolutely going to steal the userspace tcp stack over wireguard thing for API access.

chrisweekly 5 years ago |

Amazing. The client API is profoundly simple.

Also, this post prompted me to look closer at Fly.io, and it's leapfrogged to the top of my shortlist for an imminent client "edge proxy" project.

mrkurt 5 years ago | |

I love proxies and think all problems should be solved with proxies. Which means – if you give Fly.io a try and need any help, you should let me know!

chrisweekly 5 years ago | | |

Right on, MrKurt - will do!

smithclay 5 years ago |

Hacking stuff together using a userspace networking stack is an incredibly fun side project and significantly easier with the gVisor networking libraries written in Go.

Last year I implemented TCP/IP over AWS Cloudwatch. Tons of "can you believe that actually works?" stuff possible with it:

https://medium.com/clog/tcp-ip-over-amazon-cloudwatch-logs-c...

CyberRabbi 5 years ago |

Running networking stacks in user mode really opens up a lot of interesting solutions. Wireguard is sort of an enabling technology for this.

Just realized this was written by security guru tptacek, nice. What is the contextual meaning of “AFFIANT SAYS NOTHING FURTHER.”?

vdqtp3 5 years ago | |

> What is the contextual meaning of “AFFIANT SAYS NOTHING FURTHER.”?

"That's all, folks"

CyberRabbi 5 years ago | | |

Oh so it’s supposed to be a bio line that has no bio? I would assume one would just leave it out if they had nothing to say.

abrookewood 5 years ago |

Man, some people are just next level productive: "How hard could it be to put together a tiny user-mode TCP, just for the purposes of doing pure-userland WireGuard networking, so people could SSH into instances on Fly without installing WireGuard? I made the mistake of musing about this on a Slack channel I share with Jason Donenfeld. I mused about it just before I went to bed. I woke up. Jason had implemented it, using gVisor, and made it part of the WireGuard library."

bluesign 5 years ago |

Is this super complex infrastructure for fairly simple thing or am I missing something?

tptacek 5 years ago | |

No, it is extra-super complex infrastructure for a fairly simple thing.

Normal SSH still works, and is usually going to be what people end up using. You just have to have WireGuard installed and running.

The product feature here is less interesting than how we did it.

Panino 5 years ago | | |

Please keep writing about WireGuard. If it wasn't already magical enough for its stated purpose (VPN), maybe the "truly" interesting thing is how it can enable tech that wasn't previously envisioned. After using WireGuard for a couple years I'm still excited about it because I feel like I've only glimpsed a small piece of the things that can be done with it.

jrockway 5 years ago | | |

It's not clear to me how much day-to-day use of Wireguard being a Fly customer requires, but I can't help but wonder if you guys should collaborate with Tailscale to make all of the micro-VMs appear on a Tailscale network, and authorize access between humans and the VMs that way.

(I admit that I haven't looked much into mesh networking / edge servers, so I don't know what the problems are. I always preferred Internet -> Identity Aware Proxy type thing -> mTLS mesh that is useless to humans. And, I don't ssh to stuff much anymore... I have my software collect debugging information and send it to something I can access through a browser or API, and control that software through an API. So everything is editing config files, basically, not SSHing places ;)

rileymichael 5 years ago |

Pretty cool write up. It mentions that every host is running a DNS server that instances have access to, which is being utilized to store the public key (neat!)... is there any way for customers to consume this for other purposes, say out of the box service (instance) discovery?

tptacek 5 years ago | |

Yes; the original purpose of private DNS at Fly was for service discovery. `your-app.internal` is the AAAA's of every instance for your-app; `nrt.your-app.internal` every instance in Japan, `aws-rds-1._peer.internal` is AAAA for the other side of a WireGuard gateway you created to bridge your apps to an RDS database, etc.

tucif 5 years ago | | |

When you say "the public key for that root certificate is hosted in our private DNS", does that mean the public key is in.. a txt record?

tarasglek 5 years ago |

The prefix for ssh command looks good for commandline. However, is there a way to hide with some settings in .ssh/config so one can have normal-looking "ssh host" cmdline without special prefixes?

mrkurt 5 years ago | |

"flyctl ssh issue" will get you all setup for normal ssh access, and even store credentials in an agent if you have one running.

im3w1l 5 years ago |

I read this but I didn't get it at all. I can't see the forest for all the excited talk about particular trees. In simple words, what problem are they trying to solve?

tptacek 5 years ago | |

You need WireGuard to SSH to machines at Fly (that's a good thing). You don't have WireGuard installed on a particular machine. That's OK, because there's a portable, userland, Golang implementation of not only WireGuard but all of TCP/IP that can be imported into any Go program. Go programs can BYO network stacks. That's crazy. The end.

ash 5 years ago | | |

Why is userland TCP/IP stack needed? I didn't get this part of the story.

londons_explore 5 years ago | | |

Still seems like a downgrade for actual users... I just want to be able to type ssh instance7.service.zone.user.fly.io into my console, and be connected... I don't actually care about compiling my own custom ssh client written in go, however neat its implementation might be...

mrkurt 5 years ago | |

We are a hosting company. Customer apps run in isolated private networks. We let them connect to these private networks with WireGuard. Customers _also_ want to do things like "launch a console", so we give them a mechanism for SSHing into their running containers over their private network (6PN).

WireGuard is dead simple, but setting it up is extra cognitive friction if you've never dealt with it before (or if you're in an environment where you can't create a network interface). Jason Donenfield did some magic with a Google user space networking stack that lets us "hide" the wireguard component. People using our CLI will soon be able to connect to their private network + SSH into a container with one command.

Basically, WireGuard is cool and being able to connect into a wireguard network from a userland program is really helpful for building a straightforward UX.

kerng 5 years ago |

Isn't this bad for privacy? Encoding app, org and such information in IP address?

tptacek 5 years ago | |

No. These are internal IPv6 addresses in the ULA space. They're a part of our network fabric, not something the Internet sees.

spockz 5 years ago |

> We take Docker-type containers from users and transmogrify them into Firecracker micro-VMs

What is the relationship with micro kernels? Is the feature available separate from the deployment/hosting?

tptacek 5 years ago | |

None. A Firecracker micro-vm is just a very small, very quick-to-start-up VM. It uses KVM, eliminates the BIOS, and implements only the minimal devices needed to boot and run server Linux. Amazon built the project for Lambda and Fargate. More about it here:

https://fly.io/blog/sandboxing-and-workload-isolation/

atonse 5 years ago | | |

Is this sort of like what MS is doing with Windows Subsystem for Linux, where they're able to "boot" that Linux in mere seconds?

By the way, as an elixir developer Fly.io looks extremely cool. But my (mostly public sector) customers want to hear something similar to the words "AWS" when asked about hosting – so is it running on top of AWS or Azure or GCP? (instances look like they may be GCP, which is fine too).

0xbadcafebee 5 years ago |

> I’ve written a bunch about private networking at Fly. Long story short: it’s like a simpler, IPv6 version of GCP or AWS “Virtual Private Clouds”; we call it “6PN”. When an app instance (a Firecracker micro-VM) is started at Fly, we assign it a special IPv6 prefix; the prefix encodes the app’s ID, the ID of its organization, and an identifier for the Fly hardware it’s running on. We use a tiny bit of eBPF code to statically route those IPv6 packets along our internal WireGuard mesh, and to make sure that customers can’t hop into different organizations.

My first thought was "Wow, can we make this _more_ complicated please?", and then I read the rest of the post.

I hate technology.

sdevonoes 5 years ago |

Sounds very cool and all but at the same time it sounds like a terrible thing to maintain in the future.

Perhaps it's just me, but this is something I would accept as a "hey, I was bored and worked on something on my free time. It's probably broken but nobody cares because it's a toy thing, but it's sooo cool". I wouldn't accept it as " Fly.io OKR 1.3 (2021): SSH and User-mode iP WireGuard"... it's sounds pretty much like a hack.

tptacek 5 years ago | |

This is called "coming to grips with the insanity that is gVisor, the Docker runtime for GKE that is also inexplicably just a Go import". I feel your pain.

Wait until I find a reason to put a whole virtual memory manager into `flyctl`. I'll probably knock out a whole bunch of MBOs that way, and gVisor has me covered.

tasssko 5 years ago |

Nice work, I love WireGuard, what it needs is more recognition and definitely more integrations like this.

boundlessdreamz 5 years ago |

Off-topic: What's the software used for the blog?

michaeldwan 5 years ago | |

Markdown + middleman. We use it for docs and landing pages in addition to the blog.

resoluteteeth 5 years ago |

Maybe you should put this up on github so everyone can use it rather than just talking about how easy it is?

mrkurt 5 years ago | |

The link to the source code is in the middle of the article, jeez: https://github.com/superfly/flyctl/pull/368

devwastaken 5 years ago | |

I believe all informational blogs/guides should backup to a markdown file on GitHub or other. Over time playing with technologies I've found a lot of dead links to personal websites. This is of course because maintaining your own hosting can be cumbersome, domain name expirey, etc. Some valuable information gets lost with only waybackmachine to save the day. Someday wayback may no longer exist though.

177tcca 5 years ago | | |

Same with GitHub.

Any Git opening will do, for private cloning.