GoBGP: BGP Implemented in Go

GoBGP: BGP Implemented in Go(github.com)

168 points by ryancox 10 years ago | 126 comments

tptacek 10 years ago |

This is the sort of thing Go really shines on: network and infrastructure services that would ordinarily be provided by big ugly C programs, where the latency requirements are significant but not as bad as raw packet forwarding. If your current best alternative is a C program, I'm not sure why you wouldn't seriously consider replacing any of the following with Go (or Rust) programs:

* Authority DNS

* DNS caches

* ntp

* SMTP

* SSH

* IMAP (added later)

* SNMP

* PBX/Telephony

Fortunately, as time goes on, fewer and fewer people need to run these services at all.

dsr_ 10 years ago | |

...well, because of the primary shortcoming of the Go system: lack of standard infrastructure to manage the inevitable updates.

Let's say I have replaced bind, unbound, ntpd, postfix, openssh, dovecot, snmpd and asterisk with Go-written equivalents. Three weeks later, there is a bug found in the standard Go TLS library.

My distro ships all the packages noted above, but not their Go-equivalents, so my work load now includes monitoring security-announce lists for eight different products, where before I monitored the security-announce list for my distro.

I need to be able to rebuild all eight systems myself, rather than getting automatic package updates to my test systems, and then promoting the packages through alpha and then production. Go is nicer than some other languages about that, but it builds binaries, not packages.

I'm pretty sure you can't build an snmpd without ASN.1 parsing, and ASN.1 parsing is the very model of a fraught and perilous splatter-fest. Will the Go ASN.1 parser be better maintained than libtasn? Maybe, maybe not. Repeat this for everything else.

Can these problems be solved? Sure. Are they ready right now? Not that I'm aware of. Please enlighten me, if you have good answers.

tptacek 10 years ago | | |

No. I've written SNMP from scratch in Ruby, Python, and C++. DER ASN.1 for X.509 might be treacherous (simply in the sense that any mistake you make at all will be ruinous), but that's just not the case for SNMP's BER.

The whole point of using Rust or Go instead of C is that the "peril" of implementing things like ASN.1/BER is pretty much eliminated.

As for your former point: I don't follow. Go's deployment infrastructure is a superset of C's, and, if you're a masochist, almost everything in C's deployment toolkit is available to Go projects as well.

noja 10 years ago | | |

So you install the go equivalents from your distro. If your distro doesn't have them, you vote for them to add them (or go through whatever process your distro has).

tobz 10 years ago | | |

Having the shared library that can be replaced by itself, instead of deploying N things, is nice. Not sure it's the end of the world, but it's nice.

On the flipside, though, this is probably a case where using something like Rust could be excellent: stronger language support for eliminating entire classes of bugs but also able to be compiled down to something that can be a shared library.

I'm not an ML expert, but reading about the things the Mirage project has done related to TLS, leveraging OCaml to provide compile-time guarantees, sounds very exciting. Rust feels like it could be that bridge. Start rewriting core libraries in it: cleaning up old code, removing dead code, avoiding bugs by virtue of the language/compiled, in one fell swoop.

IshKebab 10 years ago | | |

> Let's say I have replaced bind, unbound, ntpd, postfix, openssh, dovecot, snmpd and asterisk with Go-written equivalents. Three weeks later, there is a bug found in the standard Go TLS library. > My distro ships all the packages noted above, but not their Go-equivalents, so my work load now includes monitoring security-announce lists for eight different products, where before I monitored the security-announce list for my distro.

But say your distro includes the Go versions, but not the C ones... You're basically complaining that your distro doesn't include everything.

cstrahan 10 years ago | | |

Regarding packaging, I mostly agree. However, I would highly recommend looking into the Nix package manager and our packages (and packaging infrastructure). Updating one of our go packages is sufficient for all other go packages to be built with the new version. So that would mostly solve the problem you're talking about.

https://github.com/NixOS/nixpkgs/blob/master/pkgs/top-level/...

(Sorry for being light on details; typing from phone)

traverseda 10 years ago | | |

https://github.com/whyrusleeping/gx is pretty cool.

You could handle it the same way that python handles it. Don't rely on a distro, have each person package their own stuff.

unethical_ban 10 years ago | |

>Fortunately, as time goes on, fewer and fewer people need to run these services at all.

I'm not sure what you mean here. Are you suggesting it's a positive that the Internet is becoming centralized? Why is it "good" or "bad" that lots of people run SSH, their own email, or their own phone server?

Obviously one part is that it's hard to run them correctly, but I would argue the internet may be a better, more authority-resistant mechanism if some of these services were run from each person's home or VM rather than on Google's platform.

d33 10 years ago | | |

I would say that it's not as much of a problem that it's centralised, the biggest worry is that our core protocols are terribly insecure:

https://security.stackexchange.com/questions/56069/what-secu...

15155 10 years ago | |

RE: the (or Rust) comment-

All of these services would be better suited for a language which has blessed async IO, concurrency, parallelism primitives. Go has these, Rust does not. pthreads are not the answer.

I much prefer Rust, but Go's stdlib and concurrency features far exceed that of Rust at the moment.

pcwalton 10 years ago | | |

> All of these services would be better suited for a language which has blessed async IO, concurrency, parallelism primitives. Go has these, Rust does not. pthreads are not the answer.

Go does not have truly async I/O. It has a userspace (M:N) implementation of threaded, synchronous I/O. There is a distinction, and it's not an academic one.

Rust uses the kernel-level (1:1) implementation of threaded, synchronous I/O, with async I/O provided by mio if you want it.

There is no meaningful distinction between Go's language-level primitive channels and Rust's MPSC channels in the standard library. Not supporting generics is a good reason to put channels directly in the language, but that doesn't apply to Rust.

Additionally, I don't see how Go provides any parallelism primitives that Rust doesn't. In fact, Rust's parallelism (particularly data parallelism) libraries far exceed the capabilities of Go's, mostly because of SIMD and generics which allow you to build highly optimized data parallel abstractions. If I tried to parallelize the project I work on in Rust using Go's built-in primitives, it would be far slower than sequential.

steveklabnik 10 years ago | | |

In general, almost every stdlib will be "better" than Rust's, as we're taking an "anti-batteries included" approach.

And while Go does have great built-in stuff for a certain kind of concurrency, Rust's approach is more flexible and safer. It's a tradeoff, not a "far exceed" in my mind.

im_down_w_otp 10 years ago | | |

You want ponylang.org

Concurrency & parallelism primitives + safety.

JoachimSchipper 10 years ago | |

Because some C programs are pretty excellent, and rewrites cause bugs? E.g. replacing OpenNTPd, Postfix, OpenSSH - or djbdns or qmail - by a rewrite probably doesn't reduce the number of problems.

(In particular, note that SSH daemons can fail in many ways other than by remote code execution.)

In the long run, C's role is indeed shrinking - but let's not be too hasty.

tptacek 10 years ago | | |

For a full-featured SSH running on a machine I was likely to log into and work interactively on, I'd prefer OpenSSH.

But most of what people do with SSH in a devops context isn't interactive; it's a simple control channel for well-defined sequences of file transfers and commands.

I'd prefer a minimal, Go/Rust-based SSH server for my EC2 servers, for instance.

I don't know why I'd prefer OpenNTP to a Go/Rust NTP. What's the advantage to it? OpenNTP is carefully built to avoid a class of bugs that its implementation language is very susceptible to. Go/Rust simply don't have those bugs at all. The latter seems like the safer option.

Same goes for DNS.

nickpsecurity 10 years ago | | |

You can always do it close to original implementation. A straight-up clone. Should preserve at least most of the logical-level countermeasures.

zzzcpan 10 years ago | |

Go is missing a few things for this. There is no good predictable event-driven polling library for networking with proper error handling and no GC pressure (no heap allocations), etc. And it has to implement its own syscall wrappers, because Go's syscall wrappers on non-blocking FDs call into the scheduler and even produce garbage on some errors. TLS library needs to be predictable too, produce no garbage and play well with polling.

Doing it in idiomatic way and dealing with all that goroutine per request model, concurrent memory access and unpredictable GC pauses is simply not worth it. It's going to be safer, but not of a decent quality. Better to live with what we have.

For Rust, I imagine, it's going to take even more work.

tptacek 10 years ago | | |

There's no good predictable event-driven polling library for networking because the whole runtime is a good predictable event-driven polling library for networking. You're not supposed to "event" Go I/O.

Virtually every Go program that anyone has deployed at scale has scaled I/O with goroutines (though not necessarily with "concurrent memory access").

With the exception of NTP, I can't see a single example of a service in the list I provided that is sensitive to "GC pauses" on the scale you'd end up with in an idiomatic Go program.

mikecb 10 years ago | | |

Some think it's very appropriate for high performance networking: https://github.com/google/stenographer

Granted, that uses pfring pretty substantially, but still...

kev009 10 years ago | |

You will be doing battle with the GC in many of these applications. Yes, even 1.5+. 10ms is an eternity to do no workload.

tptacek 10 years ago | | |

The only service in this list that I can see the GC mattering for is NTP, but then, you can design a tight NTP server that virtually eliminates the GC's work in Go, taking advantage of the rest of Go's high-level features that C lacks.

infogulch 10 years ago | | |

From 1.5 to 1.6 @brianhatfield saw pauses in a 8GB heap & 150M allocs/min go from 40ms to ~3ms [0].

[0]: https://twitter.com/brianhatfield/status/692778741567721473

shanemhansen 10 years ago | |

Cloudflare mentions they are heavy users of a golang DNS lib https://blog.cloudflare.com/dns-parser-meet-go-fuzzer/

ntppool.org uses golang for DNS https://news.ntppool.org/2012/10/new-dns-server/

jsmthrowaway 10 years ago | | |

Cloudflare is a heavy user of Go period.

signa11 10 years ago | |

> I'm not sure why you wouldn't seriously consider replacing any of the following with Go

just curious about this: are there folks trying out dpdk with go ? implementing these control-plane applications in vanilla sockets (or close-to-zero-wrappers on those), doesn't seem fruitful anymore.

fwiw, i have been doing dpdk stuff, but have been mostly using C...

lightcatcher 10 years ago | | |

Seconded on the dpdk point. I've been working with kernel bypass networking in C but it seems to me that the asynchronous queue based APIs would be perfect for Golang or even Javascript. The only project I've seen so far to make kernel bypass networking nicer to program is http://www.seastar-project.org/ (which has DPDK support)

nickpsecurity 10 years ago | |

Animats suggested the same kind of services for Rust projects as well given huge benefits of less memory attacks in critical services. Im in total agreement while throwing in they should preferrably be compact so vendors put them in commercial routers and appliances.

walrus01 10 years ago | |

replacing openssh with something you've written yourself seems like reinventing the wheel, just because you can.

the sheer number of person-hours at developer salary rates in north america would probably amount to at least a few million dollars.

belak 10 years ago | | |

You'd be surprised. Go actually has a supported SSH library which is a fairly decent protocol-level implementation. To actually get a "shell server" you would need to implement handling of SSH sessions (as described in the RFCs) but all the low level stuff is taken care of.

EDIT: As an example, the gogs project has implemented a small ssh server so people running it don't need to hook into OpenSSH, which relies on specific versions of OpenSSH to be performant. See https://github.com/gogits/gogs/blob/master/modules/ssh/ssh.g...

tptacek 10 years ago | | |

Except that OpenSSH is a very large, complicated, and featureful piece of C code that most servers need only a tiny portion of.

hartator 10 years ago | |

I was thinking Go is implemented in C?

fixermark 10 years ago | | |

It used to be. As of 1.5, go's toolchain (compiler, etc.) is implemented in go.

dlanouette 10 years ago | | |

That used to be the case. But with v1.4, it's mostly (all?) in golang now.

http://dave.cheney.net/2014/09/01/gos-runtime-c-to-go-rewrit...

steveklabnik 10 years ago | | |

Not anymore; it's in Go these days.

educar 10 years ago |

I look forward to the first programmer friendly SMTP/IMAP implementation. Haraka is the closest friendly SMTP server I have come across.

102030485868 10 years ago | |

I can understand why there hasn't been much movement in the SMTP world. SMTP is pretty hard to get right, as is maybe hinted at by the many RFCs. You really don't want to be making any mistakes because it's a somewhat unforgiving protocol... unless you send an error code.

DanielDent 10 years ago | | |

I think the reason SMTP is hard to get right is not because of the many RFCs. The reason is that it's not documented.

Operational experience at scale is needed to know how to write an effective SMTP implementation, and that experience is half-documented by many people in many different information silos.

But... I'd also say it's an extremely forgiving protocol. In fact, it's the fact that it's so forgiving which makes operational experience required to implement it. A "correct" SMTP implementation has a lot of latitude in the choices it makes - and it's that latitude which makes life difficult.

educar 10 years ago | | |

Agreed. But like most things opensource, it just becomes better over time if someone did a good start :-) .

devnull42 10 years ago |

So at the moment I see no reason why Go written BGP would be better than standard Quagga/Zebra. There aren't really concurrency or resource issues with large scale Quagga in my experience.

tptacek 10 years ago | |

Quagga/Zebra is a giant C project. The industry is moving away, as much as it can, from serving critical infrastructure on giant C programs.

Rapzid 10 years ago | | |

I'm not aware of any trend in the area of routing/switching for linux away from C projects. nftables and open vswitch are both new-ish and written C.

elliotf 10 years ago | |

I would imagine/hope that it's more about integration with other code than using it solely as a BGP daemon. The repo seems to be related to http://osrg.github.io/ryu/ which is a "software-defined networking framework"

Off-hand, you could use GoBGP to do cheap loadbalancing-ish things without external dependencies.

wmf 10 years ago | |

This is not due to being written in Go, but GoBGP looks like it has a nicer (non-Cisco-clone) configuration language.

misframer 10 years ago | |

Go is easier to profile and test.

detaro 10 years ago | |

Nicer to integrate with other stuff maybe. E.g. for simple "just announce these routes" or a looking glass, where I'd right now might use the (python-based) ExaBGP

AdamJacobMuller 10 years ago | | |

Indeed, this is great for things where you want to do programmatic manipulation of routing. Something which ExaBGP is good at, but is very slow and Quagga/BIRD are really poor at, but are quite fast at.

rmdoss 10 years ago |

I remember years ago when every new PHP application would have "PHP" before its name. PHPNuke, PHPMyadmin, etc, etc.

Seeing the same trend with Go now. Why add the language name to the software name? Real question...

sanderjd 10 years ago | |

Another real question: What is the better approach? Generic names (eg. "bgpd")? That seems decent if you have an over-arching project to group the generic stuff under (eg. "Apache httpd"). Making up codenames for everything (eg. "Zebra")? It's a pain to think of those, and they're rarely descriptive or meaningful.

I don't really like the language-name-prefix thing either. It makes the language seem like the important thing about the project. Sometimes it is the most important thing, but even then, that is mostly only true at the beginning of a project when attracting contributors is most critical. But I'm not sure the other approaches are much better.

malcolmgreaves 10 years ago |

I really dislike projects that assume you know the definition of an acronym and never (1) expand it nor (2) explain it. BGP is super important to the GoBGP project. It deserves at least a mention somewhere in the first 4 sentences introducing the project. Gahh!

arca_vorago 10 years ago |

Would elixir or erlang also be a good potential language for bgp/quagga/zebra?

technion 10 years ago | |

I feel "BGP in Erlang" would be exactly the kind of thing I could implement well - even if it did feel icky having to implement "MD5 Authentication" in 2016.

The problem with those sorts of projects however, is inertia. The average hobbyist rarely ever uses BGP. Large networks and ISPs aren't going to implement my personal project as a critical component to keeping their entire infrastructure online without a very good reason.

This project looks promising, I'm hoping it doesn't suffer this problem.

dragonshed 10 years ago |

I assume BGP == Border Gateway Protocol https://en.wikipedia.org/wiki/Border_Gateway_Protocol

Suggestion: include a quick abstract what what BGP is with a link for more information.

tptacek 10 years ago | |

It's the routing protocol that computes paths between ISPs and their largest customers, and that associates ranges of IP addresses with those networks.

Even if you're not a huge ISP, it's handy to have a BGP implementation available because you can use it to do network analytics and traffic management.

fogleman 10 years ago | |

This happens fairly often (projects assuming I know the tech they are built on), and I'm no dummy.

I also clicked through several pages on the repo / site and there was no clue as to what BGP was, except some mention of RPC.

misframer 10 years ago | |

Is that necessary? I'm not sure how many people are unaware of what BGP is.

jimbokun 10 years ago | | |

I'm not aware what BGP is, clicked hoping to find out, was sorely disappointed.

LamaOfRuin 10 years ago | | |

I was generally aware of what Border Gateway Protocol was, but it did not immediately spring to mind when I read BGP, and the full name is not mentioned in the repo readme.

voidlogic 10 years ago | | |

This is just classic karma mongering by running Google for other people and posting the result. So no prob. not necessary.

But it can be helpful for topics with ambiguous acronyms or tech. names (Apple) Swift vs. (OpenStack) Swift, for example.

stingraycharles 10 years ago | |

I don't think this is necessary. If this is relevant to you, you will know exactly what it is, in the same way that GoDNS would be obvious to people who know what DNS is.

mc808 10 years ago | | |

You can't get very far in life if you assume every acronym you encounter is irrelevant to you if you don't already know what it stands for.