The Need for Web UDP

The Need for Web UDP(github.com)

100 points by mrmoka 8 years ago | 127 comments

pdkl95 8 years ago |

> Requirements

> 1. Security - it has to benefit from SSL.

There is a lot more to security than transport-layer encryption and authentication.

> Connection based

UDP is too hard, so re-inventing TCP?

> p2p is not reliable ... monetization

Oh, you want require the SAaSS[1] model.

> Simple to use

The already-stated "requirements" are asking for something more complex than WebRTC.

> Minimum header overhead

Wait, are you thinking about using UDP to transport HTTP?! Do you even know what your MTU is?

> WebRTC suffers from complexity

That complexity exists for reason. Nowhere in this document is a discussion of the potential problems of using UDP or the ways tthe new service might be exploited by malicious actors.

[1] Service as a Software Substitute

mrmoka 8 years ago | |

> There is a lot more to security than transport-layer encryption and authentication.

You are welcome to PR.

> Connection based

This could be explored: starting from simple handshake, all the way to fully connection based protocol. Open for discussion based on developer needs.

> The already-stated "requirements" are asking for something more complex than WebRTC.

You are welcome to highlight specifics that makes you think that way.

> Wait, are you thinking about using UDP to transport HTTP?! Do you even know what your MTU is?

UDP is not streamed but message based protocol. As WebSockets implement their transport layer over pure TCP, WebUDP could implement it's own layer over pure UDP for various reasons.

> > WebRTC suffers from complexity

> That complexity exists for reason.

For P2P type communications, this complexity is perhaps reasonable.

For Server-Client type communications not at all.

> Nowhere in this document is a discussion of the potential problems of using UDP or the ways tthe new service might be exploited by malicious actors.

This document is initial effort to bring public discussion to form a reasonable shape of what WebUDP could look like. You are welcome to participate.

pdkl95 8 years ago | | |

> UDP is not streamed but message based protocol.

I'm very[1] familiar with the IP family of protocols.

> Open for discussion

If you don't know what your requirements are, you shouldn't be choosing a transport technology. It sounds like you want an library that wraps WebSockets or WebRTC and handles most of the complexity.

> WebUDP could implement it's own [transport] layer over pure UDP

Then you want TCP. The only reason to use UDP is to avoid the complexities of a transport layer. Transport reliability is very hard; this isn't something that is easy to re-implement by yourself in UDP.

More importantly, I take it you don't know what your MTU is? The Maximum Transmission Unit[2] the maximum packet size. On ethernet-based networks, it's probably ~0-100 octets less than ethernet's 1500 octet MTU. You need to keep UDP packets under this limit, or they will fragment. Fragmented IP packets may not arrive at all and even when they do, the OS will wait until all fragments arrive before passing the data up to the app. If you're insane and send HTTP headers in each packet, you've wasted most of your data space. Each packet? Shouldn't we send headers in the first packet only? Except that every packet IS the "first packet" in stateless protocol like UDP. It's the transport features of TCP that create ordered-data semantics.

[1] I used to write firmware for embedded systems. That included writing - from scratch, in Z80 asm - the entire network layer: ethernet (RealTek), ARP, IP, UDP, TCP, SNMP, HTTP, etc.

[2] https://en.wikipedia.org/wiki/Maximum_transmission_unit

mnarayan01 8 years ago | | |

> UDP is not streamed but message based protocol. As WebSockets implement their transport layer over pure TCP, WebUDP could implement it's own layer over pure UDP for various reasons.

As soon as your message exceeds the MTU, things get complicated. Sure you can layer something to re-assemble, but if packets are dropping, this is going to start getting problematic really fast. And if packets are not dropping, then TCP shouldn't overly increase latency anyway.

kelnos 8 years ago |

This seems to come up every now and then, and I see the same arguments, and they're still not compelling.

WebRTC data channels in unreliable mode will work just fine. Is it as easy as opening up a WebSocket connection? No, it's not. Is it as easy on the server side as accepting a WebSocket connection? Also no.

But it really isn't that hard[0], and people have built libraries to help you out. So just use one, and move on with your life.

And you also benefit from a standard that has been fleshed out over multiple years by some very smart (if imperfect) people.

On the browser side, it's already supported by all major browsers, with the notable exception of iOS Safari (which should change this fall with the release of iOS 11)[1]. Even though it's not ideal, you can fall back to WebSocket for the few holdouts.

[0] Source: I've done it before, building it from scratch.

[1] https://caniuse.com/#feat=rtcpeerconnection

lxtx 8 years ago |

Posted this recently in another thread, but maybe someone will find https://github.com/seemk/WebUdp useful as there have been many WebRTC threads lately. It's a WebRTC DataChannel server implementation for out of order, unreliable UDP traffic for browsers that has built-in SDP signaling and STUN.

mrmoka 8 years ago | |

Just looking at the complexity of "minimal" implementation only highlights the need for different solution.

Wofiel 8 years ago |

Anyone interested in this might find some enlightenment in Glenn Fiddler's whitepaper. [1] It talks about the issues with TCP-based connections, WebSockets, QUIC, WebRTC and provides a solution (with code) to doing UDP in browser.

[1] https://gafferongames.com/post/why_cant_i_send_udp_packets_f...

gafferongames 8 years ago | |

Update: there is now a full browser-based implementation of netcode.io too! https://github.com/RedpointGames/netcode.io-browser

TD-Linux 8 years ago |

I think you're going to need a stronger argument for "Why not WebRTC", or at least concrete proposals about what parts of the API you would change. For example, "SDP parsing is too complex, have a JS API to set up a client-server connection without a SDP".

IMO the most complex parts of WebRTC are the SRTP-DTLS encryption (except you also specified TLS as a requirement for Web UDP), and STUN/TURN (which are optional and not required for client-server).

mrmoka 8 years ago | |

This is valuable input, you are welcome to contribute in form of PR!

TD-Linux 8 years ago | | |

Well, the problem is that I currently think WebRTC 1.0 is sufficient. For the example I gave, I think a simple JS library to write out the SDP is a perfectly fine solution. So I'm more asking, what did you find wrong with WebRTC?

soapdog 8 years ago |

Firefox OS had TCP[0] and UDP[1] sockets as Web APIs. They were quite pleasant to use as a developer and the only way to create a real mail client (needs TCP to implement POP, IMAP, SMTP in JS).

I wish more Firefox OS APIs had become web standards. They would allow for some very powerful PWAs.

[0]: https://developer.mozilla.org/en-US/docs/Archive/B2G_OS/API/... [1]: https://developer.mozilla.org/en-US/docs/Archive/B2G_OS/API/...

jcranmer 8 years ago | |

> Firefox OS had TCP[0] and UDP[1] sockets as Web APIs. They were quite pleasant to use as a developer and the only way to create a real mail client (needs TCP to implement POP, IMAP, SMTP in JS).

The biggest problem with Firefox OS's TCP standard is that it used an event-driven model, which is somewhat at odds with more current promise-based thinking. The more natural version is to use something based on WHATWG Streams, but that spec has been stuck in vaporware-land for years.

> I wish more Firefox OS APIs had become web standards. They would allow for some very powerful PWAs.

The TCP specification actually was undergoing standardization (reformatted to use the streams API): https://www.w3.org/TR/tcp-udp-sockets/ . The problem was the working group ended up closing down, and since the specification wasn't suitable for use on webpages, it ended up with nobody to host it.

mrmoka 8 years ago | |

Those implementations were created to be utilised within FirefoxOS where applications are granted permissions by user when installing them. And access to several API would be strictly regulated by OS it self.

In Web context this approach wouldn't work and would lead to security issues. Just like there was need or WebSockets (TCP), there is need for similar API for UDP but it cannot be pure access for creating UDP connections as this leads to many security concerns.

soapdog 8 years ago | | |

I am a fan of the "just stick a security popup and let people choose" option but I understand that is not the same as having proper security.

mnarayan01 8 years ago |

For this to be taken seriously, I think you need to demonstrate that you're capable of using WebRTC to do what you need. Without a decent nod in that direction, people are going to think that you're just not aware of the potential complexities.

mrmoka 8 years ago | |

Worth mentioning again: this effort is to explore server-client low-latency, not peer-to-peer scenarios which WebRTC solves well.

And this is collaborative effort, not personal. So all input is welcome.

I've used WebRTC for p2p and server-client cases, and it is nightmare for later. And many other developers have expressed very similar experience when it comes to server-client cases.

Even more, after many years we see very little adoption of WebRTC for server-client cases due to it's complexity. WebSockets on the other hand took very little time to get adopted by many back-end platforms as well as browser vendors. I wrote my own WebSockets solution long time ago on .Net before 4.5 .Net was released (includes own WebSockets implementation).

kazinator 8 years ago |

Using UDP with the web on a wide scale will either replicate everything in TCP, inside UDP wrapping, or else cause big problems all over the Internet.

It will be fast only in the beginning when a few clients are participating, but then screw over the infrastructure with degenerative congestive behaviors when "everyone" is on it. And by then, it will be a standard everyone is stuck with, with the only way out being to complicate it with a tirade of hacky refinements based on guesswork combined with crossed fingers.

That's not even considering malicious interference: what sorts of attacks will be discovered on this new UDP based shit, and what sorts of hacks will be required to mitigate them.

tlb 8 years ago | |

I know that was the conventional wisdom, from back when backbone links were the bottleneck. But in the modern internet, almost all the congestion is at the edges.

Since most traffic for games is server->client, most of the congestion will happen when several users are competing for the same customer link (DSL or cable modem). This already happens with streaming services, and people just yell at each other to stop downloading updates while I'm watching Netflix.

kazinator 8 years ago | | |

That could change one fine day when those edges get their stuff together.

Indeed, the subscriber lines and surrounding edge hardware have not kept up with the times. Depending on where you are and who your provider is, chances are you're getting the same shitty line rates you had ten years ago (or more), though you have more memory, a bigger hard drive and a faster CPU, and the backbone is faster.

zlynx 8 years ago | | |

Anywhere with a lot of Gigabit Fiber installed, the congestion is _not_ at the edges. It's further in. If all of those installed Gigabit edges simultaneously used 1 Gbps download, it wouldn't happen. They'd get much less.

xxgreg 8 years ago | |

Too late... Chrome already uses HTTP over UDP. https://en.wikipedia.org/wiki/QUIC

detaro 8 years ago | | |

And choose the first option: replicating all the important parts of TCP inside UDP. Which is fine, but a lot of effort.

kazinator 8 years ago | | |

But it doesn't open a ridiculous number of tabs at the same time quickly and with low resource usage, so it's supposedly dead in the water now. :) :) :)

remline 8 years ago |

SCTP would be the logical choice for supporting both streams and messages.

No one really wants to support a network with the evil of arbitrary UDP from the browser. In SCTP, the handshake combined with crypto tricks can allows a server to make sure the initiator stores a larger cookie than it needs to hold for verification, throttling the DDOS riffraff.

mrmoka 8 years ago | |

SCTP have been mentioned in the repo: https://github.com/Maksims/web-udp-public/issues/1#issuecomm...

remline 8 years ago | | |

The reasoning there is kind of lame, every filter knows how to pass SCTP or UDP and doesn't pass SCTP since it is unpopular and UDP because stopping it is most of why you are filtering.

Making an SCTP web standard would improve endnode support (and actual app use) which is beginning to wane and are SCTPs adoption problems.

fulafel 8 years ago | |

WebRTC uses (UDP encapsulated) SCTP.

jstewartmobile 8 years ago |

So many of these comments are like "UDP is not TCP!" or "Muh TCP guarantees!", then they go on to mention WebRTC?!!!

I mean, if you're streaming audio and video Real Time, is there really any point to TCP? If a few frames get dropped, then bursted back once the connection stabilizes, does that improve the user experience in any way?

WebRTC seems like a perfect candidate for UDP communications for the actual media streams.

mrmoka 8 years ago | |

WebRTC is best option for media streams today for peer-to-peer cases.

The goal of the topic is to explore simple option for server-client communication using low-latency communication, without reliability and without ordered delivery.

WebRTC can be used for such case, although it is not designed for it. Due to that implementation is very complex and not much adopted. This is something we trying to explore, either new API or simplifications to WebRTC to make it simple choice for UDP in server-client scenarios.

detaro 8 years ago | | |

Given the kind of feedback happening (and my initial reaction admittedly was similar), maybe a different name would help with perception (I'm already not a big fan of "WebSockets", but "WebTCP" would have been worse), promoting the goals over the implementation detail of UDP?

jstewartmobile 8 years ago | | |

It just seems like most real-life usages of WebRTC are either media-centric where TCP does more harm than good, or multiplayer scenarios where it's a wash since TCP's ordering is a pro while TCP's overhead is a con.

topranks 8 years ago | |

TCP congestion control can be very beneficial for streaming applications (when combined with adaptive rate codecs).

With UDP you have to create your own feedback mechanism to find the optimal bitrate to stream to the far side at.

mrmoka 8 years ago | | |

This is where DCCP or alternative protocols come in play.

justsomedood 8 years ago |

Wouldn't this make it pretty easy to have browser clients unknowingly participate in DoS attacks?

xg15 8 years ago |

So if the main justification of this proposal is "WebRTC is too complicated", wouldn't that more speak for a WebRTC library and/or server geared towards games?

trelliscoded 8 years ago | |

WebRTC doesn't work for browser control of a lot of IoT type stuff. The really high volume cheapo devices speak things like CoAP or DTLS, and they don't have the horsepower to run something like WebRTC. You'd need a level of control similar to the berkeley socket API to get the browser to speak those protocols.

At the moment, I see a lot of ridiculous stuff like phone apps talking to some cloud instance which tries to jam the packets back through your firewall into your Internet light bulbs. Congratulations, you literally just used thousands of kilometers of fiber and billions of dollars of routing infrastructure to make the world's most expensive how many... light bulb joke.

TD-Linux 8 years ago | |

It sounds like you're describing HumbleNet: https://github.com/HumbleNet/HumbleNet

mrmoka 8 years ago | | |

And with WebUDP life for those guys would be much easier.

z3t4 8 years ago | |

Last time I checked on WebRTC tutorials and examples was either too old and deprecated, or required the latest Chrome with some flags enabled. The best we have right now for real time streaming is HLS with up to 20 seconds or more delay. We had real time video chat 20 years ago, why can't we have it in the browser today !?

TD-Linux 8 years ago | | |

I think you checked too long ago. Appear.in, Jitsi, Discord, and Cisco Spark are all working with WebRTC now.

nitwit005 8 years ago | |

Yes, it seems like a server side complexity issue should be a fixable problem without updating the standards.

I suppose you'll still need to deploy a stun/turn server to deal with the NAT issues unless you're happy with IPv6 only, but that's not really something the standard can fix.

mrmoka 8 years ago | |

As mentioned in the doc, one of the options is to simplify WebRTC by making some components optional to enable it's better adoption and easier to implement on the back-end as well as on front-end.

captainmuon 8 years ago |

This seems to propose that you can only connect to a certain server, similar to the same-origin-policy. What I would really love instead is the possibility to connect to arbitrary IPs to be able to implement real P2P. The linked posts dismiss this early because of the possibility to cause DDOS, but really, you can already do that from a hacked desktop "Quake", so there is no harm in being able to do it from a browser-based "Quake". What you'd have to prevent is drive-by use of UDP, not use of UDP period.

I would propose having two HTML profiles in future, HTML document, and HTML application (and maybe HTML legacy). HTML document would be restricted in what you can do, and would be primarily for reading Hypertext. For HTML application you would have to go through a few clicks to install or activate it - now you are going to say that people will just click it away, but that is already the case with current desktop app installers, so it is not more insecure! An application profile page will be able to access the net just like any other native application. Most importantly, it will be able to bypass same-origin policy and send UDP and TCP anywhere - but not with credientials of course.

You'd still have the problem of being able to probe internal networks, and being able to manipulate UPnP routers. For the first, the network admin could have a group profile setting or similar to disable this kind of access. For the second, browsers could selectively block this on a case-by-case basis if needed.

For the problem of DDOS, I think we should not let that restrict us from implementing useful technologies. Rather we should fix it at the source. For example, maybe one could lock down certain routes if an attack is detected. All traffic along these routes is throttled, unless you send along a proof-of-work token. I'm just making this up, but my point is that I think we haven't exhausted all options here.

geocar 8 years ago |

I've written a WebRTC "server" that can establish such a connection (and also acts as it's own STUN/TURN server) and hand off sockets to a local process.

WebRTC isn't very complicated.

The hardest part is probably ICE, which basically involves each point telling eachother what they see, and potentially consulting a third party (STUN/TURN). I'd love to see more magic there, but once that's in-place, I don't see what's so hard about just using DataChannels.

One idea might be to put signalling into HTTP headers, e.g. have the client and server introduce something like:

    ICE: sdp-desc...

and if so, allow WebRTC to skip the ICE negotiation step if speaking to the server.

Matheus28 8 years ago |

I'd also like to draw some attention to this proposal: https://github.com/Maksims/web-udp-public/issues/6

gafferongames 8 years ago | |

This proposal lacks challenge/response and makes game server vulnerable to being used in DDoS amplification attacks if they use request/response pattern. Also, a proposal without packet encryption in 2017, seriously?!

znpy 8 years ago |

Just use QUIC and advocate for its adoption.

Matthias247 8 years ago | |

That doesn't necessarily solve the problem which is in focus here. The QUIC version which is available in browsers (ok, Chrome only) is not only the QUIC stream layer, but also the QUIC HTTP adaption layer. From javascript side you just interact with HTTP and use QUIC under the hood. However with HTTP you also get all HTTP semantics (headers, reliability, ordering inside of request/response streams, etc.). What is requested is an additional protocol and API which avoids all the overhead and just allows to send and receive messages with best effort - because games and other realtime applications may prefer to build their own reliability mechanism on top.

The only way to use HTTP/QUIC for packetlike communication might be to send each packet inside a seperate HTTP request. But I guess that will have a super high overhead (lifecycle of a whole stream must be managed for a packet which actually has no lifecycle) and will most likely also not deliever the expected results (afaik HTTP on top of QUIC still has head-of-line blocking for request/stream establishment. Request headers must be received in order).

New javascript APIs which are utilizing QUIC could work. However one would need to explore if QUIC is actually helpful for target applications, since it provides a stream semantic, whereas UDP is purely packet based. QUIC might also introduce similar issues like WebRTC to the server side: It's a complex protocol spanning multiple layers (everything from flow-control to encrytion). Therefore it will be hard to integrate into environments where no QUIC library is available. But that's only a feeling, since I haven't yet reviewed the QUIC specification in detail.

RajuVarghese 8 years ago |

One thing that I do not see mentioned here is multicast. Are there any advantages? Watching a live sports game, for instance. Since multicast is connection-less and sent only over UDP, the more distant discussion about introducing multicast into browsers never takes place. Having used a multicast video stream for many years in an enterprise setting I can unequivocally state that this would decrease network utilization. Especially in the years to come as the interwebs get clogged up with broadcast-type data.

mrmoka 8 years ago | |

The main goal of this effort is to explore server-client communication scenarios.

wbl 8 years ago | |

ISPs are not supporting multicast over the public internet.

RajuVarghese 8 years ago | | |

They will if it becomes a standard way of broadcasting. They will save quite a bit of network load.

LinuxBender 8 years ago |

Most home users probably allow UDP out of their network. How many businesses by default allow UDP outbound on any port?

Will the documentation/RFC's encourage folks to fail gracefully if UDP is not supported in their network?

Could this spec include support for SRV records? It isn't allowed in http/1.1.

eecc 8 years ago |

Can't we just rewrite the Linux kernel in JavaScript and boot it off a browser

... /s (hopefully)

trelliscoded 8 years ago | |

Why, yes. Yes you can.

http://jslinux.org/

pdkl95 8 years ago | |

https://www.destroyallsoftware.com/talks/the-birth-and-death...

nomadlogic 8 years ago | |

netbsd beat you to this by 5 years or so: https://blog.netbsd.org/tnf/entry/kernel_drivers_compiled_to...

shurcooL 8 years ago |

Are people aware of https://www.w3.org/TR/tcp-udp-sockets/? I didn't see it mentioned so far.

mrmoka 8 years ago | |

This spec been there for very long time, and has been adopted by FirefoxOS (deprecated platform). Which exposes low-level access to establish pure TCP and UDP connections with permissions flow by environment.

It exposes many security concerns, that's why WebSockets were more favourable over TCPSocket. We want similar for UDP.

LunaSea 8 years ago | | |

Not having raw TCP means you can't speak to any existing TCP protocol through a browser. It might not be a useful feature but I think that not taking it into account is an issue because most use cases can't be implemented with WebSockets.

ksec 8 years ago |

I read 5G is doing something similar, or rather something new to solve this problem. They are completely remaking the TCP/IP stack.

Anyone knows anything about that?

dr_hooo 8 years ago | |

The ETSI NGP ISG[1] is looking at next generation protocols, also in the context of 5G.

"This ISG is seen as a transitional group i.e. a vehicle for the 5G community (and others of interest) to first gather their thoughts and prepare the case for the Internet community’s engagement in a complementary and synchronised modernisation effort."

The efforts seems to be in an quite early stage for now (architecture, models, requirements, etc).

I personally don't see TCP/IP going anywhere with 5G, but we may see more parallel deployments of protocols within isolated 5G network slices.

[1] http://www.etsi.org/technologies-clusters/technologies/next-...

api 8 years ago |

Just tweeted this and plan to publicize. If we had web UDP we could port ZeroTier pretty easily to run in the browser, allowing web apps to coexist with machines on true virtual networks.

It'd probably be lighter weight than WebRTC, which is IMHO an over-engineered nightmare. I'd like to see just the A/V encode/decode parts of WebRTC live on and the rest of it get deprecated in favor of web UDP and open-ended browser based P2P implementations. That's what should have happened, not a monolithic katamari ball of a standard.

smcdow 8 years ago |

Maybe you'd have more interest if you wrote up a proper RFC.

mrmoka 8 years ago | |

This is collaborative effort. I act from my capabilities, but people with certain skillset are welcome to contribute with proper RFC.