Google proposed Web Bundles could threaten the Web as we know it

Google proposed Web Bundles could threaten the Web as we know it(ghacks.net)

147 points by maproot 5 years ago | 79 comments

jasode 5 years ago |

Fyi... Web Bundles and Signed HTTP Exchanges are confusing topics so I think it's worth reading 2 previous threads with comments from 2 Google employees (spankalee, jefftk) [1].

One may still choose to discount their explanations because they may be biased sources but I still think everyone should try to understand what they're saying. Hopefully, being familiar with the technical details will elevate the discussion so people who disagree can point out specific and concrete technical flaws of those explanations rather than just restating a generalized version of "Google is trying to take over the whole web."

[1] previous threads:

https://news.ycombinator.com/item?id=24275752

https://news.ycombinator.com/item?id=24278068

https://news.ycombinator.com/item?id=24324120

drewbug01 5 years ago | |

> Hopefully, being familiar with the technical details will elevate the discussion so people who disagree can point out specific and concrete technical flaws

Asserting that an elevated discussion should center only on technical flaws and disagreements is a myopic way to look at a topic.

There’s more to the web than the technology used to power it. How a technology is used, and what it enables (good or bad) is an appropriate topic for this forum and constitutes elevated discussion.

jasode 5 years ago | | |

>discussion should center only on technical flaws

You misintepreted what I wrote. My comment did not restrict it to only technical flaws. I already agree with your following statement:

>How a technology is used, and what it enables (good or bad) is an appropriate topic for this forum and constitutes elevated discussion.

Yes. Having us share some armchair anthropology (which I do myself[1]) on the social or secondary effects of technology is constructive dialogue and elevated discussion. My comment was never trying to cut that off.

That said, just rehashing "Google is just trying to own the web!" or slight variations of that meme may feel good for the poster to type out but it does not educate me on this topic. This is especially degrading to the discussion if the poster restating that common sentiment has a mistaken mental model of what Web Bundles actually do or can't do. Instead, share some quality facts so I as a reader can come to the conclusion on my own that this technology forces unblockable ads and lets Google take over my web experience.

[1] https://news.ycombinator.com/item?id=19229225

jhall1468 5 years ago | | |

I think you misunderstood the point you were replying to. The idea is that you can’t really have a discussion about what it enables or how it’s used unless you already understand the technical details. And that shows as a lot of the comments/blog posts about this topic are using underlying technical assumptions that are entirely incorrect.

rektide 5 years ago | | |

I agree that how tech is used & what it enables is a good discussion.

I think folks interested in this topic should get a basic education.

I resend starting with intents & desires the project started with, by reading the ietf draft of the use cases,

https://tools.ietf.org/html/draft-yasskin-wpack-use-cases-01

TekMol 5 years ago |

I think their idea is to combine that with signing the bundles, so a page from www.someserver.com can be served by anyone, aka Google. I guess this would mean Google can serve all content on the web.

There seems to be a strong urge in Google to cut the connection between then endpoints of the web and become the central authority. Make all traffic flow through their machines. Let no information arrive at the endpoints.

Right now, requests on the web are kind of p2p. A user requests a website, the publisher serves it any way they see fit. Directly via their servers or via a CDN of their choice.

Google seems to have a strong focus on ending this. Turning the web into Googlebook / AOLoogle.

I wonder why. Do they see their business model threatened on the open web? Or do they see a chance to increase their profit with a closed web?

cflat 5 years ago |

Let’s call a spade a spade. The only real world problem that WebBundles (and Signed Exchanges) really solve is to allow AMP to impersonate your website.

Google wants all the click data and the click through navigation data about users (by way of passive logs) so they can sell more ads.

There are no other real world problems that web bundles solve.

Spivak 5 years ago | |

The real world problem web bundles solve is distributed caching. Right now sites have pick one or a few CDNs and have a trust relationship with them and allow them to impersonate your site.

Web bundles changes this relationship so that anyone can cache sites if it benefits them to do so. If you share a link on Twitter or Facebook or Discord or Slack they can cache the page on their servers and deliver it through the connection you already have open to them.

Web Bundles also open the door for network-local caches that don’t require MitM or trusting the cache.

cflat 5 years ago | | |

This feels contrived. Rarely do I, as a brand or content creator, want it circulating without my control. It doesn’t make business sense.

judge2020 5 years ago | |

Links on the page are the same as before signed, so the only actual problem with them is not being able to change/delete the documents hosted elsewhere immediately.

cflat 5 years ago | | |

Yea, but the web server delivering them is now google. Google now gets the access logs and using the persistent tls socket can follow the users activity. Sure the content is signed, but the delivery is no longer private.

tmd83 5 years ago |

I am really curious what's the general opinion on Googler's as a web developer. I have seen a long while ago some nice articles from Google about site optimization.

Do they even follow any of their original advice or Google basically keep doing over engineered stuff fixed by adding another set of over engineered staff?

Let's talk gmail. I just refreshed the window and it did close to 400 request, ~8MB download which translates to nearly 40MB resource. And it keeps making more requests even when I'm not doing anything.

And a refresh of Google.com the search page did 33 request and nearly a MB download.

And they are preaching the world about optimizing the web?

ffpip 5 years ago |

Yet another thing Google wants to fix by serving everything through their servers instead of asking devs to fix their owns sites.

Same problem with AMP. Instead of asking news sites to fix their slow pages, it forced them through AMP by promising better result ranking.

Ask them to make their sites faster within a month or say they'll get booted off search. You'll be surprised at how fast they comply

jmull 5 years ago |

I think this line of criticism of web bundles misses the mark. It looks to me like the issues raised are perfectly possible and just as easy without web bundles -- that is, these may be legitimate issues, but are independent of web bundles.

My issue with web bundles is that it's yet another pile of complexity with very little incremental value over things that already exist. A poor tradeoff.

There's a substantial on-going cost to each web standard added so each one needs to "pay" for itself with broad or deep usefulness. Web bundles are just another way to skin a cat.

maple3142 5 years ago |

I don't understand why can't this be blocked by content blockers. I tried open a .wbn in the original article, and tried to inspect the resources using devtool, it still have files listed there. So content blockers can still block something like xxx.wbn:/js/ads.js if browser have such api.

Also, I think web bundle can be a Electron replacement too for some use cases, so that some totally offline JavaScript webapp don't have to use Electron.

jakelazaroff 5 years ago |

I was hoping this would be more than a rehash of the article on the Brave blog about the same topic, but alas. Link to that discussion: https://news.ycombinator.com/item?id=24274968

jeroenhd 5 years ago |

Is there anything in the web bundle standard that forces outdated pages to be refreshed? The spec seems to say little more than "detecting stolen keys is not our problem".

I can imagine this being a problem when news stories turn out to be false alarm and Google happily keeps serving the original content instead of the corrected content.

There's also a risk of vulnerability here, as getting a signed package might very well be used to host phishing pages on web caches.

bogwog 5 years ago |

Google should just fork the web already. Let them create their own private platform and do whatever they want.

The massive control Chrome and Android gives them means they can do whatever they want already, but at least with a private platform they won’t have to fight people and deal with the negative PR of doing evil stuff. And then the rest of us who like privacy and competition and ad blockers can use the “legacy” web.

fartcannon 5 years ago | |

Then they'll just pay a few journalists to run a couple hit pieces on the open web, saying it's a place where people who kick puppies reside.

dimitrios1 5 years ago |

Interestingly the main quotation in the article is from a Brave team member -- what does Brave do when this is rolled out? Fork Chromium?

jimbobimbo 5 years ago |

Web bundles look like a great thing for Electron and PWA like scenarios, specifically due to signature support. We had to drop service workers and reinvent the wheel with APPX (basically a signed ZIP file) in one of our apps, to ensure code integrity.

tormeh 5 years ago |

So it's a signed executable, running in a sandbox, served from a federated app store... Isn't the whole JS/CSS/HTML web crap a bit overcomplicated for this purpose?

azangru 5 years ago |

The missing hyphen in the title is really confusing.

rektide 5 years ago |

I for one believe in the specified use cases of Web Bundles, & believe they are worthy.

https://wicg.github.io/webpackage/draft-yasskin-wpack-use-ca...

What we have here is a budding conspiracy theory, not even a theory, just gesticulation. Consensual Delusion, a belief that we are persecuted by secret forces that must be held off, held at bay.

This started months ago with an incoherent rambling ticket by the Brave author that is being cited. He spent months going back & forth with wild accusations & unspecified concerns. After dozens and dozens of exchanges, he finally named one single scenario, that people might "hide" their tracking malware by renaming files as they put them into the bundle.

Color me extremely unimpressed & unscared. Enormous sound & fury, for a capability that is in no way different from the web we already have today. It's not hard to setup a.webserver to randomize asset names. Nothing about webbundles is new or changes that.

Consensual Delusions like this hacked up hoax of a story threaten reality as we know it. As the old civic videos say: DONT BE A SUCKER. Anyone selling fear, uncertainty, & doubt is to be met with skepticism. Increasingly, FUD is how Apple/Mozilla/Brave are selling their anti-feature policy. "Trust us, we won't let the web work with midi" doesn't sound that great, but is much more honest than what we get, which is "these engineers & standards groups working on these specs are secretly trying to undermine this treasured web which we must protect & keep as is at all costs". the involved engineer's histories indicates they obviously care enormously about bettering the web, & in this case are combatting sizable transpiling tool bloat for devs, & enabling offline sharing & offline capable web, and literally fighting censorship, which are truly worthy goals all that will vastly help the web.

This is all super hard to work through. Yes, google used the web to reap enormous profit by means of enormous information control & inventory systems for ads & eyeballs. But Google also would not exist without the web, & historically the web was a small toy that couldn't do much compared to apps. The tables have turned, & the web is clearly ascendant, much safer, & increasingly we understand that the limitations of ux were largely from lack of will to explore & test what limits there really were, so the situation is no longer so obviously tense. But Google Chrome & Chromium & the spec work Google does are, imo, designed to improve a communal shared resource for all humanity, designed to greaten the web, not subvert it. We can see that here, as the engineers working on webbundle have shown a thousand times over their commitment to honest above board clear integrity as they have tried & tried & tried to work with Peter Snyder as he fumbled & plodded his way to a scenario where WebBundles pose any real danger, & Peter has imo failed at presenting anything. We can see the engineers take Peter seriously, try to work with him. And so I feel it is in general. It is intimating as hell that the web is so big, has so many capabilities, that so much keeps getting added, and so much of that comes from gigantic unimaginably huge pools of capital derived from eyeballs-on-screen. But somehow it has been working out, the engineers have genuinely cared about doing the right thing, & usually the standards bodies & TAG can eventually come to harmony & agree, & the web improves.

Peters dissent thread:

https://github.com/WICG/webpackage/issues/551

Personally I greatly look forward to WebBundles. It will radically improve the JS module situation, yay, a thousand times yay, & giving people the ability to share content directly with one another, without relying on centralized infrastructure, is one of the most genuine pure & true new expanses for the web & one I am greatly looking forward to.

noisy_boy 5 years ago |

The only answer is to not click on ads. Do your research via review videos/amazon etc (I know that they are/could be indirect advertisements but atleast the creators get some sponsorship money). Then go to the brick and mortar shop, check it out and then, here is the kicker, pay the extra $5 bucks to buy from them.

samsquire 5 years ago |

This is an idea I had, an alternative to web bundles and solves the same issues.

Inside a HTML file, we introduce an attribute for embedded resources called cache=”identifier”. Script tags, style tags will have this attribute defined. There would also need to be an embedded image introduced. Inline all your resources. The browser will fetch the HTML and add whatever has the cache=”identifier” to its cache.

Then when the browser fetches a page, it will send a Cache-Got header, this is a bloom filter serialized of identifiers cached.

The server will check the bloomfilter to see if an item needs to be sent to the client and exclude the contents of those embedded resources with an empty script tag or empty style tag.

EDIT: Why is this being downvoted?