Google Declaring War on the Web(tante.cc) |
Google Declaring War on the Web(tante.cc) |
Nobody is stopping you from blocking bot traffic.
You don't need search engines - you can just link between sites or have webrings. Like we used to, pre-2000.
Nobody is stopping you from not using ads on the net.
Nobody can force you to use non-essential cookies (and thus: a cookie-banner).
Imagine there was a war going on, and no-one was showing up.
I would blame trash like Discord more though. Alternative search engines are available, but the crappy little web chat hides info inside.
Well, we had the same problem with IRC. There's value to be had in not everything being discoverable in 5 seconds with a google search.
But if ReCAPTCHA won't consider me human unless i have a certified phone, having search alternatives doesn't matter -- the websites themselves are just gonna block me
It's conspiracy, but it feels like Google is actively making the usual search worse so everyone will use AI overview more.
At the end of the day, is it really all that different to provide a list of links, versus an answer or overview of a few paragraphs with links to lots of different higher-quality sources?
I follow those source links all the time. Not just to "check sources" but because they provide a ton more detail. And the links are usually much better than what I'll get with regular keyword search results.
> It’s about monopolizing access to information.
Not as long as there are competitors like OpenAI and Anthropic. In fact, LLM's have provided Google with stronger competition than it's ever had before. ChatGPT and Claude are doing what Bing was never able to.
I think though a big part of this was YouTube replaced blogs. It's a generational thing.
Totally vibed version of this:
``` { "version": "https://agent-source.org/v1", "canonical_url": "https://ninjasandrobots.com/the-cone", "title": "The Real Reason Nobody Moved the Cone", "source_name": "Ninjas and Robots", "author": "Nathan Kontny", "summary": "An essay about embarrassment, public action, and why obvious fixes go undone.", "preferred_citation": "Ninjas and Robots", "source_card": { "headline": "The Real Reason Nobody Moved the Cone", "description": "People avoid obvious public actions not because they are lazy, but because being seen trying is embarrassing.", "image": "https://ninjasandrobots.com/images/cone-card.jpg", "cta": "Read the full essay" }, "allowed_excerpt": { "max_chars": 500, "preferred_excerpt": "People often avoid obvious public action because embarrassment feels more immediate than danger." }, "commercial_terms": { "ads_allowed": true, "sponsor_card_url": "https://ninjasandrobots.com/.well-known/sponsor-card.json", "licensing_contact": "hello@ninjasandrobots.com" } } ```
But something to get our original source honored better in the LLM. Maybe if one of the LLMs do this, we'd give it more loyalty? Maybe the government needs to compel this kind of behavior? No idea. It does suck though our content is just turned into AI's own tokens and we're left with a tiny "source" link if we're lucky.
You don't write post to reach the biggest amount of people, you do because you're passionate and ultimately you get people following you.
If average Joe doesn't go on your website, what's the big deal ?
I think this feature will be very useful to fight back on the optimized SEO hell that we currently have.
As you want a cookie, i put you in a table, napking, serve you a bag of cookies and hope that you eat/find the cookie you want, while hearing my music, watching my ads, pushing you more foods that I sell and other services. And sometimes, that is the experience you are searching for. But also, many just want a cookie.
That is what a conversational and maybe agentic interface can give you. Have someone a blueberry cookie? Then it gives it to you, and also give pointers to restaurants that give a more complete experience sometimes (while others may try to scam you). It is a shortcut, but also doesn't hide you the traditional way to access that.
They are not saints, but neither are all the ones in the other side. But the new way to access the relevant information you want, in a way that you can use it, have its own value.
They're trying to pivot into AI because they have gobs of "evidence" that the vast majority of people have been typing natural language questions into Google instead of looking for specific terms
The technology for indexing the web was mature enough by then, already then.
I agree that much of the downward spiral was caused by google itself, tho.
The vast majority of people don’t.
We’ve gone from Only links to the source -> Mostly links to the source, with a short summary picked almost verbatim from the source -> AI summary that mangles several sources’ information together and gets top billing -> Only the AI summary with some footnotes linking to the source.
Google has been fairly slowly been turning up the temperature of the pot and we’re only a few degrees away from a full boil. Let’s not pretend or be naive enough to think that’s not what’s happening.
You're right that there are competitors, but those competitors are doing the same thing: hoovering up content and then not giving anything back for it. There are deals in place for some of the largest publishers [2] [3], but that leaves a ton of content out in the cold. That's going to decrease the amount of content that's out there, which will decrease the quality of AI search. I don't know where that ends, but given how leveraged the economy is in AI it seems like a good idea for somebody to figure it out.
[1] https://pressgazette.co.uk/media-audience-and-business-data/...
[2] https://futureweek.com/a-complete-list-of-publishers-strikin...
[3] https://digiday.com/media/a-timeline-of-the-major-deals-betw...
Probably not, but I don't like change.
A lot of the time, the answer itself is good, but the links are spam blogs and Tiktok videos. I don't think there's a real connection between how the text is generated and what "references" are picked for it. I just searched for a math history topic and the reference was a literal TikTok video that's an advertisement for a sketchy mobile calculator app?
So yeah, these references are boosting web content, but it has nothing to do with the high-quality sources used to train the LLMs in the first place.
I've stopped using Google and find I'm not missing anything
1. One that loves AI + Big Business + very fast Innovation and disruption
2. One that loves Artisanal work + Small Business + slower but more sustainable innovation
I personally prefer living in #2, but I can totally see both "societies" continuing to exist and develop in their own ways.
Of course there is always the reality that different societies always end up interacting and affecting eachother.
AI lacks the ability to identify greatness because it's trained on the output of the average person who also lacks this ability.
It's going to create a new elite class of people who have good taste and the masses who have bad taste. Many current elites will end up with the masses. They may retain their wealth on paper, but it will be a cheap, low-quality existence but they will be convinced it's luxury.
I think eventually, everyone will get what they want, but not everyone will get what they need.
Tax excess tech profits that derive from the efforts of others and use the proceeds to fund living artists.
Vaguely analogous to levies on blank cassettes that went to offset piracy. Give the money directly to actual artists, not labels/publishers, though.
The cassette reference was a tax on consumers to send money upward. What you’re describing is the complete inverse.
On the other hand, with the internet inevitably becoming swamped by AI generated content, I can definitely see a de-digitalization of art moving into offline spaces. At least for independent work, you don’t necessarily need mass appeal or exposure, but rather access to individuals and small groups with an actual willingness to pay for art.
Recognition and gratitude keeps me going. Money pays the bills, but if that was the only concern, I'd still be a software developer.
Anonymously feeding the slop machine is nothing like it.
This content must not be used for training or refining LLMs. If it is, rest assured that if and when the regulatory environment around training data shifts in any country in which we have legal standing, we reserve the right to sue.
Maybe even with a class action element: any lawsuit stemming from a violation of this license shall cover all other violations at the same time.
I understand that Google is feeling an existential threat from other AI products that provide answers directly. But they must also understand their symbiotic relationship with the web.
Now that Google is focusing on becoming 'self contained', so to speak, we should find a better way to drive traffic to websites. Ideally one that's not under the control of a single corporation.
Anyone miss StumbleUpon?
I know this is likely to do with the nature of the problem, but that hasn’t stopped us from getting some wildly-unsuitable decentralised nonsense in the past.
If we're nitpicking, you don't what their usage?
I feel this. I asked a developer today a question about how our product is programmed to handle something, and he just sent me a summary from the internal AI assistant they've started using.
He used to provide really good, thoughtful answers, but now it's just copy/paste from the AI.
This hits hard. There’s a senior engineer at my job who is known for well written proposals. Today he shared a doc that had the typical AI formatting, was hard to read, and clearly not his style.
On the other hand, if others use AI to summerize stuff, does it matter anymore?
Google was forced to do this and it's a miracle of their slow organizational gears that they took so long to do it. So many people have already transitioned to using ChatGPT as a replacement for Google. All of this is driven by consumer behavior and the desire to "just get an answer" rather than having to wade through all the sources and try to figure out what is SEO slop vs what is actual reputable information. Google SERP results have been gamed by SEO slop for economically valuable search terms long before the rise of AI. ChatGPT simply solved a huge problem waiting for a solution.
From the web content creator's POV, there are to paths:
1. If you are merely a publisher and rely on eyeballs on ads to drive your own revenue, you are screwed. AI is going to ignore all the ads and only extract the content.
2. If however you are serving helpful information out of the goodness of your heart or if the content itself references a product or service which from which you will derive economic benefit from, you are still good.
I don't see this as a bad thing. Ads on websites were a necessary evil and will be seen as a relic of the first 30 years of the internet. Ads will not go away but they will just migrate to the application layer (youTube, LLM interfaces etc) that will provide a much more targeted experience. There will be winners and losers from this transition but that's normal and healthy.
One could argue that my content production being a hobby lets me be pretty blasé about being intermediated by a platform. That is somewhat true. If I relied upon this as a living, I would probably also conclude that actions that harm my way of living are a war on "the web", though realistically any neutral party observing must conclude that if it is a war, it's one on my kind of participation in the web - content creation for the purpose of revenue / notoriety / some other reward.
As a user, I don't actually care very much for each website and its creator. The information contained therein is useful to me, but the heterogeneity of these sites is mostly an obstacle to the information. I am much happier when my search and summarization agents are able to accurately synthesize what these websites say, in so much as such a synthesis allows me to model reality more accurately.
So I could be convinced that this change from Google makes it less likely for accurate content to be created and that I'll be misled more often. But this is a tool, and my world-model will frequently be tested by reality. If the search-and-synthesis machine fails to produce useful outcomes, I will know. And I'll have to adjust the way I treat knowledge I obtain through it so that I don't get catastrophic outcomes. But that's the same already. I don't really know that Google's search results are not planted ones calibrated to change my opinion. And I don't know that they don't collude with the Internet Archive (with whom they have a pre-existing relationship) to make it look like their constructed consensus is real.
As a user, I have to make a lot of decisions already, and having to painstakingly read search results to synthesize them myself is far less useful than using an agent. So if there is a war on the web, then I am glad to join it, on the side against the web.
How such modals aren’t considered pop-ups is beyond me.
Google dropped it from the index long ago. I had a fun discussion with some google folk where they kept arguing my website was designed wrong and that some pages had tomany links.
Basically, if you write an article about the largest banana companies you have to chose which to link to!
The 10 best movies article is better than the best 100. If you make a list of all the movies you've seen your page gradually turns into something really bad. Others will be punished for linking to it but only if you add the nth entry.
As the website is just for me it is clearly their loss not mine. No way im ging to consider linking a sub set of patents or research papers.
Everything is probably re-traceable fairly easily because Google Analytics is on nearly every web page.
But I understand maintaining your own source of archives, videos, documents, etc.
Sounds like a good vibe coding project actually.. to try and keep it all organized offline.
Why would it be good if Perplexity does it, but bad if Google does it? What are the principles at play here?
A) Google will do a good job of this, people will find their summaries more useful, and the web will evolve into a more closed system that better serves its users
or ...
B) They're gated AI community will suck, and people will start using a different search engine that better serves its users.
My money isn't on A), but they do have a lot of clout so I wouldn't rule it out.
That would allow people to still let Google to access their site, but restrict its usage. Similar for open source projects on GitHub, etc.
So the solution to that would be “change the law”.
The thing everyone needs to ask before advocating for something "government enforced" is "what would happen if this was in the hands of a hostile government?"
And then remember that (a) just because it's not hostile to you today, doesn't mean it won't be tomorrow, and (b) one man's "hostile" is another man's "utopia."
And then remember that just because Google is not hostile to you today, doesn't mean it won't be tomorrow.
So how do you think a meta noai tag would be used by a hostile government?
It would be something the website owner set.
Someone will search for "Kylie Jenner" and they will get some kind of shopping opportunity (with Google getting a commission) and links to her profile on YouTube. And maybe some publisher content on the subject. In all cases they'll probably want to angle to get more of an "aGeNtIc" experience, where Google just reads you the story or buys the lipstick for you, without you leaving google.com.
Communities have moved from public forums to private discords. Most of the major social media’s are unviewable without an account.
Facebook had a huge opportunity in the post-AI world: real humans.
Instead of focusing on connections, they've been optimizing their properties for doomscrolling.
Google, similarly, has lost the plot on what made them trustworthy in the first place: navigating to citable content.
Both companies started on this trend well before AI, but this might be the final nail in their respective coffins[0].
[0]Yes they'll likely still be profitable for a long time, but the Bell Labs-esque downfall has begun (imo).
Facebook may well fail when people don't enjoy it. But all Google ever promised was information, of variously dubious quality, and that's still their draw.
This is a problem Google has been battling forever, with all the SEO click spam.
In either case, Google was the tool that many people used to find "trustworthy" information (citable or not), compared to the other tools online.
I surprised however, that it didn't describe phase 2 of the disaster, where in the models no longer have fresh www content to train on.
It's hard to understand the long term vision of this strategy...
Google’s Vision since they were founded:
Google's mission is to organize the world's information and make it universally accessible and useful
They told everyone what they were doing the whole time
I think the coming years will be pivotal for the web. Facebook attempted a similar strategy back when their apps got traction, but they ultimately failed. Let’s hope Google fails too.
Completely, yes, that destroys the incentive. But they can reduce it 80% or 90% or so, to the point that it's just barely worthwhile to allow their crawlers.
I’m curious how they plan to generate new content in the future, because it seems obvious that simple web pages will become obsolete and eventually stop being filled with fresh data.
It will probably end with a warning every time you click a link, something like: “You are leaving to an external unsafe site.”
Think about it. Pretty much every time they show a search box with someone asking for directions to reach a physical place, what hours is it open, etc.
The greatest thing about the internet is that it has removed distances around the whole world, but Google's major value proposition seems to be that... it can accurately index and query information about local businesses?
My hope is that this will help overflow the proverbial glass for an increasing amount of people and we'll start pushing back towards the "old" web before Google and ad networks have transformed it, or find new modalities of interacting more freely with each other, and the content.
It's not going to be a small or easy fight, though...to a large extent, it's a fight against the current state of capitalism itself, and winning back our attention, critical thinking and choice.
Abrogate their usage.
Step 2: Abruptly switch to iron-fist enforcement where suddenly people get jail time for violations, but only for entities that have been critical of the government.
This is by no means the only or most likely way, just what I could come up with in 30 seconds. There may be much better "evil government" strategies.
1: https://www.ign.com/articles/larian-ceo-responds-to-divinity... 2: https://www.dexerto.com/gaming/clair-obscur-expedition-33-ai...
Expedition 33 nailed music, aesthetics, and narrative, and I am glad that they took a diffusion model for what it is, not for what marketing wants you to believe. although the game itself would benefit from one or two months dedicated exclusively to optimization, it is THE reference of how generative technology can be used - purely internally, to ideate and iterate at the pace of your taste and a bunch of H200s. we are aware of that process detail purely because they slipped in one place and got briefly "owned" by Twitter.
That ain't it.