The Prompt API

The Prompt API(developer.chrome.com)

279 points by gslin 67 days ago | 144 comments

haberman 67 days ago |

This API seems perfect for an idea I've had for a while: a de-snarkifier for social media.

Social media can be intellectually stimulating and educational, but it's also easy to get sucked into ideological sniping and flamewars, even if you didn't go looking for it. The emotional and intellectual energy spent flaming strangers on the Internet is a complete waste of human capital.

With an API like this, I assume you could have a browser extension that could de-snarkify content before showing it to you. You could ask the LLM to preserve all factual content from the post, but to de-claw any aggressive or snarky language. If you really wanted to have fun, you could ask it to turn anything written in an aggressive tone into something that sounds absurd or incompetent, so that the more aggressive the post, the more it would make the author look silly.

This could have a double benefit. For the reader, it insulates them from the personal attacks of random strangers on the Internet. Don't get me wrong, there is a time and a place for real, charged arguments about important issues that affect us all. But there is little to be gained from having those fights with strangers; on the contrary, I think it poisons the body politic when strangers are screaming at each other.

For the writer, it takes away any incentive to be snarky or rude. If other people filter their content this way, there's no point in trying to be mean to them, and no "race to the bottom" for who can be more nasty.

nsilvestri 67 days ago | |

This is the Soylent of written communication. Full nutritional value with an unremarkable flavor.

haberman 67 days ago | | |

That is unironically exactly what I want from social media.

I want the option to engage with the substance of new developments in the world, technology, etc. without the drama. I don't want to be drawn into the drama of strangers (who could, for all I know, just be bots or ragebaiting AIs).

If I want drama, there's plenty of it on TV, or I could talk to my friends about what is going on with people I actually know.

The anti-pattern, in my mind, is logging on to engage with substantive content and to be inadvertently drawn into flamewars with strangers.

jychang 66 days ago | | |

Are humans supposed to enjoy the "flavor" of diarrhea, as the result of giving every village idiot a microphone so they can spew shit from their mouths?

Sure, you might say this sort of thing is boiling flavor out of your food, but... boiling the bacteria out of what you consume isn't a bad thing.

whatarethembits 66 days ago | |

Kinda looking forward to something like this, as it has the potential to remove empty junk calories from the internet, hopefully leading to SIGNIFICANTLY less use of today's popular platforms.

My wish list:

- Eliminate ALL clickbait titles and ads. I only want to see a dry factual title.

- For any given topic, I only care about the main article (with the option to only see a summary, unless its a high quality blog) and couple of substantive comments, rest is junk I don't want to see.

The current state of popular social media sites has meant that I don't use it at all (except HN, which is trending in the same direction due to saturation with AI), but every other week or so I end up wasting a few hours, which I'd like to avoid entirely.

Ideally this would lead to 98% of content filtered/summarised out, and over time only use the internet for looking things up with intention. I want this to remove majority of "entertainment" value from the internet (by default) so that time/energy can be refocused in real life and high quality sources (books) only.

Cider9986 66 days ago | | |

> - Eliminate ALL clickbait titles and ads. I only want to see a dry factual title.

DeArrow works for YouTube atleast. uBlock Origin or Brave browser works for ads. Not sure why you'd need an AI to remove ads...

seanhunter 66 days ago | | |

I actually have built myself a personal AI agent that does this for nthe main news headlines and for a summary of my personal email (sadly I can’t run it on work email yet). It can extract any actions required from a mail and make them into tasks, and also has a killer feature - a “sort out my email” button that archives all the emails it classifies as FYI, spam, mailing list or moot (it has classifiers for this), first producing a one-pager markdown summary of the whole lot in one shot, leaving all emails marked “action required” or “Urgent”. Email summaries are deliberately dry and factual with all advertising false urgency removed.

I can manually “hold” emails so they don’t go in the “sort out my email” woodchipper. It’s been life-changing.

encrux 66 days ago | |

For YouTube, this already exists and I‘m using it. The extension is caller DeArrow and aims to reduce sensationalism via crowdsourcing, though I wouldn’t be surprised if top contributors are bots using LLMs.

niek_pas 66 days ago | | |

Man, that before-after slider on the home page makes me so sad... YouTube used to just be random people sharing cool stuff, and those de-sensationalized titles really brought me back to that time for a second! Cool stuff.

sebzim4500 66 days ago | | |

For people like me had tried it in the past and found it annoying, note that it now has a 'casual' mode where it only changes the truly useless titles and leaves reasonable ones alone.

netcan 67 days ago | |

I think it's an interesting idea to explore.

But... It's the type of idea that is unpredictable as it comes into contact with reality. If it works, it probably works very differently from the initial idea of how it will work.

haberman 67 days ago | | |

I 100% agree with this. I am certain that I cannot foresee how this would play out in reality.

jychang 66 days ago | | |

Yeah, I 100% agree with the caution in this comment.

I see the merit in such a proposal. It's the linguistic equivalent to boiling the food you consume, instead of eating it raw with all the associated bad stuff.

The problem is, as you said, that this plan is unlikely to be as rosy as it's portrayed and probably has a lot of drawbacks in real life.

Interesting to think about and explore, though.

kbx 66 days ago | |

Chrome PM for built-in AI APIs here.

I love this "de-snarkifier" idea and it seems to have broad interest. I couldn't resist hacking (well, vibe coding[1]) a "Snarknada" prototype to explore the viability, including patterns for low-latency and accuracy.

You’ve hit on exactly why we think on-device is the right move for this class of use cases. If you tried to "de-snark" an entire infinite-scrolling feed via a cloud API, the token costs would be astronomical for a developer. Plus, people (rightly) don't want to send their private social feeds or DMs to a third-party server just to clean up the tone.

Moving this to the device should make high-frequency "Semantic Mutation" financially and technically viable for the first time. If you (or anyone else) starts building this more seriously than my PM vibe coded toy, and hits specific friction points, I’d love to hear about them: it helps us prioritize the roadmap.

[1]: If you're using a coding agent (Cursor, Claude Code, etc.), I recommend pointing it to https://www.npmjs.com/package/built-in-ai-skills-md-agent-md. Most models were trained on the now-obsolete window.ai namespace, and this skill file helps them use the current APIs correctly.

behindsight 65 days ago | | |

been cranking on this too but not just for snark but for spam/scam heuristics too.

it's something I feel is finally viable to combat at zero cost to the user.

This plus webmcp would allow it to serve as a form of automod too on websites that you authenticate with (imagine a world where your social media profile has an automod of its own powered locally. can use this to steer your feed or to mute/block/moderate as needbe). Even without WebMCP I have been working on making it autodetect html elements and extract UGC (comments/threads..etc) automatically to moderate (since my initial tests with a small group found some websites with frequent UI changes would break if hardcoded or if they did a lot of AB testing)

Even better, the concept would allow you to also use it to hide certain spoilers (imagine sports or new movies that just came out and you want to not have to hide away from all socials).

didn't find any contacts on your new HN account, but in a few weeks will be able to reach out to you with it fleshed out. :)

We have a community of nearly 14k that we will distribute this to

duskdozer 66 days ago | |

Or just ignore it. Or say you will not engage under [conditions]. Ultimately it will be you who looks foolish when the AI rewrote something incorrectly and you engaged with something that wasn't being said.

Karrot_Kream 66 days ago | |

I've thought about this for HN which, now that it's become so big, just has a lot of aggressive negativity and snark. You'd probably run into the same problem as Usenet Killfiles: the folks that use Killfiles would see random orphaned conversations or would just miss large parts of threads while the people that don't have Killfiles would see a mess of toxicity that would make them want to leave. Likewise if you prompt filter your experience, you'll be separating your experience from everyone else's.

dotancohen 67 days ago | |

Though I hate the idea of this, I can see it becoming popular in some use cases, such as schools with "safe places".

bfeist 66 days ago | |

I would love an app like this. I am a frequent user of https://www.boringreport.org/ for news, which does something like what you’re describing but for news articles.

an5ragchoudhary 65 days ago | | |

thanks for sharing this - quite cool!

altmanaltman 66 days ago | |

Don't you think its better to just curate your social media and follow communities where the default is not toxicity? This is basically a distortion layer for reality and will just encourage more echo chambers.

Also what is toxic to one person is not toxic to another depending on their subjective choices. How will you solve for this without everyone just seeing what they want to see even if reality is not like that? I feel that will just enhance the problems of social media than reduce it.

It kind of falls apart when you start to think of edge cases rather than "hey this tool will keep morons off my feed!" mentality

haberman 66 days ago | | |

I'm inclined to think that this will actually decrease the power of echo chambers. Echo chambers become that way by policing dissent, either through moderation or through aggressive attacks on dissenters. A de-snarkifier would de-fang the latter.

I agree that what is toxic to one person is not toxic to another, but think that this is largely because many people enjoy seeing their perceived enemies attacked. In other words, it comes down to a viewpoint bias: attacking my group/viewpoint is toxic, while attacking other groups/viewpoints is good and noble.

My ideal is that a de-snarkifier would be strongly instructed to be viewpoint neutral; to filter based on whether the comment is being respectful, without regard to the views being expressed.

My idea would backfire if other people program their filter to reinforce their own biases by favoring content that they agree with and creating or amplifying personal attacks on their perceived enemies. That would be unfortunate, but ultimately we can only control what we do; each person gets to make their own decision.

yearesadpeople 66 days ago | |

It is important, however, not intellectualise repugnant, racist, or inflamatory language; it deserves to be called out for what it is aimed at doing

whattheheckheck 66 days ago | |

And then we will understand reality even more. Only let the tech giants tell us what other people are expressing. Great idea

jurgenburgen 67 days ago | |

On the other hand it would make all comments sound the same and further dilute internet content into average slop.

whatarethembits 66 days ago | | |

I'm hoping that something like this can condense a 1000+ comments thread to couple of paragraphs at most.

sidkhanooja 67 days ago | | |

on reflection, i would appreciate average slop more than the occasional heinous slop people say when they are opinionated..

senordevnyc 66 days ago | |

I was literally just thinking that I’d like something like this for HN, which has become an incredibly bitter, cynical, and depressing place in the last decade. On virtually any story, most of the top comments are negative. Every major company is a greedy monster trying to destroy your life, every CEO is a sociopath, everything is terrible, all the time. I wonder how most HN users even get out of bed every day.

domenicd 66 days ago |

I led the design effort on this API, before retiring. Here's my writeup on some of the considerations that went into it: https://domenic.me/builtin-ai-api-design/

avaer 67 days ago |

It works, I've shipped this as a "local inference"/poor person's ollama for low-end llm tasks like search. The main win is that it's free and privacy preserving, and (mostly) transparent to users in that they don't have to do anything, which is great for giving non-technical users local inference without making them do scary native things.

But keep in mind the actual experience for users is not great; the model download is orders of magnitude greater than downloading the browser itself, and something that needs to happen before you get your first token back. That's unfixable until operating systems start reliably shipping their own prebaked models that an API like this could plug into.

meander_water 66 days ago |

This looks like it uses Gemini Nano under the hood. But the latest Gemma4 E2B and E4B models appear to be much better, so you'd probably be better off deploying quantized versions through an extension for now.

- Gemini Nano-1: 46% MMLU, 1.8B

- Gemini Nano-2: 56% MMLU, 3.25B

- Gemma4 E2B: 60.0% MMLU, 2.3B

- Gemma4 E4B: 69.4% MMLU, 4.5B

Sources:

- https://huggingface.co/google/gemma-4-E2B-it

- https://android-developers.googleblog.com/2024/10/gemini-nan...

domenicd 66 days ago | |

I no longer have any inside knowledge, but from my time on this team they were very quick about getting the latest small (Google) models into Chrome. I expect that if Gemma 4 (or its equivalent Gemini Nano) isn't already in Chrome, then it will be soon.

Note that the article here was last updated 2025-09-21, and as of that time it was already on Gemini Nano 3.

meander_water 66 days ago | | |

Thanks for the insider info! Do you know if there are any published benchmarks for Nano 3?

ceejayoz 66 days ago | |

> This looks like it uses Gemini Nano under the hood.

Yes; "With the Prompt API, you can send natural language requests to Gemini Nano in the browser."

Tepix 66 days ago | |

The Prompt API uses the model that's available in your browser. For Edge I believe it's Phi4.

jameslk 67 days ago |

Seems like a good way for a rogue JS script to offload token generation to a bunch of unsuspecting visitors

It would actually be pretty interesting to see if its possible to decentralize the compute to generate something useful from a larger prompt broken down and sent to a bunch of browsers using a subagent pattern or something like RLM, each working on a smaller part of the prompt

varun_ch 67 days ago | |

This feels like a lot of work for low reward, the technical/business infrastructure would be wild. And if anyone wants to offload their prompts to users browsers, they might as well just use the Chrome API correctly? How many server side prompts would realistically be useful to offload to a low end model like this?

Plus even if you really wanted to do that, WebGPU exists and has for a while right?

dotancohen 66 days ago | | |

  > This feels like a lot of work for low reward

Low per-device reward combined with a high user count - either by large legitimate players or by botnets - has been the monetisation strategy of most online enterprises.

jameslk 67 days ago | | |

> How many server side prompts would realistically be useful to offload to a low end model like this?

There's a lot of ways this API could go, e.g. more powerful models eventually, or perhaps integration with cloud models. For example, I could see Google trying to default Gemini as the model for users signed into Chrome

dnnddidiej 66 days ago | | |

Nefarous use cases. Run that on some suckers machine.

Edit: simple example is a spam bot

Tepix 66 days ago | |

token generation of a tiny model. Hardly worth anything.

rock_artist 67 days ago |

I think it's a step into a future of proper Model API. But it's just a small step. It reminds me of Apple's Foundation Models [1]

While many AI integrations are focused on text communication / chat style. A lot of software benefits from non-text interfaces.

I believe at some point OSes and browsers should provide an API to manage models so you'll have access to on-device/remote ones with a simplified interface for the app. Making something standardized that is cross-platform would be fantastic. It also needs to be on mobile devices, so the players that can easily make it happen are mostly Apple and Google. (Meta will follow or vice-versa I guess)

Key-point: it shouldn't be exclusive to promoted models.

(1) https://developer.apple.com/documentation/foundationmodels So the app would be able to query and get the right model(s).

elpakal 66 days ago | |

Apple's Foundation models seem great on paper until you see the 4k context window. (though I know we are still early in this chapter).

afshinmeh 67 days ago |

https://github.com/mozilla/standards-positions/issues/1067

kurtoid 66 days ago | |

benjaminbenben 66 days ago |

We use this for summarising our hack day write ups: https://remotehack.space/previous-hacks/

It's a tiny script that looks up the rss feed and uses the content to generate summaries; quite a nice fit with our static site. Sometime I'd like to extend it to ask different questions about the content.

tom1337 66 days ago |

The idea of having local LLMs accessible in the browser for privacy concerning is nice i guess but when each browser has a different model attached to this API testing becomes even more a nightmare then now. I wonder if this will drive more users towards chrome because most of the usages of this API might be just tailored to fit the Gemini Nano model?

mudkipdev 66 days ago |

Gemini Nano, unlike Gemma, is not open-weight, right? I would be interested in dumping the model weights, unless someone has done that already

nl 67 days ago |

The model this uses is useless for anything beyond 2 round chat at the most.

If you want to do anything interesting you need transformers.js and a decent mode. Qwen 0.9B is where things start working usefully

gopalv 66 days ago |

The better part of this is having a local-first AI, particularly because it has tool-calling builtin & structured output.

I haven't pushed out a full version[1] which uses ducklake-wasm + this to make a completely local SQL answering machine, but for now all it does is retype prompts in the browser.

[1] - https://notmysock.org/code/voice-gemini-prompt.html

fg137 67 days ago |

"sorry, to use our website, you must have at least 22 GB of free disk space."

cdrini 67 days ago | |

True, but arguably better than "sorry, to use our website, you must have a ChatGPT subscription."

fg137 66 days ago | | |

More like "you need to sign up for our website and pay for a subscription", and I'd much rather do that if it's actually providing value. I am absolutely not going to run model locally which slowly churns out words at 5 tps while making the computer hot to touch.

jfoster 66 days ago | | |

Also much better than every website wanting its own 22 GB rather than the 22 GB being a shared resource.

_pdp_ 67 days ago | | |

that is ~9% of the total available disk space for baseline phones and laptops for a model that is not that useful.

michaelbuckbee 66 days ago |

Fwiw - I did a fairly large comparison of Gemini Nano (the in browser ai model) vs a comparable free hosted model of Gemma (from OpenRouter) and the hosted model absolutely trashed the local model on every aspect of speed, reliability, availability, etc. [1]

I'm not particularly happy about that outcome as I wish we had more locally run AI models for reasons of privacy and efficiency, so this is more just a warning that at present there are some severe tradeoffs.

1 - https://sendcheckit.com/blog/ai-powered-subject-line-alterna...

kbx 66 days ago | |

Hey, Chrome PM for built-in AI here.

Thanks for the write-up and the comparison, but more importantly for using the API in production!

You’re highlighting the "state of the art" gap we’re working to close. Cloud models will always have the advantage of massive parameter counts, but our bet is that for a huge class of simpler or high-volume tasks, the upsides of on-device (e.g. zero-cost, permission-less start with no quotas/infra, network-resilience, privacy) make it a compelling trade-off.

The models have been getting better at a rapid clip, and the team is heads-down on optimizing performance and reliability. To that end, we're always grateful for feedback. If you hit specific bugs, crashes, or quality regressions, filing a report with repro steps is the best way to help us improve. You can file those on crbug.com under the "Chromium > Blink > AI" component.

skybrian 67 days ago |

Still in origin trial? Looks like they're adding a temperature parameter:

https://chromestatus.com/feature/6325545693478912

kbx 66 days ago | |

It's on track to ship in Chrome 148: https://chromestatus.com/feature/5134603979063296

The parameters are not part of this initial release but can be added back with the origin trial you discovered.

me551ah 66 days ago |

I’m just wondering how much more RAM and VRAM chrome will use after these changes

6thbit 66 days ago |

Is “Ship a 22gb model on your product” the new “put a chat window on your product”?

I agree with others this fits better in the OS, or hey maybe Apple sells a time-machine sort of NAS with neural engine chips.

jussy 64 days ago |

I get the filtering use case here and I'd say the other one is personalisation of generic marketing copy.

david_shi 66 days ago |

Will this API and others like it will be a strong enough incentive to move away from Chromium based browsers and back on to Chrome?

Ronsenshi 66 days ago |

Not long before all of the web content will be going through these AI pipelines where user might not even see original webpage.

timxtokyo 65 days ago | |

the world of agentic ai!

franze 66 days ago |

trying to wrap it in a unix style CLI

see: https://github.com/Arthur-Ficial/fenster

and: https://news.ycombinator.com/item?id=47923692

hard work so far

izietto 66 days ago |

Can pass to it the current page contents for a AI-based AdBlock / cookie manager / etc.?

solarkraft 65 days ago |

Interesting. Questionable from a web standards POV, but interesting.

Who‘s gonna make it call tools?

ilaksh 66 days ago |

Any chance this will be supported by Firefox or other browsers soon?

gorgoiler 67 days ago |

Imagine a Vendor API that adds a way to link from the page straight into a device purchase workflow. As a trial of the API in Chrome you can order a new Google Pixel 9b directly from any page with the word Android in it!

Or a LocalNet API that integrates with trusted hardware devices on your local network. As a trial (Chrome beta programme — strictly limited but here’s 3x signup links to share with your friends) you can adjust your Google Next Mini underfloor heating directly from Chrome!

Or a DirectCast API that lets you stream <video> elements to a device of your choice even over a VPN. As a Chrome trial, you can use your Google Cloud account to stream directly from YouTube Premium to any linked Google Chromecast devices you own!

tethys 66 days ago |

Slightly off-topic: Refreshing to see these two authors link to their Bluesky and Mastodon profiles. No Twitter/X in sight!

oneeyedpigeon 66 days ago |

Every time I see "prompt" nowadays, I'm briefly hopeful that I'm going to read something about $PS1. Then, inevitably, AI disappoints me yet again.

danny_codes 67 days ago |

Domain names are a nice candidate for a Georgian tax