Meta Caps Internal AI Token Spending After Costs Approach Billions in 2026

Meta Caps Internal AI Token Spending After Costs Approach Billions in 2026(mlq.ai)

98 points by typeofhuman 2 hours ago | 84 comments

simonw 1 hour ago |

"The leaderboard, which ranked employees and teams by token consumption, inadvertently incentivized usage volume over productive output."

Who could possibly have predicted that happening?

Aurornis 1 hour ago | |

A past employer thought it was a good idea to put up a leaderboard of who sent the most Slack messages. They celebrated the people at the top for being so active.

Predictably, everyone started talking in Slack like their jobs depended on it. Everyone was responding to everything. Instead of writing out a complete message and pressing enter, they'd send each fragment of the sentence as a new line.

The Slack leaderboard was never shown again. Unfortunately the habit remained because people were afraid they were going to be secretly judged by how much Slack activity they generated.

I expect the same thing is going to happen at companies who had token leaderboards. Once you've instilled that fear in people, they internalize the expectation.

PaulHoule 50 minutes ago | | |

Reminds me of the place I worked at where I got in trouble because I was the only person writing JIRA tickets. Instead of bitching out the product manager or the tester for not writing tickets, they just complained to me. And if I wrote a ticket about how we could speed up the 40 minute build to 15 minutes I'd have to explain "How does this change improve the customer experience?" to which I answered "If the build was faster the customer would have had the product six months ago"

lokar 58 minutes ago | | |

I worked somewhere that made time from PR being sent for review and ready to merge be a metric for the reviewers. Not time to add feedback in each round. Total time elapsed.

Insanity

Eridrus 37 minutes ago | | |

This will inevitably be allocated like other budgets, and from talking to Meta folks about GPU budgets, it is going to be brutal.

Loughla 1 hour ago | | |

You have to realize that if you set a measure, you're actually setting a goal for your employees. There is no such thing as a meaningless metric; why else would you measure it?

No amount of "this isn't used for anything" will change that. It's inherent in human nature in the 21st century to believe any and all metrics will be used against them, and therefore must be gamed.

It's why you also have to set UNBELIEVABLY clear goals and have incentives tied to those goals. Incentives meaning money. If you want to measure things, measure them. But have clear, consistent, and meaningful goals tied to bonuses or something if you want a thing done correctly.

morpheos137 57 minutes ago | | |

What is about silicon valley leaders not understanding basic economics or business management? These kind of cargo cult tactics would not fly in any other industry.

ryanschaefer 56 minutes ago | |

It’s funny how many times the same thing happens at each large company. I think people’s thought process is this:

> Oh wow! If I paid for this myself I would have spent a lot of money! Are other people spending as much as me? I’m going to create a leaderboard!

> Oh no, my misinformed manager is using the leaderboard as a slight of hand for work. I need to game this now.

Then the leaderboard is banned… I can’t see how this ever really goes up the chain beyond director.

what 28 minutes ago | | |

There is zero chance that this is how the leaderboards came to be.

skizm 50 minutes ago | |

It wasn't leadership doing this though. Any meta IC can generate internal apps and dashboards. This was unofficial and unsupported. Some random IC just made it for fun. Management is usually pretty lax with stuff like this (plenty of games and joke internal apps) so they left it up until it became a problem.

jghn 1 hour ago | |

> Who could possibly have predicted that happening?

Charles Goodhart :-)

goldenarm 57 minutes ago | | |

https://en.wikipedia.org/wiki/Goodhart%27s_law

giancarlostoro 42 minutes ago | |

I still don't understand how Mark Zuckerberg has any serious investors, he went on this AI tangent and has absolutely nothing to show for it, despite FB / Meta having built some key tech in the space. He needs to stop trying to do something "different" and literally try and build a serious coding agent he can sell, he could have probably had something worthwhile in that space by now.

He started being drastically more serious into AI in 2022, and 2023 and he has nothing to show for it.

Heck, he could have rented GPUs the way Elon did at this point and either mended the bleeding or stopped it, not sure how many he has, but it beats losing this badly.

If he doesn't wake up and learn how to business, I suspect he will lose his empire he's built up for himself.

MangoCoffee 36 minutes ago | | |

>he could have rented GPUs the way Elon did at this point

"Meta building cloud business to sell excess AI capacity, Bloomberg News reports Meta building cloud business to sell excess AI capacity, Bloomberg News reports"

https://www.reuters.com/business/meta-sell-excess-ai-computi...

cadamsdotcom 28 minutes ago | | |

Haven’t got numbers so I might be wrong, but I suspect it is dwarfed by the present size and future potential of Meta’s ads business.

darth_avocado 1 hour ago | |

> Who could possibly have predicted that happening?

Everyone except the executives who get paid millions to predict exactly that.

Avicebron 56 minutes ago | | |

Not a problem. There are thousands of employees standing by, willing to sacrifice their jobs for their vision.

It's a hard job, someone has to not pay consequences for bad decisions.

qwertytyyuu 1 hour ago | |

I know right? What did the leadership think would happen when they give some of the worlds greatest software engineers (supportably), a easily quantifiable metric to target?

VygmraMGVl 1 hour ago | | |

The leaderboard wasn't leadership generated, it was engineer generated from internally available data. The leadership target is "impact" from ai tools.

John23832 45 minutes ago | |

Given that Meta has run 5ish layoffs at this point, and everyone is in survival mode, what did they expect? Everyone wants to juice whatever numbers possible to keep their jobs.

0cf8612b2e1e 1 hour ago | |

Now come on, there was a recent post where the author argued that infallible management knew this would happen, but was part of the double-secret-probation strategy to get the cogs to finally start using AI.

SpicyLemonZest 1 hour ago | | |

I still think this is true and it’s not obvious to me from the source article that Meta believes otherwise. I couldn’t find the full memo, do they claim the leaderboard or “tokenmaxxing” era was a mistake?

dzonga 1 hour ago | |

unfortunately at big tech, this shit will keep happening.

people who make it to managers tend to have bozo tendencies & are yes men.

before it was lines of code, Jira tickets closed. Now it's tokens spent.

sharts 1 hour ago | |

How dare you question the most effective allocators of capital.

xnx 38 minutes ago |

It's stories like this that really dispell the genius/merit theory of successful business. The best you can say about Zuck is he didn't prevent Facebook from becoming huge.

dwoosley 1 hour ago |

I’d be curious to see the breakdown on spending by use case. I’ve heard it said that the majority of tokenmaxing comes from none technical uses like reading PDFs, creating PowerPoints, generating graphics/images… ect. But I’ve never heard any actual proof to that.

ryanschaefer 58 minutes ago |

Wasn’t this already reported on? FWIW this article links to primary sources from early last month https://www.theinformation.com/articles/tokenminimizing-meta...

bdcravens 59 minutes ago |

And I still can't exhaust the limits on my Claude Max subscription, despite being more productive than I've ever been in terms of real work (ie, things that actually make money)

jm4 31 minutes ago | |

For real. I've used 8B tokens in the past month and haven't hit my limits even once. In fact, I can't even get close except for the day I used Fable. I've barely stopped. Claude keeps reminding me to sleep.

linzhangrun 42 minutes ago |

This is what you get when token consumption becomes a KPI...

d4rkp4ttern 53 minutes ago |

Ok I’ll ask since nobody else has — are they not giving their devs a Claude code max or Codex Pro subscription? If so, why is token cost approaching billions? And if not, why not?

lesuorac 50 minutes ago | |

They can't.

The subscriptions are for personal use not enterprise.

i.e. [1] "This article is about paid Max plans for individual consumers. If you're part of an organization looking to use Claude with your team, refer to Team and Enterprise Plans."

[1]: https://support.claude.com/en/articles/11049741-what-is-the-...

542458 51 minutes ago | |

Enterprise customers don’t get those plans, at the enterprise level you have to pay by the API rate… so people don’t have limited use, but you’re also not getting the heavily discounted rate the “normal” plans are at.

grim_io 48 minutes ago | |

Big enterprises don't get to have those subscriptions. OpenAI or Anthropic simply won't sell them to you if you need a couple thousand of those.

root_axis 1 hour ago |

Not sure if I missed it but I couldn't find any information in the article to explain where the "approaching billions" estimate is coming from.

I could believe it, but I'd want to see something a little more concrete.

nsagent 1 hour ago |

Not surprising. It seems that the comment section of every coding agent thread has at least one person mentioning they use "tokenmaxxing" to increase their token usage because it was brought up during their quarterly review, at a standup, or some other communique from on high.

Just wonder what happens when more and more companies introduce similar restrictions. Will that lead to devaluations of the LLM companies?

andsoitis 1 hour ago |

measure outcomes (impact), not effort (token usage, lines of code, code coverage, hours worked, etc.)

lokar 51 minutes ago | |

The whole phenomenon of metric based Eng evaluations is because leadership does not trust line managers to evaluate individual engineers.

4yfr 1 hour ago | |

What outcomes though? The ones I’ve seen posted are still nonsensical metrics that a publicly traded firm absolutely doesn’t care about.

It wants to see faster R&D, higher revenues from existing assets, greater operating margins, higher sales to invested capital ratio and so on…

The best way to measure that for a software firm is up-time of services, usage and project completion duration

wpasc 1 hour ago | | |

measuring uptime? I've seen Anthropic's status page, and they are a >$1 Trillion dollar company who "largely solved" coding. so clearly you aren't correct. /s

dheera 1 hour ago | |

> measure outcomes (impact)

This is also not easy. In particular proactively preventing bugs is not rewarded

andsoitis 4 minutes ago | | |

> In particular proactively preventing bugs is not rewarded

The main way I think you can proactively prevent bugs in a meaningful way is by crafting and propagating better architecture.

Better (or worse) architecture and adoption of it can be measured through a mix of quantitative and qualitative means so those metrics could be used to evaluate the impact of the engineer driving that architecture.

veber-alex 58 minutes ago | |

It's not flashy.

When shit just works for months or years no one is going to come and praise you for stuff you did a while back.

You are better off breaking stuff and then fixing them to show how useful you are.

felix-the-cat 1 hour ago |

Within a few weeks of telling people at our company that if they don’t use AI they will be replaced by someone who does, they just announced that their allocation with ChatGPT has reset and are now panicking as they blew through their million token allocation for this month in under six hours - you can’t make this shit up.

Atotalnoob 12 minutes ago | |

A million tokens is like $15 with SOTA models… that’s their allocation?

Trasmatta 1 hour ago |

All those billions spent on tokens by Meta, and not a single iota of value generated by any of it

janalsncm 42 minutes ago | |

I can’t tell if you’re complaining that Meta isn’t saving the whales or that their products aren’t good. If it’s the latter, you should double check their financials.

steve-atx-7600 1 hour ago | |

I guess maybe they can crank out more ads in their dystopian ad space of a social network site.

tyre 57 minutes ago | |

I love how confidently you say this, with no evidence provided (and I doubt you have any.)

Just a pristine comment section yap.

countcol 41 minutes ago | | |

The only thing I’ve seen Meta release recently are spy glasses, and every employee who has worked on that product should be in prison (with a live 24/7 feed where the world gets to watch them wallow).

The times I’ve been asked to evaluate a prospective candidate and I see that product on their résumé, it’s been an instant veto, in the same category as working at Palantir.

jazzyjackson 51 minutes ago | | |

If there was a positive return on token spend they wouldn’t be capping it now would they?

Barrin92 50 minutes ago | | |

>I love how confidently you say this,

it's not that difficult to say it confidently if you use any of their services and applications because exactly nothing has changed.

For reference most labor productivity increases for the last 50 years amounted to about 2% per year. If a hypothetical FB engineer had doubled their productivity with their gazillion tokens that would be 30 years of productivity gains in one year. I'd wager the evidence would be quite evident if you opened any of their apps

csomar 49 minutes ago | | |

Was there a new product released by Meta that we are not aware of? The last thing I read about was the Instagram account take-over AI-bug.

wonderwonder 47 minutes ago |

I have never worked there and I am likely very unqualified to ever work there and Zuck has more money than I could dream of so take my comment with that in mind.

Meta sounds like a cluster-F of a place to work. Massive reorgs around wild ideas like the metaverse and everything Ai all the time. Employees terrified of being fired. Incentivizing token spending and then cutting it off. While the overall company may be fine, the dev department sounds rudderless and absolutely miserable.

whalesalad 1 hour ago |

Clearly no one is using Meta’s customer facing AI products. Why aren’t they using their own gpu/compute for development?

wmf 1 hour ago | |

Because Muse isn't good enough and why use Muse if they'll let you use Opus for free?

gordon_freeman 1 hour ago | |

that is a fair point. The contrast between Meta and Apple could not be bigger here. Apple has billions of devices and yet they decided to use 3rd party models from OpenAI and later Google to build their AI features rather than building foundational models in house. Yet Meta did opposite: they built models (spending billions of $$$ and firing 10% of the company) for billions of users who rather would not use Meta AI features.

smrtinsert 1 hour ago |

That is insane. I'm sure companies will learn the absolute wrong lesson from this, and attempt to centralize and kneecap token usage.

SpicyLemonZest 1 hour ago | |

As many companies do with all their budgets, down to the trivial and clearly positive EV cost of free coffee. So it goes, cost controls are hard and necessarily imprecise.

downrightmike 1 hour ago | |

Tokens are less valuable than the eyeball metric of the Dotcom era. At least the eyeballs were real then.

I'd argue most of the AI value is related to how 'Dead' the internet is.

4yfr 1 hour ago | | |

This talk of tokens is wasteful.

Ultimately the spend on tokens has to benefit the firm financially or it won’t continue spending on it.

conartist6 1 hour ago |

I don't understand though. How will all the AI users replace all the non-AI users if they can't spend money that isn't theirs to win by default?

_heimdall 1 hour ago | |

Don't worry, once we achieve post-scarcity they will have more tokens than they could ever dream for spending.

downrightmike 1 hour ago | |

How soon until this becomes part of the "no one wants to work anymore" argument