--- title: "From Token-maxxing To Token-panic: Citrini Warns AI Goldilocks Narrative Hitting A Wall" type: "News" locale: "en" url: "https://longbridge.com/en/news/289342597.md" description: "Citrini Research warns the AI 'goldilocks' narrative is hitting a wall as token spend peaks and costs skyrocket. Major tech leaders like Uber, Anthropic, and Microsoft report surging AI expenses, with companies burning through budgets rapidly. Providers are shifting to usage-based pricing, ending subsidized models. This transition from 'token-maxxing' to 'token-panic' highlights growing corporate pushback against high operational costs as monetization becomes urgent." datetime: "2026-06-10T14:20:40.000Z" locales: - [zh-CN](https://longbridge.com/zh-CN/news/289342597.md) - [en](https://longbridge.com/en/news/289342597.md) - [zh-HK](https://longbridge.com/zh-HK/news/289342597.md) --- # From Token-maxxing To Token-panic: Citrini Warns AI Goldilocks Narrative Hitting A Wall When the world and their pet rabbit was buying the hype and extrapolating trends to infinity and beyond, we dared to highlight a few 'economic' realities of the new 'tokenomics'. > _From Singularity To Tokenomics: **The AI Narrative Just Hit A Serious Snag**_ > > _Was Amazon's Tokenmaxxing Fiasco Behind **Claude's $500M Mystery Bill?**_ > > _From Singularity To Tokenomics, Part II: **The Subsidy Just Ran Out** \- And GitHub Users Went Splat_ This morning we got confirmation of this AI reality questioning from none other than Goldman Sachs Partner, Rich Privorotsky, who highlighted that **Token Spend had 'peaked'**... And now, Citrini Research - _who infamously issued a less than utopic view of the world under AI back in March_ - has written a follow up on the status quo of the AI ecosystem, noting that **in just weeks we’ve gone from_tokenmaxxing_to**_****tokenpanic**.**_ In March, we and many others were writing about the astounding growth in token consumption driven by the release of agents and more intensive models. This was enough to send the infrastructure trade sharply higher – the market value of the semiconductor industry doubled in two months. **But that goldilocks narrative is beginning to hit a wall.** The corollary of explosive token usage is explosive cost to customers, which is coming just as the US labs and hyperscalers are turning up the dial on monetization. The public story is increasingly turning to corporate pushback. The first real signs of this shift were from the much-discussed report of Uber burning through its entire AI budget in just four months. Then there was the anonymous report of a $500 million oopsie. In the past week, the idea has turned into a media avalanche. According to The Economist’s reporting, Anthropic’s ARR has increased 5x since the start of the year, reaching $45 billion in May. **Great for the lab, but it also means the “AI Opex” line item on P&Ls is going through the roof**. The issue is not just Anthropic. Sam Altman also confirmed that_all of a sudden_cost is a huge issue (and acknowledged the virality of the idea). > _“**Probably the second biggest theme is just around cost.**People are really saying, it’s kind of become a meme now, but, “My company spent my entire 2026 budget in Q1. Can you make this more efficient?” We are continuing to push on that more with models. I think we’ll have a lot of ways we can help people get more value for less spend, but that went from, at the beginning of this year, an issue that never came up. I know.**People were totally happy with the amount they were spending, to all of a sudden, a huge issue.**”_ Microsoft’s AI Chief added to the unflattery this week after cancelling Claude Code licenses in May. > _“Anthropic is extremely expensive, and I think many people are urgently looking for alternatives”_ This cost concern didn’t just come out of nowhere. **First, agents and more advanced reasoning models use orders of magnitude greater tokens.** Corporates have widely distributed these tools and encouraged their use just as the average user was gaining the ability to casually run enormous bills. **Second, prices for frontier models are increasing as providers are flipping to usage models and preparing for public market debuts.** In a unified front – OpenAI, Anthropic, Microsoft, and Google – have all implemented pricing shifts towards usage/tokens, as they simply can’t afford to endlessly subsidize their products for power users. - April 2: OpenAI changed Codex pricing to align with API token usage instead of per-message pricing - May 19: Google changed Gemini subscriptions from “daily prompt limits” to a “compute-used” model. - June 1: Microsoft’s GitHub Copilot transitioned to usage based billing **And what does a rate sheet mean really if you have no idea what your usage burns in practice?** Claude’s Opus 4.7 & 4.8 have the same “list price” as prior versions, but use a “new tokenizer” that may use up to 35% more tokens for the same fixed text. **Is this an existential problem or just the VC playbook at unimaginable scale?** Subsidize demand, gain market share and lock-in, then monetize. After all, companies are spending a trillion in capex to make trillions in revenue, right? **Well either way we’ve reached_Monetization_, and maybe not by choice. As fast as lab revenue is growing, the fundraising has grown even faster.** The money going towards building and running AI has exploded. The deepest pockets in the world – hyperscaler cash flow, venture capital, sovereign wealth, public credit, private credit, public equity – are footing most of the bill. Eventually, customers have to start picking up the tab. **Free-AI is ending. Tokenomics is beginning.** What happens when underlying costs of compute become more transparent and directly traceable to outcomes? The ROI debate is about to be answered in real time, across millions of users and use cases. For the median user, maybe not a whole lot changes. But science projects, freewheeling agents, and curiosities will either get cut or offloaded to open source models. Companies will restrict AI functionality and invest in oversight and observability. Budget constraints will pit AI spend against headcounts. Providers will become more competitive on pricing and will begin to optimize physical and digital architecture for efficiencies. **In many (most) situations, good enough will do.**The cost of running open-source, discount, or mini models is going down while their capabilities only improve. This week saw another batch of open source models like Nvidia latest Nemotron family which includes advanced general-purpose models as well as highly efficient, compact versions optimized for local deployment and specialized agentic uses. As the frontier continues to advance, inference costs drop precipitously for a fixed level of intelligence. Why rent a Ferrari when a Vespa does the trick? Of course, frontier models with highly specialized functions can continue to command an intense premium, but will serve a smaller segment of the market. A top lawyer can still bill at thousands per hour, even if millions of other workers are making minimum wage. But even across the high end, the gap between US and Chinese offerings is worth noting. Qwen 3.7 and Deepseek V4 are still behind Opus 4.8 and GPT 5.5 in terms of benchmarks, but they are 10x - 25x cheaper. Since releasing V4 Pro and V4 Flash in April, Deepseek has shot past Anthropic to the top of the charts on OpenRouter in terms of tokens processed. Meanwhile, Cursor, one of the most used coding agents, released their new model that was post-trained on compute provided by xAI after their $10 billion deal. The base model is a different Chinese open source model by Moonshot and it was trained on data Cursor gets from its customers. The results are even stronger than Deepseek, it’s comparable to 4.7 and 5.5 for 10x lower cost per task and is one of the fastest frontier models. There are obvious other “considerations” for large US enterprises that may prevent a mass exodus to Chinese alternatives. Plus, greater integration into workflows adds to lock-in. But there is a growing trend of application layer companies that will continue to post-train on open source base models for specialized workflows like coding and legal. _**But what does this mean for the AI trade?**_ First, to be clear, revenues for labs and hyperscalers are going to grow. Token usage for top Anthropic models continues to go higher. Regardless of the pushback, frontier models can certainly create meaningful value especially in high-stakes fields like tech and finance, and there are still plenty of levers to pull in the monetization phase. The entire point is for them to start making money. Likewise, this won’t fix near-term compute constraints. **But we do think that cost and efficiency only become more important as the bills get bigger. Themes of local inference, miniaturization, smart routing, observability, price competition, and efficient model architecture will grow. Competitive pressures and price competition are likely to stay.** _Subscribers can read the rest of Citrini's note here..._ ### Related Stocks - [UBER.US](https://longbridge.com/en/quote/UBER.US.md) - [UBEW.US](https://longbridge.com/en/quote/UBEW.US.md) - [ANTH.NA](https://longbridge.com/en/quote/ANTH.NA.md) - [UBRL.US](https://longbridge.com/en/quote/UBRL.US.md) - [GS.US](https://longbridge.com/en/quote/GS.US.md) - [AMZN.US](https://longbridge.com/en/quote/AMZN.US.md) - [MSFT.US](https://longbridge.com/en/quote/MSFT.US.md) - [GOOGL.US](https://longbridge.com/en/quote/GOOGL.US.md) - [GOOG.US](https://longbridge.com/en/quote/GOOG.US.md) - [W4VR.SG](https://longbridge.com/en/quote/W4VR.SG.md) ## Related News & Research - [The token bill comes due: Inside the industry scramble to manage AI’s runaway costs](https://longbridge.com/en/news/288880670.md) - [Busted AI budgets at Uber, Microsoft and Nvidia trigger hiring - because human workers are cheaper](https://longbridge.com/en/news/288869847.md) - [Stocks Set to Extend Rebound Amid AI Dip-Buying](https://longbridge.com/en/news/289175557.md) - [Uber Introduces $1,500 Monthly Cap On AI Coding Tools After Budget Blowout](https://longbridge.com/en/news/288619279.md) - [C-suites have decided: it's time to put AI on a diet](https://longbridge.com/en/news/289302426.md)