Close Menu
  • Home
  • AI
  • Art & Style
  • Economy
  • Entertainment
  • International
  • Market
  • Opinion
  • Politics
  • Sports
  • Trump
  • US
  • World
What's Hot

UK to ban social media for under-16s to ‘give children back their childhood’

June 15, 2026

SpaceX rises 6% in premarket after record debut

June 15, 2026

108th day of Iran war: Iran and US reach interim agreement to end conflict | Conflict News

June 15, 2026
Facebook X (Twitter) Instagram
Smart Breaking News on AI, Business, Politics & Global Trends | WhistleBuzz
Facebook X (Twitter) Instagram
  • Home
  • AI
  • Art & Style
  • Economy
  • Entertainment
  • International
  • Market
  • Opinion
  • Politics
  • Sports
  • Trump
  • US
  • World
Smart Breaking News on AI, Business, Politics & Global Trends | WhistleBuzz
Home » Can tech companies love cheap AI models?
AI

Can tech companies love cheap AI models?

Editor-In-ChiefBy Editor-In-ChiefJune 9, 2026No Comments4 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email


The AI ​​boom is built on the basic premise that bigger models are more powerful, and that the most powerful models will win. Now, the industry is learning what happens when that assumption begins to crumble.

Rising costs are already causing users to look back at smaller, cheaper models. This cost-conscious model shopping is new and it is unclear what impact it will have on the industry, but it is likely to be significant.

One of the predictions best described by Coinbase co-founder Brian Armstrong is that the majority of tasks will move to cheaper models.

“The demand for intelligence is nearly limitless, but 80% of workloads will be running on models that are 99% cheaper within 12 to 18 months,” Armstrong wrote in X. “20% of the workload continues to run on the latest generation models where maximizing IQ is key.”

It’s hard to overstate how big a change it will be for the AI ​​industry if Armstrong’s predictions come true.

Until now, most AI companies have competed on quality, which has meant defaulting to the most advanced models available. Being able to do these same jobs with cheaper models without impacting quality would represent a major shift in the economics of AI. And importantly, much of that savings will come out of the pockets of big labs, dealing a financial blow to OpenAI and Anthropic as they prepare for their IPOs.

This could lead to significant changes in the industry, and at the heart of it all is one fundamental question: “Are companies ready to switch to smaller models?”

Initial testing suggests that if the system is placed correctly, cheaper models can be used without sacrificing quality. In recent testing with legal AI tool Harvey, the company was able to reduce inference costs by 3x without reducing quality. This testing was conducted in partnership with the inference platform Fireworks AI, combining Claude Opus with Fireworks’ GLM 5.1, moving to Opus for the most intensive tasks. The result was a significant reduction in load in terms of server time and overall cost.

“Quality is paramount and always has been in legal affairs,” Harvey co-founder Gabe Pereyra told TechCrunch, referring to his startup’s AI legal services offering. “But the definition of quality has evolved from simply using the most powerful model for everything to using the best model that gets the right answer most efficiently.”

This trend is often framed in terms of the big labs and China model, or the promiscuous model, but that misses the bigger point. The real difference is not between proprietary and open models. It is between a large model and a small model. You can save money by switching from GPT-5.5 to DeepSeek’s V4 flash, but switching to GPT-5.4-mini works just as well.

There is an active price competition between in-house inference from major laboratories and independently provided promiscuous models. When it comes to the larger question of small vs. large, it doesn’t really matter which kind of small model wins.

All of this may seem obvious, and of course you shouldn’t use more compute than necessary, but this goes against the scaling-first approach that has dominated the industry to date. Inspired by this bitter lesson, research institutions have worked hard to train the most computationally intensive models possible, pushing the frontiers of what AI models can do. With prices heavily subsidized by investors, customers had no reason to choose anything but the most advanced options.

Users are facing cost pressure for the first time due to rising token prices and slowing subsidies. It remains to be seen whether new cost pressures will actually drive enterprise users to smaller models. You can also easily save money by making fewer calls, using less context, or simply giving up on the least promising deployments.

However, if it turns out that most deployments can be performed just as well with smaller models, it could have serious implications for growing inference demands and raise new questions about how to justify the cost of training frontier models.

If you buy through links in our articles, we may earn a small commission. This does not affect editorial independence.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Editor-In-Chief
  • Website

Related Posts

Wave of AI layoffs is becoming a powder keg

June 15, 2026

As AI companies race to go public, who else will get in on the action?

June 14, 2026

India debates future of AI as Anthropic suspends access to new models

June 13, 2026
Add A Comment

Comments are closed.

News

108th day of Iran war: Iran and US reach interim agreement to end conflict | Conflict News

By Editor-In-ChiefJune 15, 2026

explainerUS President Donald Trump and Iranian leaders announced a deal had been reached to end…

Trump allies welcome Iran deal announcement as Democrats seek clarity | U.S.-Israel war on Iran News

June 14, 2026

US and Iran announce ceasefire agreement, Trump announces reopening of Strait of Hormuz | Conflict News

June 14, 2026
Top Trending

Wave of AI layoffs is becoming a powder keg

By Editor-In-ChiefJune 15, 2026

Something strange is happening in the world of technology right now. Companies…

As AI companies race to go public, who else will get in on the action?

By Editor-In-ChiefJune 14, 2026

SpaceX went public this week in the largest IPO in history, making…

India debates future of AI as Anthropic suspends access to new models

By Editor-In-ChiefJune 13, 2026

Anthropic’s sudden move to suspend access to its latest AI models at…

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Welcome to WhistleBuzz.com (“we,” “our,” or “us”). Your privacy is important to us. This Privacy Policy explains how we collect, use, disclose, and safeguard your information when you visit our website https://whistlebuzz.com/ (the “Site”). Please read this policy carefully to understand our views and practices regarding your personal data and how we will treat it.

Facebook X (Twitter) Instagram Pinterest YouTube

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • Advertise With Us
  • Contact US
  • DMCA Policy
  • Privacy Policy
  • Terms & Conditions
  • About US
© 2026 whistlebuzz. Designed by whistlebuzz.

Type above and press Enter to search. Press Esc to cancel.