Close Menu
  • Home
  • AI
  • Art & Style
  • Economy
  • Entertainment
  • International
  • Market
  • Opinion
  • Politics
  • Sports
  • Trump
  • US
  • World
What's Hot

Microsoft may be in a slump. But here’s why it’s wrong to give up now

March 23, 2026

Microsoft may be in a slump. But here’s why it’s wrong to give up now

March 23, 2026

Trump’s next Florida representative could be Democrat Emily Gregory.

March 23, 2026
Facebook X (Twitter) Instagram
WhistleBuzz – Smart News on AI, Business, Politics & Global Trends
Facebook X (Twitter) Instagram
  • Home
  • AI
  • Art & Style
  • Economy
  • Entertainment
  • International
  • Market
  • Opinion
  • Politics
  • Sports
  • Trump
  • US
  • World
WhistleBuzz – Smart News on AI, Business, Politics & Global Trends
Home » Startup Gimlet Labs solves AI inference bottlenecks in a surprisingly elegant way
AI

Startup Gimlet Labs solves AI inference bottlenecks in a surprisingly elegant way

Editor-In-ChiefBy Editor-In-ChiefMarch 23, 2026No Comments4 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email Copy Link
Follow Us
Google News Flipboard
Share
Facebook Twitter LinkedIn Pinterest Email


Zayn Asghar, an adjunct professor at Stanford University and successful founder, has raised $80 million in Series A for a startup that cleverly solves the bottleneck problem of AI inference. The round was led by Menlo Ventures.

A company called Gimlet Labs has developed the first and only “multi-silicon inference cloud,” software that allows AI workloads to run on different types of hardware simultaneously. You can split the work of your AI apps across both traditional CPUs and AI-tuned GPUs, as well as high-memory systems.

“We basically run into different hardware that is available,” Asghar told TechCrunch.

A single agent may chain multiple steps, each of which “requires different hardware: inference is compute-dependent, decoding is memory-dependent, and tool invocation is network-dependent,” Menlo lead investor Tim Tully wrote in a blog post about the funding.

There isn’t a chip to do it all yet, but as new hardware is rolled out and aging GPUs are redeployed, “a multi-silicon fleet is ready. We’re just missing the software layer to make it work.” That’s what Tully believes the Gimlet Institute will deliver.

If current trends in deploy-more-computing continue, McKinsey estimates that data center spending will reach nearly $7 trillion by 2030. Asghar said the app only uses the existing hardware already deployed “between 15 and 30 percent” of the time.

“The other way to think about this is that you’re wasting hundreds of billions of dollars just by sitting idle resources,” he said. “Our goal was essentially to figure out how to make AI workloads 10x more efficient today than they have ever been before.”

tech crunch event

San Francisco, California
|
October 13-15, 2026

So he and co-founders Michelle Nguyen, Omid Azizi, and Natalie Serrino set out to build orchestration software that could split agent workloads and distribute them across all types of hardware simultaneously.

Gimlet Labs claims that it can reliably speed up AI inference by 3x to 10x for the same cost and power. Gimlet says the underlying model can also be sliced ​​to run across different architectures, using the best chip for each part of the model.

The company already has partnerships with chip makers such as NVIDIA, AMD, Intel, ARM, Cerebras, and d-Matrix.

Gimlet’s products, provided as software or through APIs to our proprietary Gimlet Cloud, are not intended for general AI app developers. For the largest AI model labs and data centers.

The company went public in October and says it has achieved eight-figure revenues (or at least $10 million) since its inception. Asghar said the customer base has more than doubled in the past four months and now includes major model manufacturers and very large cloud computing companies, but declined to name them.

The co-founders previously worked together at Pixie, a startup that developed open source observability tools for Kubernetes. Pixie was acquired by New Relic in 2020, just two months after launching in a $9 million Series A led by Benchmark. (Pixie’s technology is now part of the open source organization that oversees Kubernetes.)

After Mr. Asghar met Mr. Talley by chance about a year ago and received angel investment from Stanford University professors, venture capitalists started calling. After the start, a term sheet arrived on Asgar’s desk. When VCs heard that Asghar was considering an offer, the round quickly maxed out because “we had quite a lot of money,” he said.

With the previous seed, the startup has now raised a total of $92 million from a number of angels, including Sequoia’s Bill Coughran, Stanford professor Nick McKeown, former CEO of VMware Raghu Raghuram, and Intel CEO Lip-Bu Tan. The company currently employs 30 people.

Other investors include Factory, which led the seed, Eclipse Ventures, Prosperity7, and Triatomic.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Editor-In-Chief
  • Website

Related Posts

Littlebird raises $11 million for AI-assisted ‘recall’ tool to read computer screens

March 23, 2026

Apple sets WWDC 2026 date for June, teases ‘advances in AI’

March 23, 2026

Vibe coding startup Lovable is exploring acquisition

March 23, 2026
Add A Comment

Comments are closed.

News

Iran rejects any talks with US after President Trump insists on ‘productive’ talks | US and Israel’s war on Iran News

By Editor-In-ChiefMarch 23, 2026

Iran’s parliament speaker says the US president is using the idea of ​​talks to “get…

Energy, water and bonds: What will be Iran’s target if President Trump attacks power plants? |US-Israel war against Iran News

March 23, 2026

President Trump sends ICE officers to U.S. airports amid staffing issues and delays | Donald Trump News

March 23, 2026
Top Trending

Littlebird raises $11 million for AI-assisted ‘recall’ tool to read computer screens

By Editor-In-ChiefMarch 23, 2026

There has been a lot of discussion about building context for AI…

Startup Gimlet Labs solves AI inference bottlenecks in a surprisingly elegant way

By Editor-In-ChiefMarch 23, 2026

Zayn Asghar, an adjunct professor at Stanford University and successful founder, has…

Apple sets WWDC 2026 date for June, teases ‘advances in AI’

By Editor-In-ChiefMarch 23, 2026

Apple’s next Worldwide Developers Conference will be held online and at its…

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Welcome to WhistleBuzz.com (“we,” “our,” or “us”). Your privacy is important to us. This Privacy Policy explains how we collect, use, disclose, and safeguard your information when you visit our website https://whistlebuzz.com/ (the “Site”). Please read this policy carefully to understand our views and practices regarding your personal data and how we will treat it.

Facebook X (Twitter) Instagram Pinterest YouTube

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Facebook X (Twitter) Instagram Pinterest
  • Home
  • Advertise With Us
  • Contact US
  • DMCA Policy
  • Privacy Policy
  • Terms & Conditions
  • About US
© 2026 whistlebuzz. Designed by whistlebuzz.

Type above and press Enter to search. Press Esc to cancel.