← 回總覽

AI 推理配给与基础设施受限的时代即将来临

📅 2026-03-14 07:24 Tomasz Tunguz 人工智能 2 分鐘 1595 字 評分: 84
AI 基础设施 推理成本 GPU 短缺 数据中心 AI 战略
📌 一句话摘要 托马斯·通古兹分析了 AI 基础设施(从 GPU 到电力)日益加剧的连锁短缺,并预测将转向推理配给、成本上涨和模型优化。 📝 详细摘要 这篇推文综合了主要科技公司 CEO(OpenAI、甲骨文、微软、Alphabet 和英特尔)的声明,旨在强调 AI 基础设施领域持续且日益深化的危机。除了 GPU 短缺,当前的限制还包括电力、土地和数据中心容量,而缓解可能要到 2028 年才能到来。通古兹认为,这将导致补贴式、按需推理时代的终结,迫使企业对高端模型访问进行配给,优先处理特定工作负载,并转向使用更小、更优化或开源的模型。 📊 文章信息 AI 评分:84 来源:Tomasz

"We've been growing a lot and are out of GPUs."

Sam Altman, OpenAI CEO

Mar 2025 "We are still waving off customers or scheduling them out into the future. This is a situation that we have not seen in our history."

Safra Catz, Oracle CEO

Oct 2025

"You may actually have a bunch of chips sitting in inventory that I can't plug in. I don't have warm shells to plug into."

Satya Nadella, Microsoft CEO

Feb 2026

"What keeps us up at night… The top question is definitely around capacity. All constraints — be it power, land, supply chain constraints — how do you ramp up to meet this extraordinary demand?"

Sundar Pichai, Alphabet CEO

Feb 2026

"There's no relief as far as I know. No relief until 2028."

Lip-Bu Tan, Intel CEO

What happens when your AI doesn’t answer?

Everything is in short supply. It’s no longer just GPUs. It’s power. Data centers. Memory. CPUs.

If there’s no relief for six more quarters, perhaps it’s time to plan for a world where inference isn’t freely available on-demand.

Inference prices, which have been static, will rise. Subsidies will be harder to justify.

Enterprises will need to rationalize workloads, deciding which teams receive state-of-the-art models & which don’t. Not every CRM update requires a trillion-parameter frontier model.

Inference rationing normalizes. Marketing receives this much, sales receives that much, software engineers probably receive a lot more.

Constraint will be the mother of invention. Companies will optimize what they have, adopt open source where they can, and likely move to smaller models for many workloads.

查看原文 → 發佈: 2026-03-14 07:24:50 收錄: 2026-03-14 10:00:25

🤖 問 AI

針對這篇文章提問,AI 會根據文章內容回答。按 Ctrl+Enter 送出。