LiteLLM：面向 100 多种 LLM 的统一 API 网关

🚨 Every AI company wants to lock you into their API. Someone just open sourced the master key that unlocks all of them. One interface. 100+ LLMs.

It's called LiteLLM.

And it just hit 1 billion requests processed.

GPT. Claude. Gemini. Llama. Mistral. Bedrock. Azure. Cohere. Groq. 100+ models. One line of code. Same format. Same output. Swap any model by changing a single string.

No rewriting your app. No learning new APIs. No vendor lock-in. Ever.

Here's what this thing does:

→ Call 100+ LLMs using the exact OpenAI format — every model responds the same way

→ Built-in retry and fallback — if OpenAI goes down, it auto-switches to Claude or Gemini

→ Cost tracking per user, per team, per project — know exactly what you're spending

→ Rate limiting and budget caps — set a $500/month limit per team and it enforces it automatically

→ Load balancing across multiple deployments — spread traffic across Azure, OpenAI, and Bedrock

→ Virtual API keys for every team member — no sharing master keys

→ Admin dashboard UI for monitoring everything

→ 8ms P95 latency at 1,000 requests per second

→ Guardrails, PII redaction, and caching built in

→ Works as a Python SDK or a self-hosted proxy gateway

Here's the wildest part:

Enterprise AI gateway companies charge $50K-$200K/year for exactly this. Centralized LLM access. Cost controls. Key management. Usage monitoring. Load balancing.

LiteLLM does all of it. Self-hosted. Free. Backed by Y Combinator. Used by teams processing over 1 billion API requests.

240 million Docker pulls. 10.4K GitHub stars. 920 forks. 7,300 commits. MIT License.

100% Open Source.

(Link in the comments)

LiteLLM：面向 100 多种 LLM 的统一 API 网关

🤖 問 AI