Local AI

LiteLLM

Call Anthropic, OpenAI, Google and 100+ models through one consistent API, with cost tracking and fallbacks.

View the repo · BerriAI/litellm ↗
Who it is for

Anyone building on more than one model who wants to avoid lock-in and track spend.

Install it
pip install litellm
# run the proxy:
litellm --model claude-3-5-sonnet
Before production

Lets you swap models without rewriting code. Set per-key budgets so costs do not surprise you.

Where Blash AI comes in

We stand up the gateway, set budgets and fallbacks, and route each workflow to the right model for the job.

Run it, then wire it in

When you want this running on your real stack, that is the engagement

Book an AI audit
More from the library