Skill Detail

LiteLLM Unified LLM Gateway and Proxy Server

LiteLLM is an open-source Python SDK and proxy server that provides a unified OpenAI-compatible interface to call 100+ LLM APIs including OpenAI, Anthropic, Azure, Bedrock, and more. It includes cost tracking, guardrails, load balancing, and virtual key management for production deployments.

Integrations & ConnectorsCustom Agents

Integrations & Connectors Custom Agents Security Reviewed

Tool match: litellm ⭐ 41.8k GitHub stars

INSTALL WITH ANY AGENT

npx skills add agentskillexchange/skills --skill litellm-unified-llm-gateway-proxy Copy

Works best when you want a reusable capability, not another fragile one-off prompt.

View source

At a glance

Last updated

Jun 3, 2026

Quick brief

LiteLLM is a Python SDK and AI Gateway proxy server developed by BerriAI (YC-backed) that solves the multi-provider LLM integration problem. Available at github.com/BerriAI/litellm with active GitHub adoption, it lets developers call over 100 different LLM APIs through a single, unified OpenAI-compatible interface.

How it works

What this skill actually does

The fundamental value of LiteLLM is abstraction. Instead of writing separate integration code for OpenAI, Anthropic, Google VertexAI, AWS Bedrock, Azure, Cohere, HuggingFace, and dozens of other providers, developers write one completion call and swap the model string. LiteLLM handles the translation between provider-specific API formats, authentication methods, and response schemas automatically.

The proxy server component turns LiteLLM into a production-grade AI gateway. Teams deploy it as a centralized endpoint that all their applications call. It provides virtual API keys with per-key budget limits and rate limiting, automatic load balancing across multiple model deployments, cost tracking and spend analytics per key/user/team, guardrails and content filtering, and detailed request/response logging. This makes it practical for organizations running multiple AI-powered applications that need centralized control over LLM usage and spending.

A skill wrapping LiteLLM gives an AI agent the ability to route requests to the optimal model for each task. The agent could use cheaper models for simple tasks and premium models for complex reasoning, all through the same interface. The proxy supports /chat/completions, /embeddings, /images, /audio, /batches, /rerank, and the new /a2a agent-to-agent protocol.

Installation requires just pip install litellm for the SDK or pip install litellm[proxy] for the gateway server. LiteLLM is MIT-licensed with active daily releases and comprehensive documentation at docs.litellm.ai.

Best fit

When to reach for it

Best when the job fits Integrations & Connectors.
Works naturally with Custom Agents setups.

Trust & provenance

Why this listing is credible

Built around the litellm toolchain.
Trust status: Security Reviewed.
41.8k GitHub stars on the linked upstream source.
Last updated Jun 3, 2026.

View source ↗