1 place to call all your agents - OpenCode, Hermes, Claude Managed Agents, Cursor Agents API, DeepAgents.
-
Updated
Jun 14, 2026 - Rust
1 place to call all your agents - OpenCode, Hermes, Claude Managed Agents, Cursor Agents API, DeepAgents.
High-performance LLM Gateway built in Go - OpenAI compatible proxy with multi-provider support, adaptive routing, and enterprise features
Comprehensive, scalable ML inference architecture using Amazon EKS, leveraging Graviton processors for cost-effective CPU-based inference and GPU instances for accelerated inference. Guidance provides a complete end-to-end platform for deploying LLMs with agentic AI capabilities, including RAG and MCP
Connect any LLM-powered client app, such as a coding agent, to any supported inference backend/model.
This repo presents resilience patterns for scaling inference for Generative AI workloads on AWS: Bedrock cross-Region inference, AWS account sharding, and intelligent routing with LLM gateways.
A lightweight, open source UI for interacting with locally hosted LLMs featuring realtime voice, RAG, and streaming APIs.
A production-grade, air-gapped Sovereign Data Fabric for Enterprise RAG. Features LiteLLM gateway governance, Ollama local compute, Qdrant vector storage, and Langfuse observability.
LLMCallGateway 是一个基于 LiteLLM 构建的专业 LLM API 网关服务。将所有模型(包括非 OpenAI 模型)的请求格式统一为 OpenAI 格式,并提供详细的日志跟踪和性能监控,帮助开发者直观了解与下游 LLM API 交互的细节与成本。
Multi-agent chain-of-thought system that turns a merchant pitch into a ready-to-publish promotional campaign.
Self-hosted platform for running coding agents (Claude Code, Codex, Hermes) in isolated sandboxes with vault proxy.
Self-hosted platform for running coding agents (Claude Code, Codex, Hermes) in isolated sandboxes with vault proxy.
Master’s thesis developed as part of the Master’s degree in Telecommunications Engineering.
Self-hosted platform for running coding agents (Claude Code, Codex, Hermes) in isolated sandboxes with vault proxy.
A Custom LiteLLM proxy + UI Docker Compose stack (Postgres + Valkey), bind-mounted config and data for host-editable persistence.
Dockerized LiteLLM gateway for Cursor IDE. Exposes /v1 with chat, coder, and vision; routes OpenRouter models with latency-based routing, Redis caching, Postgres, and Cloudflare tunnel ingress.
Reference Docker Compose stacks, cloud deploy recipes, and SDK integration samples for the DVARA LLM Gateway
Add a description, image, and links to the litellm-ai-gateway topic page so that developers can more easily learn about it.
To associate your repository with the litellm-ai-gateway topic, visit your repo's landing page and select "manage topics."