They sound like the same thing. They are not. Here is the difference, when each matters, and which one your team actually needs.
An LLM proxy forwards requests from your app to LLM providers. It sits in the request path and can modify, log, cache, or reroute traffic in real time. Think of it as a smart relay.
An LLM gateway is a proxy plus a control plane. It adds authentication, rate limiting, billing, multi-tenant isolation, and policy enforcement on top of the basic routing. Think of it as the proxy plus everything around it.
Every gateway contains a proxy. Not every proxy is a gateway.
You have one team, one app, and one or two providers. You want to swap between OpenAI and Anthropic without changing application code. You want cost tracking and maybe caching. You do not need user management, billing, or multi-tenant isolation.
A proxy-only setup gets you model aliasing, provider failover, request logging, and basic cost tracking with minimal overhead. Stockyard calls this proxy-only mode.
Multiple teams share the same LLM infrastructure. You need per-team API keys, spend caps, rate limits, and audit trails. You are building a platform where internal or external users consume LLM capabilities through your API.
A gateway gives you the proxy layer plus identity, authorization, metering, and policy. Stockyard ships both modes in the same binary. Start with proxy-only, toggle on gateway features when you need them.
| Capability | LLM Proxy | LLM Gateway |
|---|---|---|
| Request routing | ✓ | ✓ |
| Provider failover | ✓ | ✓ |
| Model aliasing | ✓ | ✓ |
| Cost tracking | ✓ | ✓ |
| Caching | ✓ | ✓ |
| Multi-tenant API keys | ✗ | ✓ |
| Per-user rate limits | ✗ | ✓ |
| Spend caps & billing | ✗ | ✓ |
| Audit trail | Basic | ✓ |
| PII redaction | Optional | ✓ |
Stockyard ships both in a single binary with embedded SQLite. There is no separate "gateway edition." You start with the proxy, and every gateway feature is a module you toggle on at runtime. No config file changes, no restart, no redeploy.
This means you do not have to predict your architecture up front. Start routing traffic through the proxy today. Add team isolation, spend caps, and audit logs when the need arises. The binary is the same either way.
Start with the proxy. Grow into the gateway. Same binary, same deploy.
Install in 5 minutes Learn about proxy-only mode