The Agent Gateway. Compress, route, observe.
Edgee sits between your agents and the LLM provider.
Same code, fewer tokens, lower bills.
How to use Edgee
Whether you’re using a coding agent or building an app, Edgee compresses your LLM traffic in minutes.
For coding agents
Start saving tokens in 1 minute
Install Edgee CLI and connect it to your coding agent. No code changes required.
- No code changes: works as a transparent proxy for your agent
- Instant savings: token compression kicks in on the first request
- Works with any agent: Claude Code, Codex, Cursor and more
Configure your coding agent
Connect Edgee to your AI coding assistant and start saving tokens in 1 minute.
curl -fsSL https://edgee.ai/install.sh | bashedgee launch claudeWhy Edgee Agent Gateway?
An edge intelligence layer for your coding agents
Edgee sits between your coding agents and LLM providers. It applies three pillars to every request, Compress (input + output), Route (with automatic fallback), and Observe (per session, per team), so you cut token costs and extend context windows without changing a line of application code.
Token compression
Layer 1 (Input) tool-result trimming and Layer 2 (Output) brevity. Cut tool-result payloads 60–90% at the edge. Semantically lossless for coding tasks. Same model output, fewer tokens billed.
Learn moreTeam Management
Get full visibility into how your team uses coding agents. Track cost per repo and PR, manage team seats, and keep your team unblocked with automatic OSS model fallback.
Learn moreBring Your Own Keys
Use Edgee’s keys for convenience, or plug in your own provider keys for billing control and custom models.
Learn moreObservability
Monitor latency, errors, usage, and cost per model, per app, and per environment.
Learn moreRetry & Fallback
When a provider request fails, Edgee automatically retries and falls back to the next available provider, transparently, without any changes to your code.
Learn moreThe vision behind Edgee
Every technological shift creates a new foundation: the web had bandwidth, the cloud had compute, and AI has tokens. In a world powered by models, intelligence has a cost: tokens flow through every interaction, decision, and response.
At Edgee, we believe intelligence should move efficiently, closer to users, intent, and action. It should be compressed, routed, and optimized so decisions happen instantly. Hear from Sacha, Edgee’s co-founder, on how AI scales by mastering how intelligence moves.