Skip to content
Mantis logo

Mantis

A self-hosted LLM gateway for routing, caching, guardrails, and observability across model providers.
One API Stable chat completions endpoint in front of multiple model targets.
AWS-native Deployable with Terraform, ECS, ElastiCache, Bedrock, and CloudWatch.
Policy driven Routing, retry, fallback, timeout, cooldown, and cache behavior live in config.

Start with the quick start to run or deploy the gateway, then read the case study for the project background and design decisions.