Overview
Version 0.27.0 - November 2025

One API. Every Provider. Through the Architectural Passage.

Zaguán is the architectural gateway that lets OpenAI-compatible clients connect to every major AI provider - no rewrites, no glue code. Point your SDK at a single endpoint, swap any `provider/model` identifier, and ship in minutes instead of burning days on one-off integrations.

November 2025 Release: 9 major features including OpenAI Realtime API, audio output, xAI stateful conversations, Anthropic prompt caching (90% cost savings), Google Gemini thinking, and our unique extra_body translation layer for accessing provider-specific features.

November 2025 Release Highlights

🎭

Realtime Voice API

WebSocket-based voice conversations with ultra-low latency

🔊

Audio Output

Generate spoken audio directly from chat completions

🔗

xAI Stateful Conversations

30-day server-side storage with encrypted thinking

💡

Prompt Caching

90% cost savings with Anthropic Claude caching

🧑‍💻

Gemini Thinking

See the model's reasoning process in real-time

🎯

Extra Body Translation

Access any provider-specific feature seamlessly

🌐

Virtual Models

Multi-provider routing with automatic failover and load balancing

What you get

  • Unified API surface: Use familiar OpenAI endpoints and request schemas to reach hundreds of models. Swap openai/gpt-4o-mini foranthropic/claude-3.5-sonnet - nothing else changes.
  • Rapid provider switching: Swap models or vendors by changing a single namespaced model identifier.
  • Managed infrastructure: Hosted routing layer with observability, authentication, and traffic controls built-in.

What we do not do

  • No hidden model tuning: Zaguán forwards your payloads as-is; custom prompting strategies stay under your control.
  • No automatic fallback: We expose consistent errors instead of silently retrying so you can implement the failover logic you prefer.
  • No proprietary lock-in: All endpoints follow documented standards so you can migrate at any time.

Platform highlights

  • Protocol compatibility: Native support for OpenAI-compatible clients with rock-solid request translation to every upstream provider we support.
  • Security posture: Incoming requests are authenticated with Zaguán API keys and proxied through dedicated provider integrations - your provider credentials never leave our environment.
  • Transparent pricing: 15 € / month for early adopters on the Founder's Plan, 39 € / month standard pricing once the first 200 people are onboarded.

Next steps

  1. Review the API guide to point your OpenAI SDKs at https://api.zaguanai.com/v1.
  2. Browse available Models and choose the namespaced identifier that fits your workload.
  3. Explore Virtual Models for multi-provider routing with automatic failover and load balancing.
  4. Request early access through the Founder's Plan CTA on the homepage to receive your API key.