How do I pay for OpenRouter?

OpenRouter uses pay-as-you-go credits. You top up your account (typically $10-$100) and draw down as you make API calls. Credit-card top-ups incur a ~5% Stripe processing fee on top of the model markup. Crypto payments avoid the Stripe fee.

OpenRouter vs direct API: cost, markup & latency comparison (May 2026)

Q: What is OpenRouter's exact markup percentage?

Approximately 5% on most major models as of May 2026. GPT-5, Claude Sonnet 4.6, Claude Opus 4.6, and Gemini 2.5 Pro all sit at roughly +5% over their direct provider rates. Open-source models hosted via Together, Fireworks, or DeepInfra vary more widely depending on host.

Q: When is OpenRouter actually worth the markup?

OpenRouter is worth the markup when you need to test multiple models without separate provider accounts, when you mix open-source and frontier models in one stack, or when consolidating team billing. For single-model production at scale, going direct is cheaper.

Q: Does OpenRouter add latency compared to direct API calls?

Yes, OpenRouter adds approximately 50-150ms of routing overhead per request versus calling the underlying provider directly. For interactive applications this is usually imperceptible; for high-throughput batch workloads or strict-latency production paths, the overhead compounds.

Q: Does OpenRouter log my prompts or training data?

By default OpenRouter does not log request content, though prompt logging can be enabled in user settings. Your prompts still pass through OpenRouter's infrastructure as a middleman before reaching the underlying provider. For HIPAA or strict-compliance workloads, the simpler data flow of going direct is usually preferred.

Q: Does OpenRouter support GPT-5 and Claude Sonnet 4.6?

Yes. Both GPT-5 and Claude Sonnet 4.6 are accessible through OpenRouter using model identifiers openai/gpt-5 and anthropic/claude-sonnet-4.6. Claude Opus 4.6, Gemini 2.5 Pro, and open-source alternatives like LLaMA 3.3 70B are also available through the same API.

Short answer: OpenRouter charges roughly a 5% markup over direct OpenAI and Anthropic API pricing, plus about a 5% Stripe fee on credit-card top-ups. It's worth the markup when you need multi-model access without juggling separate accounts. For single-model production workloads above $500/month, going direct is meaningfully cheaper.

At a glance: OpenRouter vs direct API (May 2026)

Factor	OpenRouter	Direct API
Cost markup	~5% over provider rate	0% (base rate)
Payment fee	~5% Stripe on top-ups (free with crypto)	None (direct billing)
Accounts needed	1 (OpenRouter)	1 per provider (OpenAI, Anthropic, etc.)
Latency overhead	+50-150ms routing	Direct (baseline)
Multi-model access	One key, hundreds of models	Separate setup per provider
Open-source models	Included (Together, Fireworks, DeepInfra)	Requires separate signups
Volume discounts	Usually not passed through	Available at enterprise tier
Data privacy	Extra middleman hop	Direct to provider
Best for	Exploration, multi-model, indie	Single-model production at scale

Is OpenRouter cheaper than going direct to OpenAI or Anthropic?

No. OpenRouter charges roughly a 5% markup over the underlying provider's per-token pricing, plus about a 5% Stripe fee on credit-card top-ups. For a $1,000/month workload on Claude Sonnet 4.6, that's about $50/month extra versus going direct to Anthropic.

What is OpenRouter's exact markup percentage?

Approximately 5% on most major models as of May 2026. The exact per-model breakdown:

Model	Direct (per 1M in/out)	OpenRouter (per 1M in/out)	Markup
GPT-5	$5.00 / $15.00	~$5.25 / $15.75	~5%
Claude Sonnet 4.6	$3.00 / $15.00	~$3.15 / $15.75	~5%
Claude Opus 4.6	$15.00 / $75.00	~$15.75 / $78.75	~5%
Gemini 2.5 Pro	$2.50 / $10.00	~$2.65 / $10.50	~5%
LLaMA 3.3 70B (open)	varies by host	~$0.59 / $0.79	varies

Pricing checked May 15, 2026. OpenRouter also charges a ~5% credit card fee on top-ups (Stripe processing). Crypto payments avoid this. Verify current rates at openrouter.ai/models.

Does OpenRouter add latency compared to direct API calls?

Yes — OpenRouter adds approximately 50-150ms of routing overhead per request versus calling the underlying provider directly. For interactive chat applications and one-off completions this is usually imperceptible. For high-throughput batch workloads, real-time voice applications, or strict-latency production paths, the overhead compounds. If you measure your API latency at p95 and find it sits around 1.2-1.5 seconds, OpenRouter typically adds enough to make that 1.3-1.7s.

When is OpenRouter actually worth the markup?

Multi-model experimentation

You're building something and want to test the same prompt against GPT-5, Claude Sonnet 4.6, Gemini, and a couple of open-source alternatives. Without OpenRouter, that's 4 separate accounts, 4 separate API keys, 4 separate billing relationships, 4 different SDKs. With OpenRouter, it's one key and a model name swap. The 5% premium pays for itself in account-management time alone.

Open-source + frontier in one place

If your stack uses LLaMA 3.3 (cheap, self-hosted-grade quality) for some calls and GPT-5 (expensive, smart) for others, OpenRouter consolidates that into one bill. Going direct means signing up with Together AI or Fireworks for the open models AND OpenAI for GPT-5 — three vendor relationships.

Team billing consolidation

For a small team, OpenRouter as a single billing relationship + per-user spend tracking via API key labels is meaningfully simpler than managing OpenAI org billing + Anthropic org billing + a Google Cloud project.

Avoiding wait lists / regional access

Direct API access has historically had geographic and capacity gates. OpenRouter, as a customer of all the providers, often has access on day one. If you're in a region OpenAI hasn't fully rolled out to, OpenRouter is sometimes the only path.

When going direct is the better call

Single-model production workloads

If your product runs 90% on Claude Sonnet 4.6 in production, the 5% markup adds up. On a $1,000/month Anthropic bill, OpenRouter costs you an extra $50/month for nothing you actually need. Sign up with Anthropic directly, save the markup.

Cost-optimized scale

At scale, every percentage point matters. A team doing $10K/month on API calls should not be giving away $500/month for routing convenience. Direct relationships also unlock volume pricing tiers that OpenRouter usually can't pass through.

Compliance-sensitive data

Going direct: your prompt → OpenAI/Anthropic → response. With OpenRouter: your prompt → OpenRouter → underlying provider → response. That's an extra hop. For HIPAA, financial services, or anything with a data residency requirement, the simpler path is usually mandatory.

Fine-tuning, batch API, and provider-specific features

OpenRouter exposes the standard chat completions interface. Provider-specific features — fine-tuning, batch API discounts, prompt caching with custom TTLs, vision-specific models, audio models like Whisper — often require going direct.

Does OpenRouter log my prompts or training data?

By default, OpenRouter does not log the content of requests. Prompt logging is an opt-in setting users can enable in their account. However, your prompts still pass through OpenRouter's infrastructure as a middleman before reaching the underlying provider. For most use cases this is acceptable. For HIPAA, financial services, or anything with strict compliance requirements, the simpler data flow of going direct is usually preferred — "we use OpenAI's API" is a sentence a security review accepts more easily than "we use OpenRouter, which uses OpenAI's API."

Does OpenRouter support GPT-5 and Claude Sonnet 4.6?

Yes. Both are accessible through OpenRouter:

openai/gpt-5 — GPT-5 via OpenAI
anthropic/claude-sonnet-4.6 — Claude Sonnet 4.6
anthropic/claude-opus-4.6 — Claude Opus 4.6
google/gemini-2.5-pro — Gemini 2.5 Pro
meta/llama-3.3-70b — LLaMA 3.3 70B (open source, hosted)

OpenRouter typically gets access to new frontier models within days of public availability, sometimes faster than the underlying provider's direct API in certain regions.

How to decide: OpenRouter or direct?

Are you exploring or shipping? Exploring → OpenRouter. Shipping production → direct, unless you need genuinely multi-model routing.
How many providers do you actually call? 1-2 → direct is simpler in the long run. 3+ → OpenRouter's consolidation pays.
Is data sensitivity a factor? Yes → direct, by default. The simpler data flow is worth the slight extra setup.
What's your scale? Under $200/month total spend → don't optimize, use whatever's easiest. Over $1K/month → the markup matters, go direct on your biggest providers.

The hybrid pattern (what most pros end up doing)

Direct OpenAI account with a $50/month cap for production GPT-5 / DALL-E / Whisper calls.
Direct Anthropic account with a $50/month cap for production Claude Sonnet 4.6 / Opus calls.
OpenRouter account funded with $20-50 in credit for testing and exploring new models.
When a new model becomes a production candidate, sign up direct, get the volume discount, retire the OpenRouter route.

This gives you the cost optimization of direct relationships for the calls that matter and the experimentation surface area of OpenRouter for the rest. Total overhead: 3 accounts, 3 keys, manageable.

Two specific cases worth flagging

The "free $1" testing pattern

OpenRouter occasionally offers $1 in free credits for new accounts. That's enough to run dozens of test calls across many models for free. If you're just trying to feel out which model handles your specific prompt best, the free credit is genuinely useful even if you never become a paying user.

OpenRouter for indie / hobbyist projects

If you're building a side project that might need different models for different features (chat with Claude, summarization with Haiku, image with DALL-E), and total monthly spend is under $30, OpenRouter is the easiest setup. The 5% premium on $30 is $1.50 — not worth the complexity of three vendor accounts.

The verdict

For exploration, multi-model needs, and small-scale projects: OpenRouter is the right default. The markup is real but small, and the consolidation is worth more than the savings.

For production workloads on one or two specific models, especially at over $500/month spend: go direct. The markup compounds and the simpler data flow makes compliance reviews shorter.

For everyone else: do both. Direct accounts on what you ship, OpenRouter on what you're testing. The two patterns aren't competitors — they solve different problems.