Platform pricing

Pay for tokens.
Nothing else.

The Platform is pure usage. You pay each provider's published token rate — with no Cudator markup — and routing, load-balancing, and failover come included. No seats, no monthly minimum.

Usage-only today — seat & volume plans coming later
How it's priced

One meter: the tokens you route.

Fund a wallet, generate a key, and start routing. Every prompt and response is metered at provider rates and drawn down from prepaid or invoiced credit.

Pass-through token rates

You pay what the provider charges, per million tokens — input and output — with no platform margin added on top.

  • No per-call or per-seat fee
  • Live per-model rates in the catalog
  • Self-hosted models route at $0

Routing is free

Policy routing, weighted load-balancing, automatic failover, and residency enforcement are part of the platform — not a line item.

  • Cost / latency / quality policies
  • Failover across a credential pool
  • Region & jurisdiction pinning

One wallet, one invoice

Every provider's spend across every key, team, and country settles to one Cudator wallet — prepaid or invoiced, in the currency you choose.

  • Consolidated multi-currency billing
  • Itemised usage export
  • VAT / GST handled per jurisdiction

Rates are pass-through provider prices per million tokens. See every model's live rate in the catalog →

Stay in control

Usage-based, never a surprise.

Caps are enforced before a request ever leaves, so a runaway loop can't run up the bill. Set limits anywhere in the hierarchy.

Per-key & per-workspace caps

Hard and soft spend limits per API key, workspace, and legal entity — with alerts before you hit them.

Real-time usage ledger

Every call logged with model, provider, region, and cost — drill down by key or roll up to one invoice.

Prepaid or invoiced

Top up a wallet as you go, or settle monthly on invoice with terms — multi-currency either way.

Global billing

One invoice for AI, across every country you operate in.

Run AI across dozens of subsidiaries, currencies, and tax regimes. Cudator meters every entity locally, handles FX and tax, and rolls it into a single consolidated bill — settled in the currency your finance team chooses.

USD · EUR · GBP · JPY · SGD +40 SEPA · ACH · Wire · Card VAT / GST handled
Acme Global Industries
consolidated · 5 entities · Mar 2026
settles in USD
US Acme US Inc.United States $48,200.00USD
GB Acme UK Ltd.United Kingdom £28,100.00≈ $35,640
DE Acme Deutschland GmbHGermany · incl. VAT €19,540.00≈ $21,100
JP Acme KKJapan · incl. JCT ¥2,410,000≈ $16,180
SG Acme APAC Pte.Singapore · incl. GST S$12,800.00≈ $9,520
Total due · all entities FX & tax reconciled by Cudator
$130,640.00
Questions

How Platform billing works.

Is there a platform fee or markup?

No. You pay each provider's published token rate. Routing, failover, and residency enforcement are included — there's no per-call or per-seat charge today.

Are there seats or plans?

Not for the Platform. It's usage-only for now — fund a wallet and pay per token. Seat and volume plans are on the roadmap. (Cudator Chat, the team product, is priced per seat — see seat pricing.)

How do self-hosted models bill?

Requests routed to your own vLLM or VPC-hosted models incur no token charge from Cudator — you're running the compute. They still appear in the usage ledger for a complete audit trail.

Can I cap spend?

Yes. Set hard and soft caps per key, workspace, and entity. Caps are enforced before a request leaves, so usage can't run past your limit. Top up prepaid or settle on invoice.

Start building

Route your first request in minutes.

Generate a key, fund a wallet, and point your SDK at the gateway. No card required to start.

Prepaid or invoiced · multi-currency · cancel anytime