One smart key. Clear savings.
We route for quality, cost, and speed—so you don't have to. Three lanes, simple units, and a calculator that tells the truth.
Three lanes. One experience.
BeneCloud blends models behind the scenes, then shows you exactly why.
Economy
Lowest cost for scale
- ✓Optimized open models
- ✓Fast cold start
- ✓Great for high-volume tasks
Balanced
Value meets velocity
- ✓Best overall trade-off
- ✓Consistent latency
- ✓Ideal for most apps
Frontier
Top-tier quality
- ✓State-of-the-art models
- ✓Reasoning-ready
- ✓For mission-critical paths
Switch lanes without changing your code. The Meta-Router honors your SLA and budget.
Plans
Pick the control you need. Keep the freedom you want.
Essential
- ✓Smart Key access
- ✓Usage dashboard
- ✓Standard support
- ✓SLA 99.5%
Pro
- ✓Meta-Router policies
- ✓RAG Orchestrator
- ✓FinOps reports
- ✓SLA 99.9%
Studio
- ✓RAG Harness & evals
- ✓Budgets & alerts
- ✓Vector quotas included
- ✓SLA 99.95%
Enterprise
- ✓Data residency (GCC/global)
- ✓Dedicated serving / RTU
- ✓SSO/SCIM, DPA & SLA 99.99%
- ✓Named success manager
Usage pricing (per 1M tokens)
Smart Key
| Lane | Input | Output | Typical use |
|---|---|---|---|
| Economy | $0.20 | $0.40 | High-volume prompts, automations |
| Balanced | $1.50 | $6.00 | Everyday apps with great UX |
| Frontier | $4.50 | $18.00 | Reasoning, premium outputs |
Actual billing follows your Pricebook. Model mix and discounts are shown in your ledger.
Optimizations that stack
Batch
Up to −50% for non-interactive runs.
Prompt Cache
Up to −70% on cached input share.
RTU Reservations
20–35% off with monthly throughput commitments.
Prompt Enhancer
~12% fewer tokens by design.
RAG & Vector
RAG query: $0.25 per 1,000
RAG node: $0.20 per 1,000
Vector storage: $0.10/GB-month (global), $0.16/GB-month (GCC)
Vector writes: $0.05 per 1,000
Numbers you can trust.
Open the calculator, tune your workload, and see the truth in minutes.
Common questions
All prices are indicative; your Pricebook and contracts prevail. SLAs vary by plan.