Hosted Model Pricing for Agents
Compare the frontier models you can assign to agents inside the CloudAxis Web OS. Rates, context windows, and notes tuned for long-running agent workloads.
| Model | Input / 1M | Output / 1M | Context | Notes for agents |
|---|---|---|---|---|
| Claude 3.5 Sonnet | $3 | $15 | 200K | Excellent balance for research + synthesis agents |
| Claude 3 Opus | $15 | $75 | 200K | Max quality for final reports or complex reasoning |
| GPT-4o | $2.50 | $10 | 128K | Fast and capable for browser and workflow agents |
| DeepSeek V3 / R1 | $0.14 / $0.55 | $0.28 / $2.19 | 128K+ | Best price/performance for high-volume or monitoring agents |
| Moonshot | $1 | $4 | 200K–1M | Great for very long file or research contexts |
Prices are indicative list rates for hosted access inside CloudAxis. Confirm current rates before high-volume production use. All models are available without you managing API keys.
Choosing models for collaborating agents
In the Web OS you don't have to pick one model for everything. Assign a fast, cheap model to high-volume monitoring or data extraction agents, and reserve a higher-quality model for the final synthesis or customer-facing output step. This reference helps you make those trade-offs with real numbers.
5 use cases
- Cost-optimize a 24/7 price monitor using DeepSeek, then escalate interesting findings to a Sonnet agent for analysis.
- Use Moonshot for ingesting large research reports or codebases, then hand off to GPT-4o for action planning.
- Run cheap daily digests on a lightweight model, reserve Opus for the weekly executive summary.
- Browser-heavy workflows often benefit from GPT-4o's speed on tool use steps.
- Long-running file processing pipelines can stay economical on DeepSeek while still producing high-quality output when needed.
Frequently asked questions
The table shows the underlying hosted rates. Your effective cost inside a CloudAxis plan depends on the plan tier, included credits, and volume. Use the cost estimator tool to model your actual spend.
Yes — that's a core feature of the OS. Each agent in a workflow or on the home screen can have its own model preference.
Vision and tool-specific pricing are handled inside the platform. This reference focuses on the text model rates most relevant for planning agent text workflows.
It is a maintained reference. For production budgeting, always cross-check the current public pricing pages of the model providers.
The estimator helps you model volume. This table helps you choose which model(s) to use for each role in the volume. Use both together.