NVIDIA: Nemotron 3 Ultra
🇺🇸 NVIDIA · Nemotron 3
Overview
NVIDIA: Nemotron 3 Ultra is a large language model API from NVIDIA, part of its Nemotron 3 model family. Priced at $0.500 per million input tokens and $2.50 per million output tokens, it occupies the mid-range, balancing capability against running cost. Output tokens cost about 5× as much as input, so prompt-heavy workloads run noticeably cheaper than generation-heavy ones. An exceptionally large 1M-token context window (≈1,500 pages of text) means entire repositories or document collections can be processed without chunking. On Artificial Analysis's Intelligence Index it scores 48 (A grade), a useful proxy for its general reasoning strength relative to the other models tracked here. All prices on this page reflect OpenRouter's routed rates and are re-synced automatically every day; confirm against the provider's official pricing before committing to production.
| Dimension | Unit | Price (USD) |
|---|---|---|
| Input | per 1M tokens | $0.500 |
| Output | per 1M tokens | $2.50 |
| Cached Input | per 1M tokens | $0.150 |
- Provider
- NVIDIA
- Model Family
- Nemotron 3
- Version String
- nvidia/nemotron-3-ultra-550b-a55b
- Status
- Active
- Modality
- Text
- Context Window
- 1,000,000 tokens
- Output Limit
- 16,384 tokens
Index Metrics
Cross-domain capability indexes evaluated by Artificial Analysis — Artificial Analysis
Benchmark Scores
Data source: Artificial Analysis
Performance Metrics
Real-world benchmarks, updated every 72 hours by Artificial Analysis — Artificial Analysis
90-Day Price Trend
Input / Output price (USD per 1M tokens)
Past 90 days of records; every price change is shown here
| Date | Dimension | Price (USD) | Source |
|---|---|---|---|
| Cached Input | $0.150 | OpenRouter | |
| Output | $2.50 | OpenRouter | |
| Input | $0.500 | OpenRouter | |
| Cached Input | $0.150 | OpenRouter | |
| Output | $2.50 | OpenRouter | |
| Input | $0.500 | OpenRouter | |
| Cached Input | $0.150 | OpenRouter | |
| Output | $2.50 | OpenRouter | |
| Input | $0.500 | OpenRouter | |
| Cached Input | $0.150 | OpenRouter | |
| Output | $2.50 | OpenRouter | |
| Input | $0.500 | OpenRouter | |
| Cached Input | $0.150 | OpenRouter | |
| Output | $2.50 | OpenRouter | |
| Input | $0.500 | OpenRouter | |
| Cached Input | $0.150 | OpenRouter | |
| Output | $2.50 | OpenRouter | |
| Input | $0.500 | OpenRouter | |
| Cached Input | $0.150 | OpenRouter | |
| Output | $2.50 | OpenRouter | |
| Input | $0.500 | OpenRouter | |
| Cached Input | $0.150 | OpenRouter | |
| Output | $2.50 | OpenRouter | |
| Input | $0.500 | OpenRouter |
Key Insights
Key data points from this page for quick reference and citation.
- NVIDIA: Nemotron 3 Ultra Input price: $0.5/M tokens
- NVIDIA: Nemotron 3 Ultra Output price: $2.5/M tokens
- Context window: 1,000,000 tokens
- Provider: NVIDIA
- Model family: Nemotron 3
- Modalities: Text
- Data source: OpenRouter, updated daily