← All providers
Vultr
7 models · routes through the OpenAI-Compatible adapter
API endpoint
https://api.vultrinference.com/v1
Required environment variables
- VULTR_API_KEY
Models
| Model | Context | Input | Output | Capabilities |
|---|---|---|---|---|
| nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-BF16 | 262k | $0.13 / 1M | $0.38 / 1M | tools, reasoning, open-weights |
| moonshotai/Kimi-K2.6 | 262k | $0.15 / 1M | $0.60 / 1M | tools, reasoning, open-weights |
| zai-org/GLM-5.1-FP8 | 200k | $0.85 / 1M | $3.10 / 1M | tools, reasoning, open-weights |
| MiniMaxAI/MiniMax-M2.7 | 205k | $0.30 / 1M | $1.20 / 1M | tools, reasoning, open-weights |
| nvidia/DeepSeek-V3.2-NVFP4 | 131k | $0.55 / 1M | $1.65 / 1M | tools, reasoning, open-weights |
| nvidia/Nemotron-Cascade-2-30B-A3B | 262k | $0.15 / 1M | $0.60 / 1M | tools, reasoning, open-weights |
| nvidia/Llama-3.1-Nemotron-Safety-Guard-8B-v3 | 8k | $0.01 / 1M | $0.01 / 1M | open-weights |
Sourced from models.dev. Pricing reflects the catalog at the time this page was rendered.