← All providers
Nvidia
94 models · routes through the OpenAI-Compatible adapter
API endpoint
https://integrate.api.nvidia.com/v1
Required environment variables
- NVIDIA_API_KEY
Models
| Model | Context | Input | Output | Capabilities |
|---|---|---|---|---|
| nvidia/nemotron-3-ultra-550b-a55b | 1000k | $0.50 / 1M | $2.50 / 1M | tools, reasoning, open-weights |
| stepfun-ai/step-3.7-flash | 256k | free | free | tools, reasoning, open-weights, vision |
| nvidia/nemotron-3-nano-omni-30b-a3b-reasoning | 256k | free | free | tools, reasoning, open-weights, vision |
| deepseek-ai/deepseek-v4-flash | 1049k | $0.14 / 1M | $0.28 / 1M | tools, reasoning, open-weights |
| deepseek-ai/deepseek-v4-pro | 1049k | $0.43 / 1M | $0.87 / 1M | tools, reasoning, open-weights |
| moonshotai/kimi-k2.6 | 262k | free | free | tools, reasoning, open-weights, vision |
| nvidia/active-speaker-detection | — | free | free | open-weights |
| nvidia/nemotron-3-content-safety | 128k | free | free | open-weights |
| nvidia/synthetic-video-detector | — | free | free | open-weights |
| google/gemma-4-31b-it | 256k | free | free | tools, reasoning, open-weights, vision |
| nvidia/llama-nemotron-rerank-vl-1b-v2 | 128k | free | free | open-weights, vision |
| z-ai/glm-5.1 | 131k | free | free | tools, reasoning, open-weights |
| minimaxai/minimax-m2.7 | 205k | free | free | tools, reasoning, open-weights |
| mistralai/mistral-small-4-119b-2603 | 128k | free | free | tools, open-weights |
| nvidia/nemotron-voicechat | 128k | free | free | tools, open-weights |
| nvidia/nemotron-3-super-120b-a12b | 262k | $0.20 / 1M | $0.80 / 1M | tools, reasoning, open-weights |
| nvidia/gliner-pii | 128k | free | free | open-weights |
| nvidia/cosmos-transfer2_5-2b | — | free | free | open-weights, vision |
| qwen/qwen3.5-122b-a10b | 262k | free | free | tools, reasoning, open-weights, vision |
| qwen/qwen3.5-397b-a17b | 262k | free | free | tools, reasoning, open-weights, vision |
| minimaxai/minimax-m2.5 | 205k | free | free | tools, reasoning, open-weights |
| nvidia/llama-nemotron-embed-vl-1b-v2 | 33k | free | free | open-weights, vision |
| stepfun-ai/step-3.5-flash | 256k | free | free | tools, reasoning, open-weights |
| nvidia/nemotron-content-safety-reasoning-4b | 128k | free | free | reasoning, open-weights |
| black-forest-labs/flux_2-klein-4b | 41k | free | free | open-weights, vision |
| nvidia/usdcode | 128k | free | free | — |
| z-ai/glm4.7 | 205k | free | free | tools, reasoning, open-weights |
| nvidia/riva-translate-4b-instruct-v1_1 | 128k | free | free | open-weights |
| mistralai/devstral-2-123b-instruct-2512 | 262k | free | free | tools, reasoning, open-weights |
| mistralai/mistral-large-3-675b-instruct-2512 | 262k | free | free | tools, open-weights, vision |
| deepseek-ai/deepseek-v3.2 | 164k | free | free | tools, reasoning |
| nvidia/streampetr | 128k | free | free | open-weights |
| moonshotai/kimi-k2-thinking | 262k | free | free | tools, reasoning, open-weights |
| nvidia/llama-3_1-nemotron-safety-guard-8b-v3 | 128k | free | free | open-weights |
| mistralai/magistral-small-2506 | 33k | free | free | — |
| mistralai/mistral-medium-3-instruct | 131k | free | free | vision |
| deepseek-ai/deepseek-v3.1-terminus | 128k | free | free | tools, reasoning |
| moonshotai/kimi-k2-instruct-0905 | 262k | free | free | tools, open-weights |
| bytedance/seed-oss-36b-instruct | 262k | free | free | tools |
| qwen/qwen-image-edit | — | free | free | vision |
| nvidia/nvidia-nemotron-nano-9b-v2 | 131k | free | free | tools, reasoning, open-weights |
| black-forest-labs/flux_1-kontext-dev | 41k | free | free | open-weights, vision |
| qwen/qwen-image | — | free | free | vision |
| openai/gpt-oss-20b | 131k | free | free | tools, reasoning, open-weights |
| openai/gpt-oss-120b | 128k | free | free | reasoning |
| microsoft/phi-4-multimodal-instruct | 128k | free | free | — |
| nvidia/llama-3_3-nemotron-super-49b-v1_5 | 131k | free | free | tools, reasoning, open-weights |
| sarvamai/sarvam-m | 128k | free | free | tools, open-weights |
| nvidia/llama-3_2-nemoretriever-300m-embed-v1 | 33k | free | free | open-weights |
| qwen/qwen3-coder-480b-a35b-instruct | 262k | free | free | tools |
| nvidia/cosmos-transfer1-7b | — | free | free | open-weights, vision |
| google/gemma-3n-e2b-it | 128k | free | free | tools, open-weights, vision |
| mistralai/mistral-nemotron | 128k | free | free | tools, open-weights |
| google/gemma-3n-e4b-it | 128k | free | free | tools, open-weights, vision |
| nvidia/magpie-tts-zeroshot | — | free | free | open-weights |
| nvidia/llama-3_3-nemotron-super-49b-v1 | 131k | free | free | tools, reasoning, open-weights |
| meta/llama-guard-4-12b | 128k | free | free | open-weights, vision |
| meta/llama-4-maverick-17b-128e-instruct | 128k | free | free | tools, open-weights, vision |
| mistralai/mistral-7b-instruct-v03 | 66k | free | free | tools, open-weights |
| nvidia/bevformer | 128k | free | free | open-weights |
| nvidia/cosmos-predict1-5b | — | free | free | open-weights, vision |
| nvidia/sparsedrive | 128k | free | free | open-weights |
| nvidia/nv-embedcode-7b-v1 | 33k | free | free | open-weights |
| meta/llama-3.1-8b-instruct | 16k | free | free | tools, open-weights |
| moonshotai/kimi-k2-instruct | 128k | free | free | tools, reasoning |
| google/gemma-3-27b-it | 131k | free | free | tools, reasoning, vision |
| microsoft/phi-4-mini-instruct | 131k | free | free | tools, reasoning, vision |
| qwen/qwen3-next-80b-a3b-instruct | 262k | free | free | tools |
| qwen/qwen3-next-80b-a3b-thinking | 262k | free | free | tools, reasoning, open-weights |
| nvidia/nemotron-3-nano-30b-a3b | 131k | free | free | tools, reasoning, open-weights |
| meta/llama-3.3-70b-instruct | 128k | free | free | tools, open-weights |
| qwen/qwen2.5-coder-32b-instruct | 128k | free | free | tools, open-weights |
| nvidia/studiovoice | 128k | free | free | open-weights |
| meta/llama-3.2-90b-vision-instruct | 128k | free | free | tools, open-weights, vision |
| meta/llama-3.2-11b-vision-instruct | 128k | free | free | tools, open-weights, vision |
| meta/llama-3.2-1b-instruct | 128k | free | free | tools, open-weights |
| meta/llama-3.2-3b-instruct | 33k | free | free | open-weights |
| abacusai/dracarys-llama-3_1-70b-instruct | 128k | free | free | tools, open-weights |
| meta/esm2-650m | 128k | free | free | open-weights |
| nvidia/nemotron-mini-4b-instruct | 128k | free | free | tools, open-weights |
| black-forest-labs/flux_1-schnell | 0k | free | free | open-weights |
| black-forest-labs/flux.1-dev | 4k | free | free | — |
| nvidia/usdvalidate | — | free | free | open-weights |
| google/gemma-2-2b-it | 128k | free | free | tools, open-weights |
| meta/llama-3.1-70b-instruct | 128k | free | free | tools, open-weights |
| nvidia/nv-embed-v1 | 33k | free | free | open-weights |
| upstage/solar-10_7b-instruct | 128k | free | free | tools, open-weights |
| google/google-paligemma | 128k | free | free | open-weights, vision |
| mistralai/mixtral-8x22b-instruct | 66k | free | free | tools, open-weights |
| nvidia/rerank-qa-mistral-4b | 128k | free | free | open-weights |
| meta/esmfold | 128k | free | free | open-weights |
| baai/bge-m3 | 8k | free | free | open-weights |
| mistralai/mixtral-8x7b-instruct | 33k | free | free | tools, open-weights |
| openai/whisper-large-v3 | — | free | free | open-weights |
Sourced from models.dev. Pricing reflects the catalog at the time this page was rendered.