← All providers
Weights & Biases
18 models · routes through the OpenAI-Compatible adapter
API endpoint
https://api.inference.wandb.ai/v1
Required environment variables
- WANDB_API_KEY
Models
| Model | Context | Input | Output | Capabilities |
|---|---|---|---|---|
| zai-org/GLM-5.1 | 200k | $1.40 / 1M | $4.40 / 1M | tools, reasoning, open-weights |
| nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-FP8 | 262k | $0.20 / 1M | $0.80 / 1M | tools, open-weights |
| MiniMaxAI/MiniMax-M2.5 | 197k | $0.30 / 1M | $1.20 / 1M | tools, open-weights |
| zai-org/GLM-5-FP8 | 200k | $1.00 / 1M | $3.20 / 1M | tools, open-weights |
| moonshotai/Kimi-K2.5 | 262k | $0.50 / 1M | $2.85 / 1M | tools, reasoning, open-weights, vision |
| deepseek-ai/DeepSeek-V3.1 | 161k | $0.55 / 1M | $1.65 / 1M | tools, open-weights |
| openai/gpt-oss-120b | 131k | $0.15 / 1M | $0.60 / 1M | tools |
| openai/gpt-oss-20b | 131k | $0.05 / 1M | $0.20 / 1M | tools |
| Qwen/Qwen3-30B-A3B-Instruct-2507 | 262k | $0.10 / 1M | $0.30 / 1M | tools, open-weights |
| Qwen/Qwen3-235B-A22B-Thinking-2507 | 262k | $0.10 / 1M | $0.10 / 1M | tools, reasoning, open-weights |
| Qwen/Qwen3-Coder-480B-A35B-Instruct | 262k | $1.00 / 1M | $1.50 / 1M | tools, open-weights |
| OpenPipe/Qwen3-14B-Instruct | 33k | $0.05 / 1M | $0.22 / 1M | tools, open-weights |
| Qwen/Qwen3-235B-A22B-Instruct-2507 | 262k | $0.10 / 1M | $0.10 / 1M | tools, open-weights |
| meta-llama/Llama-4-Scout-17B-16E-Instruct | 64k | $0.17 / 1M | $0.66 / 1M | tools, reasoning, open-weights, vision |
| microsoft/Phi-4-mini-instruct | 128k | $0.08 / 1M | $0.35 / 1M | tools, reasoning, open-weights |
| meta-llama/Llama-3.3-70B-Instruct | 128k | $0.71 / 1M | $0.71 / 1M | tools, reasoning, open-weights |
| meta-llama/Llama-3.1-70B-Instruct | 128k | $0.80 / 1M | $0.80 / 1M | tools, open-weights |
| meta-llama/Llama-3.1-8B-Instruct | 128k | $0.22 / 1M | $0.22 / 1M | tools, reasoning, open-weights |
Sourced from models.dev. Pricing reflects the catalog at the time this page was rendered.