← All providers
Inference
9 models · routes through the OpenAI-Compatible adapter
API endpoint
https://inference.net/v1
Required environment variables
- INFERENCE_API_KEY
Models
| Model | Context | Input | Output | Capabilities |
|---|---|---|---|---|
| google/gemma-3 | 125k | $0.15 / 1M | $0.30 / 1M | tools, open-weights, vision |
| meta/llama-3.1-8b-instruct | 16k | $0.03 / 1M | $0.03 / 1M | tools, open-weights |
| meta/llama-3.2-11b-vision-instruct | 16k | $0.06 / 1M | $0.06 / 1M | tools, open-weights, vision |
| meta/llama-3.2-1b-instruct | 16k | $0.01 / 1M | $0.01 / 1M | tools, open-weights |
| meta/llama-3.2-3b-instruct | 16k | $0.02 / 1M | $0.02 / 1M | tools, open-weights |
| mistral/mistral-nemo-12b-instruct | 16k | $0.04 / 1M | $0.10 / 1M | tools, open-weights |
| osmosis/osmosis-structure-0.6b | 4k | $0.10 / 1M | $0.50 / 1M | tools, open-weights |
| qwen/qwen-2.5-7b-vision-instruct | 125k | $0.20 / 1M | $0.20 / 1M | tools, open-weights, vision |
| qwen/qwen3-embedding-4b | 32k | $0.01 / 1M | free | open-weights |
Sourced from models.dev. Pricing reflects the catalog at the time this page was rendered.