Nebius AI Provider
Nebius AI Studio - OpenAI-compatible API for large language models
Available Models
Gemma 3 27B
gemma-3-27bProviders
Nebius AI
nebius/gemma-3-27bContext Size
128k
Stability
STABLEPricing
Input
$0.27/M
Cached
—/M
Output
$0.27/M
Capabilities
Streaming
Vision
Llama 3.1 8B Instruct
llama-3.1-8b-instructProviders
Nebius AI
nebius/llama-3.1-8b-instructContext Size
128k
Stability
STABLEPricing
Input
$0.02/M
Cached
—/M
Output
$0.06/M
Capabilities
Streaming
Llama 3.1 Nemotron Ultra 253B
llama-3.1-nemotron-ultra-253bProviders
Nebius AI
nebius/llama-3.1-nemotron-ultra-253bContext Size
128k
Stability
STABLEPricing
Input
$0.60/M
Cached
—/M
Output
$1.80/M
Capabilities
Streaming
JSON Output
Llama 3.3 70B Instruct
llama-3.3-70b-instructProviders
Nebius AI
nebius/llama-3.3-70b-instructContext Size
128k
Stability
STABLEPricing
Input
$0.13/M
Cached
—/M
Output
$0.40/M
Capabilities
Streaming
Tools
JSON Output
Llama 3.1 405B Instruct
llama-3.1-405b-instructProviders
Nebius AI
nebius/llama-3.1-405b-instructContext Size
128k
Stability
STABLEPricing
Input
$1.00/M
Cached
—/M
Output
$3.00/M
Capabilities
Streaming
Tools
JSON Output
DeepSeek V3
deepseek-v3Providers
Nebius AI
nebius/deepseek-v3Context Size
64k
Stability
unstablePricing
Input
$0.50/M
Cached
—/M
Output
$1.50/M
Capabilities
Streaming
DeepSeek R1 (0528)
deepseek-r1-0528Providers
Nebius AI
nebius/deepseek-r1-0528Context Size
64k
Stability
unstablePricing
Input
$0.80/M
Cached
—/M
Output
$2.40/M
Capabilities
Streaming
Kimi K2
kimi-k2Providers
Nebius AI
nebius/kimi-k2Context Size
131.1k
Stability
STABLEPricing
Input
$0.50/M
Cached
—/M
Output
$2.40/M
Capabilities
Streaming
Tools
JSON Output
Qwen QwQ 32B
qwen-qwq-32bProviders
Nebius AI
nebius/qwen-qwq-32bContext Size
32.8k
Stability
STABLEPricing
Input
$0.15/M
Cached
—/M
Output
$0.45/M
Capabilities
Streaming
JSON Output
Qwen3 235B A22B Instruct 2507
qwen3-235b-a22b-instruct-2507Providers
Nebius AI
nebius/qwen3-235b-a22b-instruct-2507Context Size
262k
Stability
STABLEPricing
Input
$0.20/M
Cached
—/M
Output
$0.60/M
Capabilities
Streaming
Tools
JSON Output
Qwen3 235B A22B Thinking 2507
qwen3-235b-a22b-thinking-2507Providers
Nebius AI
nebius/qwen3-235b-a22b-thinking-2507Context Size
262k
Stability
unstablePricing
Input
$0.20/M
Cached
—/M
Output
$0.60/M
Capabilities
Streaming
Tools
Reasoning
JSON Output
Qwen3 14B
qwen3-14bProviders
Nebius AI
nebius/qwen3-14bContext Size
32.8k
Stability
STABLEPricing
Input
$0.08/M
Cached
—/M
Output
$0.24/M
Capabilities
Streaming
Tools
JSON Output
Qwen3 32B
qwen3-32bProviders
Nebius AI
nebius/qwen3-32bContext Size
32.8k
Stability
STABLEPricing
Input
$0.10/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Tools
JSON Output
Qwen3 30B A3B
qwen3-30b-a3bProviders
Nebius AI
nebius/qwen3-30b-a3bContext Size
32.8k
Stability
STABLEPricing
Input
$0.10/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Tools
JSON Output
Qwen2.5 Coder 7B
qwen25-coder-7bProviders
Nebius AI
nebius/qwen25-coder-7bContext Size
32.8k
Stability
STABLEPricing
Input
$0.01/M
Cached
—/M
Output
$0.03/M
Capabilities
Streaming
JSON Output
Qwen2.5 32B Instruct
qwen25-32b-instructProviders
Nebius AI
nebius/qwen25-32b-instructContext Size
32.8k
Stability
STABLEPricing
Input
$0.06/M
Cached
—/M
Output
$0.20/M
Capabilities
Streaming
Tools
JSON Output
Qwen2.5 72B Instruct
qwen25-72b-instructProviders
Nebius AI
nebius/qwen25-72b-instructContext Size
32.8k
Stability
STABLEPricing
Input
$0.13/M
Cached
—/M
Output
$0.40/M
Capabilities
Streaming
Tools
JSON Output
Qwen2 VL 72B Instruct
qwen2-vl-72b-instructProviders
Nebius AI
nebius/qwen2-vl-72b-instructContext Size
32.8k
Stability
STABLEPricing
Input
$0.13/M
Cached
—/M
Output
$0.40/M
Capabilities
Streaming
Vision
JSON Output
Qwen2.5 VL 72B Instruct
qwen2-5-vl-72b-instructProviders
Nebius AI
nebius/qwen2-5-vl-72b-instructContext Size
32.8k
Stability
STABLEPricing
Input
$0.13/M
Cached
—/M
Output
$0.40/M
Capabilities
Streaming
Vision
JSON Output
Qwen3 Coder 480B A35B Instruct
qwen3-coder-480b-a35b-instructProviders
Nebius AI
nebius/qwen3-coder-480b-a35b-instructContext Size
262k
Stability
STABLEPricing
Input
$0.40/M
Cached
—/M
Output
$1.80/M
Capabilities
Streaming
Tools
JSON Output
Qwen3 Coder 30B A3B Instruct
qwen3-coder-30b-a3b-instructProviders
Nebius AI
nebius/qwen3-coder-30b-a3b-instructContext Size
262k
Stability
STABLEPricing
Input
$0.10/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Tools
JSON Output
Qwen3 30B A3B Instruct 2507
qwen3-30b-a3b-instruct-2507Providers
Nebius AI
nebius/qwen3-30b-a3b-instruct-2507Context Size
262k
Stability
STABLEPricing
Input
$0.10/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Tools
JSON Output
Qwen3 30B A3B Thinking 2507
qwen3-30b-a3b-thinking-2507Providers
Nebius AI
nebius/qwen3-30b-a3b-thinking-2507Context Size
262k
Stability
STABLEPricing
Input
$0.10/M
Cached
—/M
Output
$0.30/M
Capabilities
Streaming
Tools
Reasoning
JSON Output
Hermes 3 Llama 405B
hermes-3-llama-405bProviders
Nebius AI
nebius/hermes-3-llama-405bContext Size
131.1k
Stability
STABLEPricing
Input
$1.00/M
Cached
—/M
Output
$3.00/M
Capabilities
Streaming
JSON Output