Inference.net Provider
Inference.net is a platform for running large language models in the cloud.
Available Models
Llama 3.1 8B Instruct
llama-3.1-8b-instructProviders
Inference.net
inference.net/llama-3.1-8b-instructContext Size
128k
Stability
unstablePricing
Input
$0.07/M
Cached
—/M
Output
$0.33/M
Capabilities
Streaming
Llama 3.2 11B Instruct
llama-3.2-11b-instructProviders
Inference.net
inference.net/llama-3.2-11b-instructContext Size
128k
Stability
unstablePricing
Input
$0.07/M
Cached
—/M
Output
$0.33/M
Capabilities
Streaming
JSON Output