Models
The public models endpoint is GET /api/inference/models. Use ?type=llm, ?type=media, or ?type=both. If omitted, it defaults to llm.
LLM models
GET /api/inference/models?type=llm returns pricing and input types. Costs are per million tokens.
json
{
"type": "llm",
"models": [
{
"id": "gemma-4-31b",
"name": "Gemma 4",
"input_cost_per_million": 0.13,
"output_cost_per_million": 0.38,
"input_types": ["text", "image"]
}
]
}Media models
GET /api/inference/models?type=media returns only id and name. Costs depend on several factors and are obtained by filling in the desired parameters on the respective media generation pages.
json
{
"type": "media",
"models": [
{
"id": "gpt-image-1.5",
"name": "GPT Image 1.5"
}
]
}