Skip to content

Available Models

AI Foundation Services provides access to a wide range of open-source and proprietary LLMs, embedding models, vision models, and audio models — all through a single OpenAI-compatible API.

Available Models (45)

Gemma 4 – Preview

Cloud: T-Cloud Server Location: Germany Data Processing: Germany

GPT-OSS 120B

Cloud: T-Cloud Server Location: Germany Data Processing: Germany

Qwen3-Next 80B

Cloud: T-Cloud Server Location: Germany Data Processing: Germany

Qwen3-VL 30B

Cloud: T-Cloud Server Location: Germany Data Processing: Germany

Qwen3 Coder 30B

Cloud: T-Cloud Server Location: Germany Data Processing: Germany

Llama 3.3 70B

Cloud: T-Cloud Server Location: Germany Data Processing: Germany

Mistral Small 3

Cloud: T-Cloud Server Location: Germany Data Processing: Germany

Mistral Medium 3

Cloud: GCP Server Location: Europe Data Processing: EU

GPT-5.2

Cloud: Azure Server Location: Sweden Data Processing: EU

GPT-5

Cloud: Azure Server Location: Sweden Data Processing: EU

GPT-5 mini

Cloud: Azure Server Location: Sweden Data Processing: EU

GPT-5 Codex

Cloud: Azure Server Location: Sweden Data Processing: EU

o4 mini

Cloud: Azure Server Location: Sweden Data Processing: EU

o3

Cloud: Azure Server Location: Sweden Data Processing: EU

o3 mini

Cloud: Azure Server Location: Sweden Data Processing: EU

o1 mini

Cloud: Azure Server Location: Sweden Data Processing: EU

Claude 4.6 Opus

Cloud: GCP Server Location: Europe Data Processing: EU

Claude 4.6 Sonnet

Cloud: GCP Server Location: Europe Data Processing: EU

Claude 4.5 Opus

Cloud: GCP Server Location: Europe Data Processing: EU

Claude 4.5 Sonnet (>200k)

Cloud: GCP Server Location: Europe Data Processing: EU

Claude 4.5 Sonnet (≤200k)

Cloud: GCP Server Location: Europe Data Processing: EU

Claude 4.5 Haiku

Cloud: GCP Server Location: Europe Data Processing: EU

Claude 4 Sonnet

Cloud: GCP Server Location: Europe Data Processing: EU

Claude 3.7 Sonnet

Cloud: GCP Server Location: Europe Data Processing: EU

Gemini 3 Pro (>200k)

Cloud: GCP Server Location: Europe Data Processing: EU

Gemini 3 Pro (≤200k)

Cloud: GCP Server Location: Europe Data Processing: EU

Gemini 3 Flash

Cloud: GCP Server Location: Europe Data Processing: EU

Gemini 2.5 Pro (>200k)

Cloud: GCP Server Location: Europe Data Processing: EU

Gemini 2.5 Pro (≤200k)

Cloud: GCP Server Location: Europe Data Processing: EU

Gemini 2.5 Flash

Cloud: GCP Server Location: Europe Data Processing: EU

Gemini 2.5 Flash Image

Cloud: GCP Server Location: Europe Data Processing: EU

GPT-4.1

Cloud: Azure Server Location: Sweden Data Processing: EU

GPT-4.1 mini

Cloud: Azure Server Location: Sweden Data Processing: EU

GPT-4.1 nano

Cloud: Azure Server Location: Sweden Data Processing: EU

GPT-4o

Cloud: Azure Server Location: France Data Processing: EU

GPT-4o mini

Cloud: Azure Server Location: Sweden Data Processing: EU

GPT-Image-1 (Text→Image)

Cloud: Azure Server Location: Sweden Data Processing: EU

GPT-Image-1 (Image→Image)

Cloud: Azure Server Location: Sweden Data Processing: EU

BGE-M3

Cloud: T-Cloud Server Location: Germany Data Processing: Germany

Jina Embeddings v2 Base DE

Cloud: T-Cloud Server Location: Germany Data Processing: Germany

Whisper Large v3

Cloud: T-Cloud Server Location: Germany Data Processing: Germany

Whisper Large v3 Turbo

Cloud: T-Cloud Server Location: Germany Data Processing: Germany

o1

Cloud: Azure Server Location: Sweden Data Processing: EU

Qwen 3 Coder 30B

Cloud: T-Cloud Server Location: Germany Data Processing: Germany

GPT 5.2

Cloud: Azure Server Location: Sweden Data Processing: EU

For a complete list with pricing, see the Plans & Pricing page.

Terminal window
curl "$OPENAI_BASE_URL/models" \
-H "Authorization: Bearer $OPENAI_API_KEY"

Each model includes metadata you can inspect:

models = client.models.list()
print(models.data[0].meta_data)
{
"model_type": "LLM",
"source_type": "OPEN SOURCE",
"max_sequence_length": 128000,
"hidden_size": 0,
"max_output_length": 0,
"deployment_region": "otc-germany",
"location": "otc-germany",
"license": "https://www.llama.com/llama3_3/license/",
"display_name": "Meta LLama 3.3 70B",
"deployment_country": "Germany, EU",
"input_modalities": ["text"],
"output_modalities": ["text"],
"model_lifecycle_stage": "Stable",
"is_externally_hosted": false
}

Key fields:

  • model_typeLLM, EMBEDDING, STT (audio)
  • max_sequence_length — Maximum input + output tokens
  • hidden_size — Embedding vector dimensions (embedding models only)
  • max_output_length — Maximum output tokens the model can generate
  • deployment_region — Where the model is hosted (e.g., otc-germany for T-Cloud Germany)
CategoryExamplesUse Case
LLMLlama 3.3 70B, GPT 4.1, Claude Sonnet 4, Qwen 3Text generation, chat, reasoning
Embeddingjina-embeddings-v2-base-de, text-embedding-bge-m3Semantic search, RAG
VisionQwen3-VL-30B-A3B-Instruct-FP8, Gemini 2.5 FlashImage analysis, multimodal chat
Audio/STTwhisper-large-v3, whisper-large-v3-turboSpeech-to-text, translation
Image Generationgpt-image-1Image creation from text prompts
  • Open-source models are hosted on the T-Cloud in Germany, fully GDPR-compliant.
  • Proprietary models (GPT, Claude, Gemini) are hosted via MS Azure, AWS, or GCP in GDPR-compliant configurations.

All data processing stays within the EU.