Skip to main content

LLM & embedding models

Providers

  • OpenRouter — Unified gateway to Claude, Gemini, GPT (default)
  • Ollama — Local LLM inference

LLM aliases

AliasModel ID
gemini-flashgoogle/gemini-3-flash-preview
claude-sonnetanthropic/claude-sonnet-4.5
gpt-proopenai/gpt-5.2-pro

Configure via shipspec.json (llm.provider, llm.modelName) or ship-spec model set.

Embeddings

ProviderDefault ModelDimensions
OpenRoutermistralai/codestral-embed-2505Auto
Ollamanomic-embed-text768

Token budgeting:

  • maxContextTokens: 16000
  • reservedOutputTokens: 4000
  • RAG allocation ~70% of available context; low-relevance chunks pruned to fit.