singularity-forge/gitbook/getting-started/choosing-a-model.md
ace-pm 35dc87ef53 chore: sync workspace state after rebrand
- Rebrand commits already in history (gsd → forge)
- Sync pre-existing doc, docker, and CI config updates
- All rebrand artifacts verified in place:
  * Native crates: forge-engine, forge-ast, forge-grep
  * Log prefixes: [forge] across 22+ files
  * Binary: ~/bin/sf-run
  * Workspace scopes: @sf-run/*, @singularity-forge/*
  * Nix flake: Rust toolchain ready

System ready for: nix develop && bun run build:native

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-04-15 14:54:20 +02:00

2.7 KiB

Choosing a Model

SF auto-selects a default model after you log in to a provider. You can switch models at any time.

Switch Models

Inside a SF session, type:

/model

This opens an interactive picker showing all available models from your configured providers.

Per-Phase Models

Different phases of work have different requirements. You can assign specific models to each phase in your preferences:

models:
  research: claude-sonnet-4-6        # scouting and research
  planning: claude-opus-4-6          # architectural decisions
  execution: claude-sonnet-4-6       # writing code
  execution_simple: claude-haiku-4-5 # simple tasks (docs, config)
  completion: claude-sonnet-4-6      # summaries and wrap-up
  subagent: claude-sonnet-4-6        # delegated sub-tasks

Omit a key to use whatever model is currently active for that phase.

Model Fallbacks

If a model is unavailable (provider down, rate limited, credits exhausted), SF can automatically fall back to another:

models:
  planning:
    model: claude-opus-4-6
    fallbacks:
      - openrouter/z-ai/glm-5
      - openrouter/moonshotai/kimi-k2.5

Fallbacks are tried in order until one works.

Token Profiles

Token profiles coordinate model selection, phase skipping, and context compression with a single setting:

Profile Cost Savings Best For
budget 40-60% Prototyping, small projects, well-understood codebases
balanced 10-20% Most projects, day-to-day development (default)
quality 0% (baseline) Complex architectures, greenfield projects, critical work
token_profile: balanced

See Token Optimization for details.

Dynamic Model Routing

When enabled, SF automatically picks cheaper models for simple tasks and reserves expensive ones for complex work:

dynamic_routing:
  enabled: true

A documentation fix gets Haiku. An architectural refactor gets Opus. Your configured model is always the ceiling — routing never upgrades beyond what you've set.

See Dynamic Model Routing for the full guide.

Supported Providers

SF supports 20+ providers out of the box. See Provider Setup for setup instructions:

Provider Auth Method
Anthropic (Claude) OAuth or API key
OpenAI API key
Google Gemini API key
OpenRouter API key
Groq API key
xAI (Grok) API key
Mistral API key
GitHub Copilot OAuth
Amazon Bedrock IAM credentials
Vertex AI ADC
Azure OpenAI API key
Ollama Local (no auth)
LM Studio Local (no auth)
vLLM / SGLang Local (no auth)