- Rebrand commits already in history (gsd → forge) - Sync pre-existing doc, docker, and CI config updates - All rebrand artifacts verified in place: * Native crates: forge-engine, forge-ast, forge-grep * Log prefixes: [forge] across 22+ files * Binary: ~/bin/sf-run * Workspace scopes: @sf-run/*, @singularity-forge/* * Nix flake: Rust toolchain ready System ready for: nix develop && bun run build:native Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2.7 KiB
Choosing a Model
SF auto-selects a default model after you log in to a provider. You can switch models at any time.
Switch Models
Inside a SF session, type:
/model
This opens an interactive picker showing all available models from your configured providers.
Per-Phase Models
Different phases of work have different requirements. You can assign specific models to each phase in your preferences:
models:
research: claude-sonnet-4-6 # scouting and research
planning: claude-opus-4-6 # architectural decisions
execution: claude-sonnet-4-6 # writing code
execution_simple: claude-haiku-4-5 # simple tasks (docs, config)
completion: claude-sonnet-4-6 # summaries and wrap-up
subagent: claude-sonnet-4-6 # delegated sub-tasks
Omit a key to use whatever model is currently active for that phase.
Model Fallbacks
If a model is unavailable (provider down, rate limited, credits exhausted), SF can automatically fall back to another:
models:
planning:
model: claude-opus-4-6
fallbacks:
- openrouter/z-ai/glm-5
- openrouter/moonshotai/kimi-k2.5
Fallbacks are tried in order until one works.
Token Profiles
Token profiles coordinate model selection, phase skipping, and context compression with a single setting:
| Profile | Cost Savings | Best For |
|---|---|---|
budget |
40-60% | Prototyping, small projects, well-understood codebases |
balanced |
10-20% | Most projects, day-to-day development (default) |
quality |
0% (baseline) | Complex architectures, greenfield projects, critical work |
token_profile: balanced
See Token Optimization for details.
Dynamic Model Routing
When enabled, SF automatically picks cheaper models for simple tasks and reserves expensive ones for complex work:
dynamic_routing:
enabled: true
A documentation fix gets Haiku. An architectural refactor gets Opus. Your configured model is always the ceiling — routing never upgrades beyond what you've set.
See Dynamic Model Routing for the full guide.
Supported Providers
SF supports 20+ providers out of the box. See Provider Setup for setup instructions:
| Provider | Auth Method |
|---|---|
| Anthropic (Claude) | OAuth or API key |
| OpenAI | API key |
| Google Gemini | API key |
| OpenRouter | API key |
| Groq | API key |
| xAI (Grok) | API key |
| Mistral | API key |
| GitHub Copilot | OAuth |
| Amazon Bedrock | IAM credentials |
| Vertex AI | ADC |
| Azure OpenAI | API key |
| Ollama | Local (no auth) |
| LM Studio | Local (no auth) |
| vLLM / SGLang | Local (no auth) |