- Rebrand commits already in history (gsd → forge) - Sync pre-existing doc, docker, and CI config updates - All rebrand artifacts verified in place: * Native crates: forge-engine, forge-ast, forge-grep * Log prefixes: [forge] across 22+ files * Binary: ~/bin/sf-run * Workspace scopes: @sf-run/*, @singularity-forge/* * Nix flake: Rust toolchain ready System ready for: nix develop && bun run build:native Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
94 lines
2.7 KiB
Markdown
94 lines
2.7 KiB
Markdown
# Choosing a Model
|
|
|
|
SF auto-selects a default model after you log in to a provider. You can switch models at any time.
|
|
|
|
## Switch Models
|
|
|
|
Inside a SF session, type:
|
|
|
|
```
|
|
/model
|
|
```
|
|
|
|
This opens an interactive picker showing all available models from your configured providers.
|
|
|
|
## Per-Phase Models
|
|
|
|
Different phases of work have different requirements. You can assign specific models to each phase in your preferences:
|
|
|
|
```yaml
|
|
models:
|
|
research: claude-sonnet-4-6 # scouting and research
|
|
planning: claude-opus-4-6 # architectural decisions
|
|
execution: claude-sonnet-4-6 # writing code
|
|
execution_simple: claude-haiku-4-5 # simple tasks (docs, config)
|
|
completion: claude-sonnet-4-6 # summaries and wrap-up
|
|
subagent: claude-sonnet-4-6 # delegated sub-tasks
|
|
```
|
|
|
|
Omit a key to use whatever model is currently active for that phase.
|
|
|
|
## Model Fallbacks
|
|
|
|
If a model is unavailable (provider down, rate limited, credits exhausted), SF can automatically fall back to another:
|
|
|
|
```yaml
|
|
models:
|
|
planning:
|
|
model: claude-opus-4-6
|
|
fallbacks:
|
|
- openrouter/z-ai/glm-5
|
|
- openrouter/moonshotai/kimi-k2.5
|
|
```
|
|
|
|
Fallbacks are tried in order until one works.
|
|
|
|
## Token Profiles
|
|
|
|
Token profiles coordinate model selection, phase skipping, and context compression with a single setting:
|
|
|
|
| Profile | Cost Savings | Best For |
|
|
|---------|-------------|----------|
|
|
| `budget` | 40-60% | Prototyping, small projects, well-understood codebases |
|
|
| `balanced` | 10-20% | Most projects, day-to-day development (default) |
|
|
| `quality` | 0% (baseline) | Complex architectures, greenfield projects, critical work |
|
|
|
|
```yaml
|
|
token_profile: balanced
|
|
```
|
|
|
|
See [Token Optimization](../features/token-optimization.md) for details.
|
|
|
|
## Dynamic Model Routing
|
|
|
|
When enabled, SF automatically picks cheaper models for simple tasks and reserves expensive ones for complex work:
|
|
|
|
```yaml
|
|
dynamic_routing:
|
|
enabled: true
|
|
```
|
|
|
|
A documentation fix gets Haiku. An architectural refactor gets Opus. Your configured model is always the ceiling — routing never upgrades beyond what you've set.
|
|
|
|
See [Dynamic Model Routing](../features/dynamic-model-routing.md) for the full guide.
|
|
|
|
## Supported Providers
|
|
|
|
SF supports 20+ providers out of the box. See [Provider Setup](../configuration/providers.md) for setup instructions:
|
|
|
|
| Provider | Auth Method |
|
|
|----------|-------------|
|
|
| Anthropic (Claude) | OAuth or API key |
|
|
| OpenAI | API key |
|
|
| Google Gemini | API key |
|
|
| OpenRouter | API key |
|
|
| Groq | API key |
|
|
| xAI (Grok) | API key |
|
|
| Mistral | API key |
|
|
| GitHub Copilot | OAuth |
|
|
| Amazon Bedrock | IAM credentials |
|
|
| Vertex AI | ADC |
|
|
| Azure OpenAI | API key |
|
|
| Ollama | Local (no auth) |
|
|
| LM Studio | Local (no auth) |
|
|
| vLLM / SGLang | Local (no auth) |
|