ace-pm 35dc87ef53 chore: sync workspace state after rebrand

- Rebrand commits already in history (gsd → forge)
- Sync pre-existing doc, docker, and CI config updates
- All rebrand artifacts verified in place:
  * Native crates: forge-engine, forge-ast, forge-grep
  * Log prefixes: [forge] across 22+ files
  * Binary: ~/bin/sf-run
  * Workspace scopes: @sf-run/*, @singularity-forge/*
  * Nix flake: Rust toolchain ready

System ready for: nix develop && bun run build:native

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-04-15 14:54:20 +02:00

2.7 KiB

Raw Blame History

Choosing a Model

SF auto-selects a default model after you log in to a provider. You can switch models at any time.

Switch Models

Inside a SF session, type:

/model

This opens an interactive picker showing all available models from your configured providers.

Per-Phase Models

Different phases of work have different requirements. You can assign specific models to each phase in your preferences:

models:
  research: claude-sonnet-4-6        # scouting and research
  planning: claude-opus-4-6          # architectural decisions
  execution: claude-sonnet-4-6       # writing code
  execution_simple: claude-haiku-4-5 # simple tasks (docs, config)
  completion: claude-sonnet-4-6      # summaries and wrap-up
  subagent: claude-sonnet-4-6        # delegated sub-tasks

Omit a key to use whatever model is currently active for that phase.

Model Fallbacks

If a model is unavailable (provider down, rate limited, credits exhausted), SF can automatically fall back to another:

models:
  planning:
    model: claude-opus-4-6
    fallbacks:
      - openrouter/z-ai/glm-5
      - openrouter/moonshotai/kimi-k2.5

Fallbacks are tried in order until one works.

Token Profiles

Token profiles coordinate model selection, phase skipping, and context compression with a single setting:

Profile	Cost Savings	Best For
`budget`	40-60%	Prototyping, small projects, well-understood codebases
`balanced`	10-20%	Most projects, day-to-day development (default)
`quality`	0% (baseline)	Complex architectures, greenfield projects, critical work

token_profile: balanced

See Token Optimization for details.

Dynamic Model Routing

When enabled, SF automatically picks cheaper models for simple tasks and reserves expensive ones for complex work:

dynamic_routing:
  enabled: true

A documentation fix gets Haiku. An architectural refactor gets Opus. Your configured model is always the ceiling — routing never upgrades beyond what you've set.

See Dynamic Model Routing for the full guide.

Supported Providers

SF supports 20+ providers out of the box. See Provider Setup for setup instructions:

Provider	Auth Method
Anthropic (Claude)	OAuth or API key
OpenAI	API key
Google Gemini	API key
OpenRouter	API key
Groq	API key
xAI (Grok)	API key
Mistral	API key
GitHub Copilot	OAuth
Amazon Bedrock	IAM credentials
Vertex AI	ADC
Azure OpenAI	API key
Ollama	Local (no auth)
LM Studio	Local (no auth)
vLLM / SGLang	Local (no auth)

2.7 KiB Raw Blame History