singularity/singularity-forge

Author	SHA1	Message	Date
Mikael Hugo	2ed1638153	fix: add headless heartbeat output	2026-04-29 19:29:43 +02:00
Mikael Hugo	0d6eca9cdd	fix: preserve subagent debate mode details	2026-04-29 17:50:26 +02:00
Mikael Hugo	d78c5ac198	feat: add SF skills and subagent debate mode	2026-04-29 17:44:30 +02:00
Mikael Hugo	d02d33aa70	feat: add repo harness profiler	2026-04-29 17:39:52 +02:00
Mikael Hugo	fb4885b757	prompt(execute-task): add parallel-tool-call rule Adds step 0a: when independent reads/greps are needed, batch them in a single assistant turn instead of one-at-a-time. The existing step 0 already pushed for terse narration, but didn't address the bigger waste — sequential tool calls when parallel would work. Common case: reading handler + test + schema to triangulate a bug — three reads in one turn, not three turns. Also nudges away from "talking-then-doing": if the next action is unambiguous, just take it. Describing intent before every call is the dead weight that adds up to 30-50% extra round-trips. Behavior fix only (prompt-level). Model can still narrate inside its thinking channel since that's a model property; this targets the chat/tool-use channel where the user pays per turn. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 15:42:22 +02:00
Mikael Hugo	c5df4b46a6	fix(headless): await auto loop in headless mode	2026-04-29 15:37:17 +02:00
Mikael Hugo	df614a3e47	fix(headless): split idle-timeout role from deadlock-backstop role The single IDLE_TIMEOUT_MS constant was conflating two different jobs: "are we done?" vs "is the agent stuck?". For multi-turn commands (auto, next, discuss, plan), the first question is wrong — those signal completion explicitly via "auto-mode stopped" terminal notifications, and child-process exit catches crashes. The 120s I'd just bumped multi-turn to was still in idle-detection mindset; that's not what we need from this timer. New semantics: - IDLE_TIMEOUT_MS = 15s — quick commands (status, queue, …); idle really does mean done. - NEW_MILESTONE_IDLE_TIMEOUT_MS = 120s — bounded creative task with pauses for thinking between bootstrap steps. - MULTI_TURN_DEADLOCK_BACKSTOP_MS = 30 minutes — auto/next/discuss/plan. Not a "done" detector; a deadlock recovery bound. Long enough to never bother slow LLM reasoning or chained tool calls; short enough to recover from a true hang within a reasonable window. Real completion comes from terminal notifications + child-process exit, both already wired. Code reads cleaner too: effectiveIdleTimeout selection now mirrors the three-way conceptual split. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 15:18:58 +02:00
Mikael Hugo	c239ad6c9d	fix(headless): use long idle timeout for auto/next/discuss/plan The 15s IDLE_TIMEOUT_MS was killing auto-mode prematurely. Symptom: sf headless auto would dispatch a task, the LLM would make 1-2 tool calls, pause to reason about the next step, exceed 15s of "no events", and headless would declare "Status: complete" — exiting at ~35s with the task barely started (123 events but only 2 tool calls). The 120s NEW_MILESTONE_IDLE_TIMEOUT_MS already exists for the same reason ("LLM may pause between tool calls e.g. after mkdir, before writing files"). The same applies to auto/next/discuss/plan — all multi-turn commands where the LLM thinks longer between actions, especially on non-trivial tasks. isMultiTurnCommand was already defined for related logic; this just wires it into the idle-timeout decision. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 15:13:43 +02:00
Mikael Hugo	2afe2ac6f1	feat(prefs): self-aligning template upgrades — sf keeps its own files synced Companion to the earlier schema-versioning framework. Where that handles data-shape evolution via forward migrations, this handles file-template evolution via silent self-rewrite. The user shouldn't have to know: - ensurePreferences() now stamps `last_synced_with_sf: <semver>` in the frontmatter when seeding a new project's PREFERENCES.md, recording the sf version that wrote the template. - New module preferences-template-upgrade.ts: - detectTemplateDrift(prefs) — pure check, returns { fromVersion, toVersion, needsUpgrade }. - upgradePreferencesFileIfDrifted(path, prefs) — silently re-renders the file's frontmatter when fromVersion ≠ toVersion. Body (anything after the closing `---`) is preserved verbatim, so user notes stay. - Wired into loadPreferencesFile() — every read self-aligns. No human warnings, no opt-in flow; sf keeps its own house in order. - last_synced_with_sf added to SFPreferences + KNOWN_PREFERENCE_KEYS so it round-trips through validatePreferences without "unknown key" warnings. Failure modes are non-fatal: missing file, malformed frontmatter, or read-only filesystem all leave the file alone and return the in-memory prefs unchanged. SF_VERSION env var (set by loader.ts) is the source of truth for "current sf"; "0.0.0" sentinel skips upgrade so atypical entry points don't stamp incorrect values. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 15:05:37 +02:00
Mikael Hugo	a2b709f669	fix(gitignore): write sf runtime patterns to .git/info/exclude, not .gitignore ensureGitignore was re-adding `.sf`, `.sf-id`, `.bg-shell/` to the project's .gitignore on every sf run, causing two issues: 1. Working-tree churn — every invocation dirtied .gitignore, forcing a commit just to silence "uncommitted changes" warnings. Pattern flagged by user: "is this the right way with its own every run". 2. False-positive duplicate-add — the literal-string check (`existingLines.has(".sf")`) didn't recognize user-equivalent patterns like `/.sf` (root-only) or `.sf/` (with trailing slash), so an explicit user entry got duplicated by the auto-add on next run. Fix: move sf-specific runtime patterns to `.git/info/exclude` via new `ensureGitInfoExclude()`. That file is per-clone (not committed), so re-writing is invisible to git status. The project's `.gitignore` stays human-curated and sf doesn't opinionate on it. `ensureGitignore()` now calls `ensureGitInfoExclude()` first so callers don't need to update — backwards compatible. Generic OS/IDE/lang patterns (.DS_Store, node_modules/, target/, etc.) stay in BASELINE_PATTERNS for .gitignore since those genuinely belong in version control. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 14:58:14 +02:00
Mikael Hugo	9b718f8e36	fix(headless): repair missing sf project symlink	2026-04-29 14:43:30 +02:00
Mikael Hugo	3b6cbcd79f	feat(prefs): schema versioning with forward-migration registry Adds the framework for evolving the prefs schema without silently breaking projects pinned to older versions. Each PREFERENCES.md declares `version: N`; sf declares CURRENT_PREFERENCES_SCHEMA_VERSION in code. On load: - prefs.version === current → no-op - prefs.version < current → run registered migrations in chain (forward only, pure functions). Missing migration in the chain throws — bumping the schema version requires a matching Migration entry, by construction. - prefs.version > current → warn "prefs from a newer sf, fields may be ignored", preserve the value so a later upgrade reads correctly. - prefs.version undefined → assume v1 (legacy file pre-versioning) and warn so the user adds an explicit pin. Migration registry is empty for now (current schema version stays at 1) — the framework is in place so the first real schema bump is a one-line addition, not a refactor. Drift detection (`checkPreferencesDrift`) is also the natural surface for future deprecated-key / missing-required-field checks when CLAUDE.md / template comparisons are added. Wired into validatePreferences() so every load path gets the new behavior automatically — no caller changes needed. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 14:38:43 +02:00
Mikael Hugo	6248e79a7a	feat(init): auto-seed PREFERENCES.md with detected verification_commands Without this, every fresh project inherits sf's user-level dogfooding defaults (npm run typecheck:extensions, test:sf-light) — which run sf's own dev scripts against unrelated repos and produce universal false negatives. Hit in dr-repo (Go): T01-VERIFY.json showed all_fail because those npm scripts don't exist there, even though T01's actual work passed verification per its SUMMARY. - ensurePreferences() now calls detectProjectSignals() and embeds the auto-detected commands in the YAML frontmatter on first init. Detection failure is non-fatal — falls back to the bare template. - detectVerificationCommands() Go branch now handles multi-module repos (no root go.mod, only nested ones — common pattern for repos like dr-repo/{dr-agent,portal,gateway,installer,cmd/installer}). Generates a per-module loop instead of running go vet/test from the repo root, which would fail since each subdir is its own Go module. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>	2026-04-29 14:26:49 +02:00
Mikael Hugo	a8cf2cd941	feat(workflow): add product-audit (slim port) Milestone-end workflow that compares declared product intent (VISION.md, RUNBOOKS.md, etc.) against actual code/test/deploy/docs evidence and emits structured gaps with severity. Soft gates — adds follow-up slices but doesn't hard-block merge. Slim port (4 new files + 1 registration) — extracts only the audit feature itself, not bunker's parallel rewrite of dispatch/prompts/ benchmark-selector that came with it in commit 2aa785475. Created: - prompts/product-audit.md — prompt verbatim, gsd_→sf_ and .gsd→.sf - tools/product-audit-tool.ts — slim file-write implementation, atomicWriteAsync to .sf/active/{mid}/ PRODUCT-AUDIT.{json,md}; no DB deps - bootstrap/product-audit-tool.ts — pi-coding-agent tool registration, TypeBox schema for sf_product_audit - workflow-templates/product-audit.md — workflow template Modified: - bootstrap/register-extension.ts — 2 lines: import + add to nonCriticalRegistrations - workflow-templates/registry.json — registry entry - package.json — version 2.75.0 → 2.75.1 Verdict logic (no-gaps \| gaps-found \| contract-underspecified) is the load-bearing innovation: contract-underspecified forces the auditor to flag unverifiable docs as a real gap rather than rubber-stamping no-gaps when the product contract is silent. Out of scope: phase enum changes, dispatch hookup. Wire-up to the phase machine is a follow-up; the prompt + tool + template stand alone. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 13:55:23 +02:00
Mikael Hugo	2eebeccb93	feat(search): add MiniMax web search provider New search backend alongside tavily/brave/serper/exa/ollama. API key resolution: MINIMAX_CODE_PLAN_KEY → MINIMAX_CODING_API_KEY → MINIMAX_API_KEY (fallback order matches MiniMax's documented aliases). Wired through every existing seam: - type union: SearchProvider = 'tavily' \| 'minimax' \| 'brave' \| 'ollama' - VALID_PREFERENCES set + selection logic in provider.ts - native-search routing (Anthropic native web_search delegates correctly) - /search-provider CLI command (tab completion, select UI, parser) - tool-search.ts: search execution path - tool-llm-context.ts: prefetch / context-builder path - preferences-types + preferences-validation - configuration.md user docs - extension-manifest description Tests not added in this commit — the bunker reference tests don't match our preferences/provider export shape (we have serper/exa/combosearch that bunker doesn't). Tests for getMiniMaxSearchApiKey priority order, resolveSearchProvider returning "minimax", /search-provider minimax CLI behavior, no-key error messages, and executeMiniMaxSearch request shape are TODO. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 13:55:04 +02:00
Mikael Hugo	dff0df5fdc	fix(headless): suppress notification spam, categorize messages, distinguish phase vs status Three small UX fixes for headless / autopilot logs: 1. Add `zz-notifications` to TUI_FOOTER_STATUS_KEYS — these are sticky notification dots from the interactive TUI footer; they have no meaning in headless and were spamming the log. 2. Categorize notification messages by prefix so headless output is scannable: [mcp] for MCP-client-ready, [search] for web search status, [parallel] for slice-parallel/subagent dispatch. Falls through to the existing important/non-important formatting for everything else. 3. Distinguish phase transitions from generic status updates: phase:/ milestone:/slice:/task: prefixed keys get [phase]; everything else gets [status]. Previously both used [phase], which was misleading. Patterns based on bunker commits 14ec4d97f / c15afb45f (which were the research source) but written fresh against our existing TUI_FOOTER_STATUS_KEYS structure rather than cherry-picked. The assistant-text-preview commit (cf0274c63) is a separate, larger refactor in headless.ts and is deferred to v3.1. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 13:43:40 +02:00
Mikael Hugo	c41912ff55	fix(prompts): tell agents about Serena (repo-intelligence MCP) for code exploration We have .serena/ configured (cache, memories, project.local.yml) but no prompt mentioned Serena anywhere. Agents weren't using it for symbol lookup or cross-file architecture mapping; they fell straight to rg/find. Added a one-sentence Serena hint to the code-exploration step in: - research-slice.md - research-milestone.md - plan-slice.md - plan-milestone.md - guided-research-slice.md Phrased generically ("If a repo-intelligence MCP (e.g. Serena) is configured...") so it degrades cleanly when Serena isn't set up. Pattern based on bunker commit 4ba746888 but written fresh against our post-rename prompt structure rather than cherry-picked. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 13:41:33 +02:00
Mikael Hugo	b24f426f2b	batch: snapshot of in-flight v2 work This commit captures uncommitted modifications that accumulated in the working tree across multiple in-progress workstreams. It is a snapshot to clear the deck before sf v3 work begins; individual workstreams should land separately on top of this. Notable additions: - trace-collector.ts, traces.ts, src/tests/trace-export.test.ts — trace export plumbing - biome.json — Biome linter configuration - .gitignore — exclude native/npm/*/.node compiled binaries The bulk of the diff is across src/resources/extensions/sf/ (301 files) and src/resources/extensions/sf/tests/ (277 files), reflecting the ongoing sf extension work. Specific feature commits should follow this snapshot rather than being archaeology'd out of it. The 76MB native/npm/linux-x64-gnu/forge_engine.node compiled binary was left out of the commit — it's now gitignored and built locally. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-29 12:42:31 +02:00
Mikael Hugo	6eaf5926ad	sf snapshot: uncommitted changes after 248m inactivity	2026-04-28 21:10:17 +02:00
Mikael Hugo	d30d91bf2f	sf snapshot: uncommitted changes after 41m inactivity	2026-04-28 17:01:26 +02:00
Mikael Hugo	5d3c204006	fix(git-merge): no auto-flip from approved to declined; cached approval is sticky Codex-rescue output (a299c461 / bnr88iy59) — the 'Git merge approved once' followed seconds later by 'Git merge declined by user' bug we hit on M002 complete-milestone. Same gate, same agent run, opposite verdicts. Single source of truth for the merge-gate state in guardrails/index.ts. Approval is now sticky — re-asks return the cached approval until consumed or explicitly revoked, never auto-flip to decline. Timeout converts to pause+log instead of decline. Adds tests/safe-git-merge-gate.test.ts. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Co-Authored-By: OpenAI Codex <noreply@openai.com>	2026-04-28 16:20:08 +02:00
Mikael Hugo	d38e5ea092	fix(schema): auto-coerce string → [string] for sf_* list fields + provider_model_allow tests Two codex-rescue tasks landed together: 1. Auto-coerce JSON-schema validator: when a tool field declares {type:"array", items:{type:"string"}} and the model sends a single string, wrap it in [string] before validation instead of hard-rejecting. Fixes the recurring "keyDecisions: must be array" rejection on sf_complete_task that wasted retries. 2. Provider_model_allow filter (proper implementation with helpers): - resolveProviderModelAllowList / isProviderModelAllowed / filterModelsByProviderModelAllow helpers in preferences-models - Wired into model-registry and auto-model-selection - New tests/provider-model-allow.test.ts Tools coerced: sf_complete_task, sf_complete_milestone, sf_plan_milestone, sf_plan_slice, sf_replan_slice, sf_reassess_roadmap (key list fields). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Co-Authored-By: OpenAI Codex <noreply@openai.com>	2026-04-28 12:30:55 +02:00
Mikael Hugo	f98a1e360e	batch: codex-rescue session output (multiple in-flight tasks) Combined output of multiple parallel codex-rescue runs that produced working-tree edits but didn't commit. Tasks contributing: - prefs: per-provider model allow-list (provider_model_allow) — manual - TUI scroll + unresponsive (a7884d1a / bt3fpn4y2) - planningMeeting required (aa09e904 / br127l763) - Logs UX 4-pack (a5c65314 / btcplhu7f) - Gate auto-resolve + completion nudge (ae4c8b64 / bw1w1fjkp) - sf_task_complete atomic + retry (a7a079b4 / b20cy5owv) - Multi-model meeting + minimax M2.7 + draft promotion (a756faac / task-moifjknd-lwjc98) - Per-role slice prompts (a94c3e1a) - Per-role vision-meeting prompts (afd165a0 / task-moifple5-lcwtjl) - Schema sweep (ac994b1e / task-moifq7pu-83coqz) - Flow audit (ad26ecfd / bttj4vrqm) Typecheck passes. Tests not run as a full suite — spot-check after merge. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> Co-Authored-By: OpenAI Codex <noreply@openai.com>	2026-04-28 11:52:42 +02:00
Mikael Hugo	66ff949c11	cherry-pick(security): harden project-controlled surfaces (PR #4755 partial) Cherry-pick of gsd-build/gsd-2 65ca5aa2e — applies the security hardening hunks that conflicted minimally: - mcp-server/env-writer: validate writes against a strict allowlist - web/api/files: enforce path containment via web/lib/secure-path - vscode-extension: read binaryPath/autoStart only from trusted global/default scopes (resolveTrustedSfStartupConfig), avoiding workspace-controlled override (renamed Gsd → Sf for sf naming) - New regression tests: mcp-client-security, vscode-startup-security, web-files-symlink Skipped hunks (drifted): mcp-server/server.ts, mcp-client/index.ts, mcp-server/README.md. Co-Authored-By: Jeremy <jeremy@fluxlabs.net> Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 05:37:07 +02:00
Mikael Hugo	bf727173e7	cherry-pick(file-lock): make file-lock actually lock and throw on contention Cherry-pick of gsd-build/gsd-2 a09e01640 — withFileLockSync now actually acquires a proper-lockfile (was previously a no-op when proper-lockfile wasn't required) and throws on ELOCKED contention by default. Adds onLocked: 'skip' option for best-effort callers that tolerate dropped entries (audit, journal). Modernizes import style (createRequire/join from imports rather than ad-hoc require). Path-renames preserved (gsd-pi → sf-run). Co-Authored-By: Jeremy <jeremy@fluxlabs.net> Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 05:28:36 +02:00
Mikael Hugo	22d4579690	cherry-pick(state): lock-wrapped appends for journal, audit, workflow-logger Cherry-pick of gsd-build/gsd-2 53babec29 — lock-wrapped append half. Wraps appends to .sf/journal/, .sf/audit/events.jsonl, and the workflow-logger error log in withFileLockSync (onLocked: skip), preserving best-effort semantics while preventing torn writes under contention. Companion to the atomic-write half landed in `3df56cb94`. Path-renames (gsdRoot→sfRoot, gsd-db→sf-db) preserved during conflict resolution. Co-Authored-By: Jeremy <jeremy@fluxlabs.net> Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 05:27:44 +02:00
Mikael Hugo	f1f4b840e1	cherry-pick(doctor): self-heal symlinked .sf staging to prevent silent data loss Cherry-pick of gsd-build/gsd-2 9340f1e9b (#4423) — doctor self-heal detection for symlinked staging directories that can cause silent data loss. Skips native-git-bridge.ts and git-service test (drifted). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 05:25:56 +02:00
Mikael Hugo	7fd4672e55	cherry-pick(auto): handle worktree context fallback + sanitize paused session paths Cherry-pick of gsd-build/gsd-2 a4f78731f — handles worktree context fallback and sanitizes paths in paused session resumption. Skips uok-plan-v2-wiring test hunk (drifted in sf). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 05:25:40 +02:00
Mikael Hugo	93402643f4	cherry-pick(sf-db): tolerate corrupt task arrays in milestone rows Cherry-pick of gsd-build/gsd-2 851507913 (#4056) — defensive parsing so a corrupt or non-array tasks blob in a milestone row doesn't crash sf-db reads. Test hunk skipped (sf-db.test.ts has drifted). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 05:25:21 +02:00
Mikael Hugo	3df56cb94f	cherry-pick(state): atomic-writes for guided-flow-queue and reports Cherry-pick of gsd-build/gsd-2 53babec29 (Jeremy <jeremy@fluxlabs.net>) — atomic-write half only. Eliminates torn-write risk on PROJECT.md queue sync and reports.json/HTML index regeneration by switching writeFileSync → atomicWriteSync (tmp+rename). The companion lock-wrapped-append changes (journal.ts, uok/audit.ts, workflow-logger.ts) are deferred — they need proper-lockfile + withFileLockSync helper introduced first. Co-Authored-By: Jeremy <jeremy@fluxlabs.net> Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 05:16:39 +02:00
Mikael Hugo	8e827147c9	feat(code-intelligence): add sift indexer backend alongside project-rag Generalize the code-intelligence hook to support multiple indexer backends, with sift (rupurt/sift) as a new option next to the existing project-rag MCP server. Backend is selected via CodebaseMapPreferences. - code-intelligence.ts: new abstraction + sift backend (detect, resolve, status, context-block contribution) - preferences-types.ts: codebaseIndexer field (project-rag \| sift \| none) - preferences-validation.ts: validate the new field - bootstrap/system-context.ts, commands-codebase.ts: dispatch on backend - tests/code-intelligence.test.ts: sift detection/resolution/status tests (19 pass, 0 fail) project-rag path unchanged and continues to work. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 05:05:26 +02:00
Mikael Hugo	0606983d97	feat(subagent): add background job manager and tests SubagentBackgroundJobManager tracks long-running subagent jobs with status, abort support, and TTL-based eviction of completed results. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 04:18:17 +02:00
Mikael Hugo	efd5e14e0a	feat: add FEATURES.md capability map and generator Human-oriented documentation of SF capabilities, with a script that keeps it in sync with workflow-tools.ts and extension manifests. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-28 04:18:12 +02:00
Mikael Hugo	0d286b991b	sf snapshot: pre-dispatch, uncommitted changes after 2902m inactivity	2026-04-27 23:42:51 +02:00
Mikael Hugo	f0da5b6d21	fix: bind getProviderAuthMode to registry instance to avoid undefined 'this' Extracting a class method as a bare reference loses its 'this' context, causing 'Cannot read properties of undefined' when minimax (or any provider) triggers the flat-rate auth-mode lookup. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-25 19:22:39 +02:00
Mikael Hugo	7289933909	fix: populate memoriesSection in execute-task prompt and fix stale dist buildExecuteTaskPrompt was not passing memoriesSection to loadPrompt, causing headless auto to fail with a template variable error. Also updated plan-slice-prompt.test.ts to supply the four template variables (memoriesSection, runtimeContext, phaseAnchorSection, gatesToClose) that were missing from the test fixture. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-25 18:46:55 +02:00
Mikael Hugo	a30a7692e3	fix: dist-redirect.mjs incorrectly rewrites .js→.ts for node_modules paths containing /src/ The resolver guarded on context.parentURL.includes('/src/') to identify in-repo source files, but @google/gemini-cli-core installs to node_modules/@google/gemini-cli-core/dist/src/ which also contains '/src/'. Relative imports from that dist package (e.g. './config/config.js') were incorrectly rewritten to './config/config.ts', causing ERR_MODULE_NOT_FOUND on every test that transitively imports the google-gemini provider. Fix: add !context.parentURL.includes('/node_modules/') guard. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-25 18:04:23 +02:00
Mikael Hugo	2e32c96fa0	Port gsd2 functional parity: turn-epoch, abandon-detect, reapplyThinking, exec chain, memory chain, onboarding-state - auto/turn-epoch.ts: AsyncLocalStorage-backed stale-write dropping for timeout recovery - journal.ts: isStaleWrite() guard drops superseded turn writes - auto/run-unit.ts: wrap agent_end Promise.race in runWithTurnGeneration - auto/session.ts: ThinkingLevelSnapshot type + autoModeStartThinkingLevel/originalThinkingLevel fields - auto-model-selection.ts: reapplyThinkingLevel() called after every successful setModel() - auto/phases.ts: pass autoModeStartThinkingLevel to selectAndApplyModel + hook override restore - abandon-detect.ts: two-signal milestone abandon detection in rewrite-docs overrides - auto-post-unit.ts: use detectAbandonMilestone + parkMilestone in rewrite-docs handler - preferences-types.ts: ContextModeConfig + isContextModeEnabled - exec-sandbox.ts: sandboxed bash/node/python subprocess with .sf/exec/ persistence - exec-history.ts: read-side scan of .sf/exec/*.meta.json - compaction-snapshot.ts: ≤2 KB markdown digest written before context compaction - tools/exec-tool.ts: sf_exec MCP tool executor - tools/exec-search-tool.ts: sf_exec_search MCP tool executor - tools/resume-tool.ts: sf_resume MCP tool executor - bootstrap/exec-tools.ts: registers sf_exec/sf_exec_search/sf_resume - memory-relations.ts: knowledge-graph edges between memories (traverseGraph) - tools/memory-tools.ts: capture_thought/memory_query/sf_graph executors - bootstrap/memory-tools.ts: registers capture_thought/memory_query/sf_graph - bootstrap/register-extension.ts: wire exec-tools + memory-tools into registration - onboarding-state.ts: onboarding completion record at ~/.sf/agent/onboarding.json Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-25 10:58:39 +02:00
Mikael Hugo	5887ea3fd1	port gsd2: blocked-models gate, milestone-summary classifier, unsupported-model recovery blocked-models.ts (new): Persistent per-project blocklist at .sf/runtime/blocked-models.json. loadBlockedModels / isModelBlocked / blockModel (file-lock-safe write). milestone-summary-classifier.ts (new): classifyMilestoneSummaryContent → "success" \| "failure" \| "unknown". isTerminalMilestoneSummaryContent: failure summaries are NOT terminal — lets auto-mode re-enter a milestone after a failed recovery summary. state.ts: Phase 1 (completeMilestoneIds) and Phase 2 (registry) now check isTerminalMilestoneSummaryContent before treating a SUMMARY as complete. A failure SUMMARY no longer prematurely parks a milestone. error-classifier.ts: Add "unsupported-model" ErrorClass kind with regex detection (model + not-supported/unavailable/no-access + account/plan/tier). Checked before "permanent" so /account/i in PERMANENT_RE doesn't swallow it. auto-model-selection.ts: Wire isModelBlocked() gate in selectAndApplyModel candidate loop: skips provider-rejected models and continues to fallbacks. bootstrap/agent-end-recovery.ts: Handle cls.kind === "unsupported-model": blockModel(), try fallback chain skipping already-blocked models, pause if no usable fallback. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-25 10:13:27 +02:00
Mikael Hugo	6cb6de4fd2	perf: parallelize I/O, add runtime cache, extend nix devenv - unit-context-composer: resolve artifact keys in parallel (Promise.all) - unit-runtime: add in-memory cache to avoid repeated disk reads per dispatch - auto-timers: share 15s idle watchdog tick with context-pressure check - auto-prompts: 1s TTL budget cache to coalesce repeated loadEffectiveSFPreferences calls - native-git-bridge: extend nativeHasChanges TTL 10s→30s - auto-dashboard: remove pulsing dot animation (CPU churn, no UX value) - flake.nix: add nodePackages.typescript to dev shell Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-25 10:12:32 +02:00
Mikael Hugo	12aabd863e	port gsd2 #4769 : worktree telemetry, slice-cadence, canonical-root fix + /sf scan Ports commit 7fb35ca58 from gsd2 (PR #4769) covering four issues: #4761 — resolveCanonicalMilestoneRoot in worktree-manager.ts routes validate-milestone through the live worktree path instead of stale project-root state when a milestone worktree is active. #4762 — auditOrphanedMilestoneBranches in auto-start.ts now surfaces in-progress milestone branches with unmerged commits ahead of main (previously only complete milestones were audited). Gated on isClosedStatus so parked/other closed statuses are unaffected. #4764 — worktree-telemetry.ts: typed emit helpers (emitWorktreeCreated, emitWorktreeMerged, emitWorktreeOrphaned, emitAutoExit, emitWorktreeSync, emitCanonicalRootRedirect, emitSliceMerged, emitMilestoneResquash) plus summarizeWorktreeTelemetry aggregator and nearest-rank percentile(). Wired in: worktree-resolver.ts (create/merge events), auto-start.ts (orphan telemetry), auto.ts stopAuto (auto-exit with normalized reason), worktree-manager.ts (canonical-root-redirect). Surfaced in forensics.ts via detectWorktreeOrphans and Worktree Telemetry sections. #4765 — slice-cadence.ts: mergeSliceToMain squash-merges each slice's commits onto main as soon as the slice passes validation (opt-in via git.collapse_cadence: "slice"). resquashMilestoneOnMain collapses N per-slice commits into one milestone commit at completion. Wired in auto-post-unit.ts (slice merge after complete-slice with stopAuto on conflict/error) and worktree-resolver.ts (resquash at mergeAndExit). AutoSession.milestoneStartShas tracks the pre-first-slice SHA. GitPreferences and preferences-validation.ts extended with collapse_cadence and milestone_resquash fields. Also ports /sf scan command: commands-scan.ts with parseScanArgs, resolveScanDocuments, buildScanOutputPaths, and handleScan dispatching a focused codebase assessment prompt to .sf/codebase/. journal.ts: 9 new JournalEventType values for the telemetry events. All changes are additive; default behavior (cadence="milestone") unchanged. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-25 09:03:56 +02:00
Mikael Hugo	2911d3b93d	port gsd2: reassess-roadmap opt-in (ADR-003 §4) + prefer toolDefinition.label reassess-roadmap: flip default from true → false. Most reassess units conclude "roadmap is fine" burning a session for no change; the plan-slice prompt now carries a JIT preamble at zero cost. (#4778) tool-execution: always prefer toolDefinition.label when non-empty, even when label === name — allows tools to display their canonical name explicitly. (#4758) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-25 08:33:50 +02:00
Mikael Hugo	d4cdcb582d	port gsd2 #3338 : ecosystem plugin loader for .sf/extensions/ Adds support for project-local SF extension plugins dropped in .sf/extensions/. Trust-gated (requires pi trust), symlink-escape safe. - ecosystem/sf-extension-api.ts: SFExtensionAPI wrapper exposing getPhase() and getActiveUnit() to third-party handlers; updateSnapshot refreshes state before_agent_start so handlers see current phase/unit - ecosystem/loader.ts: discovers .sf/extensions/*.js, loads them via dynamic import, dispatches factory(api) for each - register-extension.ts: initializes ecosystemHandlers array, wires loader - register-hooks.ts: before_agent_start refreshes snapshot then dispatches ecosystem handlers before returning SF system prompt - types.ts: SFActiveUnit interface (milestoneId/sliceId/taskId + titles) - workflow-logger.ts: "ecosystem" added to LogComponent union Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-25 08:27:55 +02:00
Mikael Hugo	6c36d62f35	port gsd2 #4961 : stop using active-tool snapshot as model-policy gate Fixes a bug where per-unit tool narrowing poisoned the policy gate for subsequent units, causing "Model policy denied dispatch before prompt send" errors on complete-slice and discuss-milestone (100% Win repro). Four-part port from gsd2@817031b2a: - ModelPolicyDispatchBlockedError class with per-model deny reasons - TOOL_BASELINE WeakMap + clearToolBaseline/restoreToolBaseline lifecycle - auto-model-selection: use getRequiredWorkflowToolsForAutoUnit as requiredTools - auto/loop: catch ModelPolicyDispatchBlockedError as non-retryable (pause) - auto.ts: wire clearToolBaseline at startAuto (fresh only) and stopAuto Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-25 08:15:04 +02:00
Mikael Hugo	4fdd8700a3	port gsd2 upstream features: scope classifier, composer v2, GPT-5.5, test timeout - milestone-scope-classifier: add getMilestonePipelineVariant + milestoneRowToScopeInput wired into auto-dispatch trivial-skip for research/validation phases (#4781) - auto-prompts: rename GSD→SF identifiers, add isSummaryCleanForSkip, prefs param on checkNeedsReassessment, buildExtractionStepsBlock from commands-extract-learnings - unit-context-manifest + unit-context-composer: port v2 typed computed artifacts (#4924) - skill-manifest: per-unit-type skill filter resolver (#4788, #4792) - escalation: stub for ADR-011 mid-execution escalation (full port deferred) - auto-start: extract decideSurvivorAction for testability (#4832) - models: add gpt-5.5 + gpt-5.4-mini to cost table, router, and models.generated.ts - types: EscalationArtifact, context_window_override, skip_clean_reassess, mid_execution_escalation, sketch_scope on SliceRow - tool-execution: add visibleWidth import (was undefined) - package.json: add --test-timeout=30000 to prevent parallel tests from freezing machine Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-25 08:08:11 +02:00
Mikael Hugo	e2147c0694	sf snapshot: pre-dispatch, uncommitted changes after 43m inactivity	2026-04-25 06:34:49 +02:00
Mikael Hugo	7b6c9dd099	sf snapshot: pre-dispatch, uncommitted changes after 4703m inactivity	2026-04-25 05:51:29 +02:00
ace-pm	c744bdf6c1	fix: atomic writes, parse radix, lossy json, silent worker spawn 8 fixes from 3rd-pass scan: 1. web/components/sf/tempCodeRunnerFile.tsx: remove orphan VS Code 'Code Runner' artifact (850+ lines duplicated from shell-terminal.tsx). Unreferenced but compiled into tsc project. 2. sf/phase-anchor.ts: writePhaseAnchor used plain writeFileSync — a crash mid-write would corrupt the handoff checkpoint that readPhaseAnchor then silently returns null for, losing cross-phase context. Switched to atomicWriteSync (already used by sibling files). 3. sf/forensics.ts: same non-atomic writeFileSync on active-forensics.json marker. Race with a concurrent reader produces an empty object and the forensics session is lost. Switched to atomicWriteSync. 4. web/auto-dashboard-service.ts: paused-session.json existence was the intended signal but a corrupt body silently dropped the paused flag so the UI showed active. Now reports paused on file existence regardless of body integrity, and warns on corruption. 5. sf/visualizer-data.ts: doctor-history.jsonl parser did .map(JSON.parse) inside an outer catch. One corrupt line discarded 19 valid entries. Per-line try/catch preserves the valid rows. 6. sf/files.ts: three parseInt calls without radix (step, total_steps, totalSteps) — also missing \|\| 0 fallback for NaN. 7. cli.ts: parseInt(process.versions.node) without radix. Split on '.' and use radix 10 explicitly. 8. sf/slice-parallel-orchestrator.ts: silent 'catch {}' around spawn() masked worker-spawn failures as 'no workers available'. Matches sibling parallel-orchestrator.ts pattern — now logs via logWarning. Skipped from the scan (need a real lock mechanism, not safe as a one-line fix): - sf/auto-dispatch.ts:164 (UAT counter race) - sf/captures.ts:107 (CAPTURES.md append race) Deferred (low-value): - preferences-models.ts, key-manager.ts, auto-timers.ts silent catches - dead variable in visualizer-data.ts - google-gemini-cli.ts maxTokens clamp interaction tsc --noEmit green at root.	2026-04-21 02:13:10 +02:00
ace-pm	51b65fd490	fix: symlink extensions + silent catches masking real errors Real bugs from 2nd-pass scan: 1. extension-registry.ts: discoverAllManifests skipped symlinked extension dirs because Dirent.isDirectory() returns false for symlinks. Dev-workflow symlinks under ~/.sf/agent/extensions/ were invisible to list/enable/ disable/info. Matches the regression documented in symlink-extension-discovery.test.ts — the test inlines the correct logic, but this callsite still had the buggy form. Now accepts isDirectory() \|\| isSymbolicLink(). 2. headless.ts SIGINT handler: client.stop() failures were double-silenced (inner .catch(()=>{}), outer try{}catch{}). Interactive mode logs stop errors to stderr. Restored head/headless parity — still fire-and-forget (exit code is forced via process.exit) but failures are observable. 3. openai-codex-responses.ts SSE parser: malformed data frames were silently dropped so broken streams looked identical to clean ones. Now debug-logs the parse error with the chunk context so broken streams are distinguishable in logs. Stream continues on bad chunk (one bad frame shouldn't kill the whole generation). 4. web/cleanup-service.ts generated script: bare 'catch {}' around four native git calls (nativeBranchList, nativeDetectMainBranch, nativeBranchListMerged, nativeForEachRef). A failed main-branch detection silently left mainBranch undefined-shaped, then the next native call operated on garbage. Now emits console.warn so failures surface in the subprocess log. 5. web/undo-service.ts generated script: git revert failure was silenced; when --no-commit failed, user saw commitsReverted=0 with no reason. Now logs the revert error before attempting --abort (abort itself remains best-effort silent). False positives from the same scan (investigated and dismissed): - auto-worktree.ts #2505: code uses ':(exclude).sf/milestones' pathspec + shelter-and-restore, which is a better fix than the 'drop --include-untracked' approach the test comment describes. Test comment is stale; source is correct. - Lifecycle handler unhandled rejections across 5 extensions: extensions/runner.ts already try/catches handler invocations and routes to emitError. Wrapping the individual handlers would be redundant.	2026-04-21 02:01:41 +02:00
ace-pm	0f94341b43	fix(loader): fall back to src/resources when SF-WORKFLOW.md missing from dist Build sometimes copies dist/resources/extensions/ without the top-level markdown files (observed: SF-WORKFLOW.md absent in dist/resources/ while extensions/ was present). existsSync(distRes) was true either way, so SF_WORKFLOW_PATH pointed at a non-existent path and /sf failed with ENOENT. Check for the specific file instead of the directory.	2026-04-21 01:39:18 +02:00

1 2 3 4 5 ...

2522 commits