singularity/singularity-forge

Author	SHA1	Message	Date
Mikael Hugo	59a37c1080	fix(sf): move scan.md skillActivation from dangling end to Instructions step 0 The {{skillActivation}} placeholder was at the very bottom of scan.md, after the 'Report sf-internal observations' section, with no header or context. Since the default prompt-loader provides a one-sentence 'use the SF Skill Preferences block...' instruction, it landed as an orphan footer the agent only encountered AFTER finishing the scan. Move it to step 0 of the numbered Instructions so the agent activates skills before exploring the codebase, matching the research-slice and plan-milestone pattern. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 17:53:54 +02:00
Mikael Hugo	61485c5bef	fix(sf): remove legacy completion tool aliases	2026-05-02 17:51:38 +02:00
Mikael Hugo	1891ccbdcd	chore(sf): delete orphaned commands-debug + debug-session-store `/sf debug` was ported in `360208cba` but never wired up: - handleDebug exported but no caller anywhere in the tree - not in commands/catalog.ts - loadPrompt("debug-session-manager") and loadPrompt("debug-diagnose") referenced prompts that never existed in prompts/ — guaranteed runtime crash if the dispatch path were ever hit - debug-session-store.ts only consumed by commands-debug.ts - no tests reference any of it 887 LOC of dead code with a latent crash. Removing both files eliminates the orphan-prompt callsite that gap-audit kept flagging and the broken dispatch path. Resolves sf-moohvyzc-ll5bd0. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 17:42:28 +02:00
Mikael Hugo	e07f2bc225	fix(sf): add depth calibration to research-milestone prompt Mirror the tiered Deep/Targeted/Light breakdown that research-slice.md already had — same structure, milestone-scoped wording. Add explicit '## Steps' header so the numbered steps no longer flow visually out of the calibration paragraph. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 17:39:23 +02:00
Mikael Hugo	8032ee6144	fix(sf): gap-audit detects prompts loaded by direct filesystem read Orphan-prompt detection only checked loadPrompt() callsites. Three prompts (heal-skill, product-audit, review-migration) are loaded by direct readFileSync of "<name>.md" — they got false-flagged as orphans. Add a literal-filename check so any source file containing "<name>.md" counts as a load. Cheap one-pass grep, same shape as the existing loadPrompt patterns. Verified with live runGapAudit: 0 new findings (was previously logging the 3 false positives every session_start). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 17:29:44 +02:00
Mikael Hugo	617608347d	fix(sf): align auto-mode prompts to canonical sf_task_complete / sf_slice_complete Auto-mode prompts called legacy aliases (sf_complete_task, sf_complete_slice) while guided used canonical (sf_task_complete, sf_slice_complete). The divergence was locked in by the test 'auto execute-task requires legacy completion alias until prompt contract is aligned' — explicit tech debt marker. Migrated: - workflow-mcp.ts getRequiredWorkflowToolsForAutoUnit: returns canonical - prompts/execute-task.md: 4 callsites - prompts/complete-slice.md: 3 callsites - prompts/reactive-execute.md: any (none on this file) - workflow-mcp.test.ts: assertion + transport-error fixtures - Test rename: 'requires legacy completion alias' → 'requires canonical' The aliases stay registered (sf_complete_task → sf_task_complete) so external callers and old session resumes don't break. Tool-naming.test.ts still asserts both names route to the same handler. Resolves: sf-moohqbza-yyq8sd. Tests: workflow-mcp + tool-naming 29/29 pass. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 17:25:53 +02:00
Mikael Hugo	9d13c7ef49	chore(sf): delete orphan templates/reassessment.md 29-line template with zero callers. inlineTemplate("reassessment") isn't called anywhere; reassess-roadmap.md prompt has its own inline structure. Removing prevents drift between dead template and live prompt. Resolves: orphan-template-reassessment. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 17:21:42 +02:00
Mikael Hugo	21663be282	fix(sf): add depth calibration to plan-milestone prompt Mirror plan-slice + research-slice + research-milestone: 3-tier Calibrate Depth (Deep / Targeted / Light) with explicit Light tier authorizing 1-2-slice decompositions for focused well-scoped work. Prevents the synthesized over-decomposition pattern where every milestone produced 4-5 slices regardless of scope. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 17:18:14 +02:00
Mikael Hugo	012862fc9a	fix(sf): add depth calibration to plan-slice prompt plan-slice was force-deep on every dispatch — full multi-task decomposition + long architectural narration regardless of slice complexity. research-slice has a 3-tier Calibrate Depth section (Deep / Targeted / Light) that lets the agent right-size; plan-slice now mirrors it. Light tier explicitly authorizes 1-task plans for well-understood work (CRUD, config changes, established-pattern wiring) — preventing the synthesized 4-task decompositions that were a likely contributor to recurring runaway-guard pauses on planning units. Resolves: sf-moohebyg-y0hnhq. Tests: plan-slice-prompt 16/16 still pass. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 17:13:52 +02:00
Mikael Hugo	b9375656ca	fix(sf): stop on contradictory roadmap slice counts	2026-05-02 17:13:06 +02:00
Mikael Hugo	8133ba9003	fix(sf): avoid parallel research redispatch loops	2026-05-02 17:08:36 +02:00
Mikael Hugo	71ce87b981	fix(sf): await scoped dispatch messages	2026-05-02 16:57:41 +02:00
Mikael Hugo	364a1e000e	fix(sf): compact feedback view and animate progress	2026-05-02 16:43:54 +02:00
Mikael Hugo	fbee428196	fix(sf): record sessionId+sessionFile in auto.lock at acquire time acquireSessionLock now accepts an optional sessionInfo arg (sessionId, sessionFile) and writes both into the initial lockData JSON. The caller in auto-start.ts:382 reads them from ctx.sessionManager. updateSessionLock already writes these fields per-dispatch; this closes the gap at acquire time. Lets observers correlate the live auto.lock with the .sf/sessions/ event log (e.g. flow-auditor agents, dashboard, doctor). Resolves: sf-moocx6lv-9grpvt (active-auto-session-pointer-missing). Tests: 32/32 in session-lock + auto-start. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 16:27:08 +02:00
Mikael Hugo	8814e0b8ce	fix(sf): avoid self-reload on dirty inline fixes	2026-05-02 16:26:06 +02:00
Mikael Hugo	f5290e41aa	fix(sf): reload after self-feedback inline fixes	2026-05-02 16:12:23 +02:00
Mikael Hugo	a4059e5871	fix(sf): add 'hook' to LogComponent + use it in hook-emitter The auto-drain shipped hook-emitter.ts:80,93 logWarning calls with component "hook-emitter" but that string wasn't in the LogComponent union, blocking tsc compilation. Add 'hook' to the union (consistent with the existing short component names like 'tool', 'dispatch', 'timer') and update the two callsites. Without this, tsc fails and dist/resource-loader.js (which contains the new verifyManifestFilesExist fix) can't update — leaving the ask-user-questions.js boot failure unresolved despite the source-side fix landing in aa7d3f10a. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 16:08:16 +02:00
Mikael Hugo	644187c73e	fix: resolve 10 high-severity self-feedback inline-fix issues - gap-audit prompt detection: Add DYNAMICALLY_LOADED_PROMPTS set for prompts loaded through wrappers (research-slice, plan-slice, execute-task, etc.) and detect loadPrompt calls with comma-separated args (#sf-moobj36l-ewu7js) - gap-audit command detection: Detect exact match, prefix match, and switch/case patterns for command dispatch (#sf-moobj36o-n8b7g9) - empty task summary: Add isValidTaskSummary() to require non-empty content with frontmatter or H1 before reconciliation marks task complete (#sf-moobj36o-6rxy6e) - journal write failures: Emit bounded health warning to .write-failures.jsonl on journal write failure with per-session dedup (#sf-moobj36p-ikq3b2) - resource sync manifest divergence: Add verifyManifestFilesExist() to check all manifest-listed files exist on disk after hash match (#sf-moody5qi-8gbwp2) - self-feedback markdown stale: Regenerate SELF-FEEDBACK.md from jsonl on markResolved with resolved entries section (#sf-moobj36p-rlo95i) - self-feedback context bloat: Cap entries to 20 max, 4000 chars, inject compact summaries only with pointer to jsonl for full evidence (#sf-moobj36p-ko6snt) - hook-emitter types: Replace unknown with EventResult discriminated union, implement emitExtensionEvent call with fallback warning when _pi missing (#sf-moobmhwt-bxejb6, #sf-moobmhx4-gk9g83) - export visualizer types: Add VisualizerExportData interface with proper PhaseAggregate/SliceAggregate/ModelAggregate/ProjectTotals types replacing any (#sf-moobmhx0-ow5fhy) - native-edit-bridge: Already resolved (artifact removed from repo) (#sf-moobj36q-z4id3u)	2026-05-02 16:03:52 +02:00
Mikael Hugo	c61f848f79	fix(sf): make reload work in interactive sessions	2026-05-02 15:52:31 +02:00
Mikael Hugo	a48cf9beb0	refactor(sf): rename sift cache env to SIFT_SEARCH_CACHE Switches the per-project sift warmup runtime dir field from cacheHome (generic XDG_CACHE_HOME) to searchCache (specific SIFT_SEARCH_CACHE). Narrower env var only redirects sift's search index, leaving sift's other XDG_CACHE_HOME consumers (model downloads etc.) on the global ~/.cache/sift path so models are shared across projects. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 15:33:41 +02:00
Mikael Hugo	c4ac851187	fix(sf): isolate sift warmup cache per project	2026-05-02 15:24:14 +02:00
Mikael Hugo	f21890addb	fix(sf): cap sift warmup and add minimax coverage	2026-05-02 15:13:16 +02:00
Mikael Hugo	7e1eff46a2	fix(sf): remove unwired native edit bridge	2026-05-02 15:02:26 +02:00
Mikael Hugo	9f773815d1	fix(sf): repair doctor orphan cleanup	2026-05-02 14:34:16 +02:00
Mikael Hugo	f990ce1048	test(sf): cover manual rate command	2026-05-02 14:26:31 +02:00
Mikael Hugo	d4e094b408	fix(sf): surface agent-end ordering failures	2026-05-02 14:25:44 +02:00
Mikael Hugo	c19d987894	fix(sf): wire /sf rate to manual ops dispatcher /sf rate was advertised in commands/catalog.ts and reachable from auto-mode but had no branch in the manual ops handler — typing /sf rate outside auto-mode silently no-op'd because ops.ts had no trimmed.startsWith("rate ") branch. Add the dispatch alongside the existing /sf todo branch using the same lazy-import pattern. handleRate from commands-rate.ts already exists. Resolves: sf-monzctqn-m42nlq (command-dispatch-gap). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 14:24:23 +02:00
Mikael Hugo	a8f0c63b0a	fix(sf): contain research unit dispatch	2026-05-02 14:23:01 +02:00
Mikael Hugo	64b46fcb8a	fix(sf): self-heal stale auto locks before resume	2026-05-02 14:10:16 +02:00
Mikael Hugo	bba5a7f143	fix(headless): ignore pasted prose on orchestrator stdin	2026-05-02 14:08:08 +02:00
Mikael Hugo	3d0ebd981f	fix(sf): drain self-feedback into repair turns	2026-05-02 13:59:22 +02:00
Mikael Hugo	983a2e0a44	refactor(sf): rename BACKLOG.md → SELF-FEEDBACK.md (matches jsonl SoT) The forge-local human-readable file was misnamed — it's sf-internal self- reports, not a generic project backlog. The jsonl source-of-truth is already self-feedback.jsonl; the markdown should match. Renames: - File: BACKLOG.md → SELF-FEEDBACK.md - Constant: BACKLOG_HEADER → SELF_FEEDBACK_HEADER - Constant: BACKLOG_MAX_CHARS → SELF_FEEDBACK_MAX_CHARS - Function: appendBacklogRow → appendSelfFeedbackRow - Function: loadBacklogBlock → loadSelfFeedbackBlock (parallel session) - Prompt file: prompts/triage-backlog.md → prompts/triage-self-feedback.md (parallel session) - Module: triage-backlog.ts → triage-self-feedback.ts (parallel session) - Header: "# SF Self-Feedback Backlog" → "# SF Self-Feedback" Doc/text refs across prompts (execute-task, complete-milestone, triage-self-feedback) and helper modules (gap-audit, requirement-promoter, db-tools, system-context) updated to .sf/SELF-FEEDBACK.md. Migration: new exported migrateLegacyBacklogFilename() in self-feedback.ts runs at session_start (wired in register-hooks.ts) — renames the legacy BACKLOG.md → SELF-FEEDBACK.md once, idempotent + non-fatal. system-context's loadSelfFeedbackBlock also reads either name during the transition. system-context.ts: BACKLOG_MAX_CHARS retained but raised earlier from 2000 to 8000 with all-entries-fit-or-truncate-tail (separate commit). The SoT mtime-cache and per-severity rendering remain as before. Tests: 77/77 pass across UOK + upstream-bridge + triage-self-feedback. Not done in this commit (next iteration): - Direct-drain dispatch at session_start for high/critical (subprocess spawn). - Queue promotion for medium severity. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 13:52:57 +02:00
Mikael Hugo	6a492079b9	fix(sf): speed resource sync and expand backlog context	2026-05-02 13:42:50 +02:00
Mikael Hugo	51aec5616f	feat(sf): surface high/critical inline-fix candidates at session_start When SF starts and the still-blocked self-feedback drain finds entries at severity high/critical, emit a separate warning notification listing the candidate IDs + kinds. Visible in the SF UI on session start; operator (or a follow-up auto-dispatcher) can drain them without leaving the session. Read-only signal for now — no auto-dispatch yet. The hook lives next to the existing still-blocked summary in register-hooks.ts session_start. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 13:37:09 +02:00
Mikael Hugo	7053938f7d	fix(gemini): keep cli tools in pi harness	2026-05-02 13:32:05 +02:00
Mikael Hugo	98fe3b605d	fix(gemini): route cli retry and quota through core	2026-05-02 13:20:10 +02:00
Mikael Hugo	14c0412ee4	test(sf): re-align two static-analysis tests with refactored sources deferred-commit.test.ts: stagedPendingCommit-to-commitStaged proximity threshold bumped 500 → 1500 chars. Recent refactors added ~95 chars of pre-commit code between the false-assignment and the call. Invariant preserved (false assigned BEFORE commit); the proximity check is informational, not load-bearing. skipped-validation-completion.test.ts: regex assertion updated to match the source's [\s-] character class (no \\-). The test was checking for [\\s\\-] but the actual regex at auto-dispatch.ts:1369 uses [\s-] (legal — hyphen at end of char class). Same semantic, correct shape. UOK + skip-by-preference behavior unchanged. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 13:14:46 +02:00
Mikael Hugo	3c3000c25f	fix(auth): use gemini cli credentials outside sf store	2026-05-02 13:08:41 +02:00
Mikael Hugo	cb2ab66d4f	feat(sf): UOK production hardening — diff capture, exit symmetry, commit-gate Three production gaps Codex's adversarial review flagged are now closed: 1. Real legacy-vs-UOK parity diff (per turn, per plane): - parity-diff-capture.ts captures plan / graph / model-policy / audit-envelope / gitops decisions for both paths and emits ParityDiffEvent records to .sf/runtime/uok-parity.jsonl. - parity-report.ts aggregates divergencesByPlane, populates criticalMismatches with real divergence summaries, and tracks enterEvents / exitEvents / missingExitEvents for symmetry. 2. Exit-event symmetry: - sessionId / turnId now flow through enter+exit parity events. - writeParityHeartbeat lets kernel/loop-adapter emit best-effort diagnostics on plane failure paths so missing-exit gaps shrink. 3. Commit-gating on divergence or missing-exit: - resolveParitySafeGitAction (in uok/gitops.ts) reads the parity report and downgrades turn_action to status-only when divergence count > 0 or missing-exit count > 0 — UOK can no longer commit on top of unverified state. - auto-post-unit.ts now resolves a configuredTurnAction from UOK flags then asks the parity gate for the safe action; the gate's decision is what flows to the actual git op. - new test: tests/uok-gitops-commit-gate.test.ts. - existing gitops-wiring assertion updated for the renamed configuredTurnAction (semantic preserved). Tests: 53/53 UOK pass. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 12:57:48 +02:00
Mikael Hugo	85a0188fe1	fix(sf): stabilize auto notices and package checks	2026-05-02 12:39:27 +02:00
Mikael Hugo	ed2c4af729	test(sf): align verification-gate + workflow-mcp tests with current reality verification-gate "real lint fails → gate fails with exit code 1" was asserting biome exits 1, but biome currently exits 0 (warnings only, no errors). Reframe to verify the gate captures the lint exit code faithfully regardless of biome's verdict — that's the contract we actually care about, not whether the codebase happens to have lint errors. workflow-mcp client timeouts bumped 30s → 60s. Test passes in isolation in 8.5s but flakes under full-suite cold-cache load when the MCP stdio round-trip exceeds 30s. 60s gives breathing room without losing real-bug signal. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 12:16:44 +02:00
Mikael Hugo	e0fd2076d3	test: Investigated R102 symlink dedup: canonicalizePath already exists… SF-Task: S01/T07	2026-05-02 12:00:56 +02:00
Mikael Hugo	3915dfda3a	chore(vitest): bump testTimeout 30s→60s to absorb cold-import latency Cold vitest+esbuild module-graph imports take 16-25s on this repo (dynamic imports of captures.js and friends). The 30s testTimeout was racing the import phase, producing 30s spurious failures across dev-engine-wrapper, ensure-db-open, workflow-mcp, sf-tools, verification-gate, hook-key-parsing, visualizer-overlay, and others — all timing out at exactly ~30s with no real assertion failure. Also bumps hookTimeout symmetrically. Re-running the affected files: 147/147 pass. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 11:53:31 +02:00
Mikael Hugo	44204e0424	chore(sf): add optional token telemetry	2026-05-02 11:50:34 +02:00
Mikael Hugo	ff60f5f62f	test(sf): make worktree suites explicit	2026-05-02 11:40:18 +02:00
Mikael Hugo	26be0b4153	fix(sf): stabilize headless auto flow	2026-05-02 11:34:41 +02:00
Mikael Hugo	12538bbfa3	sf snapshot: pre-dispatch, uncommitted changes after 32m inactivity	2026-05-02 11:25:51 +02:00
Mikael Hugo	3edc35a7ea	feat(sf): UOK parity safety + verification gate hard-kill Three small fixes for UOK rollout debuggability and gate reliability: 1. parity-report.ts: writeParityReport now writes via atomic temp+rename so the report file is never partially written on disk full / crash. parseParityEvents now skips whitespace-only lines without recording error events. 2. verification-gate.ts: spawnSync gate commands use killSignal: SIGKILL so npm/node grandchildren actually exit when the deadline fires (default SIGTERM was being caught by shell wrappers, leaving lingering children that out-lived the deadline). 3. session_start drain (bootstrap/register-hooks.ts) now reads .sf/runtime/uok-parity-report.json and notifies the operator on criticalMismatches, fallbackInvocations, or status errors. New helper module uok-parity-summary.ts encapsulates the read+summarize logic with 8 tests. Tests: parity-report 5/5, parity-summary 8/8, verification-gate 87/87. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 10:52:52 +02:00
Mikael Hugo	75a4f35ea5	test(sf): fix zombie-cleanup test pollution from sibling-stop changes Adding the new "cancelled" worker state in `1fdaae5c7` didn't itself break the test, but the existing afterEach hooks (placed inside each test body) weren't reliably resetting the orchestrator singleton between runs. M002 leftover from test #2 was leaking into test #3, breaking the "all cached workers in error state" assertion. Add a top-level beforeEach that always resets the orchestrator before each test so the shared module-level state can't leak across the file. afterEach blocks remain for tmpdir cleanup. All 4 tests now pass. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 10:48:45 +02:00
Mikael Hugo	1fdaae5c77	feat(sf): parallel sibling-stop opt-in When one parallel worker fails, siblings keep running (and burn budget) by default. Add an opt-in cascade so dependent parallel work stops on first failure instead of producing wasted output. - CLI: /sf parallel start --stop-on-failure - Pref: parallel.stop_on_failure (default false) - Journal: parallel-cancelled-by-sibling event (workerId, triggeringWorkerId, kind) - State: cancelled (vs error) so post-hoc reporting distinguishes "I failed" from "a sibling failed and I was cancelled" - Cancellation: graceful via existing file-IPC stop signal + SIGTERM Side fix: after → afterAll in worktree-bugfix.test.ts (vitest API). Tests: 10/10 in parallel-stop-on-failure.test.ts; 38/38 across the worktree + parallel test set. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-02 09:39:13 +02:00

1 2 3 4 5 ...

3875 commits