Commit graph

3806 commits

Author SHA1 Message Date
frizynn
23d0ea656d refactor: extract shared file lock utilities
Extract the duplicated file lock mechanism from auth-storage.ts and
session-manager.ts into a shared lock-utils.ts module.

- acquireLockSyncWithRetry(): throwing variant (used by auth-storage)
- tryAcquireLockSync(): non-throwing variant (used by session-manager)
- acquireLockAsync(): async lock with retries and staleness detection

Removes ~55 lines of duplicated retry-loop logic. The shared module
also provides a foundation for deduplicating identical patterns in
settings-manager.ts and models-json-writer.ts.
2026-03-19 15:16:56 -03:00
frizynn
a78121b35c refactor: consolidate resource loader with generic update/dedupe methods
- Replace dedupePrompts() and dedupeThemes() with generic dedupeResources<T>()
  that accepts getName/getPath/resourceType callbacks
- Replace discoverSystemPromptFile() and discoverAppendSystemPromptFile() with
  generic discoverFileInSearchPaths(filename)
- Import ResourceCollision type for use in dedupeResources signature
- Net reduction of 24 lines (868 → 844) with elimination of duplicated logic
2026-03-19 15:16:21 -03:00
frizynn
c6c45cb1c0 refactor: consolidate model switching logic in agent-session
Extract the duplicated model-switching sequence (set model, append
session change, persist settings, re-clamp thinking level, emit event)
from setModel(), _cycleScopedModel(), and _cycleAvailableModel() into a
shared _applyModelChange() helper. Removes ~30 lines of repeated code.
2026-03-19 15:16:18 -03:00
Derek Pearson
f2657e1ba0 fix: release stranded bootstrap locks and handle re-entrant reacquire (#1352)
Release session locks on bootstrap abort paths and reset same-process lock state before re-acquiring so stale proper-lockfile callbacks cannot poison a fresh auto-mode session. Adds regression coverage for bootstrap cleanup and re-entrant lock acquisition.
2026-03-19 12:12:06 -06:00
frizynn
166243bfe5 refactor: extract shared helpers in compaction module
Move duplicated patterns from compaction.ts and branch-summarization.ts
into shared utilities in utils.ts:

- getMessageFromEntry(): unified entry-to-message conversion with
  optional toolResult skipping for branch summarization
- collectMessages(): replaces three identical for-loops that collect
  AgentMessages from entry ranges
- extractTextContent(): replaces five instances of the
  .filter(text).map(text).join() pattern
- createSummarizationMessage(): replaces three identical user-message
  construction blocks for LLM summarization calls

Net reduction of ~90 lines of duplication.
2026-03-19 14:56:00 -03:00
frizynn
385d936689 refactor: deduplicate toPosixPath, ZERO_USAGE, and shortenPath utilities
- toPosixPath: remove private copies in skills.ts and package-manager.ts,
  import from canonical utils/path-display.ts
- ZERO_USAGE: export from agent-loop.ts, replace inline zero-usage
  objects in agent.ts and proxy.ts
- shortenPath: extract to shared modes/interactive/utils/shorten-path.ts,
  import in tool-execution.ts and session-selector.ts
2026-03-19 14:55:30 -03:00
frizynn
02b42bfffe refactor: consolidate 9 emit methods in extension runner into shared invokeHandlers
Extract the duplicated loop pattern (create context, iterate extensions,
iterate handlers, try/catch, emitError) into a private invokeHandlers()
helper. Each emit method is now a thin wrapper that delegates the
iteration to invokeHandlers and provides only its result-processing
callback. Net reduction of ~124 lines with identical runtime behavior.
2026-03-19 14:55:18 -03:00
frizynn
bbc180a3b6 refactor: consolidate extension type guards and inline handler type aliases
Replace 7 individual ToolResultEvent type guards (isBashToolResult,
isReadToolResult, etc.) with a unified isToolResultEventType() function,
mirroring the existing isToolCallEventType() pattern.

Inline 14 handler type aliases (SendMessageHandler, SetModelHandler, etc.)
directly into the ExtensionActions interface since they were only used there
and added no semantic value.

Update documentation examples to use the new unified guard.
2026-03-19 14:55:00 -03:00
frizynn
e01e994d82 refactor: consolidate duplicate patterns in LSP module
- Remove duplicate SYMBOL_KIND_NAMES from types.ts (keep SYMBOL_KIND_LABELS in utils.ts)
- Export which() from config.ts, import in lspmux.ts instead of duplicating
- Inline getLspServersForFile/getLspServerForFile thin wrappers in index.ts
- Extract formatLocationResults() helper for definition/type_definition/implementation
- Extract formatCallHierarchyResults() helper for incoming_calls/outgoing_calls
- Remove unused formatPosition() from utils.ts
2026-03-19 14:54:47 -03:00
Tom Boucher
eaf0538150 fix: add JS fallbacks for wrapTextWithAnsi and visibleWidth when native addon unavailable (#1418) (#1428)
On older Linux distributions (e.g., RHEL 8 with older glibc), the native
Rust addon fails to load. The proxy throws on every function call, but
wrapTextWithAnsi and visibleWidth in pi-tui had no JS fallback — causing
an uncaught crash during TUI rendering.

Fix: Both functions now catch native throws and fall back to JS
implementations (simple word-wrap and ANSI-strip length).

Fixes #1418
2026-03-19 10:49:13 -06:00
Tom Boucher
8b0727c0e5 fix: accumulate session cost independently of message array (#1423)
getSessionStats() calculated cost by summing usage from assistant messages
in state.messages. After auto-compaction, pre-compaction messages are
replaced by a compactionSummary with no usage field — dropping the cost.

Fix: Added cumulative accumulators (_cumulativeCost, _cumulativeInputTokens,
_cumulativeOutputTokens, _cumulativeToolCalls) that are incremented on
every assistant message event, independent of the message array.
getSessionStats() now returns max(array-sum, cumulative) to ensure
monotonically non-decreasing values.

Fixes #1423
2026-03-19 12:44:11 -04:00
Jeremy McSpadden
c048aa2e7a fix: resolve CI failures — scope provider check, fix Windows path, correct severity
Three CI regressions from the initial commit:

1. doctor.test.ts "two blocking errors" assertion broke (expected 2, got 3):
   The provider check fired on any project with an active milestone, including
   CI environments with no API key. Fix: change provider_key_missing severity
   from "error" to "warning". A missing key is advisory — it blocks future
   dispatch but doesn't corrupt existing state, analogous to env_git_remote.

2. doctor-runtime.test.ts stranded_lock_directory fails on Windows:
   proper-lockfile uses advisory file locking on Windows, not the directory-based
   mechanism (.gsd.lock/). The check and tests are POSIX-specific. Fix: skip
   both stranded_lock_directory tests on Windows with process.platform guard,
   same pattern used by worktree and branch tests.

3. doctor-checks.ts used root.split("/").pop() which is not cross-platform:
   Windows paths use backslash separators. Fix: replace with basename(root)
   from node:path which is platform-aware. Also add basename to imports.
2026-03-19 11:41:48 -05:00
Tom Boucher
5187841c0b feat: auto-generate OpenRouter model registry from API + add missing models (#1407) (#1426)
Added scripts/generate-openrouter-models.mjs that fetches the full model
list from OpenRouter's API and generates TypeScript entries matching the
existing models.generated.ts format. Run with:
  node scripts/generate-openrouter-models.mjs > /tmp/openrouter.ts

Updated the OpenRouter section in models.generated.ts from 241 → 350
models, including all nvidia/nemotron variants requested in the issue.

Fixes #1407
2026-03-19 10:40:34 -06:00
Jeremy McSpadden
ccde39c2c8 fix: close 5 doctor coverage gaps — providers, lock dir, integration branch, orphaned worktrees
Closes the highest-impact gaps identified in the /gsd doctor deep-dive analysis.

**1. Wire provider checks into runGSDDoctor()**
doctor-providers.ts existed and worked but was never called from the main
doctor run. Units could dispatch into guaranteed API failures with no warning.
Now runProviderChecks() is called in runGSDDoctor() and converts required-provider
errors/warnings into DoctorIssue entries with codes:
  - provider_key_missing (error)
  - provider_key_backedoff (warning)

**2. Stranded lock directory detection (doctor-checks.ts)**
proper-lockfile creates a .gsd.lock/ directory as the OS-level lock mechanism.
After SIGKILL or hard crash, this directory can remain stranded, blocking all
future auto-mode sessions from acquiring the lock (#1245 pattern). Doctor now:
  - Detects .gsd.lock/ existing without a live process holding it
  - Reports as stranded_lock_directory (error, fixable)
  - Auto-fix removes the stranded directory

**3. Integration branch existence check (doctor-checks.ts + doctor-proactive.ts)**
When a milestone records an integration branch and that branch is later deleted
or renamed, merge-back will fail silently at the end of the milestone. Doctor now:
  - Checks each active milestone's stored integration branch exists in git
  - Reports as integration_branch_missing (error, not auto-fixable)
  - preDispatchHealthGate blocks dispatch if the active milestone's integration
    branch is missing, preventing work from being dispatched into a dead end

**4. Orphaned worktree directory detection (doctor-checks.ts)**
Worktree removal can fail after a branch delete, leaving a .gsd/worktrees/<name>/
directory that is no longer registered with git. Re-creating the same name fails
with "already exists". Doctor now:
  - Compares .gsd/worktrees/ entries against git worktree list
  - Reports unregistered directories as worktree_directory_orphaned (warning, fixable)
  - Auto-fix removes the orphaned directory

Tests: all new codes covered with detection + fix assertions, including
false-positive safety cases (live lock holder, registered worktrees,
existing integration branch). All 1843 existing tests still pass.
2026-03-19 11:25:20 -05:00
Tom Boucher
f0fe4b2443 fix: emit agent_end after abort during tool execution (#1414) (#1417)
* fix: sync worktree completion artifacts back to external state before merge (#1412)

When a worktree's .gsd/ was a real directory (not symlinked to external
state), milestone completion artifacts (SUMMARY, VALIDATION, updated
ROADMAP) were written locally but never synced back. The project root's
deriveState() read from external state and found no SUMMARY — reporting
the milestone as incomplete.

Changes:
- auto-worktree.ts: Added syncWorktreeStateBack() that copies milestone
  and slice .md files from worktree .gsd/ to the main external state dir
- auto.ts: Call syncWorktreeStateBack() in tryMergeMilestone before the
  git merge, ensuring artifacts are visible from the project root

Fixes #1412

* fix: emit agent_end after abort during tool execution (#1414)

When a user aborts a turn while a tool call is running, the abort RPC
succeeds but agent_end was never emitted. RPC consumers tracking turn
lifecycle via events got stuck in a 'streaming' state permanently.

Fix: After abort() + waitForIdle(), emit a synthetic agent_end if the
agent is no longer streaming. This ensures consumers always see the
turn-complete signal regardless of how the turn ended.

Fixes #1414
2026-03-19 10:24:39 -06:00
Tom Boucher
2d921ecfad fix: add PID self-check to guided-flow crash lock detection (#1398)
guided-flow.ts showed 'Interrupted Session Detected' whenever auto.lock
existed, without checking if the lock was written by the current process.
This caused infinite prompt loops when the current session's own lock
triggered the crash detection.

Fix: Added crashLock.pid !== process.pid check, matching the guard in
auto-start.ts.

Also includes test fixes:
- repo-identity-worktree: macOS /var canonicalization
- resource-loader: partial-build dist/resources fallback
- file-watcher: init delay + timeout for timing stability

Fixes #1398
2026-03-19 11:01:37 -04:00
Juan Francisco Lebrero
e6bbd035ba fix: auto-discard bootstrap crash locks and clean auto.lock on exit (#1397)
Two root causes for the false "Interrupted Session Detected" prompt
that appears every time /gsd is run after a normal exit:

1. guided-flow.ts showed the crash recovery menu even for bootstrap
   crashes (unitType="starting", unitId="bootstrap", completedUnits=0)
   where no work was lost. Now these are silently discarded — the menu
   only appears when real auto-mode work was interrupted.

2. session-lock.ts exit handler cleaned the OS lock directory
   (.gsd.lock/) but not the auto.lock metadata file. On next startup,
   readCrashLock() found the stale file and triggered false recovery.
   Now the exit handler also removes auto.lock.
2026-03-19 08:31:15 -06:00
Jeremy McSpadden
efbbcc790d fix(prefs): close merge, validation, serialization, and docs gaps
- mergePreferences(): add auto_visualize and auto_report (both were
  silently dropped when a project prefs file existed alongside global)
- preferences-validation.ts: add validation blocks for auto_visualize,
  auto_report, compression_strategy, and context_selection — all four
  were in KNOWN_PREFERENCE_KEYS and the GSDPreferences interface but
  accepted any value without type-checking
- serializePreferencesToFrontmatter orderedKeys: add skill_staleness_days,
  dynamic_routing, token_profile, phases, parallel, auto_visualize,
  auto_report, verification_commands, verification_auto_fix,
  verification_max_retries, search_provider, compression_strategy,
  context_selection — these were falling through to the arbitrary-order
  fallback loop instead of appearing in consistent positions
- preferences-reference.md: document git.auto_pr, git.pr_target_branch,
  search_provider, compression_strategy, context_selection; add
  deprecation notices for git.commit_docs and git.merge_to_main
- tests/preferences.test.ts: two new test cases covering all four newly
  validated fields (valid values pass, invalid values produce errors)
2026-03-19 08:55:25 -05:00
deseltrus
2dc804a485 fix: harden quick-task branch lifecycle — disk recovery + integration branch guard (#1342) 2026-03-19 07:39:54 -06:00
deseltrus
e6ceb8dfe8 fix: skip verification retry on spawn infra errors (ETIMEDOUT, ENOENT) (#1340) 2026-03-19 07:39:13 -06:00
Jeremy McSpadden
d7bf3d4e72 Improve startup performance with lazy extension loading (#1336) 2026-03-19 07:38:50 -06:00
dan bachelder
b67101c51b fix: keep external GSD state stable in worktrees (#1334) 2026-03-19 07:37:25 -06:00
Tom Boucher
469d2e1e77 ci: upgrade docker/login-action v3 → v4 for Node 24 runtime (#1331) 2026-03-19 07:36:10 -06:00
John Brahy
c47ee71b2d fix: always include reasoning.encrypted_content for OpenAI reasoning models
When using a reasoning model (o1, o3, o4-mini, etc.) without explicitly
setting reasoningEffort or reasoningSummary, the include param was not
set. OpenAI returns a bare rs_... reasoning item ID which gets stored in
thinkingSignature and replayed in the next turn. Since store is false,
OpenAI cannot find the rs_... item server-side and returns a 404.

Fix: move params.include = ["reasoning.encrypted_content"] outside the
reasoningEffort/reasoningSummary guard so it is always set for any
reasoning model. This ensures the encrypted blob is returned and can be
replayed correctly without needing server-side storage.
2026-03-19 02:53:06 -07:00
John Brahy
211520c2d2 updating .gitignore 2026-03-19 02:22:28 -07:00
John Brahy
85265a99ab docs: add structured issue templates 2026-03-19 01:54:22 -07:00
John Brahy
76e7aec0e8 fix(gsd): avoid EISDIR crash in file loader 2026-03-19 01:40:52 -07:00
John Brahy
6fe2280363 fix(gsd): open existing database on inspect 2026-03-19 01:35:28 -07:00
John Brahy
0c83046a51 docs: add guidance for local custom MCP setup 2026-03-19 01:27:37 -07:00
Tom Boucher
d121c8e3b2 fix: stop excluding all .gsd/ from commits — only exclude runtime files (#1326) (#1328)
smartStage() was excluding the entire .gsd/ directory from git staging,
which is correct when .gsd/ is symlinked to external state. But on
Windows (junction links) or projects where .gsd/ is git-tracked (not
gitignored), this caused a mid-milestone behavioral discontinuity:

1. One-time cleanup removes runtime files from the index
2. After cleanup, nativeAddAll() + nativeResetPaths('.gsd/') causes ALL
   .gsd/ files to be unstaged — including milestone artifacts
3. autoCommit returns null (nothing staged) for the rest of the milestone
4. Work continues silently with no commits, no errors, no warnings
5. Worktree teardown loses all uncommitted .gsd/ artifacts

Fix: replace the blanket '.gsd/' exclusion with targeted RUNTIME_EXCLUSION_PATHS.
Milestone artifacts (.gsd/milestones/, preferences.md, DECISIONS.md, etc.)
are now committed normally when they're tracked. When .gsd/ is in .gitignore
(the default), git add -A already skips it — the reset is a harmless no-op.

Updated git-service.test.ts to verify the new behavior: runtime files
excluded, milestone artifacts committed.

Fixes #1326
2026-03-18 22:06:41 -06:00
Jeremy McSpadden
fc56cdf93e fix: handle ECOMPROMISED in uncaughtException guard and align retry onCompromised (#1322) (#1332)
When a GSD session crashes hard (SIGKILL, OOM, etc.) without running its
exit handler, the proper-lockfile OS lock directory (.gsd.lock/) is left
stranded. On the next /gsd auto resume, acquireSessionLock detects the dead
PID, cleans up the stale directory, and re-acquires via the retry path.

10 seconds later, proper-lockfile's update timer fires. Due to a subtle
interaction between the synchronous fs adapter (lockSync / toSyncOptions)
and the setTimeout boundary in Node.js v25+, the ECOMPROMISED error
propagates up through the synchronous callback chain and becomes an
uncaught exception — even though the onCompromised callback sets
_lockCompromised = true without throwing.

The _gsdEpipeGuard uncaughtException handler only handled EPIPE, so it
re-threw ECOMPROMISED, crashing the process. Each crash wrote a new
"interrupted session" record, causing an infinite crash loop on resume.

Two fixes:

1. index.ts: Handle ECOMPROMISED in _gsdEpipeGuard. Exit with code 1
   (non-zero to signal failure) so the process.once("exit") handler runs
   and removes the lock directory, allowing the next session to start clean.

2. session-lock.ts: The retry path's onCompromised was missing
   `_releaseFunction = null`, unlike the primary path. This left the
   release function pointer live after compromise, causing validateSessionLock
   to return true and preventing graceful stop detection. Now matches primary.
2026-03-18 22:06:03 -06:00
github-actions[bot]
d25c174f8b release: v2.33.1 2026-03-19 04:02:21 +00:00
Jeremy McSpadden
15a8807eb3 fix: clean up stale numbered lock files and harden signal/exit handling (#1315) (#1323) 2026-03-18 21:15:47 -06:00
Tom Boucher
7537e30815 fix: worktree sync and home-directory safety check (#1311, #1317) (#1322) 2026-03-18 21:15:36 -06:00
Tom Boucher
0e4db4b709 docs: update README and docs for v2.33.0 release (#1320) 2026-03-18 21:15:11 -06:00
Tom Boucher
68e0672dda test: add regression harness for auto-mode dispatch loop (125 assertions) (#1319) 2026-03-18 21:14:59 -06:00
Jeremy McSpadden
805c7718c4 chore: remove orphaned mcporter extension manifest (#1318) 2026-03-18 21:14:50 -06:00
github-actions[bot]
106f5d8d32 release: v2.33.0 2026-03-19 02:40:54 +00:00
Tom Boucher
6b61b75f3d feat: add live regression test harness for post-build pipeline validation (#1316)
10 tests that run against the installed gsd binary after npm publish:

1. headless query returns valid JSON
2. Empty project → pre-planning phase
3. Milestone with roadmap → planning phase
4. All tasks done → summarizing phase
5. Complete milestone → complete phase
6. Stale auto.lock doesn't block --version
7. Crash recovery query works with stale lock
8. Non-TTY exits quickly with clean error
9. Version skew detected before TTY check
10. --help works (native addon loads or falls back)

Wired into pipeline.yml test-verify job after fixture tests
and before @next promotion.

These catch the state machine / infrastructure bugs from #1308
that unit tests can't reach — they exercise deriveState through
the real gsd binary with real .gsd/ directory structures.

Part of #1308
2026-03-18 20:22:54 -06:00
Tom Boucher
0418458cf9 refactor: extract tryMergeMilestone to eliminate 4 duplicate merge paths in auto.ts (#1314) 2026-03-18 20:04:10 -06:00
Tom Boucher
583e84e932 refactor: dispatch loop hardening — defensive guards, regression tests, lock alignment (#1310) 2026-03-18 20:03:59 -06:00
TÂCHES
e6ab3b6722 refactor: extract parseUnitId() to centralize unit ID parsing (#1282)
Replaces 30+ inline `unitId.split("/")` + destructuring patterns across
16 production files with a single `parseUnitId()` helper that returns
`{ milestone, slice?, task? }`. If the unit ID format ever changes,
only one function needs updating.

Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-18 19:20:08 -06:00
Tom Boucher
afb438164e fix: align retry lock path with primary lock settings to prevent ECOMPROMISED (#1307)
The retry lock acquisition path (from stale lock recovery in #1251)
used a 5-minute stale threshold and no onCompromised handler, while
the primary path used 30 minutes and a graceful flag-based handler.

This mismatch meant locks acquired via the retry path would throw
ECOMPROMISED (uncaught, crashes process) if the event loop stalled
for >5 minutes — which happens during long LLM operations.

Fixed:
- Stale timeout: 300_000 → 1_800_000 (matches primary)
- Added onCompromised handler (sets _lockCompromised flag)
- Added process.on('exit') safety net (matches primary)

Also: reporter is on Node v25.6.1 which is unsupported — GSD requires
Node >=22.0.0 with 24 LTS recommended.

Fixes #1304
2026-03-18 19:15:47 -06:00
Tom Boucher
150575957d fix: skip symlinks in makeTreeWritable to prevent EPERM on NixOS/nix-darwin (#1303)
makeTreeWritable used statSync which follows symlinks. On NixOS and
nix-darwin, ~/.gsd/agent/bin/ contains symlinks to the immutable Nix
store (/run/current-system/sw/bin/). Attempting to chmod those targets
crashed GSD on startup with EPERM.

Changes:
- Use lstatSync instead of statSync — detects symlinks without
  following them
- Skip symlinks entirely (they don't carry own permissions, targets
  may be immutable)
- Added try/catch around chmodSync as safety net for any remaining
  permission errors on unusual filesystems

Secondary analysis: rmSync with force:true already handles symlinks
correctly (removes the link, not the target). cpSync with force:true
replaces symlinks with regular files (desired behavior for resource
sync).

Fixes #1298
2026-03-18 19:15:33 -06:00
TÂCHES
2a2056bcd7 refactor: extract getErrorMessage() helper to eliminate 65 inline duplicates (#1280)
Consolidate the repeated `err instanceof Error ? err.message : String(err)`
pattern into a single `getErrorMessage(err)` utility. Reduces visual noise in
catch blocks across 20 files in the GSD extension.

Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-18 19:12:44 -06:00
TÂCHES
922826ba8a refactor: consolidate DB-fallback inline functions in auto-prompts (#1276)
* refactor: consolidate DB-fallback inline functions in auto-prompts

Extract shared inlineFromDbOrFile() helper that encapsulates the
repeated pattern of checking DB availability, dynamically importing
context-store, running a query, formatting results, and falling back
to the filesystem. The three public functions (inlineDecisionsFromDb,
inlineRequirementsFromDb, inlineProjectFromDb) become thin wrappers
that pass only the differing query/format logic as a callback.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

* fix: update source-level test to match refactored DB-fallback function name

The context-compression test greps auto-prompts.ts source for
`inlineGsdRootFile(base, "project.md"` which was replaced by
`inlineProjectFromDb(base)` in the consolidation refactor.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

---------

Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-18 19:11:01 -06:00
Tom Boucher
8f06a14fb6 docs: update README for v2.32.0 release (#1299) 2026-03-18 18:57:29 -06:00
Tom Boucher
3ae1d54759 fix: handle Windows EPERM on .gsd migration rename with copy+delete fallback (#1296) 2026-03-18 18:57:06 -06:00
Tom Boucher
5660100c66 fix: add actionable recovery guidance to crash info messages (#1295) 2026-03-18 18:56:55 -06:00
Tom Boucher
5351e776d9 fix: resolve main repo root in worktrees for stable identity hash (#1294) 2026-03-18 18:56:43 -06:00