singularity/singularity-forge

Author	SHA1	Message	Date
Lex Christopherson	2d974dfb59	chore: update state — S01 planned, ready for execution	2026-03-18 00:25:26 -06:00
Lex Christopherson	26451ffc2f	docs(S01): add slice plan	2026-03-18 00:24:33 -06:00
Lex Christopherson	1e3d70f06c	chore(M001-1ya5a3/S01): auto-commit after state-rebuild	2026-03-18 00:19:32 -06:00
Lex Christopherson	c6f4cd826b	chore(M001-1ya5a3/S01): auto-commit after research-slice	2026-03-18 00:19:30 -06:00
Lex Christopherson	79052b418e	docs(M001-1ya5a3): context, requirements, and roadmap	2026-03-18 00:15:19 -06:00
Lex Christopherson	2615473dab	fix: update tests for god-file decomposition - token-profile.test.ts: read preferences-types, preferences-models, and preferences-validation alongside preferences.ts for structural checks - triage-dispatch.test.ts: search auto-post-unit.ts for triage/dispatch markers that moved during extraction, update comment markers to match actual code - none-mode-gates.test.ts: skip "no prefs default" test when global preferences file exists (cannot control ~/.gsd/preferences.md) - preferences.test.ts: skip getIsolationMode default test (same reason) Reduces test failures from 48 to 3 (all pre-existing: doctor-git, worktree-e2e, stopAutoRemote). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-18 00:09:34 -06:00
Jeremy McSpadden	155c32e01b	fix: strip model variant suffix for API key auth (#1097 ) (#1099 ) * fix: strip model variant suffix for all auth methods, not just OAuth (#1097) The model ID variant suffix (e.g., `[1m]` in `claude-opus-4-6[1m]`) was only stripped for OAuth token auth. When using an API key, the suffix was sent to the Anthropic API as-is, causing a 400 "upstream_error" because `claude-opus-4-6[1m]` is not a valid API model ID. The default Anthropic model is `claude-opus-4-6[1m]` (1M context variant), so every API key user hits this on every request. Fix: strip `[...]` suffix unconditionally for all auth methods. * fix: update source-reading tests for post-refactor file locations triage-dispatch.test.ts: read auto-post-unit.ts (dispatch logic moved from auto.ts) and update comment string matches to reflect renamed section headers. token-profile.test.ts: read preferences-types.ts, preferences-validation.ts, and preferences-models.ts (GSDPreferences interface and validation logic split from preferences.ts).	2026-03-17 23:31:40 -06:00
Jeremy McSpadden	b20e7b065a	feat: cache-ordered prompt assembly and dashboard cache hit rate (#1094 ) * feat: cache-ordered prompt assembly and dashboard cache hit rate Add prompt section reordering for better Anthropic cache hit rates. Sections are classified as static/semi-static/dynamic and reordered so stable content appears first in the prefix. - prompt-ordering.ts: section extraction, classification, and reordering by cache stability (static -> semi-static -> dynamic) - auto.ts: wire reorderForCaching into dispatch with logged warnings on failure (not silent catch) - auto-dashboard.ts: show cache hit rate percentage in progress widget - dashboard-overlay.ts: show aggregate cache hit rate in status overlay - auto-prompts.ts: respect compression_strategy preference before compressing carry-forward sections Includes 12 tests for reorderForCaching and analyzeCacheEfficiency. Split from #1083 per review feedback. * fix: update source-reading tests for post-refactor file locations triage-dispatch.test.ts: read auto-post-unit.ts (dispatch logic moved from auto.ts) and update comment string matches to reflect renamed section headers. token-profile.test.ts: read preferences-types.ts, preferences-validation.ts, and preferences-models.ts (GSDPreferences interface and validation logic split from preferences.ts).	2026-03-17 23:31:20 -06:00
Jeremy McSpadden	76a834cdf6	feat: add comprehensive API key manager (/gsd keys) (#1089 ) * feat: add comprehensive API key manager (/gsd keys) Add /gsd keys command with 6 subcommands for full API key lifecycle management: list, add, remove, test, rotate, and doctor. - list/status: Dashboard grouped by category (LLM, search, tool, remote) with masked key previews, OAuth expiry, env var source detection - add: Interactive provider picker with OAuth vs API key choice, prefix validation, and env var activation - remove: Multi-key support with individual or bulk removal - test: Lightweight API validation per provider with latency reporting and error classification (401/429/5xx/timeout) - rotate: Remove-and-replace flow with optional pre-save validation - doctor: Health checks for expired OAuth, empty keys, duplicates, env var conflicts, file permissions, missing LLM provider Includes unified provider registry (22 providers), tab completions, and redirect from /gsd setup keys. 44 unit tests. * fix: convert key-manager tests from vitest to node:test for CI typecheck Extension tests use node:test + node:assert/strict (not vitest) since tsconfig.extensions.json includes test files and vitest types are not available in the CI typecheck step.	2026-03-17 22:32:26 -06:00
TÂCHES	5ef52b8a59	refactor: decompose doctor.ts into types, format, and checks modules (#1096 ) Extract three modules from the 1,348-line doctor.ts god file: - doctor-types.ts: DoctorSeverity, DoctorIssueCode, DoctorIssue, DoctorReport, DoctorSummary - doctor-format.ts: summarizeDoctorIssues, filterDoctorIssues, formatDoctorReport, formatDoctorIssuesForPrompt - doctor-checks.ts: checkGitHealth, checkRuntimeHealth All public exports are re-exported from doctor.ts so existing imports from "./doctor.js" continue to work unchanged. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 22:27:38 -06:00
TÂCHES	51c259e778	refactor: extract milestone-ids and guided-flow-queue from guided-flow.ts (#1095 ) - Extract milestone ID utilities (MILESTONE_ID_RE, generateMilestoneSuffix, nextMilestoneId, extractMilestoneSeq, parseMilestoneId, milestoneIdSort, maxMilestoneNum, findMilestoneIds) into milestone-ids.ts (~95 lines) - Extract queue management (showQueue, handleQueueReorder, showQueueAdd, buildExistingMilestonesContext) into guided-flow-queue.ts (~445 lines) - Add re-exports from guided-flow.ts to preserve public API - Fix circular dependency: queue-order.ts now imports milestoneIdSort from milestone-ids.js instead of guided-flow.js - guided-flow.ts reduced from 1611 to 1144 lines Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 22:27:35 -06:00
TÂCHES	bf3c17c8de	refactor: decompose preferences.ts, populate skills and models modules (#1091 ) Extract types/interfaces/constants to preferences-types.ts (~200 lines), validation logic to preferences-validation.ts (~490 lines), move skill resolution into preferences-skills.ts (~160 lines), and model resolution into preferences-models.ts (~270 lines). The retained preferences.ts (~330 lines) handles loading, merging, rendering, hooks, and re-exports all symbols so existing imports remain unmodified. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 22:27:21 -06:00
TÂCHES	25d5f60836	refactor: decompose auto.ts into 6 focused modules (#1088 ) Extract 6 cohesive modules from the 3,476-line auto.ts god file, reducing it to 1,732 lines while preserving all external import paths. New modules: - auto-timers.ts (223 lines): Unit supervision timers — soft timeout, idle watchdog, hard timeout, context-pressure monitor - auto-idempotency.ts (150 lines): Completed-key checks, skip loop detection, phantom loop handling, fallback persistence - auto-stuck-detection.ts (220 lines): Dispatch count tracking, lifetime cap, MAX_UNIT_DISPATCHES loop detection, stub recovery. Uses return values instead of calling stopAuto/dispatchNextUnit. - auto-verification.ts (195 lines): Post-unit typecheck/lint/test gate, runtime error capture, dependency audit, auto-fix retry logic - auto-post-unit.ts (585 lines): Split into postUnitPreVerification and postUnitPostVerification — commit, doctor, state rebuild, worktree sync, DB dual-write, hooks, triage, quick-tasks - auto-start.ts (472 lines): Fresh session bootstrap — git/state init, crash lock detection, debug init, worktree setup, DB lifecycle All extracted functions receive AutoSession + context as parameters. No circular dependencies — new modules import from leaf dependencies only, never from ./auto.js. All public exports from auto.ts are preserved so external import paths continue to work unchanged. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 22:26:05 -06:00
TÂCHES	05fa939c11	Merge pull request #1092 from jeremymcs/fix/export-html-blocker-filter fix: match both milestoneId and sliceId in blocker card filter	2026-03-17 22:25:56 -06:00
TÂCHES	79303deb30	Merge pull request #1093 from gsd-build/refactor/decompose-commands-phase2 refactor: decompose commands.ts into 5 focused modules	2026-03-17 22:25:45 -06:00
TÂCHES	03a8fcc0dd	Merge pull request #1090 from gsd-build/ci-cd feat(ci): add Dockerfile for CI builder and runtime images	2026-03-17 22:23:35 -06:00
Lex Christopherson	f7a03946f3	refactor: decompose commands.ts into 5 focused modules Extract cohesive groups of functions from the 2,247-line commands.ts god-file into focused modules: - commands-prefs-wizard.ts (~600 lines): TUI preferences wizard, all configure* functions, serialization helpers - commands-config.ts (~80 lines): TOOL_KEYS, loadToolApiKeys, API key management - commands-inspect.ts (~80 lines): InspectData type, formatInspectOutput, DB diagnostics - commands-maintenance.ts (~200 lines): cleanup branches/snapshots, skip, dry-run handlers - commands-handlers.ts (~335 lines): doctor, steer, capture, triage, knowledge, run-hook, update, skill-health handlers commands.ts retains registerGSDCommand (router), dispatchDoctorHeal, fireStatusViaCommand, projectRoot, handleStatus, handleVisualize, handleSetup, showHelp, and re-exports from all sub-modules to preserve the public API surface. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 22:21:00 -06:00
Jeremy McSpadden	2d8fdcc0ab	fix: match both milestoneId and sliceId when filtering duplicate blocker cards The high-risk card filter in buildBlockersSection only compared sliceId, causing false positives when different milestones had slices with the same ID (e.g. M001/S01 and M002/S01). Now matches on both milestoneId and sliceId to correctly deduplicate.	2026-03-17 23:19:42 -05:00
TÂCHES	d3242c4c6e	Merge pull request #1021 from trek-e/ci-cd CI/CD: Three-stage promotion pipeline (Dev → Test → Prod)	2026-03-17 22:18:51 -06:00
Lex Christopherson	de618be8f3	feat(ci): add multi-stage Dockerfile for CI builder and runtime images Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 22:14:53 -06:00
Lex Christopherson	c1bc65bcca	fix: switch alibaba-coding-plan to OpenAI-compat endpoint with proper compat flags (#1003 ) (#1057 ) Co-Authored-By: Tom Boucher <trek-e@users.noreply.github.com>	2026-03-17 22:09:55 -06:00
TÂCHES	965028e219	Merge pull request #1077 from jeremymcs/feat/token-optimization-suite feat: token optimization suite — caching, compression, smart context selection	2026-03-17 22:07:52 -06:00
Tom Boucher	41186ad9b0	docs: document /gsd config for global API keys (#1079 ) * docs: add Node LTS pinning guide for macOS Homebrew users New doc (docs/node-lts-macos.md) explains how to pin Node 24 LTS via Homebrew to avoid running on odd-numbered development releases. Covers brew install/link/pin, version managers as alternatives, and verification steps. Added notice banner in README linking to the guide. * docs: document /gsd config for global API keys Added Global API Keys section to configuration.md explaining: - /gsd config saves keys to ~/.gsd/agent/auth.json - Keys apply to all projects automatically - Three supported keys: Tavily, Brave, Context7 - How precedence works (env vars > saved keys) - Anthropic models don't need search keys Updated commands.md gsd config entry to link to the new section. Added Set up API keys section to getting-started.md for first-run.	2026-03-17 22:03:10 -06:00
Jeremy McSpadden	288b399f88	fix: add dispatch stall guards to prevent auto-mode pause after slice completion (#1073 ) (#1076 ) * fix: prevent summarizing phase stall by retrying dropped agent_end events (#1072) When handleAgentEnd dispatches a sub-unit (via hooks, triage, or quick-task early-dispatch paths) and that unit completes before handleAgentEnd returns, the resulting agent_end event is silently dropped by the reentrancy guard. This leaves auto-mode active but permanently stalled — no unit running, no watchdog set, process at high CPU doing nothing. Add a pendingAgentEndRetry flag to AutoSession that the reentrancy guard sets when it drops an agent_end event. The finally block in handleAgentEnd checks this flag and schedules a deferred retry via setImmediate, ensuring the completed unit's agent_end is always processed. * fix: add dispatch stall guards to prevent auto-mode pause after slice completion (#1073) After a slice completes all tasks, auto-mode can stall if newSession() hangs or dispatchNextUnit gets permanently blocked at any await point. The existing gap watchdog only fires AFTER dispatchNextUnit returns, so it cannot recover from hangs inside the function itself. - Wrap newSession() with Promise.race timeout (30s) to prevent permanent hangs from session manager deadlocks or network issues - Add pre-dispatch hang guard (60s) in handleAgentEnd that starts the gap watchdog if dispatchNextUnit hasn't completed — catches hangs at any await point (model selection, session creation, etc.) - Add better diagnostics: notify user when session creation times out or fails, with specific unit type/ID for debugging	2026-03-17 22:02:10 -06:00
Jeremy McSpadden	b2befe3628	fix: prevent summarizing phase stall by retrying dropped agent_end events (#1072 ) (#1074 ) When handleAgentEnd dispatches a sub-unit (via hooks, triage, or quick-task early-dispatch paths) and that unit completes before handleAgentEnd returns, the resulting agent_end event is silently dropped by the reentrancy guard. This leaves auto-mode active but permanently stalled — no unit running, no watchdog set, process at high CPU doing nothing. Add a pendingAgentEndRetry flag to AutoSession that the reentrancy guard sets when it drops an agent_end event. The finally block in handleAgentEnd checks this flag and schedules a deferred retry via setImmediate, ensuring the completed unit's agent_end is always processed.	2026-03-17 22:01:58 -06:00
Tom Boucher	38b79d75a7	refactor: remove redundant test file, identify consolidation targets (#1070 ) * docs: add Node LTS pinning guide for macOS Homebrew users New doc (docs/node-lts-macos.md) explains how to pin Node 24 LTS via Homebrew to avoid running on odd-numbered development releases. Covers brew install/link/pin, version managers as alternatives, and verification steps. Added notice banner in README linking to the guide. * refactor: remove auto-draft-pause.test.ts — redundant with auto-dashboard.test.ts auto-draft-pause.test.ts tested describeNextUnit() for needs-discussion, pre-planning, and executing phases. All of these are already covered by auto-dashboard.test.ts which has proper node:test structure. The removed file also had fragile structural tests (string-matching source code) that break on refactors. The behavioral coverage is complete in the existing file. 1296 tests pass, 0 fail.	2026-03-17 22:01:20 -06:00
Jeremy McSpadden	60dfaabe03	fix: use atomic writes for completed-units.json and invalidate caches in db-writer (#1069 ) Addresses state safety issues found during #1062 deep dive: 1. completed-units.json writes in auto-worktree.ts and auto-worktree-sync.ts used plain writeFileSync which could produce truncated/corrupt files on crash, losing completion keys and causing unit re-dispatch. Switched to atomicWriteSync (temp file + rename) for crash safety. 2. Plan file checkbox reconciliation in auto-worktree.ts also switched to atomicWriteSync to prevent partial PLAN.md writes on crash. 3. db-writer.ts functions (saveDecisionToDb, updateRequirementInDb, saveArtifactToDb) wrote markdown files via saveFile() without invalidating caches afterward. Added targeted cache invalidation (state + path + parse) so deriveState() always sees fresh data. Uses individual invalidation functions rather than invalidateAllCaches() to avoid clearing the artifacts table that was just written to.	2026-03-17 22:01:08 -06:00
Jeremy McSpadden	668f12b97f	fix: reject prose Verify: fields from being executed as shell commands (#1066 ) (#1068 ) The verification gate's discoverCommands() was passing prose descriptions from task plan Verify: fields through sanitizeCommand(), which only checked for shell injection characters. English prose like "Document exists, contains all 5 scale names..." passed the filter and was executed via spawnSync, causing exit code 127 false negatives. Added isLikelyCommand() heuristic that distinguishes executable commands from prose descriptions by checking: - Known command prefixes (npm, node, tsc, eslint, etc.) - Path-like first tokens (./script.sh, /usr/bin/check) - Flag-like tokens (-v, --check) - Uppercase-initial words with 4+ tokens (prose pattern) - Comma-space clause separators (prose pattern) Prose Verify: fields now fall through to package.json scripts or "none" instead of being executed. Valid commands continue to work as before. Closes #1066	2026-03-17 22:00:52 -06:00
Jeremy McSpadden	9083d86766	fix: restore session model on error instead of reading stale global prefs (#1065 ) (#1067 ) When a model fails during auto-mode and the fallback chain is exhausted (or absent), the error recovery path previously fell through to pause without attempting to restore the session's original model. Meanwhile, the fallback chain itself was read fresh from disk via loadEffectiveGSDPreferences(), which could pick up models configured by a different concurrent GSD session sharing the same global preferences file. This adds a session model recovery step between fallback exhaustion and pause. After the existing fallback chain logic, we now check whether the current model has diverged from the model captured at auto-mode start (autoModeStartModel). If so, we restore the session model and retry before giving up and pausing. Changes: - auto.ts: export getAutoModeStartModel() getter for the session's captured start model - index.ts: add session model recovery block after fallback chain exhaustion, using the session-scoped model instead of re-reading global preferences from disk - model-isolation.test.ts: add 4 tests covering cross-session leakage detection, divergence checks, and null safety	2026-03-17 22:00:33 -06:00
Jeremy McSpadden	306c205dfc	fix: prevent run-uat re-dispatch loop when roadmap checkbox update fails (#1063 ) (#1064 ) Two compounding bugs caused auto-mode to re-dispatch run-uat indefinitely after UAT passed: 1. markSliceDoneInRoadmap regex required dash at line start (^-) but the roadmap parser accepts optional leading whitespace (^\s*-). When LLMs indented checklist items, the doctor could never mark them done. 2. After run-uat completed, handleAgentEnd ran doctor with fixLevel:"task" which explicitly excluded slice-level completion transitions. Since run-uat is the terminal unit for a slice, the roadmap checkbox stayed unchecked, causing deriveState to return the same slice indefinitely. Fix: Update markSliceDoneInRoadmap and markTaskDoneInPlan regexes to accept leading whitespace (matching the parser), preserving indentation in the replacement. Add run-uat to the set of unit types that use fixLevel:"all" in handleAgentEnd closeout.	2026-03-17 22:00:19 -06:00
Tom Boucher	55769392af	refactor: batch 2 — consolidate preferences, convert 8 more files to node:test (#1061 ) * docs: add Node LTS pinning guide for macOS Homebrew users New doc (docs/node-lts-macos.md) explains how to pin Node 24 LTS via Homebrew to avoid running on odd-numbered development releases. Covers brew install/link/pin, version managers as alternatives, and verification steps. Added notice banner in README linking to the guide. * refactor: batch 2 — consolidate preferences tests, convert 7 more files to node:test Preferences (6 files → 1): preferences-{git,hooks,mode,models,schema-validation,wizard-fields}.test.ts → preferences.test.ts (28 tests) Converted to node:test (custom runner → node:test): - discuss-prompt.test.ts (1 test) - auto-preflight.test.ts (1 test) - next-milestone-id.test.ts (4 tests) - plan-slice-prompt.test.ts (3 tests) - workspace-index.test.ts (1 test) - roadmap-slices.test.ts (5 tests) - in-flight-tool-tracking.test.ts (5 tests) Net: -933 lines, -6 files. Full suite: 1325 pass, 0 fail. * refactor: convert dispatch-guard.test.ts to node:test Net: 1 more file converted. Total this branch: 14 files converted/consolidated, 6 deleted. * fix: add null guards for parsePreferencesMarkdown in tests Add assert.ok(prefs) after each parsePreferencesMarkdown() call to narrow the GSDPreferences \| null return type before property access. Fixes TS18047 errors in CI typecheck.	2026-03-17 22:00:04 -06:00
Tom Boucher	8dfa7d058c	refactor: consolidate tests by area, standardize on node:test (#1059 ) * docs: add Node LTS pinning guide for macOS Homebrew users New doc (docs/node-lts-macos.md) explains how to pin Node 24 LTS via Homebrew to avoid running on odd-numbered development releases. Covers brew install/link/pin, version managers as alternatives, and verification steps. Added notice banner in README linking to the guide. * refactor: consolidate tests by area, standardize on node:test Consolidated 10 test files into 4, standardizing on node:test. Provider errors (3 files → 1): provider-errors.test.ts (34 tests) Metrics (2 files → 1): metrics.test.ts (13 tests, converted from custom runner) Activity log (2 files → 1): activity-log.test.ts (11 tests, converted from custom runner) Complexity (2 files → 1): removed redundant structural string checks Net: -694 lines, -6 files.	2026-03-17 21:59:50 -06:00
Jeremy McSpadden	3d4f77b2ee	fix: inline compareSemver in gsd extension to fix broken relative import (#1058 ) The /gsd update command imported compareSemver from ../../../update-check.js, a relative path that resolves correctly in the source tree (src/resources/ extensions/gsd/ → src/update-check.js) but breaks when extensions are synced to ~/.gsd/agent/extensions/gsd/ (where ../../../ points to ~/.gsd/ which has no update-check.js). This caused the error: Extension "command:gsd" error: Cannot find module '../../../update-check.js' Fix: inline a local compareSemverLocal() function in commands.ts, eliminating the cross-tree import. The function is small (10 lines) and already well-tested via update-check.test.ts.	2026-03-17 21:59:25 -06:00
Tom Boucher	41ebc6b643	docs: recommend pi-dashscope extension for DashScope models (#1056 ) * docs: add Node LTS pinning guide for macOS Homebrew users New doc (docs/node-lts-macos.md) explains how to pin Node 24 LTS via Homebrew to avoid running on odd-numbered development releases. Covers brew install/link/pin, version managers as alternatives, and verification steps. Added notice banner in README linking to the guide. * docs: recommend pi-dashscope extension for DashScope models The built-in alibaba-coding-plan provider uses the Anthropic-compat endpoint and lacks per-model thinking format and compatibility flags, causing issues like #1003 (MiniMax-M2.5 thinking loop). The community pi-dashscope extension uses the correct OpenAI-compat endpoint, sets thinkingFormat per model (qwen/zai), includes compat flags (supportsDeveloperRole, supportsReasoningEffort), and provides an interactive /dashscope-configure command. Added Community Provider Extensions section to configuration docs recommending pi-dashscope over the built-in provider.	2026-03-17 21:59:01 -06:00
Tom Boucher	85d48d3c97	fix: disable reasoning for MiniMax-M2.5 in alibaba-coding-plan provider (#1003 ) (#1055 ) * docs: add Node LTS pinning guide for macOS Homebrew users New doc (docs/node-lts-macos.md) explains how to pin Node 24 LTS via Homebrew to avoid running on odd-numbered development releases. Covers brew install/link/pin, version managers as alternatives, and verification steps. Added notice banner in README linking to the guide. * fix: disable reasoning for MiniMax-M2.5 in alibaba-coding-plan provider (#1003) MiniMax-M2.5 via Dashscope's Anthropic-compatible API does not properly support extended thinking, causing the model to get stuck in a thinking loop. Set reasoning: false for this model entry in the alibaba-coding-plan provider.	2026-03-17 21:58:38 -06:00
Tom Boucher	004f0ac861	docs: update README and docs for v2.28.0 release (#1054 ) * docs: add Node LTS pinning guide for macOS Homebrew users New doc (docs/node-lts-macos.md) explains how to pin Node 24 LTS via Homebrew to avoid running on odd-numbered development releases. Covers brew install/link/pin, version managers as alternatives, and verification steps. Added notice banner in README linking to the guide. * docs: update README and docs for v2.28.0 release - README: add 'What's New in v2.28' section with key features - commands.md: add /gsd update, /gsd export --html --all, and Export section with usage examples - auto-mode.md: add --all flag to export, add Failure Recovery (v2.28) section documenting reliability hardening - getting-started.md: mention /gsd update as in-session option	2026-03-17 21:58:07 -06:00
Jeremy McSpadden	45bff3456c	feat(gsd): add directory safeguards for system/home paths (#1053 ) * feat(gsd): add directory safeguards to prevent running in system/home paths GSD previously had no protection against being launched from dangerous directories like $HOME, /, /usr, or /etc. This adds layered validation: - Blocked system paths (hard stop): /, /usr, /etc, /var, $HOME, tmpdir, etc. - High entry count heuristic (>200 entries triggers confirmation dialog) - Symlink resolution via realpathSync to prevent bypass - Integrated at three chokepoints: projectRoot(), showSmartEntry(), bootstrapGsdDirectory() Includes 19 tests covering all blocked categories, boundary conditions, and the assertSafeDirectory throw/return behavior. * fix: make directory safeguard tests cross-platform (Windows CI) - Skip Unix-specific blocked path tests on Windows (/, /usr, /etc, etc.) - Add Windows-specific blocked path tests (C:\, C:\Windows) - Use platform-appropriate path separator in trailing slash test - Fix root path normalization for Windows drive letters (C:\ not C:)	2026-03-17 21:57:53 -06:00
Jeremy McSpadden	ce1ad35706	perf: skip initResources when version matches, consolidate startup I/O (#1052 ) - Add version-match early return to initResources() — skips ~800ms of synchronous rmSync + cpSync when managed-resources.json already matches the running GSD version (steady-state on every launch) - Consolidate package.json reads in loader.ts from 3 to 1 — single read reused for --version, --help, banner, and GSD_VERSION env var - Replace blocking checkAndPromptForUpdates() with passive checkForUpdates() to avoid blocking startup on npm registry fetch + user prompt (up to 5s) - Cache bundled extension keys in resource-loader to avoid redundant filesystem scan in buildResourceLoader() - Use GSD_VERSION env var in getBundledGsdVersion() to skip package.json re-read from resource-loader.ts - Add test verifying version-skip behavior: marker file survives when versions match, gets cleaned on mismatch	2026-03-17 21:57:13 -06:00
Jeremy McSpadden	326cef0b2d	feat: enhance HTML report with derived metrics, visualizations, and interactivity (#1078 ) * feat: enhance HTML report with derived metrics, visualizations, and interactivity Add 13 features to the HTML report generator across 6 implementation waves: Wave 1 - Summary enhancements: - Executive summary paragraph with project completion %, cost, and budget context - ETA calculation based on completion rate and remaining slices - Cost/slice and Tokens/tool efficiency metrics in KV grid - Cache hit ratio percentage - Milestone scope indicator when scoped to a milestone Wave 2 - Metrics visualizations: - Cost over time inline SVG area chart with grid lines and axis labels - Duration by slice bar chart (third chart using existing buildBarChart) - Budget burndown horizontal stacked bar (spent/projected/overshoot) - Chart row CSS changed to auto-fit for flexible multi-column layout Wave 3 - Blockers section: - New section with card-based layout for blocker verifications and high-risk incomplete slices, added to sections array and TOC nav Wave 4 - Gantt chart: - SVG horizontal bar timeline grouped by slice with done/active/pending coloring and time axis labels Wave 5 - Interactive JS features: - Timeline filter input for text-based row filtering - Collapsible sections with toggle buttons (localStorage persisted) - Dark/light theme toggle in header (localStorage persisted) Wave 6 - Mobile responsiveness: - 768px and 480px breakpoints with stacked layouts and compressed padding All changes in a single file (export-html.ts). No data layer changes needed. 30 new tests covering all features and edge cases. * fix: correct Phase type literal in export-html-enhancements test Change "execution" to "executing" to match the Phase type definition.	2026-03-17 21:46:51 -06:00
Tom Boucher	50bea6e73a	feat: auto-extract lessons to KNOWLEDGE.md on slice/milestone completion (#711 ) (#1081 ) * docs: add Node LTS pinning guide for macOS Homebrew users New doc (docs/node-lts-macos.md) explains how to pin Node 24 LTS via Homebrew to avoid running on odd-numbered development releases. Covers brew install/link/pin, version managers as alternatives, and verification steps. Added notice banner in README linking to the guide. * feat: auto-extract lessons to KNOWLEDGE.md on slice/milestone completion (#711) Added knowledge extraction steps to completion prompts: - complete-slice.md step 9: review task summaries for patterns, gotchas, and non-obvious lessons → append to KNOWLEDGE.md - complete-milestone.md step 9: review all slice summaries for cross-cutting insights → append to KNOWLEDGE.md Combined with the existing execute-task step 13 (which already tells agents to append discoveries during execution), this creates a three-layer extraction pipeline: task → slice → milestone.	2026-03-17 21:45:55 -06:00
Tom Boucher	c5739f1282	feat: auto-create PR on milestone completion (#687 ) (#1084 ) * docs: add Node LTS pinning guide for macOS Homebrew users New doc (docs/node-lts-macos.md) explains how to pin Node 24 LTS via Homebrew to avoid running on odd-numbered development releases. Covers brew install/link/pin, version managers as alternatives, and verification steps. Added notice banner in README linking to the guide. * feat: auto-create PR on milestone completion (#687) New git preferences: - git.auto_pr (boolean, default false): create a PR when a milestone completes via gh CLI - git.pr_target_branch (string, default main branch): target branch for auto-created PRs (e.g. develop, qa, staging) Implementation: - GitPreferences: added auto_pr and pr_target_branch fields - preferences.ts: added validation for both fields - auto-worktree.ts: after push, pushes milestone branch and creates PR via 'gh pr create' (non-fatal on failure) Documentation: - configuration.md: added fields to git config block, table, and new git.auto_pr section with requirements and flow - git-strategy.md: added Automatic Pull Requests section with Gitflow example config	2026-03-17 21:45:29 -06:00
Tom Boucher	792b166ce6	fix: improve LSP diagnostics when no servers detected (#1082 ) (#1086 ) * docs: add Node LTS pinning guide for macOS Homebrew users New doc (docs/node-lts-macos.md) explains how to pin Node 24 LTS via Homebrew to avoid running on odd-numbered development releases. Covers brew install/link/pin, version managers as alternatives, and verification steps. Added notice banner in README linking to the guide. * fix: improve LSP diagnostics when no servers detected (#1082) When lsp status returns 'No language servers configured', the output now includes diagnostics: - Which project markers were detected (e.g. package.json found) - Which server commands are missing (e.g. typescript-language-server) - Install instructions Also added LSP troubleshooting section to docs/troubleshooting.md with common install commands per language.	2026-03-17 21:45:11 -06:00
Jeremy McSpadden	4e7b3d486f	test: add end-to-end token optimization benchmark Benchmark validates all optimization modules with realistic GSD content: - Structured data: 20% decisions savings, 7% requirements savings - Prompt compression: 5-17% across light/moderate/aggressive levels - Semantic chunking: 73% content reduction via TF-IDF selection - Summary distillation: 73% savings preserving structured fields - Combined pipeline: 43% total savings on realistic dispatch prompt - Cache efficiency: 94% cacheable prefix, 85% estimated Anthropic savings - Provider-aware: 14% budget accuracy improvement for Anthropic vs OpenAI	2026-03-17 22:10:58 -05:00
Jeremy McSpadden	d65da6c927	feat: wire semantic chunking, add preferences, metrics, and docs - Wire semantic chunker into inlineFileSmart() for large file context selection - Use inlineFileSmart for knowledge file in buildExecuteTaskPrompt (TF-IDF relevance) - Add compression_strategy and context_selection preferences with profile defaults - Add resolveCompressionStrategy() and resolveContextSelection() resolvers - Add cacheHitRate and compressionSavings to UnitMetrics - Add aggregateCacheHitRate() for session-wide cache performance - Update token-optimization.md with compression, chunking, and distillation docs - Add 12 integration tests for optimization preferences and modules	2026-03-17 22:07:05 -05:00
Jeremy McSpadden	39b3daee6f	feat: add token optimization suite for prompt caching, compression, and smart context selection Introduces six new modules that work together to reduce token usage across the dispatch pipeline while preserving semantic content quality: - Provider-aware token counting with per-provider char/token ratios - Prompt cache optimizer for maximizing Anthropic/OpenAI cache hit rates - Structured data formatter (compact notation for decisions/requirements/tasks) - Deterministic prompt compressor (light/moderate/aggressive levels) - Semantic chunker with TF-IDF relevance scoring for context selection - Summary distiller for condensed dependency summaries Integration points: - inlineDependencySummaries uses distillation before truncation (3+ deps) - inlineDecisionsFromDb/inlineRequirementsFromDb use compact format at non-full levels - buildExecuteTaskPrompt compresses carry-forward when it exceeds 40% of budget - context-budget.reduceToFit combines compression with section-boundary truncation - computeBudgets accepts optional provider for accurate char/token ratios All existing 1475 unit tests + 30 integration tests pass with zero regressions. 157 new tests cover all optimization modules.	2026-03-17 22:02:27 -05:00
Jeremy McSpadden	68a999ebde	fix: prevent summarizing phase stall by retrying dropped agent_end events (#1072 ) When handleAgentEnd dispatches a sub-unit (via hooks, triage, or quick-task early-dispatch paths) and that unit completes before handleAgentEnd returns, the resulting agent_end event is silently dropped by the reentrancy guard. This leaves auto-mode active but permanently stalled — no unit running, no watchdog set, process at high CPU doing nothing. Add a pendingAgentEndRetry flag to AutoSession that the reentrancy guard sets when it drops an agent_end event. The finally block in handleAgentEnd checks this flag and schedules a deferred retry via setImmediate, ensuring the completed unit's agent_end is always processed.	2026-03-17 21:49:39 -05:00
Tom Boucher	d252168de5	fix: switch alibaba-coding-plan to OpenAI-compat endpoint with proper compat flags (#1003 ) The alibaba-coding-plan provider was using the Anthropic-compatible endpoint (/apps/anthropic) with anthropic-messages API, which caused issues with thinking mode on several models (MiniMax-M2.5 thinking loop, missing thinkingFormat for Qwen/GLM models). Changes for all 8 models: - API: anthropic-messages → openai-completions - Endpoint: /apps/anthropic → /v1 (OpenAI-compatible) - Added per-model compat flags: - Qwen models: thinkingFormat: 'qwen', supportsDeveloperRole: false - GLM models: thinkingFormat: 'qwen', supportsDeveloperRole: false - MiniMax-M2.5: supportsReasoningEffort: true, maxTokensField: 'max_tokens' - Kimi K2.5: thinkingFormat: 'zai', supportsDeveloperRole: false - Enabled reasoning for qwen3-max (was incorrectly false) - Fixed context windows to match tested values - Fixed MiniMax-M2.5 maxTokens: 24576 → 65536	2026-03-17 21:11:18 -04:00
Tom Boucher	7377a08d8e	docs: add Node LTS pinning guide for macOS Homebrew users New doc (docs/node-lts-macos.md) explains how to pin Node 24 LTS via Homebrew to avoid running on odd-numbered development releases. Covers brew install/link/pin, version managers as alternatives, and verification steps. Added notice banner in README linking to the guide.	2026-03-17 20:49:51 -04:00
TÂCHES	94be09482f	fix: add barrel files for remote-questions, ttsr, and shared extensions (#1048 ) * fix: add barrel files for remote-questions, ttsr, and shared extensions Centralizes public API surface for three extension directories behind index.ts barrel files. External consumers now import from the barrel instead of reaching into internal module files, reducing coupling and making future refactors safer. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix: rename barrel files to mod.ts to avoid extension loader auto-discovery The extension loader auto-discovers extensions by looking for index.ts files inside extensions/*/ directories. remote-questions/ and shared/ are utility directories, not extensions — their index.ts barrel files caused load failures. Renamed to mod.ts which the loader ignores, and updated all import paths. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 18:48:32 -06:00
TÂCHES	27e79f76b3	refactor: centralize magic numbers into constants.ts (#1044 ) Extracts 11 hardcoded timeout, retry, compaction, and tool-default values from 9 source files into a single constants.ts module. Each source file now imports from the central definition, eliminating duplicated literals and making tuning a single-file change. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-17 18:45:43 -06:00

1 2 3 4 5 ...

1284 commits