singularity/singularity-forge

Author	SHA1	Message	Date
Jeremy	e4987f5337	fix(db): delete orphaned WAL/SHM files alongside empty gsd.db (#2478 ) syncProjectRootToWorktree deleted empty gsd.db but left companion -wal and -shm files on disk. On Node 24, node:sqlite attempts WAL recovery from orphaned files, triggering a synchronous CPU spin loop (227% CPU, 1.4GB RSS). Now deletes gsd.db-wal and gsd.db-shm when the main DB is deleted or already missing.	2026-04-04 19:36:43 -05:00
Jeremy	fd96a1a30b	fix(gsd): prevent auto-wrapup from interrupting in-flight tool calls (#3512 ) Gate triggerTurn behind getInFlightToolCount() === 0 for both soft timeout and context-pressure wrapup messages. Add clearQueue() to stopAuto() and pauseAuto() to flush late async follow-ups.	2026-04-04 19:14:23 -05:00
Jeremy McSpadden	dbaf37ae78	Merge pull request #3491 from Tibsfox/fix/claude-code-skill-directory-support fix(gsd): add Claude Code official skill directories to skill resolution	2026-04-04 18:24:51 -05:00
Jeremy McSpadden	677ca806df	Merge pull request #3494 from Tibsfox/fix/decision-save-transaction-race fix(gsd): wrap decision and requirement saves in transaction to prevent ID races	2026-04-04 18:23:15 -05:00
Tibsfox	c70eacea89	fix(gsd): wrap decision and requirement saves in transaction to prevent ID races nextDecisionId() and nextRequirementId() compute the next ID via SELECT MAX then pass it to a separate upsertDecision/upsertRequirement call. When parallel tool calls hit these functions concurrently, both read the same MAX value and produce the same ID — the second insert silently overwrites the first. Move the SELECT MAX + INSERT into a single transaction() call from gsd-db.ts, which uses BEGIN/COMMIT/ROLLBACK and works on both better-sqlite3 and node:sqlite providers. The transaction is re-entrant safe (nested calls skip the BEGIN). Same fix applied to saveRequirementToDb for consistency. Closes #3326, closes #3339, closes #3459	2026-04-04 15:16:52 -07:00
Tibsfox	e107828363	fix(gsd): add Claude Code official skill directories to skill resolution GSD-2 only searches ~/.agents/skills/ and .agents/skills/ for skills. Claude Code's official skill directories (~/.claude/skills/ and .claude/skills/) are not included in the search path, making GSD-2 blind to any skills managed there. The skills.sh CLI (npx skills list -g) already recognises both ~/.agents/skills/ and ~/.claude/skills/ as valid global skill directories. This commit aligns GSD-2's resolution logic with that behaviour. Affected functions: - getSkillSearchDirs(): adds ~/.claude/skills/ and .claude/skills/ - captureAvailableSkills(): includes Claude Code dir in telemetry - detectStaleSkills(): includes Claude Code dir in staleness checks - detectNewSkills(): resolves SKILL.md from either directory - isPackInstalled(): checks both dirs before recommending installs - formatSkillDetail(): finds SKILL.md in either directory	2026-04-04 15:15:29 -07:00
Jeremy McSpadden	099e6f3120	Merge pull request #3511 from jeremymcs/fix/steer-worktree-path fix(gsd): steer writes overrides to worktree when active	2026-04-04 16:10:30 -05:00
Jeremy	ee87924636	fix(gsd): gate steer worktree routing on active session, fix messaging Address adversarial review findings: 1. [high] Override routing now requires an active auto-mode session (in-process or remote via checkRemoteAutoSession) before writing to a worktree path. Previously, any existing worktree directory would receive the override even if no agent was running there — a leftover worktree from a previous session would silently eat the override. 2. [medium] Success messages now report the actual resolved override location (worktree vs project root .gsd/OVERRIDES.md) so operators know exactly where to look during recovery or manual rewrite. Additional tests cover: inactive worktree fallback, double-gate (autoRunning + valid .git), and getAutoWorktreePath null on missing .git. Closes #3476	2026-04-04 15:37:13 -05:00
Jeremy McSpadden	82906524a8	Merge pull request #3482 from NilsR0711/fix/remote-questions-interactive-mode fix(remote-questions): fire configured channels in interactive mode	2026-04-04 15:32:19 -05:00
Jeremy	bd863e3e21	fix(gsd): resolve steer overrides to worktree path when worktree is active handleSteer used process.cwd() as the base path for appendOverride, which writes to project/.gsd/OVERRIDES.md. When auto-mode runs in a worktree, it reads from worktree/.gsd/ — so overrides written from a second terminal were never seen by the agent. Now checks for an active worktree via getAutoWorktreePath and writes the override there when one exists, falling back to the project root when no worktree is active. Closes #3476	2026-04-04 15:25:26 -05:00
Jeremy McSpadden	af82c37041	Merge pull request #2755 from jeremymcs/feat/capability-aware-model-routing-pr feat: capability-aware model routing (ADR-004)	2026-04-04 15:23:38 -05:00
Jeremy McSpadden	2acf5292d0	Merge pull request #3508 from jeremymcs/fix/audit-log-hardening fix(gsd): harden audit log persistence and demote probe warnings	2026-04-04 15:12:14 -05:00
Jeremy	4ddb9ca8a5	fix(gsd): add codebase validation in validatePreferences so preferences are not silently dropped The codebase preferences block was accepted as a known key but never validated or assigned in validatePreferences(), causing all user-configured codebase defaults to be silently discarded. Adds validation for exclude_patterns (string[]), max_files (positive int), and collapse_threshold (positive int) with unknown-key warnings and 4 new tests.	2026-04-04 15:01:15 -05:00
Jeremy	bbe67da02c	feat(gsd): enhance /gsd codebase with preferences, --collapse-threshold, and auto-init Add configurable codebase map options via preferences.md (exclude_patterns, max_files, collapse_threshold), expose --collapse-threshold as a CLI flag, and auto-generate CODEBASE.md during project init for instant agent orientation. Closes #3509	2026-04-04 14:51:51 -05:00
Jeremy McSpadden	5cd25cf5df	Remove copyright notice from test file Removed copyright notice from capability-router tests.	2026-04-04 14:33:50 -05:00
Jeremy	a290708573	fix(test): update db-path-worktree-symlink test for simplified diagnostic logging The ensureDbOpen catch block now logs via logWarning with error message instead of structured diagnostic object. Update source-level assertion to match the new pattern.	2026-04-04 14:33:12 -05:00
Jeremy	6eb532bf9d	fix(gsd): update tests for errors-only audit persistence, fix empty catch blocks Update existing workflow-logger tests to use logError for audit persistence assertions (warnings are now ephemeral). Add void expression to empty catch blocks in detectMainBranch to satisfy the no-empty-catch CI check.	2026-04-04 14:29:00 -05:00
Jeremy	10cd4a12c5	test(gsd): add workflow-logger audit persistence tests Covers error-only persistence policy, warning ephemeral behavior, message truncation, context field allowlist sanitization, and mixed severity filtering.	2026-04-04 14:22:56 -05:00
Jeremy	2396ecf1db	fix(gsd): harden audit log persistence — errors-only, sanitized, demote probe warnings Only persist error-severity entries to audit-log.jsonl (warnings stay ephemeral in stderr + buffer). Sanitize persisted entries with message truncation and context field allowlisting. Demote expected main/master branch probe failures to silent control flow. Remove JSON.stringify of diagnostic objects embedding cwd/paths in warning messages. Addresses Codex adversarial review findings on workflow-logger migration.	2026-04-04 14:19:36 -05:00
Jeremy McSpadden	7a1c6213a0	Merge pull request #3507 from jeremymcs/refactor/workflow-logger-migration refactor(gsd): migrate all catch blocks to centralized workflow-logger	2026-04-04 14:04:26 -05:00
Jeremy McSpadden	1a21915572	Merge pull request #3505 from jeremymcs/pr-3496 fix(gsd): fail-closed stop guard, harden backtrack parsing, fix prompt params	2026-04-04 13:59:04 -05:00
Jeremy	64fe364fdb	fix(gsd): address adversarial review findings on workflow-logger migration workflow-events.ts: stop logging raw event line content to audit log — log byte length only to avoid persisting potentially sensitive payload fragments to .gsd/audit-log.jsonl. parallel-orchestrator.ts: revert worker NDJSON parse failure to silent drop — non-JSON lines (progress text, tool output) are expected in worker stdout and logging each one creates I/O pressure and audit log bloat in the parallel execution hot path.	2026-04-04 13:53:16 -05:00
Jeremy	3d6d72c04d	refactor(gsd): migrate all catch blocks to centralized workflow-logger Replace raw process.stderr.write(), console.error(), and empty catch blocks across 50 GSD files with structured logWarning/logError calls from the centralized workflow-logger system. Add 13 new LogComponent types to cover all subsystems: recovery, session, prompt, dashboard, timer, worktree, command, parallel, fs, bootstrap, guided, registry, renderer. Every migrated catch block now automatically: - Shows in terminal (stderr) with component tag - Gets buffered for auto-loop stuck-detection summary - Persists to .gsd/audit-log.jsonl for post-mortem analysis Update regression test to verify catch blocks use workflow-logger instead of raw stderr/console, covering auto-mode files and all explicitly migrated infrastructure files. Closes #3506 Supersedes the approach in #3496	2026-04-04 13:42:55 -05:00
Jeremy	abe887de10	fix(gsd): fail-closed stop guard, harden backtrack parsing, fix prompt params - Stop/backtrack guard now calls pauseAuto before marking captures executed, and returns break on any exception to prevent silently dropping user halt intent - Backtrack target parsing excludes current milestone ID and rejects ambiguous multi-target strings instead of guessing first match - Fixed gsd_skip_slice parameter names in rethink prompt (milestone_id → milestoneId)	2026-04-04 13:09:16 -05:00
Tibsfox	4f896cc561	fix(gsd): add diagnostic logging to empty catch blocks in auto-mode Auto-mode has empty catch blocks across 11 files that silently swallow errors. When these operations fail (DB writes, git commands, file sync, worktree operations), the error is lost and downstream systems see stale or inconsistent state — leading to stuck loops, phantom milestones, and silent data loss. Replace every empty catch with a process.stderr.write() call that logs the operation context and error message. Format: gsd [filename]: <operation> failed: <error.message> For catches already annotated with /* non-fatal / or / best-effort */ comments, the logging is added alongside the annotation to preserve the original intent while making failures observable. Adds a regression test that scans all auto-mode source files and asserts no empty catch blocks remain. Files modified (11): auto-worktree.ts, auto.ts, auto-recovery.ts, auto-prompts.ts, auto-dashboard.ts, auto-start.ts, auto-timers.ts, auto-post-unit.ts, auto-dispatch.ts, auto-unit-closeout.ts, auto/phases.ts No behavioral changes — only diagnostic output added. Addresses #3348, addresses #3345	2026-04-04 10:38:54 -07:00
Jeremy McSpadden	d07f573799	Merge pull request #3499 from jeremymcs/test/state-machine-edge-cases test(gsd): fill state machine E2E verification gaps	2026-04-04 11:57:17 -05:00
Jeremy	e0884375e6	test: add regression test for interview-ui notes loop (#3502 ) Exercises the goNextOrSubmit → notes auto-open path to ensure: - Enter after typing a note advances instead of looping - Empty notes still trigger the auto-open - Normal option selection is unaffected Fixes #3502	2026-04-04 11:22:15 -05:00
Jeremy	f153745c4f	fix: break infinite notes loop when selecting "None of the above" goNextOrSubmit() unconditionally reopened the notes field whenever the cursor sat on the "None of the above" slot, even after the user had already typed a note and pressed Enter. This trapped users in an endless loop where Enter always bounced back to notes mode. Add a `!states[currentIdx].notes` guard so the auto-open only fires when notes are still empty. Fixes #3502	2026-04-04 11:12:17 -05:00
Jeremy	e3288e8dad	fix: align defaultRoutingConfig capability_routing to true The feature branch intends capability_routing to default to true when routing is enabled. Conflict resolution incorrectly kept the false default from the earlier commit.	2026-04-04 11:04:27 -05:00
Jeremy	6dc7c0ec1d	test(01-05): add capability-aware routing integration tests - Full pipeline with capability_routing: true returns capability-scored decision - capability_routing: false falls back to tier-only with no capabilityScores - Single eligible model (pinned) skips scoring and uses tier-only - Unknown model gets uniform score of 50 and competes in scoring - capabilityOverrides change scoring outcome in scoreEligibleModels - capabilityOverrides pass through resolveModelForComplexity to STEP 2 - Regression guards: routing disabled, unknown model, no-downgrade-needed all pass - All 51 tests pass (42 existing + 9 new integration)	2026-04-04 10:56:23 -05:00
Jeremy	1645be072c	feat(01-05): fire before_model_select hook, add verbose scoring output, load capability overrides - Fire pi.emitBeforeModelSelect() in selectAndApplyModel before resolveModelForComplexity - Hook override bypasses capability scoring entirely with tier-only selectionMethod - Verbose output shows capability-scored breakdown: model scores sorted descending - Add loadCapabilityOverrides() to model-router.ts for deep-merge with built-in profiles - Extend resolveModelForComplexity signature with optional capabilityOverrides parameter - Pass capabilityOverrides through to scoreEligibleModels in STEP 2	2026-04-04 10:56:22 -05:00
Jeremy	6cc42bb504	feat(01-04): register before_model_select placeholder handler in GSD hooks - Add before_model_select handler registration inside registerHooks() - Handler returns undefined (no override) to let capability scoring proceed - Comment references ADR-004 for traceability - Serves as documentation and ensures event type is registered for Plan 05 wiring	2026-04-04 10:56:06 -05:00
Jeremy	1866ccf781	feat(01-03): wire taskMetadata from selectAndApplyModel to resolveModelForComplexity - Pass unitType and classification.taskMetadata as 5th and 6th args to resolveModelForComplexity - Completes end-to-end data pipeline: classifier extracts metadata, attaches to ClassificationResult, auto-model-selection passes through to router for capability scoring	2026-04-04 10:56:06 -05:00
Jeremy	accee43563	feat(01-03): insert STEP 2 capability scoring into resolveModelForComplexity - Add unitType and taskMetadata optional params to resolveModelForComplexity - Replace findModelForTier with getEligibleModels for multi-model eligible set - Insert STEP 2 scoring block: activates when capability_routing enabled, eligible.length > 1, unitType provided - Add buildFallbackChain helper to deduplicate fallback assembly logic - Scoring returns capability-scored selectionMethod with capabilityScores and taskRequirements - Single-model and zero-model paths fall through to tier-only behavior - All 42 existing tests pass unchanged (backward compat via optional params)	2026-04-04 10:55:28 -05:00
Jeremy	bf918d30d5	test(01-02): add unit tests for scoring functions and taskMetadata passthrough - Add scoreModel, computeTaskRequirements, scoreEligibleModels, getEligibleModels describe blocks to model-router.test.ts (27 new tests) - Add ClassificationResult taskMetadata describe block to complexity-classifier.test.ts (4 new tests: execute-task populated, hook undefined, plan-slice undefined, extractTaskMetadata export) - Add getModelTier unknown-default tests verifying standard tier (not heavy) per D-15 - All 42 model-router tests pass, all 32 complexity-classifier tests pass - All 36 pre-existing capability-router tests continue to pass	2026-04-04 10:54:02 -05:00
Jeremy	409cd77cbc	feat(01-01): add taskMetadata to ClassificationResult and export extractTaskMetadata - Add taskMetadata?: TaskMetadata to ClassificationResult in complexity-classifier.ts - Add taskMetadata?: TaskMetadata to ClassificationResult in types.ts (duplicate interface) - Export extractTaskMetadata() so it can be imported by model-router.ts - Refactor classifyUnitComplexity() to extract metadata once for execute-task (eliminates double-extraction at adaptive learning step) - Populate taskMetadata field on ClassificationResult for execute-task units - Set taskMetadata: undefined explicitly on hook unit fast-path	2026-04-04 10:53:45 -05:00
Jeremy	0ccd3fd8a4	feat(01-01): add capability types, data tables, and scoring functions to model-router - Import TaskMetadata from complexity-classifier - Add capability_routing?: boolean to DynamicRoutingConfig - Add capabilityScores, taskRequirements, selectionMethod fields to RoutingDecision - Add ModelCapabilities interface (7 dimensions: coding, debugging, research, reasoning, speed, longContext, instruction) - Add MODEL_CAPABILITY_PROFILES data table with 9 model profiles - Add BASE_REQUIREMENTS data table with 11 unit type vectors - Add exported scoreModel() pure function (weighted average, 0-100 range) - Add exported computeTaskRequirements() with metadata-driven vector refinement - Add exported scoreEligibleModels() with cost-preferring tie-break sorting - Add exported getEligibleModels() extracted from findModelForTier() logic - Add selectionMethod: "tier-only" to all 5 return sites in resolveModelForComplexity() - Change getModelTier() unknown default from "heavy" to "standard" (per D-15) - Add capability_routing: true to defaultRoutingConfig()	2026-04-04 10:53:45 -05:00
Jeremy	e89bf7d18e	test(01-01): add failing tests for capability-aware model routing - Tests for scoreModel weighted average, edge cases (empty/unknown dims) - Tests for computeTaskRequirements with all branch paths (docs, concurrency, migration, large-file) - Tests for MODEL_CAPABILITY_PROFILES (9 models, 7 dimensions each) - Tests for BASE_REQUIREMENTS (all 11 unit types) - Tests for scoreEligibleModels (sorting, tie-breaking, unknown models, overrides) - Tests for getEligibleModels (tier filtering, explicit config, empty result) - Tests for DynamicRoutingConfig.capability_routing and RoutingDecision.selectionMethod	2026-04-04 10:51:31 -05:00
Jeremy	3f9fa9351f	fix(test): use correct RequirementCounts type fields in edge case tests Replace non-existent `invalidated` field with the correct type fields (`outOfScope`, `blocked`, `total`) to pass typecheck.	2026-04-04 10:25:00 -05:00
Jeremy	62cc474002	test(gsd): fill state machine E2E verification gaps (#3498 ) Add 102 integration tests across two new files covering state machine edge cases, runtime failures, and boundary conditions not exercised by the existing live-validation suite. Closes #3498	2026-04-04 10:00:07 -05:00
Tom Boucher	a061e3c276	feat: stop/backtrack capture classifications for milestone regression (#3488 ) * feat: add stop/backtrack capture classifications for milestone regression (#3487) Adds 4-layer methodology for halting auto-mode and backtracking to previous milestones when captures indicate the user wants to stop or that a milestone missed critical features: 1. Type layer: "stop" and "backtrack" classification types in captures.ts 2. Guard layer: pre-dispatch stop check in runGuards() pauses auto-mode before the next unit dispatches 3. Resolution layer: executeBacktrack() writes BACKTRACK-TRIGGER.md and milestone regression markers for state machine detection 4. Protection layer: revertExecutorResolvedCaptures() detects and reverts captures silenced by non-triage agents (resolved without classification) Also adds fast-path stop detection in auto-post-unit.ts that pattern-matches pending capture text for stop keywords without waiting for triage. Closes #3487 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: add slice-level skip with gsd_skip_slice tool (#3477) Adds "skipped" as a closed status alongside "complete" and "done": - status-guards.ts: isClosedStatus() recognizes "skipped" - state.ts: isStatusDone() recognizes "skipped" - gsd-db.ts: getActiveSliceFromDb() skips slices with status "skipped" - db-tools.ts: new gsd_skip_slice tool for rethink and manual use - rethink.md: added "Skip a slice" operation to rethink prompt - rethink.ts: buildRethinkData shows skipped slice counts Skipped slices satisfy dependencies for downstream slices, allowing auto-mode to advance past them. Slice data is preserved for reference. Relates to #3477 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: resolve 4 issues found in adversarial review of PR #3488 1. triage-ui.ts: Restore stop/backtrack entries in CLASSIFICATION_LABELS and ALL_CLASSIFICATIONS — the Record<Classification, ...> type requires all union members, and runtime lookups would crash on stop/backtrack. Also auto-confirm stop/backtrack in the triage confirmation flow (matching the triage-captures.md prompt directive). 2. triage-resolution.ts: Replace require("node:fs") in clearBacktrackTrigger with ESM import of unlinkSync — consistent with the rest of the codebase. 3. auto-post-unit.ts: Anchor STOP_PATTERN regex to start-of-string (^) to prevent false positives on captures like "add a pause button" or "stop the timer from re-rendering" which are feature descriptions, not halt directives. 4. status-guards.test.ts: Add missing test case for isClosedStatus("skipped") to cover the new status value. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: update tool-naming test count for gsd_skip_slice The new gsd_skip_slice tool (no alias) brings the total from 29 to 30. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-04 01:40:33 -04:00
Tom Boucher	7d5bf63b2d	feat: GSD context optimization with model routing and context masking * docs: add context optimization design spec, implementation plan, and pi-layer research - Spec: 6-change design for GSD extension context optimization - Plan: 9-task TDD implementation plan with exact file paths and code - Pi-layer doc: 10 infrastructure opportunities (research only, not planned) Part of #3171, #3406, #3452, #3433. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat(context): add observation masking for auto-mode sessions Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * feat(context): add phase handoff anchors for auto-mode Introduces PhaseAnchor read/write utilities so downstream agents can inherit decisions, blockers, and intent written at phase boundaries without re-inferring from conversation history. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * feat(context): add capability-aware model routing and context management preferences Implement ADR-004 Phase 2 capability scoring with 7-dimension model profiles, task requirement vectors, and weighted scoring. Add ContextManagementConfig preferences for observation masking thresholds. Wire capability scoring into auto-model-selection dispatch path. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat(context): wire observation masking, phase anchors, and tool truncation Register observation masker in before_provider_request hook to replace old tool results with placeholders during auto-mode. Add tool result truncation (configurable via context_management.tool_result_max_chars). Inject phase handoff anchors into prompt builders so downstream phases inherit decisions from research/planning. Write anchors after successful phase completion. Update ADR-004 status to Implemented. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore: remove internal planning artifacts from PR Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: add capability routing, observation masking, and context management Update dynamic-model-routing.md with capability-aware scoring section. Update token-optimization.md with observation masking, tool truncation, and phase handoff anchor documentation. Update configuration.md with context_management preference block and capability_routing flag. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * Merge branch 'main' into feat/gsd-context-optimization * fix: add context_management to known keys and prevent tool truncation state corruption - Add missing 'context_management' to KNOWN_PREFERENCE_KEYS set so users don't get spurious unknown-key warnings when configuring it. - Replace in-place mutation of tool result content with immutable spread to prevent corrupting shared conversation message objects. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: add stop and backtrack to triage-ui classification labels The Classification type gained stop and backtrack variants from main but triage-ui.ts was not updated, causing a TypeScript build failure. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: context masker and tool truncation operate on correct pi-ai message format The observation masker and tool result truncation in before_provider_request were checking m.type === "toolResult" but the actual pi-ai payload uses m.role === "toolResult" with content as TextContent[] arrays (not strings). bashExecution messages are converted to {role:"user"} by convertToLlm before the hook fires, so checking m.type === "bashExecution" was a no-op. - Fix context-masker to match on role, handle array content, detect bash results by their "Ran `" prefix - Fix register-hooks truncation to operate on role:"toolResult" with array content blocks - Update tests to use correct pi-ai LLM payload format Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-04 01:02:35 -04:00
Nils Reeh	131cb1bed2	fix(remote-questions): fire configured channels in interactive mode tryRemoteQuestions was gated behind if (!ctx.hasUI), so Telegram/Slack/ Discord were never contacted when GSD ran with a terminal UI. The test message sent during setup always worked (direct API call, no guard), which made the feature appear configured but non-functional in practice. Move the remote call before the hasUI guard so configured channels fire regardless of UI availability. When no remote channel is configured, tryRemoteQuestions returns null and the local UI is used as before. Adds a source-level regression test asserting that tryRemoteQuestions is called before the !ctx.hasUI branch. Closes #3480 Verified with AI.	2026-04-04 01:51:18 +02:00
Jeremy McSpadden	bb47f5a087	Merge pull request #2287 from jeremymcs/worktree-btw-implementation feat: add /btw skill — ephemeral side questions from conversation context	2026-04-03 15:18:03 -05:00
Justin Wyer	95875c41c5	refactor(test): consolidate regression and override tests into #666 test files Move regression tests and override tests from standalone files into the existing test files introduced by PR #666: - resolve-config-value.test.ts: add REGRESSION #666 describe block and setAllowedCommandPrefixes override tests - url-utils.test.ts: add REGRESSION #666 describe block and setFetchAllowedUrls override tests - Delete: regression-666.test.ts, resolve-config-value-override.test.ts, url-utils-override.test.ts Same 59 tests, fewer files, tests live next to the code they test.	2026-04-02 14:06:19 +02:00
Justin Wyer	d5f581fe6b	test: add regression tests for #666 (fails on main, passes on fix) Two regression tests that prove the bug introduced by PR #666: 1. Non-default credential tool (sops) is silently blocked by the hardcoded SAFE_COMMAND_PREFIXES with no way to override. 2. Private IP URL is silently blocked by isBlockedUrl() with no way to allowlist. Both tests use dynamic import to check for the override functions, so they run cleanly on both main (where they fail) and this branch (where they pass). Verified in a git worktree of main.	2026-04-02 14:03:34 +02:00
Justin Wyer	71caa18552	fix(security): add configurable overrides for command allowlist and SSRF blocklist PR #666 introduced hardcoded SAFE_COMMAND_PREFIXES and SSRF URL blocklists with no override mechanism. Users with non-standard credential tools (sops, doppler, age, infisical) or needing to fetch from internal URLs (self-hosted docs, VPN services) were silently blocked with no recourse. Add two global-only settings (ignored in project-level settings.json to preserve the security property against malicious repos): - allowedCommandPrefixes: replaces the built-in command allowlist - fetchAllowedUrls: exempts hostnames from SSRF blocking Both also support env var overrides (GSD_ALLOWED_COMMAND_PREFIXES, GSD_FETCH_ALLOWED_URLS) for CI/container environments. Env vars take precedence over settings.json. Security model: global-only keys are stripped from project settings at load time via stripGlobalOnlyKeys(), applied at all three assignment points for this.projectSettings. The merge function stays untouched — no future caller can accidentally skip stripping. 15 new tests covering override behavior, cache invalidation, allowlist exemptions, and global-only enforcement.	2026-04-02 13:45:05 +02:00
Jeremy McSpadden	46d5fa56af	Merge pull request #2312 from jeremymcs/fix/tui-review fix(tui): comprehensive TUI review — layout, flow, rendering, and state fixes	2026-04-01 16:38:31 -05:00
Jeremy	a3a2f2e3b3	test(tui): update provider-manager tests for confirmation-based removal Tests now match the new hasAuth guard and double-press r confirmation flow introduced in the TUI review PR.	2026-04-01 16:24:14 -05:00
Jeremy McSpadden	d0555857c2	Merge pull request #2976 from jeremymcs/splash-header-updates-clean feat(splash): add remote channel indicator to tools row	2026-04-01 16:14:23 -05:00

1 2 3 4 5 ...

1678 commits