singularity/singularity-forge

Author	SHA1	Message	Date
Jeremy McSpadden	2cafd66bed	Merge pull request #4184 from jeremymcs/claude/refactor-code-cleanup-078AQ fix: preserve image blocks in Claude Code SDK prompt path	2026-04-14 09:39:03 -05:00
Jeremy	01857ea180	fix(claude-code-cli): forward image blocks in SDK query prompt (#4183 )	2026-04-14 09:30:02 -05:00
mastertyko	5474e99ae2	feat(claude-code): pass thinking level as effort	2026-04-13 18:05:19 +02:00
Jeremy	937aef2c71	fix(claude-code): default GSD subagents to bypassPermissions and pre-authorize safe built-ins (#4099 follow-up) The first pass at #4099 only pre-authorized `mcp__<server>__` tools, but in `acceptEdits` mode the SDK still gates Read, Write, Glob/Grep, and basic shell inspection commands like `ls`. GSD subagents need the full workflow toolset and were still hitting "This command requires approval" prompts on every tool call. Two changes: 1. `resolveClaudePermissionMode` now returns `bypassPermissions` for all GSD subagent runs (auto + interactive), dropping the `acceptEdits` branch and the `isAutoActive` dynamic import. The host Claude Code session's permission model is the user-visible gate; the inner SDK process re-prompting on every tool was approval fatigue with no net safety benefit. `GSD_CLAUDE_CODE_PERMISSION_MODE` env override stays so security-conscious users can opt back into a stricter mode. 2. Expanded the pre-authorized `allowedTools` list to include Read, Write, Edit, Glob, Grep, `Bash(ls:)`, and `Bash(pwd)` alongside the MCP server globs. Acts as a belt-and-suspenders safety net for users who set the env override to `acceptEdits`. Tradeoff documented inline: bypass means a prompt-injection payload read from an untrusted file could trigger tool calls without a second gate. Accepted because the workflow is explicit user intent and the alternative is continuous approval fatigue that blocks real work. Tests updated for the new allowedTools shape; permission-mode tests already accepted bypass as the default. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-13 07:13:00 -05:00
Jeremy	b92fdc7b6f	fix(claude-code): pre-authorize workflow MCP tools so interactive acceptEdits mode stops blocking GSD commands Since 2.72.0 the interactive permission default is acceptEdits, which auto-approves built-in Edit/Write/Bash but leaves the SDK permission gate up for MCP tools. Without a canUseTool handler, every mcp__gsd-workflow__* call surfaces as "This command requires approval" and blocks GSD actions (#4099). Add allowedTools entries (mcp__<server>__*) for each registered workflow MCP server in buildSdkOptions so they run unattended while the rest of the acceptEdits safety gate stays intact. Env-overridden server names are handled by deriving the glob list from the built mcpServers keys. Fixes #4099	2026-04-13 06:34:59 -05:00
Jeremy	ad2211b218	fix(claude-code): wrap prompt history in XML tags to stop transcript fabrication Closes #4102. buildPromptFromContext previously serialized multi-turn history using literal [User] / [Assistant] / [System] bracket labels. Those tokens are the exact pattern the anti-fabrication rule in system.md and discuss.md forbids — the model saw its own input framed as a bracket- labeled transcript and mirrored the format in its output, inventing both sides of the conversation during /gsd discuss turns. Replace the bracket labels with XML-tag structure: - <conversation_history> wraps the whole turn sequence - <user_message> / <assistant_message> per turn - <prior_system_context> for the system prompt (renamed from <system_prompt> to avoid overlap with Claude Code's reserved <system-reminder> convention) Prepend a directive telling the model to respond only to the final user message and not emit the XML tags in its own response. Keep system.md and discuss.md in sync by documenting that prior context is delivered in those tags. Add regression tests asserting: - no literal [User]/[Assistant]/[System] substrings in the prompt - history wrapped in <conversation_history> with per-turn tags - directive leads the prompt - empty-history edge cases still render correctly	2026-04-13 01:23:47 -05:00
Jeremy McSpadden	3a529f7a95	Merge pull request #4100 from jeremymcs/claude/cleanup-mcp-stream-output-9uCeK Improve MCP tool rendering with name parsing and compact args	2026-04-13 00:54:38 -05:00
Claude	2d1081f1cc	fix: clean up MCP tool rendering in Claude Code CLI stream Strip the `mcp__<server>__` prefix from tool_use blocks emitted by the Claude Agent SDK so registered GSD extension renderers (gsd_plan_milestone, gsd_task_complete, etc.) match instead of falling through to the generic JSON-dump fallback. The original server name is preserved on the toolCall block under `mcpServer` for downstream rendering. Tighten the generic ToolExecutionComponent fallback for any remaining prefixed names (third-party MCP servers): show a muted `server·tool` title, render primitive args as compact `key=value` pairs, and truncate output to 10 lines when collapsed.	2026-04-13 05:46:35 +00:00
Jeremy	dc489f0a07	fix(mcp): resolve rebase regressions in stream-adapter Rename intermediateToolCalls → intermediateToolBlocks to match upstream rename, and pass onElicitation via extraOptions (4th arg) instead of overrides (3rd arg) in buildSdkOptions test.	2026-04-12 20:09:36 -05:00
Claude	1be15758ec	fix(mcp): thread abort signals, restore tool fidelity, and fix subpath imports Audit-driven fixes across the two MCP server surfaces and the Claude Code streaming adapter: - src/mcp-server.ts: propagate `extra.signal` into `tool.execute` so MCP clients can actually cancel long-running Bash/WebFetch/grep calls, and route the remaining `/server` subpath import through `createRequire` for #3603 consistency. - src/tests/mcp-createRequire.test.ts: extend regression coverage to the `/server` subpath. - claude-code-cli/stream-adapter.ts: (a) classify aborts as `aborted` instead of the retry-eligible `stream_exhausted_without_result`, (b) merge final-turn toolCall blocks from the builder into the AssistantMessage via the new `mergePendingToolCalls` helper so a turn ending in `tool_use` stop_reason no longer drops its tool calls, and (c) resolve the SDK permission mode via `resolveClaudePermissionMode` (auto-mode → bypass, interactive → acceptEdits, env override). - packages/mcp-server/src/server.ts: make `gsd_query` actually respect its `query` argument with known categories + forward-compatible fallback, and thread `extra.signal` into `gsd_execute` so an aborted RPC request cancels the newly-created session instead of leaking a background RpcClient process. - stream-adapter test suite: add regression tests for abort classification, final-turn tool-call merging, and permission mode resolution. Verified via: mcp-createRequire, stream-adapter (27), partial-builder, mcp-server package (31), workflow-tools (13) — 83 tests green. https://claude.ai/code/session_0174sYny3VvdwYTdCNTmY4Do	2026-04-12 20:04:47 -05:00
mastertyko	4189afe8a0	fix(claude-code-cli): surface result text for success errors	2026-04-12 14:03:29 +02:00
Jeremy	bf4bcfadde	fix(claude-code): harden MCP elicitation schema handling	2026-04-11 13:26:24 -05:00
Jeremy	1495e711e1	fix(claude-code): accept secure_env_collect MCP elicitation forms	2026-04-11 13:18:27 -05:00
Jeremy	74fee9ed48	fix(interactive): keep MCP tool output ordered and restore secure prompt fallback	2026-04-11 12:47:41 -05:00
Jeremy McSpadden	23f1e72868	Merge pull request #3975 from jeremymcs/fix/windows-portability-sweep fix(gsd): harden Claude Code workflow MCP bootstrap and guidance	2026-04-11 10:49:23 -05:00
Jeremy	4301a72522	fix(gsd): harden claude-code workflow MCP bootstrap Ensure startup/session/init paths auto-prepare workflow MCP for Claude Code. Disable native AskUserQuestion in Claude Code SDK options to avoid broken host prompts. Add explicit /gsd mcp init . guidance when workflow MCP is missing. Refs #3964	2026-04-11 08:35:19 -05:00
Jeremy	d64056f833	fix claude code mcp elicitation bridge	2026-04-10 19:24:51 -05:00
Jeremy	ac1a51ef55	fix: Claude Code MCP tool output rendering and real-time streaming - Stream tool results in real-time during Claude Code SDK sessions instead of deferring until session end. Tool calls (read, bash, write, etc.) now show their output as they complete, not collapsed as "..." - Stop suppressing toolcall_start/delta/end events from stream adapter so the TUI can render tool call progress during streaming - On SDK turn boundary (user message with tool results), push synthetic toolcall_end events with externalResult attached for immediate rendering - Chat controller checks for externalResult on toolcall_end message updates and calls updateResult on pending ToolExecutionComponents - Fix case-sensitive tool name matching (Read vs read, Bash vs bash) in TUI ToolExecutionComponent rendering - Auto-discover and pass GSD_WORKFLOW_EXECUTORS_MODULE and GSD_WORKFLOW_WRITE_GATE_MODULE env vars in MCP server launch config - Add /gsd mcp init command and auto-bootstrap .mcp.json for Claude Code provider during auto-start - Add tool_execution_update event type for web UI streaming updates - Add setStderrLoggingEnabled toggle for workflow logger	2026-04-10 06:12:44 -05:00
Jeremy McSpadden	6c708f7795	Merge pull request #3890 from jeremymcs/feat/workflow-mcp-provider-parity feat: expose GSD workflow tools over MCP for provider parity	2026-04-09 15:09:55 -05:00
Jeremy	20cbc1ed37	fix(gsd): enforce workflow write gates over MCP	2026-04-09 14:42:38 -05:00
Jeremy	c297559211	fix(mcp): harden workflow tool boundary	2026-04-09 14:29:15 -05:00
Jeremy	4ea87a33d6	feat: expose core GSD workflow tools over MCP	2026-04-09 11:30:02 -05:00
Jeremy	9b20b28a25	fix(claude-code-cli): suppress streamed internal tool noise	2026-04-09 09:51:28 -05:00
Jeremy	b4a0392464	fix(claude-code-cli): suppress internal tool call noise	2026-04-09 08:27:23 -05:00
Jeremy McSpadden	535d3a29da	Merge pull request #3775 from mastertyko/fix/3770-claude-code-windows-lookup fix(claude-code): use native Windows claude lookup	2026-04-08 21:53:42 -05:00
mastertyko	304b1bf329	fix(pi-ai): recover XML parameters trapped in JSON strings	2026-04-08 17:43:28 +02:00
Jeremy	ea456d4cdb	fix(providers): route Anthropic subscription users through Claude Code CLI (#3772 ) Anthropic now blocks third-party apps from using Pro/Max subscription quotas via direct API calls. This change makes the claude-code provider (which delegates to the local claude CLI binary) the default path for Anthropic subscription users — TOS-compliant because requests flow through Anthropic's own infrastructure. Changes: - Enhanced readiness check to verify CLI auth status (not just binary) - Startup migration: auto-switch anthropic → claude-code when CLI ready - Error recovery: auto-switch on third-party 400 block error - Onboarding: removed Anthropic from OAuth, added Claude CLI option - Added claude-code to flat-rate providers (no dynamic routing benefit) Closes #3772	2026-04-08 07:20:20 -05:00
mastertyko	120949db19	fix(claude-code): use native Windows claude lookup	2026-04-08 09:12:53 +02:00
Tom Boucher	2f3ffbfc10	fix: repair YAML bullet lists in malformed tool-call JSON (#3090 ) * fix: repair YAML bullet lists in malformed tool-call JSON (#2660) When LLMs copy YAML template formatting into tool-call arguments, they produce `"key": - item` instead of `"key": ["item"]`, causing JSON parse errors that block milestone completion. Add a repairToolJson() utility that detects and converts YAML-style bullet lists into JSON arrays before parsing. Integrated into both the PartialMessageBuilder (claude-code-cli) and the anthropic-shared streaming provider, with fallback in parseStreamingJson for all other providers. Closes #2660 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: use .js import extension in repair-tool-json test Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-30 14:37:09 -06:00
Tom Boucher	348c399c9d	fix: make claude-code provider stateful with full context and sidechain events (#2859 ) (#3254 ) The claude-code provider in gsd-pi was effectively stateless: it sent only the last user message, disabled session persistence, and filtered out all sidechain/subagent events. This made multi-turn conversations feel isolated and caused incomplete responses. - Replace extractLastUserPrompt with buildPromptFromContext that serialises the full conversation history (system prompt + all turns) - Change persistSession from false to true for session continuity - Remove parent_tool_use_id filtering so delegated/sidechain outputs are included in the final response - Extract buildSdkOptions for testability Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-30 13:52:14 -06:00
mastertyko	f2113f1353	fix: surface exhausted Claude SDK streams as errors (#2719 ) Treat Claude SDK generator exhaustion without a terminal result as a stream interruption instead of a successful completion. This prevents phantom-success auto-mode advances, keeps the failure classifiable as transient provider recovery, and adds regression tests for the fallback message plus provider classification. Closes #2575	2026-03-26 16:11:23 -06:00
mastertyko	43b1de6d59	fix: signal malformed tool arguments in toolcall_end event (#2647 ) When the API stream is truncated mid-tool-call, PartialMessageBuilder emits a toolcall_end event with { _raw: "<broken json>" } in the arguments — but the event looks identical to a healthy tool completion. Downstream consumers (error classifiers, tool handlers, activity log) have no way to distinguish a truncated call from a completed one. Add a malformedArguments: boolean flag to the toolcall_end event variant in AssistantMessageEvent. The flag is set to true only in the JSON parse catch path, so existing consumers (which do not check for it) are unaffected. New consumers like classifyProviderError can use it to handle truncated tool calls appropriately. Closes #2574	2026-03-26 08:15:16 -06:00
Lex Christopherson	bbea8460b5	fix(claude-code-cli): render tool calls above text response - Filter toolcall_start/delta/end events from streaming to prevent out-of-order rendering in the TUI's accumulated message content - Collect tool calls from intermediate SDK turns and include them BEFORE text content in the final AssistantMessage - The agent loop's externalToolExecution path emits proper tool_execution_start/end events for each intermediate tool call - Result: tool activity renders above the text response, not below Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-25 12:57:47 -06:00
Lex Christopherson	a0ee03d331	feat(agent-core): add externalToolExecution mode for external providers Adds `externalToolExecution` flag to AgentLoopConfig. When true, the agent loop emits tool_execution_start/end events for TUI rendering but skips local tool dispatch. Used by providers that handle tool execution internally (e.g., Claude Code CLI via Agent SDK). The flag is dynamically evaluated per-loop via a callback on AgentOptions, so model switches mid-session are handled correctly. Providers with authMode "externalCli" automatically use this mode. Also updates the Claude Code CLI stream adapter to preserve tool call blocks in the final message instead of stripping them. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-25 12:57:47 -06:00
Lex Christopherson	e8a7881307	fix(claude-code-cli): resolve SDK executable path and update model IDs - Add pathToClaudeCodeExecutable to SDK query options, resolving the system `claude` binary via `which claude`. Without this, the SDK looks for a bundled cli.js that doesn't exist when installed as a library dependency. - Remove env option that was replacing the subprocess environment and stripping auth credentials, causing "Not logged in" errors. - Update model IDs to current versions: claude-opus-4-6 (1M ctx), claude-sonnet-4-6 (1M ctx), claude-haiku-4-5 (200K ctx). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-25 12:22:40 -06:00
Lex Christopherson	c55d409991	feat(provider): add Claude Code CLI provider extension Implements Phase 1 of the Claude Code subscription-as-provider integration (issue #2509). Users with a Claude Code subscription (Pro/Max/Team) can use subsidized inference through GSD's UI via the official Agent SDK. The extension registers a provider with authMode: "externalCli" that delegates to the user's locally-installed claude CLI. The SDK runs the full agentic loop (multi-turn, tool execution) in one streamSimple call. Tool calls stream in real-time for TUI visibility but are stripped from the final AssistantMessage so the agent loop ends cleanly without local tool dispatch. Zero core changes — pure extension-based implementation. Closes #2509 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-25 12:07:08 -06:00

36 commits