singularity-forge

History

Jeremy 4ba2d5a219 feat(ollama): native /api/chat provider with full option exposure Replace the OpenAI-compat shim with a native Ollama /api/chat streaming provider that exposes all commonly-used Ollama options and surfaces inference performance metrics. Key changes: - Native NDJSON streaming from /api/chat (no more OpenAI shim) - Known models send num_ctx from capability table; unknown models defer to Ollama's default to avoid OOM on constrained hosts - Exposes: temperature, top_p, top_k, repeat_penalty, seed, num_gpu, keep_alive, num_predict via per-model providerOptions - Extracts <think>...</think> blocks for reasoning models (deepseek-r1, qwq) - Surfaces InferenceMetrics (tokens/sec, durations) on AssistantMessage - Adds remove and show actions to ollama_manage LLM tool - Adds "ollama-chat" to KnownApi, providerOptions to Model<TApi> - NDJSON parser uses strict mode for chat (fails on malformed frames) - Mixed content+tool_call chunks handled independently Closes #3544		2026-04-05 09:01:40 -05:00
..
daemon	wip: M005 daemon — orchestrator, event bridge, formatter, batcher improvements (#2929 )	2026-03-27 20:22:30 -06:00
mcp-server	feat(mcp-server): add 6 read-only tools for project state queries (#3515 )	2026-04-04 16:41:24 -05:00
native	fix: align @gsd/native module type with compiled output (#3253 )	2026-03-30 13:51:57 -06:00
pi-agent-core	fix: cap consecutive tool validation failures to prevent stuck-loop (#3301 )	2026-04-05 01:04:58 -04:00
pi-ai	feat(ollama): native /api/chat provider with full option exposure	2026-04-05 09:01:40 -05:00
pi-coding-agent	feat(ollama): native /api/chat provider with full option exposure	2026-04-05 09:01:40 -05:00
pi-tui	fix: skip TUI render loop on non-TTY stdout to prevent CPU burn (#3095 ) (#3263 )	2026-03-30 13:49:55 -06:00
rpc-client	feat: Headless Integration Hardening & Release (M002) (#2811 )	2026-03-26 23:33:22 -06:00