singularity-forge/docs/plans/todo-triage-2026-05-06-plan.md
Mikael Hugo 553ba23b89 integrate: hook quick wins into UOK dispatch loop
Integration of 3 quick wins into existing UOK infrastructure:

1. Model Learning (Quick Win #2) → metrics.js
   - Record outcomes to model-learner for per-task-type performance tracking
   - Hook: recordUnitOutcome() now calls ModelLearner.recordOutcome()
   - Fire-and-forget: never blocks outcome recording on learning failure
   - Enables adaptive model routing decisions in downstream gates

2. Self-Report Fixing (Quick Win #1) → triage-self-feedback.js
   - Auto-fix high-confidence reports (>0.85) in applyTriageReport()
   - Hook: After triage and requirement promotion, apply auto-fixes
   - Fire-and-forget: never blocks report application on fix failure
   - Returns reportsAutoFixed count for triage metrics

3. Knowledge Injection (Quick Win #3) → already integrated in auto-prompts.js
   - Already active in execute-task prompt template
   - Semantic matching with graceful degradation

All integration points:
- Fire-and-forget: learning/fixing failures never block dispatch
- UOK-native: use existing outcome recording, db, gates
- Backward compatible: applyTriageReport now async, but callers handle it
- No new dependencies: all modules already in codebase

Testing: 2934 tests pass (no regressions from integration)

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-05-06 22:34:41 +02:00

4.4 KiB

TODO Inbox Triage Plan — 2026-05-06

Summary

Root TODO.md is a raw dump inbox, not a roadmap. The 2026-05-06 dump has been promoted into this durable plan and cross-referenced with existing roadmap/design documents. Future agents should use this plan and the referenced docs instead of treating the old raw dump as instruction.

Existing Durable Homes

These raw notes already had a suitable home and should be continued there:

Raw note Durable home Disposition
Product-audit phase auto-fire BUILD_PLAN.md Tier 1+ active follow-up Existing roadmap item
Extended config-overlay keys BUILD_PLAN.md Tier 1.4 Existing v3 blocker
Architecture doc auto-update BUILD_PLAN.md Tier 2.2 Existing strong item
Semantic checkpoint chapters BUILD_PLAN.md Tier 2.3 Existing v3.1 item
Custom Anthropic SSE parser BUILD_PLAN.md Tier 0 Existing deferred port
Symlinked package dedup BUILD_PLAN.md Tier 0 Existing port item
Extension API setWorkingVisible() BUILD_PLAN.md Tier 0 Existing port item
Cloudflare Workers AI provider BUILD_PLAN.md Tier 0 Existing provider item
Azure Cognitive Services base URL BUILD_PLAN.md Tier 0 Existing provider item
Local LLM SSE timeout BUILD_PLAN.md Tier 0 Already marked done
Bedrock inference profile normalization BUILD_PLAN.md Tier 0 Already marked done
gsd-2 safety/correctness ports BUILD_PLAN.md Tier 0.5 Existing critical-port list
Self-report feedback loop docs/dev/UOK-SELF-EVOLUTION.md quick win 1 Existing self-evolution plan
Continuous model learning docs/dev/UOK-SELF-EVOLUTION.md quick win 2 Existing self-evolution plan
Automated knowledge injection docs/dev/UOK-SELF-EVOLUTION.md quick win 3 Existing self-evolution plan
Gate pattern aggregation, adaptive thresholds, hypothesis testing, regression detection docs/dev/UOK-SELF-EVOLUTION.md medium-term roadmap Existing self-evolution plan

Newly Promoted Roadmap Items

These were not clearly represented as durable roadmap items and should be planned as slices before implementation:

Item Why Suggested tier Implementation note
Typed SF environment schema SF_* env vars should fail early with actionable diagnostics instead of late runtime surprises. Tier 1 Add an SF-owned env schema module and route startup/tool validation through it.
Autonomous-path coverage ratchet Global coverage thresholds are too broad; autonomous/recovery paths need higher targeted confidence. Tier 2 Started with focused DB-authority/UOK runtime suites; continue with dispatch and recovery families before changing global thresholds.
End-to-end milestone lifecycle tests DB-only runtime state needs integration proof across plan, execute, validate, and complete. Done Added runtime-state regression coverage proving SQLite slice/task order stays authoritative over stale markdown/JSON projections, and DB-backed runtime refuses implicit roadmap, plan, and summary imports.
Fault-injection recovery tests Stuck-loop, timeout, runaway, stale lock, and projection drift recovery are high-risk paths. Tier 2 Add deterministic fault fixtures before adding broader chaos coverage.
MCP server residue/docs cleanup SF currently ships the MCP client extension only; tracked MCP server source was removed. Done Removed untracked packages/mcp-server/ residue and updated durable docs so future work never recreates an SF MCP server.
Biome schema version cleanup Tooling drift creates noisy lint/config failures. Tier 3 Run biome migrate as a focused tooling cleanup.
Headless assistant-text preview completion Prior headless work deferred buffer separation. Tier 2 Finish assistantTextBuffer / thinkingBuffer separation and preview flushing.

Explicitly Deferred

Item Decision
auto-dispatch.js / auto-prompts.js decomposition Known design debt, but explicitly out of scope until requested as a dedicated refactor. Do not start it while fixing DB authority, UOK safety, or roadmap triage.

Acceptance Criteria

  • TODO.md contains no untriaged raw notes.
  • New work starts from this plan, BUILD_PLAN.md, or docs/dev/UOK-SELF-EVOLUTION.md, not from deleted raw dump text.
  • Items that need implementation are converted into SF milestone/slice/task state before code changes begin.