Mikael Hugo 553ba23b89 integrate: hook quick wins into UOK dispatch loop

Integration of 3 quick wins into existing UOK infrastructure:

1. Model Learning (Quick Win #2) → metrics.js
   - Record outcomes to model-learner for per-task-type performance tracking
   - Hook: recordUnitOutcome() now calls ModelLearner.recordOutcome()
   - Fire-and-forget: never blocks outcome recording on learning failure
   - Enables adaptive model routing decisions in downstream gates

2. Self-Report Fixing (Quick Win #1) → triage-self-feedback.js
   - Auto-fix high-confidence reports (>0.85) in applyTriageReport()
   - Hook: After triage and requirement promotion, apply auto-fixes
   - Fire-and-forget: never blocks report application on fix failure
   - Returns reportsAutoFixed count for triage metrics

3. Knowledge Injection (Quick Win #3) → already integrated in auto-prompts.js
   - Already active in execute-task prompt template
   - Semantic matching with graceful degradation

All integration points:
- Fire-and-forget: learning/fixing failures never block dispatch
- UOK-native: use existing outcome recording, db, gates
- Backward compatible: applyTriageReport now async, but callers handle it
- No new dependencies: all modules already in codebase

Testing: 2934 tests pass (no regressions from integration)

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

2026-05-06 22:34:41 +02:00

4.4 KiB

Raw Permalink Blame History

TODO Inbox Triage Plan — 2026-05-06

Summary

Root TODO.md is a raw dump inbox, not a roadmap. The 2026-05-06 dump has been promoted into this durable plan and cross-referenced with existing roadmap/design documents. Future agents should use this plan and the referenced docs instead of treating the old raw dump as instruction.

Existing Durable Homes

These raw notes already had a suitable home and should be continued there:

Raw note	Durable home	Disposition
Product-audit phase auto-fire	`BUILD_PLAN.md` Tier 1+ active follow-up	Existing roadmap item
Extended config-overlay keys	`BUILD_PLAN.md` Tier 1.4	Existing v3 blocker
Architecture doc auto-update	`BUILD_PLAN.md` Tier 2.2	Existing strong item
Semantic checkpoint chapters	`BUILD_PLAN.md` Tier 2.3	Existing v3.1 item
Custom Anthropic SSE parser	`BUILD_PLAN.md` Tier 0	Existing deferred port
Symlinked package dedup	`BUILD_PLAN.md` Tier 0	Existing port item
Extension API `setWorkingVisible()`	`BUILD_PLAN.md` Tier 0	Existing port item
Cloudflare Workers AI provider	`BUILD_PLAN.md` Tier 0	Existing provider item
Azure Cognitive Services base URL	`BUILD_PLAN.md` Tier 0	Existing provider item
Local LLM SSE timeout	`BUILD_PLAN.md` Tier 0	Already marked done
Bedrock inference profile normalization	`BUILD_PLAN.md` Tier 0	Already marked done
gsd-2 safety/correctness ports	`BUILD_PLAN.md` Tier 0.5	Existing critical-port list
Self-report feedback loop	`docs/dev/UOK-SELF-EVOLUTION.md` quick win 1	Existing self-evolution plan
Continuous model learning	`docs/dev/UOK-SELF-EVOLUTION.md` quick win 2	Existing self-evolution plan
Automated knowledge injection	`docs/dev/UOK-SELF-EVOLUTION.md` quick win 3	Existing self-evolution plan
Gate pattern aggregation, adaptive thresholds, hypothesis testing, regression detection	`docs/dev/UOK-SELF-EVOLUTION.md` medium-term roadmap	Existing self-evolution plan

Newly Promoted Roadmap Items

These were not clearly represented as durable roadmap items and should be planned as slices before implementation:

Item	Why	Suggested tier	Implementation note
Typed SF environment schema	`SF_*` env vars should fail early with actionable diagnostics instead of late runtime surprises.	Tier 1	Add an SF-owned env schema module and route startup/tool validation through it.
Autonomous-path coverage ratchet	Global coverage thresholds are too broad; autonomous/recovery paths need higher targeted confidence.	Tier 2	Started with focused DB-authority/UOK runtime suites; continue with dispatch and recovery families before changing global thresholds.
End-to-end milestone lifecycle tests	DB-only runtime state needs integration proof across plan, execute, validate, and complete.	Done	Added runtime-state regression coverage proving SQLite slice/task order stays authoritative over stale markdown/JSON projections, and DB-backed runtime refuses implicit roadmap, plan, and summary imports.
Fault-injection recovery tests	Stuck-loop, timeout, runaway, stale lock, and projection drift recovery are high-risk paths.	Tier 2	Add deterministic fault fixtures before adding broader chaos coverage.
MCP server residue/docs cleanup	SF currently ships the MCP client extension only; tracked MCP server source was removed.	Done	Removed untracked `packages/mcp-server/` residue and updated durable docs so future work never recreates an SF MCP server.
Biome schema version cleanup	Tooling drift creates noisy lint/config failures.	Tier 3	Run `biome migrate` as a focused tooling cleanup.
Headless assistant-text preview completion	Prior headless work deferred buffer separation.	Tier 2	Finish `assistantTextBuffer` / `thinkingBuffer` separation and preview flushing.

Explicitly Deferred

Item	Decision
`auto-dispatch.js` / `auto-prompts.js` decomposition	Known design debt, but explicitly out of scope until requested as a dedicated refactor. Do not start it while fixing DB authority, UOK safety, or roadmap triage.

Acceptance Criteria

TODO.md contains no untriaged raw notes.
New work starts from this plan, BUILD_PLAN.md, or docs/dev/UOK-SELF-EVOLUTION.md, not from deleted raw dump text.
Items that need implementation are converted into SF milestone/slice/task state before code changes begin.

4.4 KiB Raw Permalink Blame History