Integration of 3 quick wins into existing UOK infrastructure: 1. Model Learning (Quick Win #2) → metrics.js - Record outcomes to model-learner for per-task-type performance tracking - Hook: recordUnitOutcome() now calls ModelLearner.recordOutcome() - Fire-and-forget: never blocks outcome recording on learning failure - Enables adaptive model routing decisions in downstream gates 2. Self-Report Fixing (Quick Win #1) → triage-self-feedback.js - Auto-fix high-confidence reports (>0.85) in applyTriageReport() - Hook: After triage and requirement promotion, apply auto-fixes - Fire-and-forget: never blocks report application on fix failure - Returns reportsAutoFixed count for triage metrics 3. Knowledge Injection (Quick Win #3) → already integrated in auto-prompts.js - Already active in execute-task prompt template - Semantic matching with graceful degradation All integration points: - Fire-and-forget: learning/fixing failures never block dispatch - UOK-native: use existing outcome recording, db, gates - Backward compatible: applyTriageReport now async, but callers handle it - No new dependencies: all modules already in codebase Testing: 2934 tests pass (no regressions from integration) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
4.4 KiB
4.4 KiB
TODO Inbox Triage Plan — 2026-05-06
Summary
Root TODO.md is a raw dump inbox, not a roadmap. The 2026-05-06 dump has been promoted into this durable plan and cross-referenced with existing roadmap/design documents. Future agents should use this plan and the referenced docs instead of treating the old raw dump as instruction.
Existing Durable Homes
These raw notes already had a suitable home and should be continued there:
| Raw note | Durable home | Disposition |
|---|---|---|
| Product-audit phase auto-fire | BUILD_PLAN.md Tier 1+ active follow-up |
Existing roadmap item |
| Extended config-overlay keys | BUILD_PLAN.md Tier 1.4 |
Existing v3 blocker |
| Architecture doc auto-update | BUILD_PLAN.md Tier 2.2 |
Existing strong item |
| Semantic checkpoint chapters | BUILD_PLAN.md Tier 2.3 |
Existing v3.1 item |
| Custom Anthropic SSE parser | BUILD_PLAN.md Tier 0 |
Existing deferred port |
| Symlinked package dedup | BUILD_PLAN.md Tier 0 |
Existing port item |
Extension API setWorkingVisible() |
BUILD_PLAN.md Tier 0 |
Existing port item |
| Cloudflare Workers AI provider | BUILD_PLAN.md Tier 0 |
Existing provider item |
| Azure Cognitive Services base URL | BUILD_PLAN.md Tier 0 |
Existing provider item |
| Local LLM SSE timeout | BUILD_PLAN.md Tier 0 |
Already marked done |
| Bedrock inference profile normalization | BUILD_PLAN.md Tier 0 |
Already marked done |
| gsd-2 safety/correctness ports | BUILD_PLAN.md Tier 0.5 |
Existing critical-port list |
| Self-report feedback loop | docs/dev/UOK-SELF-EVOLUTION.md quick win 1 |
Existing self-evolution plan |
| Continuous model learning | docs/dev/UOK-SELF-EVOLUTION.md quick win 2 |
Existing self-evolution plan |
| Automated knowledge injection | docs/dev/UOK-SELF-EVOLUTION.md quick win 3 |
Existing self-evolution plan |
| Gate pattern aggregation, adaptive thresholds, hypothesis testing, regression detection | docs/dev/UOK-SELF-EVOLUTION.md medium-term roadmap |
Existing self-evolution plan |
Newly Promoted Roadmap Items
These were not clearly represented as durable roadmap items and should be planned as slices before implementation:
| Item | Why | Suggested tier | Implementation note |
|---|---|---|---|
| Typed SF environment schema | SF_* env vars should fail early with actionable diagnostics instead of late runtime surprises. |
Tier 1 | Add an SF-owned env schema module and route startup/tool validation through it. |
| Autonomous-path coverage ratchet | Global coverage thresholds are too broad; autonomous/recovery paths need higher targeted confidence. | Tier 2 | Started with focused DB-authority/UOK runtime suites; continue with dispatch and recovery families before changing global thresholds. |
| End-to-end milestone lifecycle tests | DB-only runtime state needs integration proof across plan, execute, validate, and complete. | Done | Added runtime-state regression coverage proving SQLite slice/task order stays authoritative over stale markdown/JSON projections, and DB-backed runtime refuses implicit roadmap, plan, and summary imports. |
| Fault-injection recovery tests | Stuck-loop, timeout, runaway, stale lock, and projection drift recovery are high-risk paths. | Tier 2 | Add deterministic fault fixtures before adding broader chaos coverage. |
| MCP server residue/docs cleanup | SF currently ships the MCP client extension only; tracked MCP server source was removed. | Done | Removed untracked packages/mcp-server/ residue and updated durable docs so future work never recreates an SF MCP server. |
| Biome schema version cleanup | Tooling drift creates noisy lint/config failures. | Tier 3 | Run biome migrate as a focused tooling cleanup. |
| Headless assistant-text preview completion | Prior headless work deferred buffer separation. | Tier 2 | Finish assistantTextBuffer / thinkingBuffer separation and preview flushing. |
Explicitly Deferred
| Item | Decision |
|---|---|
auto-dispatch.js / auto-prompts.js decomposition |
Known design debt, but explicitly out of scope until requested as a dedicated refactor. Do not start it while fixing DB authority, UOK safety, or roadmap triage. |
Acceptance Criteria
TODO.mdcontains no untriaged raw notes.- New work starts from this plan,
BUILD_PLAN.md, ordocs/dev/UOK-SELF-EVOLUTION.md, not from deleted raw dump text. - Items that need implementation are converted into SF milestone/slice/task state before code changes begin.