Commit graph

30 commits

Author SHA1 Message Date
Mikael Hugo
19bfc3d3f6 feat(sf): align node sqlite uok runtime 2026-05-08 03:01:20 +02:00
Mikael Hugo
b5893d1c28 Make SF direct command surface baseline 2026-05-08 01:34:07 +02:00
Mikael Hugo
6fc054e7c3 sf snapshot: uncommitted changes after 49m inactivity 2026-05-08 01:07:24 +02:00
Mikael Hugo
89677b7e9b sf snapshot: uncommitted changes after 110m inactivity 2026-05-08 00:17:47 +02:00
Mikael Hugo
b0fce94f9e feat: record retrieval evidence across context tools 2026-05-07 18:17:41 +02:00
Mikael Hugo
a2184a0a0e feat: store judgment log in db 2026-05-07 06:22:07 +02:00
Mikael Hugo
2178aa8803 fix: isolate uok message bus db per project 2026-05-07 06:09:32 +02:00
Mikael Hugo
5c32d91124 feat: promote schedule and self-feedback state to db 2026-05-07 05:34:42 +02:00
Mikael Hugo
5bc3895586 feat: expose uok message bus metrics 2026-05-07 05:19:41 +02:00
Mikael Hugo
c0973ac287 fix: complete gate cost micro-usd migration 2026-05-07 05:07:57 +02:00
Mikael Hugo
7c39165c81 Tier 2.7: Migrate cost_usd to cost_micro_usd for accurate accounting
- Schema version bumped to 36
- Add migrateCostUsdToMicroUsd() helper for safe migration
- Convert cost_usd REAL to cost_micro_usd INTEGER in gate_runs
- Migration: multiply USD values by 1,000,000 to avoid float drift
- Update insertGateRun() to support cost_micro_usd field
- Old cost_usd column retained for backward compatibility

Benefits:
- Eliminates floating-point drift on accumulated costs
- Easier reasoning about cost totals
- Integer arithmetic is faster and more predictable
- Idempotent migration (safe to re-run)

Migration runs automatically on first database open for schema < 36.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-05-07 05:04:35 +02:00
Mikael Hugo
9ceb0bf229 fix: store backlog items in db 2026-05-07 04:50:13 +02:00
Mikael Hugo
ffde54e05a fix: persist live planning specs in db 2026-05-07 04:44:09 +02:00
Mikael Hugo
79896b4377 Tier 1.3 Phase 4: Add evidence recording to plan and complete tools
- Updated plan-milestone, plan-slice, plan-task to record planning evidence
- Updated complete-milestone, complete-slice, complete-task to record completion evidence
- All evidence includes relevant spec fields (goals, narratives, decisions, etc.)
- Evidence recorded atomically within transactions
- Enables audit trail queries to reconstruct planning and completion decisions

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-05-07 04:35:03 +02:00
Mikael Hugo
076e8c4894 Tier 1.3 Phase 3: Add evidence management API
Implements data layer functions for managing and querying spec/evidence data.

New export functions:
- insertMilestoneEvidence(): Append evidence for milestone
- insertSliceEvidence(): Append evidence for slice
- insertTaskEvidence(): Append evidence for task
- getMilestoneAuditTrail(): Query full audit trail (spec + evidence + runtime)
- getSliceAuditTrail(): Query slice audit trail with joined spec/evidence
- getTaskAuditTrail(): Query task audit trail with joined spec/evidence
- getMilestoneSpec(): Get spec only (immutable intent)
- getSliceSpec(): Get slice spec only
- getTaskSpec(): Get task spec only

Key properties:
- Evidence functions use timestamp for recording time (set at insertion)
- Audit trail queries JOIN runtime, spec, and evidence tables
- All queries support data archaeology (reconstruct decision history)
- Spec-only queries useful for validation and re-planning
- All functions include JSDoc with purpose and consumer

This completes Phase 3 of Tier 1.3 implementation. Phase 4 (tool updates) and
Phase 5 (integration tests) follow in next PRs.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-05-07 04:24:31 +02:00
Mikael Hugo
f3761d7f46 Tier 1.3 Phase 2: Migrate existing data to spec tables
Implements automatic population of new spec tables from existing milestone/slice/task columns.

Migration function: populateSpecTablesFromExisting()
- Runs during schema v32 migration (first database open)
- Populates milestone_specs from existing milestone table spec columns
- Populates slice_specs from existing slice table spec columns
- Populates task_specs from existing task table spec columns
- Uses INSERT OR IGNORE to safely handle existing data
- Sets spec_version to 1 for all migrated specs
- Uses current timestamp for created_at if missing

Key properties:
- Non-destructive: existing runtime rows preserved
- Idempotent: safe to re-run (INSERT OR IGNORE)
- Evidence tables left empty: populated as tools create new evidence
- Evidence populated retroactively in future phase

This completes Phase 2 of Tier 1.3. Phases 3-5 (data layer updates, tool updates, tests) follow.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-05-07 04:22:41 +02:00
Mikael Hugo
87aa04cf05 Tier 1.3: Add spec/runtime/evidence schema separation (v32)
Implements the 3-table normalization model for milestone, slice, and task entities:

- 9 new tables: {milestone,slice,task}_{specs,evidence} + runtime tables
- milestone_specs: immutable record of intent (vision, goals, risks, proof strategy)
- slice_specs: immutable slice-level intent
- task_specs: immutable task verification criteria
- {entity}_evidence: append-only audit trail with timestamps and phase metadata
- Indices on evidence tables for efficient chronological queries

Key improvements:
- Spec immutability: Write-once specs preserve original intent
- Audit trail: Evidence chain enables data archaeology and decision history
- Query efficiency: Each table contains only relevant columns
- Re-planning clarity: Multiple spec versions can exist for same entity ID
- Forensic capability: Timestamp + phase metadata on evidence rows

Migration:
- Schema version bumped to 32
- Migration runs on first open of existing databases
- No data loss; existing milestone/slice/task rows preserved
- Creates spec and evidence tables from existing columns (future work)

This is Phase 1 of Tier 1.3 implementation (schema definition + basic setup).
Phases 2-5 (migration, data layer updates, tool updates, tests) follow in next PRs.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-05-07 04:20:32 +02:00
Mikael Hugo
a14cd0df29 chore: ignore generated sf eval outputs 2026-05-07 03:57:08 +02:00
Mikael Hugo
4cefa6de2a feat: persist SF runtime signals 2026-05-07 03:07:51 +02:00
Mikael Hugo
2d465b11fd test: add comprehensive Phase 1 coverage for dispatch loop (48 tests)
- Add metrics.test.ts: 21 tests for unit outcome recording, model performance tracking, fire-and-forget safety, persistence, error handling
- Add triage-self-feedback.test.ts: 27 tests for report classification, confidence thresholds, auto-fix, deduplication, severity categorization, async safety

Purpose: Increase coverage of critical autonomous dispatch paths from 40% to 60%+.
Covers fire-and-forget patterns (metrics recording and auto-fix application must not
block dispatch), concurrent recording safety, graceful degradation on error.

Tests validate:
  ✓ Unit outcome recording without blocking
  ✓ Per-task-type model performance tracking
  ✓ Fire-and-forget error handling (metrics/fixes don't break dispatch)
  ✓ Concurrent metric recording race conditions
  ✓ Persistence atomicity
  ✓ Report classification by type/severity
  ✓ Confidence thresholds (0.85-0.95 per type)
  ✓ Auto-fix deduplication and prioritization
  ✓ Async triage without blocking dispatch

Phase 1 complete: 48 tests, all passing.
Phase 2: Recovery path hardening (recovery/forensics)
Phase 3: Property-based FSM testing (fast-check)

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-05-07 00:38:19 +02:00
Mikael Hugo
30f8738585 test: harden uok self-evolution paths 2026-05-06 22:55:35 +02:00
Mikael Hugo
6471e10245 sf snapshot: uncommitted changes after 64m inactivity 2026-05-06 16:28:31 +02:00
Mikael Hugo
d8570d059e sf snapshot: uncommitted changes after 38m inactivity 2026-05-06 14:48:15 +02:00
Mikael Hugo
a73ea845e7 sf snapshot: uncommitted changes after 61m inactivity 2026-05-06 10:04:20 +02:00
Mikael Hugo
95726c1789 sf snapshot: uncommitted changes after 39m inactivity 2026-05-06 09:02:38 +02:00
Mikael Hugo
7a13dd82b1 feat: persist solver eval evidence in db 2026-05-06 03:49:32 +02:00
Mikael Hugo
fbb61026fc fix: stabilize uok ledger and steering 2026-05-06 01:47:21 +02:00
Mikael Hugo
00a118ea71 chore: commit current workspace state 2026-05-05 14:46:18 +02:00
Mikael Hugo
f11c877224 style: format repository with biome 2026-05-05 14:31:16 +02:00
Mikael Hugo
47c806d733 fix: version sf extension runtime sources 2026-05-04 23:27:20 +02:00