Commit graph

7 commits

Author SHA1 Message Date
Mikael Hugo
7287490cfd fix: enhance missing-checkpoint repair with better low-confidence guidance
- Add explicit low-confidence reconstruction guidance for no-transcript cases
- Clarify when to use outcome='decide' when confidence < 0.98
- Fix typo in repair prompt ('what was was expected' -> 'what was expected')
- Strengthen final human-acceptance-gate guidance to prefer outcome='decide'
- Addresses solver-missing-checkpoint self-feedback entry acceptance criteria

Resolves: sf-mowykewh-3ehn5p
2026-05-08 15:47:00 +02:00
Mikael Hugo
89677b7e9b sf snapshot: uncommitted changes after 110m inactivity 2026-05-08 00:17:47 +02:00
Mikael Hugo
deeb4dbd4e sf snapshot: uncommitted changes after 61m inactivity 2026-05-07 16:39:39 +02:00
Mikael Hugo
4f217cc88c docs: promote sf state guidance 2026-05-07 03:59:38 +02:00
Mikael Hugo
a14cd0df29 chore: ignore generated sf eval outputs 2026-05-07 03:57:08 +02:00
Mikael Hugo
14c59a7583 Phase 3: Property-based FSM tests (17 passing tests)
- Created src/resources/extensions/sf/tests/phases-fsm.test.ts
- 17 comprehensive property-based tests using fast-check
- FSM invariants verified: terminal states, no invalid transitions, dispatch termination
- State transition correctness validated for all paths (pending→running→done, etc.)
- Performance tests confirm sub-1s processing for 500+ concurrent units
- Tests confirm BLOCKED state is non-terminal (can retry after unblock)
- All tests passing 

Phase 3 completes test coverage roadmap: 40% → 60%+ coverage target
- Phase 1: 48 tests (metrics + triage) ✓
- Phase 2: 31 tests (crash recovery) ✓
- Phase 3: 17 tests (property-based FSM) ✓

Total this session: 104 new tests, all passing

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
2026-05-07 01:01:04 +02:00
Mikael Hugo
fec30b8278 chore: init sf 2026-05-06 23:03:20 +02:00