Commit History

update: FAL results report with final eval numbers + conclusions
99c09f5
verified

rtferraz commited on

add: FAL demo results report (preliminary β€” eval crash pending fix)
f46c90b
verified

rtferraz commited on

add: Modal deployment lessons learned β€” TRL dependency hell postmortem
81195a7
verified

rtferraz commited on

add: ADR-003 Future-as-Label demo β€” detailed implementation plan with research validation
d75cbbf
verified

rtferraz commited on

add: V4.2 Final Report β€” complete project retrospective with evidence-based analysis
22cca8b
verified

rtferraz commited on

Create v4_2-handoff.md
d1385b0
verified

rtferraz commited on

docs: add V4.1 run report β€” detailed evaluation with per-task analysis and V4.2 roadmap
482efc4
verified

rtferraz commited on

Create v4_1-handoff.md
958f6d7
verified

rtferraz commited on

docs: add V4 run assessment with lessons learned and improvement roadmap
cfaf49c
verified

rtferraz commited on

ADR-002: V4 Instruct-Only GRPO β€” revises dual-model plan based on model repo audit
50e0e4d
verified

rtferraz commited on

Add comprehensive investigation report β€” performance audit, unexplored alternatives, literature-backed recommendations
4312bfd
verified

rtferraz commited on

Add session checkpoint: v3 launch decision with full context
bead5cb
verified

rtferraz commited on

Add v3 thinking control patch - task-aware system prompts + think efficiency reward
0f39df7
verified

rtferraz commited on

docs: add ADR-001 next steps with detailed execution plans
b47b36b
verified

rtferraz commited on

docs: add project documentation
aa71b0c
verified

rtferraz commited on