espressovi/BODHI-rlvr
Viewer • Updated • 19k • 90
Artifacts for the paper titled "BODHI: Can LLMs Branch Out and Discover Heterogeneous Inferences?"
Note RLVR dataset, derived from "open-r1/DAPO-Math-17k-Processed" and https://github.com/understanding-search/maze-dataset.
Note Distillation dataset, derived from "open-r1/OpenThoughts-114k-math" and https://github.com/understanding-search/maze-dataset.