Collection of models and datasets for Beyond Binary Rewards: Training LMs to Reason about their Uncertainty
Mehul Damani PRO
mehuldamani
AI & ML interests
Reinforcement Learning, Large Language Models
Recent Activity
published a dataset about 23 hours ago
mehuldamani/neurips-story-test-v3 published a dataset about 23 hours ago
mehuldamani/neurips-bug-eval-v4 published a dataset about 23 hours ago
mehuldamani/neurips-grammarly-eval-v1Organizations
None yet