None defined yet.
SciPredict: Can LLMs Predict the Outcomes of Scientific Experiments in Natural Sciences?
Agentic Rubrics as Contextual Verifiers for SWE Agents