MA-ProofBench: A Two-Tiered Evaluation of LLMs for Theorem Proving in Mathematical Analysis Paper • 2606.13782 • Published 14 days ago • 2
MA-ProofBench: A Two-Tiered Evaluation of LLMs for Theorem Proving in Mathematical Analysis Paper • 2606.13782 • Published 14 days ago • 2
UltraEval-Audio: A Unified Framework for Comprehensive Evaluation of Audio Foundation Models Paper • 2601.01373 • Published Jan 4 • 1
FactNet: A Billion-Scale Knowledge Graph for Multilingual Factual Grounding Paper • 2602.03417 • Published Feb 3
MiniCPM-SALA: Hybridizing Sparse and Linear Attention for Efficient Long-Context Modeling Paper • 2602.11761 • Published Feb 12 • 8
UltraData Collection Ultra Scale, Ultra Quality, Ultra Coverage • 11 items • Updated 28 days ago • 98