SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations Paper • 2510.25955 • Published Oct 29, 2025 • 1
OccuBench: Evaluating AI Agents on Real-World Professional Tasks via Language World Models Paper • 2604.10866 • Published 18 days ago • 63
Representation-Regularized Convolutional Audio Transformer for Audio Understanding Paper • 2601.21612 • Published Jan 29 • 1
Typhoon ASR Real-time: FastConformer-Transducer for Thai Automatic Speech Recognition Paper • 2601.13044 • Published Jan 19 • 12
GigaSpeech Series Collection Evolving, Large-Scale, and Multi-domain ASR Corpus • 5 items • Updated Mar 28
k2SSL Collection A Faster and Better Framework for Self-Supervised Speech Representation Learning • 5 items • Updated Jan 20