GigaSpeech Series Collection Evolving, Large-Scale, and Multi-domain ASR Corpus • 6 items • Updated about 24 hours ago
UAT: Unified Audio-Text Diffusion for Audio Generation, Editing, and Captioning Paper • 2606.04939 • Published 24 days ago
Evaluating the Expressive Appropriateness of Speech in Rich Contexts Paper • 2605.09413 • Published May 10 • 5
WavCube: Unifying Speech Representation for Understanding and Generation via Semantic-Acoustic Joint Modeling Paper • 2605.06407 • Published May 7
Representation-Regularized Convolutional Audio Transformer for Audio Understanding Paper • 2601.21612 • Published Jan 29 • 1
GigaSpeech Series Collection Evolving, Large-Scale, and Multi-domain ASR Corpus • 6 items • Updated about 24 hours ago
GigaSpeech Series Collection Evolving, Large-Scale, and Multi-domain ASR Corpus • 6 items • Updated about 24 hours ago