SpeechColab

non-profit

Activity Feed Request to join this org

AI & ML interests

Machine Learning for Audio/Speech

Recent Activity

yfyeung new activity about 3 hours ago

speechcolab/gigaspeech-test:[bot] Conversion to Parquet

yfyeung updated a collection about 24 hours ago

GigaSpeech Series

yfyeung updated a dataset about 24 hours ago

speechcolab/gigaspeech-test

View all activity

in speechcolab/gigaspeech-test about 3 hours ago

[bot] Conversion to Parquet

#1 opened about 11 hours ago by

parquet-converter

updated a collection about 24 hours ago

GigaSpeech Series

Evolving, Large-Scale, and Multi-domain ASR Corpus • 6 items • Updated about 24 hours ago

updated a dataset about 24 hours ago

speechcolab/gigaspeech-test

Viewer • Updated about 23 hours ago • 19.9k • 7

published a dataset about 24 hours ago

speechcolab/gigaspeech-test

Viewer • Updated about 23 hours ago • 19.9k • 7

in speechcolab/gigaspeech2 1 day ago

Can't load the `dev` and `test`

#8 opened 3 months ago by

authored a paper 19 days ago

MMAE: A Massive Multitask Audio Editing Benchmark

Paper • 2606.07229 • Published 22 days ago • 46

authored a paper 23 days ago

UAT: Unified Audio-Text Diffusion for Audio Generation, Editing, and Captioning

Paper • 2606.04939 • Published 24 days ago

authored a paper about 1 month ago

Evaluating the Expressive Appropriateness of Speech in Rich Contexts

Paper • 2605.09413 • Published May 10 • 5

authored a paper about 2 months ago

WavCube: Unifying Speech Representation for Understanding and Generation via Semantic-Acoustic Joint Modeling

Paper • 2605.06407 • Published May 7

authored a paper 3 months ago

Representation-Regularized Convolutional Audio Transformer for Audio Understanding

Paper • 2601.21612 • Published Jan 29 • 1

authored a paper 3 months ago

Voxtral TTS

Paper • 2603.25551 • Published Mar 26 • 63

updated a collection 3 months ago

GigaSpeech Series

Evolving, Large-Scale, and Multi-domain ASR Corpus • 6 items • Updated about 24 hours ago

in speechcolab/gigaspeech2 3 months ago

[Help Wanted] Support for GigaSpeech 2 Splits

#4 opened about 2 years ago by

updated 2 datasets 3 months ago

speechcolab/gigaspeech2

Viewer • Updated Mar 26 • 27.2M • 37.8k • 64

speechcolab/gigaspeech2-test

Updated Mar 26 • 42

authored a paper 4 months ago

Voxtral Realtime

Paper • 2602.11298 • Published Feb 11 • 28

in speechcolab/gigaspeech 5 months ago

Convert dataset to Parquet

#20 opened 5 months ago by

in speechcolab/gigaspeech 5 months ago

test

#14 opened 8 months ago by

updated a collection 5 months ago

GigaSpeech Series

Evolving, Large-Scale, and Multi-domain ASR Corpus • 6 items • Updated about 24 hours ago