Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
PanChanghao's picture
3 7 6

PanChanghao

DavidPigeon
·
https://david-pigeon.github.io/
  • DavidPigeon

AI & ML interests

audio synthesis

Recent Activity

upvoted a paper 5 days ago
Towards Streaming Synchronized Spatial Audio Generation via Autoregressive Diffusion Transformer
upvoted a paper 5 days ago
Comprehensive Benchmarking of Long-Form Speech Generation in Diverse Scenarios
upvoted a paper 5 days ago
SwanVoice: Expressive Long-Form Zero-Shot Speech Synthesis for Both Monologue and Dialogue
View all activity

Organizations

Zhejiang University's profile picture

liked a Space 12 days ago
Running
85

ACL Pubcheck

📝
85

Check your PDF for ACL guidelines

liked a Space 4 months ago
Paused
Agents
Featured
1.94k

Qwen3-TTS Demo

🎙
1.94k

Generate custom speech from text, voice descriptions, or samples

liked a model 5 months ago

stepfun-ai/Step-Audio-R1.1

Audio-Text-to-Text • 33B • Updated Feb 14 • 231 • 181
liked a Space 5 months ago
Running
Agents
21

Fun-ASR-Nano

🚀
21

LLM-powered ASR: 31 languages, Chinese dialects, timestamps

liked a model 5 months ago

nvidia/bigvgan_v2_24khz_100band_256x

Audio-to-Audio • Updated Sep 5, 2024 • 98.8k • 22
liked a dataset 10 months ago

OpenSound/CapSpeech

Viewer • Updated Jun 4, 2025 • 20.8M • 782 • 24
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs