Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

shraey
/
zonos2-mlx

Text-to-Speech
MLX
Safetensors
Zonos
English
apple-silicon
tts
voice-cloning
zonos2
Mixture of Experts
Model card Files Files and versions
xet
Community

Instructions to use shraey/zonos2-mlx with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

  • Libraries
  • MLX

    How to use shraey/zonos2-mlx with MLX:

    # Download the model from the Hub
    pip install huggingface_hub[hf_xet]
    
    huggingface-cli download --local-dir zonos2-mlx shraey/zonos2-mlx
  • Zonos

    How to use shraey/zonos2-mlx with Zonos:

    # pip install git+https://github.com/Zyphra/Zonos.git
    import torchaudio
    from zonos.model import Zonos
    from zonos.conditioning import make_cond_dict
    
    model = Zonos.from_pretrained("shraey/zonos2-mlx", device="cuda")
    
    wav, sr = torchaudio.load("speaker.wav")           # 5-10s reference clip
    speaker = model.make_speaker_embedding(wav, sr)
    
    cond  = make_cond_dict(text="Hello, world!", speaker=speaker, language="en-us")
    codes = model.generate(model.prepare_conditioning(cond))
    
    audio = model.autoencoder.decode(codes)[0].cpu()
    torchaudio.save("sample.wav", audio, model.autoencoder.sampling_rate)
    
  • Notebooks
  • Google Colab
  • Kaggle
  • Local Apps Settings
  • LM Studio
zonos2-mlx
30.3 GB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 6 commits
shraey's picture
shraey
Model card: use upstream's 'voice cloning' wording
c5a034e verified 10 days ago
  • bf16
    Add bf16 tier 10 days ago
  • int4
    Add int4 tier 10 days ago
  • int8
    Add int8 tier 10 days ago
  • .gitattributes
    1.52 kB
    initial commit 10 days ago
  • README.md
    4.72 kB
    Model card: use upstream's 'voice cloning' wording 10 days ago