NVIDIA / BigVGANLinks
Official PyTorch implementation of BigVGAN (ICLR 2023)
☆1,051Updated 9 months ago
Alternatives and similar repositories for BigVGAN
Users that are interested in BigVGAN are comparing it to the libraries listed below
Sorting:
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆656Updated 9 months ago
- The Open Source Code of UniAudio☆568Updated 11 months ago
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis☆944Updated 10 months ago
- Audio Dataset for training CLAP and other models☆688Updated last year
- A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)☆473Updated last year
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.☆589Updated last year
- This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf☆395Updated 3 years ago
- unofficial vits2-TTS implementation in pytorch☆529Updated last year
- Official Implementation of StyleTTS☆436Updated 5 months ago
- Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation☆631Updated last month
- This toolbox aims to unify audio generation model evaluation for easier comparison.☆347Updated 9 months ago
- An Open-source Streaming High-fidelity Neural Audio Codec☆477Updated 3 months ago
- Unified automatic quality assessment for speech, music, and sound.☆522Updated 3 weeks ago
- Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".☆440Updated last year
- State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.☆1,491Updated last week
- Learning audio concepts from natural language supervision☆567Updated 9 months ago
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion☆671Updated 5 months ago
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆576Updated last year
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022☆294Updated last year
- A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (…☆437Updated 2 years ago
- Voice Conversion With Just Nearest Neighbors☆492Updated last year
- Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate☆618Updated 7 months ago
- Keep track of big models in audio domain, including speech, singing, music etc.☆485Updated 9 months ago
- Contrastive Language-Audio Pretraining☆1,710Updated last month
- Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.☆765Updated 9 months ago
- Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation☆390Updated last week
- Pytorch implementation of the CREPE pitch tracker☆451Updated last month
- General Speech Restoration☆280Updated last year
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…☆326Updated 2 years ago
- AudioLDM training, finetuning, evaluation and inference.☆256Updated 6 months ago