haoheliu / versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
☆1,408Updated 2 months ago
Alternatives and similar repositories for versatile_audio_super_resolution:
Users that are interested in versatile_audio_super_resolution are comparing it to the libraries listed below
- General Speech Restoration☆1,127Updated 2 months ago
- Model for MDX23 music separation contest☆720Updated 2 weeks ago
- Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs☆532Updated 3 months ago
- Official implementation of "Separate Anything You Describe"☆1,725Updated 4 months ago
- AI powered speech denoising and enhancement☆1,756Updated 4 months ago
- Repository for training models for music source separation.☆710Updated 2 weeks ago
- State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.☆1,394Updated 9 months ago
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis☆918Updated 8 months ago
- Versatile AI-driven audio upscaler to enhance the quality of any audio.☆108Updated 3 months ago
- Code for the paper Hybrid Spectrogram and Waveform Source Separation☆1,452Updated 9 months ago
- Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (pr…☆719Updated this week
- Official PyTorch implementation of BigVGAN (ICLR 2023)☆1,004Updated 7 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆596Updated 8 months ago
- The official implementation of HierSpeech++☆1,219Updated last year
- WavJourney: Compositional Audio Creation with LLMs☆535Updated last year
- Colab adaptation of MVSep Model for MDX23 music separation contest☆302Updated 7 months ago
- Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch☆648Updated 6 months ago
- Generative models for conditional audio generation☆3,028Updated last month
- Text-to-Audio/Music Generation☆2,407Updated 6 months ago
- A family of diffusion models for text-to-audio generation.☆1,161Updated 3 months ago
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion☆662Updated 3 months ago
- VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design☆557Updated last year
- The Open Source Code of UniAudio☆556Updated 9 months ago
- A GUI for music separation AI demucs☆757Updated 2 months ago
- Music repair method to convert lossy MP3 compressed music to lossless music.☆227Updated last month
- TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5,…☆2,119Updated this week
- An easy to understand TTS / SVS / SVC framework☆696Updated last month
- ☆187Updated 3 months ago
- 🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!☆198Updated 4 months ago
- Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.☆756Updated 7 months ago