AudiogenAI / agc
Audiogen Codec
☆131Updated 7 months ago
Alternatives and similar repositories for agc:
Users that are interested in agc are comparing it to the libraries listed below
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆93Updated 6 months ago
- Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"☆132Updated 5 months ago
- A DDSP-based neural voice synthesiser.☆113Updated 3 months ago
- ☆43Updated 8 months ago
- Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.☆188Updated 5 months ago
- Robust Singing Voice Transcription and MIDI Extraction☆69Updated 3 months ago
- Code for the paper "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆54Updated last month
- ☆119Updated last month
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆76Updated last month
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆35Updated 6 months ago
- AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model☆154Updated last month
- SSR-Speech: Towards Stable, Safe and Robust Zero-shot Speech Editing and Synthesis☆121Updated last month
- VoiceLDM: Text-to-Speech with Environmental Context☆169Updated 6 months ago
- The open source code for SimpleSpeech series☆127Updated 4 months ago
- The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…☆79Updated 5 months ago
- Official Implementation of EnCLAP (ICASSP 2024)☆90Updated 8 months ago
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆73Updated 4 months ago
- Pitch Estimating Neural Networks (PENN)☆242Updated 6 months ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆51Updated last year
- The official Implementation of PeriodWave and PeriodWave-Turbo☆162Updated last week
- Unofficial implementation JEN-1: Text-Guided Universal Music Generation with Omnidirectional Diffusion Models(https://arxiv.org/abs/2308.…☆52Updated last year
- Object-oriented handling of audio data, with GPU-powered augmentations, and more.☆257Updated last month
- A simple library for Fréchet Audio Distance (FAD) calculation☆179Updated last week
- Pytorch implementation of BigVSAN☆200Updated 10 months ago
- Code for the paper: GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities☆109Updated 2 months ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆146Updated 2 years ago
- ☆51Updated last year
- Unofficial download repository for MusicCaps☆45Updated last year
- Implementation of SoundStorm built upon SpeechTokenizer.☆108Updated last year
- GOMIN; Gaudio Open Mel-spectrogram Inversion Network☆110Updated last year