AMAAI-Lab / SonicMasterLinks
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
☆32Updated last week
Alternatives and similar repositories for SonicMaster
Users that are interested in SonicMaster are comparing it to the libraries listed below
Sorting:
- Speech Resynthesis and Language Modeling☆26Updated 2 months ago
- ☆13Updated 5 months ago
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆49Updated 2 months ago
- ☆17Updated last year
- Spherical residual vector quantization (SRVQ)☆30Updated 11 months ago
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆67Updated last month
- Landing Page for Divide and Remaster v3☆18Updated 3 weeks ago
- Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.☆43Updated last week
- A Singing Style Conversion Framework Based On Audio Infilling☆26Updated 3 months ago
- DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-to-Speech☆37Updated 2 weeks ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Updated last year
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Updated 2 years ago
- Evaluation tool used in the BigVSAN paper☆14Updated last year
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆16Updated last month
- Bilingual Singing Voice Synthesis☆18Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated last year
- ☆48Updated 4 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆36Updated 3 months ago
- Streaming Vocos☆29Updated 2 months ago
- Zero-Shot Blind Audio Bandwidth Extension☆24Updated 2 years ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆49Updated 3 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆29Updated last year
- ☆13Updated last year
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆27Updated last year
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆21Updated last month
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆22Updated 2 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆22Updated 11 months ago
- ☆28Updated 3 weeks ago
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆26Updated 3 months ago
- A neural speech codec based on discrete WavLM representations☆24Updated 11 months ago