cifkao / ss-vq-vaeView external linksLinks
Self-supervised VQ-VAE for One-Shot Music Style Transfer
☆98Feb 24, 2025Updated 11 months ago
Alternatives and similar repositories for ss-vq-vae
Users that are interested in ss-vq-vae are comparing it to the libraries listed below
Sorting:
- Code for "Groove2Groove: One-Shot Music Style Transfer with Supervision from Synthetic Data"☆172Sep 16, 2024Updated last year
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024☆61Feb 19, 2025Updated 11 months ago
- Official Implementation of "Towards Automatic Instrumentation by Learning to Separate Parts in Symbolic Multitrack Music" (ISMIR 2021)☆59Jun 26, 2023Updated 2 years ago
- Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…☆21Mar 28, 2023Updated 2 years ago
- " Music Style Transfer with Time-Varying Inversion of Diffusion Models"☆60Jul 23, 2024Updated last year
- Official PyTorch implementation for "Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations".☆21Dec 3, 2021Updated 4 years ago
- ☆15May 8, 2021Updated 4 years ago
- A collection of metrics for evaluating timbre dissimilarity using the TorchMetrics API☆30Dec 30, 2021Updated 4 years ago
- Deep Performer: Score-to-audio music performance synthesis☆44Jun 26, 2023Updated 2 years ago
- Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch☆503Oct 28, 2023Updated 2 years ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆13Sep 13, 2024Updated last year
- ☆32Nov 25, 2023Updated 2 years ago
- NANSY++: Unified Voice Synthesis with Neural Analysis and Synthesis☆150Feb 11, 2023Updated 3 years ago
- source code of "End-to-end Music Remastering System Using Self-supervised and Adversarial Training"☆47Sep 7, 2023Updated 2 years ago
- Hierarchical fast and high-fidelity audio generation☆75Jul 25, 2024Updated last year
- Source code for "MusCaps: Generating Captions for Music Audio" (IJCNN 2021)☆85Dec 3, 2024Updated last year
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 8 months ago
- The implementation of "Symbolic Music Loop Generation with Neural Discrete Representations"☆34Aug 24, 2022Updated 3 years ago
- DeepAFx-ST - Style transfer of audio effects with differentiable signal processing. Please see https://csteinmetz1.github.io/DeepAFx-ST/☆403May 30, 2023Updated 2 years ago
- SelfRemaster: SSL Speech Restoration☆93Jan 5, 2024Updated 2 years ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆26Aug 30, 2024Updated last year
- ☆22Feb 22, 2024Updated last year
- Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.☆140Sep 25, 2024Updated last year
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Feb 7, 2024Updated 2 years ago
- A Python Implementation of Driedger's "Let It Bee" Technique for Audio Mosaicing☆25Sep 14, 2024Updated last year
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- ☆12Feb 9, 2021Updated 5 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 9 months ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- ICASSP 2023 Accepted☆189May 6, 2024Updated last year
- The implementation of "Systematic Analysis of Music Representations from BERT"☆27May 23, 2023Updated 2 years ago
- A unified model for zero-shot singing voice conversion and synthesis☆22Nov 30, 2022Updated 3 years ago
- The code repository for our paper "Interpreting Song Lyrics with a Music-Informed Pre-trained Language Model".☆24Dec 12, 2022Updated 3 years ago
- PyTorch implementation of MuseMorphose (published at IEEE/ACM TASLP), a Transformer-based model for music style transfer.☆193Dec 19, 2022Updated 3 years ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Sep 16, 2022Updated 3 years ago
- Self-supervised neural network for music recommendations.☆18Jul 6, 2023Updated 2 years ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated last year
- Complete implementation of MusicNet in Pytorch☆12Apr 15, 2020Updated 5 years ago