☆12Feb 3, 2026Updated 3 weeks ago
Alternatives and similar repositories for d3rm
Users that are interested in d3rm are comparing it to the libraries listed below
Sorting:
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- ☆15Aug 22, 2025Updated 6 months ago
- ☆20Feb 19, 2026Updated last week
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- PDMX: A Large-Scale Public Domain MusicXML Dataset for Symbolic Music Processing☆104Jun 1, 2025Updated 9 months ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- ☆22Apr 4, 2023Updated 2 years ago
- 基于FreeVC的歌声转换☆21Dec 16, 2022Updated 3 years ago
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- ☆25Jan 24, 2023Updated 3 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- A neural speech codec based on discrete WavLM representations☆24Aug 28, 2024Updated last year
- AudioCodec-Hub is a Python library for encoding and decoding audio data, supporting various neural audio codec models☆25Sep 26, 2023Updated 2 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 2 years ago
- [ICMR 2025] Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale …☆18Aug 17, 2025Updated 6 months ago
- Official code for SongEcho☆41Feb 21, 2026Updated last week
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆35Oct 26, 2025Updated 4 months ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆36Feb 11, 2025Updated last year
- ☆32Nov 25, 2023Updated 2 years ago
- My vocoder experiments☆31Jul 26, 2025Updated 7 months ago
- This is the official repository of ISMIR 2024 paper "Emotion-driven Piano Music Generation via Two-stage Disentanglement and Functional R…☆60Sep 17, 2024Updated last year
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Feb 18, 2026Updated last week
- ☆13Sep 1, 2023Updated 2 years ago
- ☆12Apr 1, 2024Updated last year
- ☆16Dec 12, 2023Updated 2 years ago
- Code from blog 'Searching by Music: Leveraging Vector Search for Music Information Retrieval'☆16Nov 16, 2023Updated 2 years ago
- [AAAI'24] Official dataset & demo code for MID-FiLD: MIDI Dataset for Fine-Level Dynamics☆20Mar 31, 2024Updated last year
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Feb 7, 2026Updated 3 weeks ago
- ☆15Apr 13, 2025Updated 10 months ago
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆64Sep 22, 2025Updated 5 months ago
- MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.☆15Jan 29, 2022Updated 4 years ago
- a Neural Vocoder supporting Ring Attention, Conformer and NSF.☆24Aug 1, 2025Updated 7 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Apr 25, 2023Updated 2 years ago
- ☆15Sep 20, 2023Updated 2 years ago
- Official implementation of the paper: "NeoBabel: A Multilingual Open Tower for Visual Generation"☆23Aug 4, 2025Updated 6 months ago
- Estimating musical surprisal/information content in Audio☆23Jan 19, 2026Updated last month
- source code of EfficientTTS 2☆20Feb 18, 2024Updated 2 years ago
- Real-time Timbre Remapping with Differentiable DSP.☆20Nov 12, 2024Updated last year