☆18Sep 22, 2025Updated 5 months ago
Alternatives and similar repositories for DOSE
Users that are interested in DOSE are comparing it to the libraries listed below
Sorting:
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- Latent Space Sound Design Tool based on the VAE of stable-audio-open☆15Aug 23, 2024Updated last year
- Code for ChordSync, a conformer-based audio-to-chord synchroniser☆13Oct 17, 2025Updated 4 months ago
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago
- A piano music dataset with Audio, Symbolic and Text labels☆34Mar 6, 2025Updated 11 months ago
- ☆15Feb 6, 2026Updated 3 weeks ago
- A web app for annotating Freesound loops, and the tools to analyse the dataset created.☆20Jul 6, 2023Updated 2 years ago
- ☆17Jan 20, 2025Updated last year
- ☆18Oct 20, 2023Updated 2 years ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Feb 9, 2026Updated 3 weeks ago
- Code for paper "Network Bending of Diffusion Models for Audio-Visual Generation" at DAFx 2024☆16Aug 26, 2025Updated 6 months ago
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆23Oct 30, 2024Updated last year
- ☆18Nov 8, 2024Updated last year
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- ☆29Mar 19, 2025Updated 11 months ago
- Official Code of "A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-by-Humming Task" (ISMIR 2023)☆18Nov 7, 2023Updated 2 years ago
- Streaming Audio Models Examples in JS☆19Mar 29, 2024Updated last year
- ☆18May 4, 2025Updated 9 months ago
- Survey on speech generation work.☆21Nov 26, 2023Updated 2 years ago
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆58Nov 10, 2025Updated 3 months ago
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆29Dec 19, 2024Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆21May 26, 2025Updated 9 months ago
- Time-varying subtractive synth experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆30Jun 19, 2024Updated last year
- Polyphonic generalisation of DDSP☆22Apr 30, 2024Updated last year
- Searching for Music Mixing Graphs: A Pruning Approach☆25Feb 13, 2025Updated last year
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆48Jan 19, 2026Updated last month
- Repository for the ISMIR 2024 Paper "STONE: Self-supervised Tonality Estimator".☆28Oct 24, 2025Updated 4 months ago
- Composer's Assistant for REAPER☆64Jun 16, 2025Updated 8 months ago
- ☆70Jan 25, 2025Updated last year
- AudioStretchy is a Python wrapper around the `audio-stretch` C library, which performs fast, high-quality time-stretching of WAV/MP3 file…☆61Sep 24, 2025Updated 5 months ago
- fast, precise tempo prediction in python☆65Updated this week
- "Automatic DJ Transitions with Differentiable Audio Effects and Generative Adversarial Networks", ICASSP 2022☆106Nov 7, 2025Updated 3 months ago
- Source code of APNet2, a vocoder☆58Nov 23, 2023Updated 2 years ago
- ☆33Dec 23, 2025Updated 2 months ago
- Companion repository which facilitates the creation of Gradio endpoints which are accessible from within Digital Audio Workstations (DAWs…☆28Sep 13, 2025Updated 5 months ago
- Toward Deep Drum Source Separation☆86Sep 10, 2024Updated last year
- a new family of super small music generation models focusing on experimental music and latent space exploration capabilities☆36May 9, 2024Updated last year
- XMIDI Dataset: A large-scale symbolic music dataset with emotion and genre labels.☆33Jan 16, 2025Updated last year
- Understanding and Tackling Hallucinations in Large Audio-Language Models | ICASSP 2025, Interspeech 2024☆32Mar 14, 2025Updated 11 months ago