Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.
☆13Sep 13, 2024Updated last year
Alternatives and similar repositories for audio_mod_idessai
Users that are interested in audio_mod_idessai are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A spoken version of the textual story cloze benchmark☆21Aug 6, 2023Updated 2 years ago
- Demucs Lightning: A PyTorch lightning version of Demucs with Hydra and Tensorboard features☆84May 3, 2023Updated 2 years ago
- ☆33Dec 23, 2025Updated 3 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- Official Repository for "Music Source Restoration"☆32Jun 1, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆79Jun 19, 2025Updated 9 months ago
- Supervoice diffusion enhance☆28Jul 15, 2024Updated last year
- The implementation of "Systematic Analysis of Music Representations from BERT"☆27May 23, 2023Updated 2 years ago
- My Master's Project, a function/system/program that gives the structure of a given song (The pattern of repetition of verse, chorus, etc.…☆14Jun 21, 2019Updated 6 years ago
- Deep Performer: Score-to-audio music performance synthesis☆45Jun 26, 2023Updated 2 years ago
- The official code repo for "Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled Data", in AAAI 2022☆210Jul 14, 2022Updated 3 years ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- Frontend filterbank learning module with HVQT initialization capabilities.☆21Feb 27, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆34Jul 31, 2024Updated last year
- Profile your CoreML models directly from Python 🐍☆30Sep 8, 2025Updated 6 months ago
- ☆61Nov 4, 2023Updated 2 years ago
- LibriVoc is a new open-source, large-scale dataset for vocoder artifact detection. LibriVoc is derived from the LibriTTS speech corpus, w…☆16Nov 6, 2025Updated 4 months ago
- https://arxiv.org/abs/2111.00195☆16Mar 30, 2022Updated 4 years ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆49Jan 19, 2026Updated 2 months ago
- A pytorch implementation of LCGNN☆11Jun 1, 2020Updated 5 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆32Nov 23, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 基于FreeVC的歌声转换☆21Dec 16, 2022Updated 3 years ago
- Codebase and project page for EDMSound☆35Nov 20, 2023Updated 2 years ago
- Empirical study that reports the correlation between complexity measures and generalization performance by deep learning models on medica…☆13Nov 29, 2021Updated 4 years ago
- Multimodal Music Generation with Explicit Bridges and Retrieval Augmentation: A framework for generating multimodal music by bridging dif…☆28Jan 21, 2025Updated last year
- ☆20Oct 5, 2025Updated 5 months ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- [ICML 2022] Official implementation of "Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems".☆12Jul 19, 2022Updated 3 years ago
- ☆14Mar 1, 2021Updated 5 years ago
- Code for the paper "Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription"☆40May 5, 2024Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Minimal implementation of a Byte Pair Encoding (BPE) tokenizer in Zig☆14Apr 7, 2025Updated 11 months ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- Code for GBK-GNN (paper accepted by WWW2022)☆17Jun 16, 2022Updated 3 years ago
- Music Demixing Challenge Submission Repo☆15Sep 8, 2023Updated 2 years ago
- ☆20Jul 17, 2023Updated 2 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- Pragmatic approach to parsing import profiles for CI's☆12Jul 1, 2024Updated last year