A repository for generating and training short audio samples with unconditional waveform diffusion on accessible consumer hardware (<2GB VRAM GPU)
☆183Jun 6, 2024Updated last year
Alternatives and similar repositories for tiny-audio-diffusion
Users that are interested in tiny-audio-diffusion are comparing it to the libraries listed below
Sorting:
- ☆11Nov 7, 2024Updated last year
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- MAX/MSP objects for audio and rhythmic synthesis using networks of coupled oscillators☆13May 5, 2023Updated 2 years ago
- music generation with masked transformers!☆350May 16, 2025Updated 9 months ago
- Encode and decode audio samples to/from compressed latent representations!☆248Sep 19, 2025Updated 5 months ago
- Audio generation using diffusion models, in PyTorch.☆2,095Jun 12, 2023Updated 2 years ago
- AFTER : Audio Features Transfer and Exploration in Real-time☆104Sep 8, 2025Updated 5 months ago
- Differentiable audio signal processors in PyTorch☆283Dec 4, 2023Updated 2 years ago
- Binaural Spatializer Audio Plugin☆23Jun 25, 2024Updated last year
- ☆25Jan 24, 2023Updated 3 years ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆32Jan 19, 2024Updated 2 years ago
- ☆12Feb 3, 2026Updated last month
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 3 months ago
- A Neural Recorder plug to make the process of cloning external soft/hardware a bit more comfortable☆31Nov 25, 2023Updated 2 years ago
- A neural speech codec based on discrete WavLM representations☆24Aug 28, 2024Updated last year
- Rhythm generator using Variational Autoencoder(VAE)☆39May 19, 2022Updated 3 years ago
- ☆33Dec 23, 2025Updated 2 months ago
- Official code for SongEcho☆41Feb 21, 2026Updated last week
- An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)☆192Jun 19, 2023Updated 2 years ago
- ☆18May 4, 2025Updated 9 months ago
- ☆87Jan 29, 2023Updated 3 years ago
- ☆18Sep 22, 2025Updated 5 months ago
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆22Jun 10, 2024Updated last year
- 基于FreeVC的歌声转换☆21Dec 16, 2022Updated 3 years ago
- Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…☆216Jul 25, 2024Updated last year
- Fine-tune your own MusicGen with LoRA☆158Apr 26, 2024Updated last year
- Audiogen Codec☆144Jul 9, 2024Updated last year
- Fine-tune Stable Audio Open with DiT ControlNet.☆249May 16, 2025Updated 9 months ago
- Low-latency timbre transfer models for instrumental interaction.☆92Oct 10, 2025Updated 4 months ago
- Vocal Tract Area Estimation by Gradient Descent☆38Jul 16, 2023Updated 2 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆26Dec 12, 2024Updated last year
- Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.☆786Sep 25, 2024Updated last year
- Self-supervised learning for real-time pitch estimation☆280Oct 15, 2025Updated 4 months ago
- Flexible LoRA Implementation to use with stable-audio-tools☆80Sep 9, 2024Updated last year
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆19Feb 9, 2026Updated 3 weeks ago
- Conformer block with Rotary Position Embedding, modified from lucidrains' implement☆17Sep 13, 2024Updated last year
- AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI mo…☆909Jul 8, 2025Updated 7 months ago
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆167Dec 22, 2023Updated 2 years ago
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆35May 25, 2023Updated 2 years ago