A toolbox that provides hackable building blocks for generic 1D/2D/3D UNets, in PyTorch.
☆89Jun 12, 2023Updated 2 years ago
Alternatives and similar repositories for a-unet
Users that are interested in a-unet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A collection of audio autoencoders, in PyTorch.☆44Mar 7, 2023Updated 3 years ago
- A collection of useful audio datasets and transforms for PyTorch.☆144Feb 11, 2023Updated 3 years ago
- Audio generation using diffusion models, in PyTorch.☆2,096Jun 12, 2023Updated 2 years ago
- Unofficial implementation for the paper 'Improving Diffusion Models for Inverse Problems using Manifold Constraints'[https://arxiv.org/ab…☆12Aug 21, 2022Updated 3 years ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆72Dec 9, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Nov 25, 2022Updated 3 years ago
- A collection of pre-trained audio models, in PyTorch.☆116Jan 27, 2023Updated 3 years ago
- Open implementation of UNIVERSE and UNIVERSE++ diffusion-based speech enhancement models.☆112Aug 29, 2024Updated last year
- State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.☆1,745Jan 26, 2026Updated 2 months ago
- million song dataset split for extended clean tag & artist-level stratified☆52Aug 12, 2023Updated 2 years ago
- ☆24Feb 28, 2023Updated 3 years ago
- Trainer for audio-diffusion-pytorch☆129Jan 13, 2023Updated 3 years ago
- Collection of audio-focused loss functions in PyTorch☆856Jul 30, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆43Jun 13, 2024Updated last year
- Audiogen Codec☆144Jul 9, 2024Updated last year
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆57Mar 12, 2024Updated 2 years ago
- MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.☆16Jan 29, 2022Updated 4 years ago
- A lightweight library for Frechet Audio Distance calculation.☆312Feb 11, 2026Updated last month
- Notebook examples using mirdata☆12Dec 5, 2023Updated 2 years ago
- A repository for generating and training short audio samples with unconditional waveform diffusion on accessible consumer hardware (<2GB …☆183Jun 6, 2024Updated last year
- music generation with masked transformers!☆351May 16, 2025Updated 10 months ago
- video cut powered by AI☆24Nov 15, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation for MGN☆20Dec 22, 2022Updated 3 years ago
- ☆48Jul 20, 2024Updated last year
- A collection of common functionality to simplify the design, training and evaluation of machine learning models based on pytorch with an …☆72Feb 26, 2026Updated last month
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆196Apr 27, 2023Updated 2 years ago
- ☆23Jun 30, 2023Updated 2 years ago
- Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…☆193Mar 25, 2024Updated 2 years ago
- GUI-application for fast and easy HRTF measurements with head tracking.☆14Nov 14, 2022Updated 3 years ago
- Contrastive Language-Audio Pretraining☆2,066May 15, 2025Updated 10 months ago
- ERB representation of an audio file implemented in Python☆27Oct 21, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Unified automatic quality assessment for speech, music, and sound.☆694Jun 5, 2025Updated 9 months ago
- Python bindings of speexdsp noise suppression library☆48Nov 18, 2022Updated 3 years ago
- Codebase and project page for EDMSound☆35Nov 20, 2023Updated 2 years ago
- This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples a…☆650Jun 9, 2024Updated last year
- An Open-source Streaming High-fidelity Neural Audio Codec☆500Mar 4, 2025Updated last year
- Tensorflow implementation of DiffWave: A Versatile Diffusion Model for Audio Synthesis☆42Oct 3, 2020Updated 5 years ago
- The source code for the paper CrossSinger (asru2023)☆18Oct 12, 2023Updated 2 years ago