lihenryhfl / diffusion_optimal_control
Solving Inverse Problems with Diffusion Optimal Control [NeurIPS 2024]
☆10Updated 4 months ago
Alternatives and similar repositories for diffusion_optimal_control:
Users that are interested in diffusion_optimal_control are comparing it to the libraries listed below
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆28Updated 2 months ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆30Updated last year
- ☆38Updated 2 months ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆24Updated 7 months ago
- Official repo for DiscoDiff: Coarse-to-Fine Text-to-Music Latent Diffusion presented at ICASSP 2025☆11Updated 2 weeks ago
- ☆13Updated last year
- ☆12Updated 2 years ago
- Art2Mus is a system that generates music based on digitized artworks and text by using the AudioLDM2 architecture with an added projectio…☆16Updated 4 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆25Updated last year
- ☆40Updated 5 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆30Updated 4 months ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆13Updated 5 months ago
- Test code disclosure for the research paper "UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model", as a supplementa…☆20Updated last year
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆38Updated 10 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆33Updated last year
- ☆22Updated 6 months ago
- DiffPhase: Generative Diffusion-based STFT Phase Retrieval☆14Updated last year
- ☆10Updated 5 months ago
- Event Relation in Text-to-Audio (TTA) Generation☆17Updated last month
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆19Updated 10 months ago
- An AR+AR TTS attempt.☆15Updated 3 months ago
- Adversarial Training of Denoising Diffusion Model Using Dual Discriminators for High-Fidelity Multi-Speaker TTS☆38Updated last year
- Project for MIDI to Audio Synthesis☆23Updated 2 years ago
- code for "DDD: A Perceptually Superior Low-Response-Time DNN-Based Declipper"☆23Updated last year
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆69Updated last year
- Viterbi decoding in PyTorch☆30Updated 3 weeks ago
- Synthesis of percussion sounds using sinusoidal modelling, DDSP noise synthesis, and a neural source filter approach.☆29Updated 3 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated 2 years ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆19Updated 2 months ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆19Updated 4 months ago