zlab-princeton / SoFlowLinks
SoFlow: Solution Flow Models for One-Step Generative Modeling
☆108Updated 3 weeks ago
Alternatives and similar repositories for SoFlow
Users that are interested in SoFlow are comparing it to the libraries listed below
Sorting:
- [Official Implementation] Acoustic Autoregressive Modeling 🔥☆74Updated last year
- unofficial Split Mean Flow Implementation from bytedance☆63Updated 5 months ago
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆51Updated last year
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆32Updated 10 months ago
- ☆47Updated 8 months ago
- Implementation of Multi-Source Music Generation with Latent Diffusion.☆27Updated last year
- Official implemention for Diffusion Models Are Innate One-Step Generators☆26Updated 6 months ago
- Official code implementation for the work Preference Alignment with Flow Matching (NeurIPS 2024)☆63Updated last year
- Official Code Implementation for 'A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models'☆20Updated last year
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆43Updated last year
- Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"☆104Updated 3 weeks ago
- Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).☆128Updated last year
- Official implementation of our paper "Bidirectional Consistency Models"; and reproduced Improved Consistency Models (iCT).☆21Updated 8 months ago
- Towards Fine-grained Audio Captioning with Multimodal Contextual Cues☆86Updated last week
- A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.☆131Updated 3 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆117Updated 7 months ago
- ☆25Updated last year
- ☆43Updated 11 months ago
- Pytorch implementation of SoundCTM☆100Updated 9 months ago
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".☆61Updated 2 months ago
- ☆41Updated 9 months ago
- Official Implementation (Pytorch) of "Constant Acceleration Flow", NeurIPS 2024☆35Updated 2 months ago
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Updated 6 months ago
- The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)☆31Updated last year
- ☆49Updated 5 months ago
- small audio language model for reasoning☆84Updated last month
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50Updated 8 months ago
- [ICML 2025 Spotlight] Direct Discriminative Optimization: Supercharging Diffusion/Autoregressive with GAN-type Discrimination☆109Updated 5 months ago
- The official repository for the paper "Optimal Flow Matching: Learning Straight Trajectories in Just One Step" (NeurIPS 2024)☆101Updated last year
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆127Updated last year