Annotated Flow Matching paper
☆228Sep 14, 2024Updated last year
Alternatives and similar repositories for flow-matching
Users that are interested in flow-matching are comparing it to the libraries listed below
Sorting:
- Educational implementation of the Discrete Flow Matching paper☆132Aug 26, 2024Updated last year
- TorchCFM: a Conditional Flow Matching library☆2,331Nov 11, 2025Updated 3 months ago
- A summary of related works about flow matching, stochastic interpolants☆641Feb 4, 2026Updated 3 weeks ago
- A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes…☆4,167Jan 5, 2026Updated last month
- Official PyTorch implementation of the paper: Flow Matching in Latent Space☆336Jan 20, 2025Updated last year
- Official Implementation of Rectified Flow (ICLR2023 Spotlight)☆1,552Jul 20, 2024Updated last year
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆43Jun 13, 2024Updated last year
- ☆23Feb 8, 2025Updated last year
- Normalizing flows in PyTorch☆444Dec 1, 2025Updated 3 months ago
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Jan 14, 2021Updated 5 years ago
- Aligned Diffusion Schroedinger Bridges (UAI 2023)☆13Sep 18, 2025Updated 5 months ago
- DiffPhase: Generative Diffusion-based STFT Phase Retrieval☆16Sep 21, 2023Updated 2 years ago
- A lightweight audio codec based on a single quantizer☆69Aug 15, 2025Updated 6 months ago
- Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"☆259Jan 17, 2025Updated last year
- MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆124Sep 2, 2025Updated 6 months ago
- ☆14May 21, 2024Updated last year
- million song dataset split for extended clean tag & artist-level stratified☆52Aug 12, 2023Updated 2 years ago
- [NeurIPS 2024] Simple and Effective Masked Diffusion Language Model☆639Sep 29, 2025Updated 5 months ago
- ☆70Sep 3, 2024Updated last year
- Official Implementation of EnCLAP (ICASSP 2024)☆94Jun 2, 2024Updated last year
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Jul 10, 2024Updated last year
- Implementation of Differentiable Molecular Simulations with torchMD.☆16Oct 9, 2023Updated 2 years ago
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆78Jun 8, 2025Updated 8 months ago
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions☆268Jan 13, 2025Updated last year
- ☆31Nov 24, 2023Updated 2 years ago
- [INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"☆64Jun 16, 2025Updated 8 months ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆48Jan 19, 2026Updated last month
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆348Jul 21, 2025Updated 7 months ago
- ICML 2023: Reduce, Reuse, Recycle: Composing Energy-Based Diffusion Models with MCMC☆150Oct 18, 2024Updated last year
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆107Jan 17, 2025Updated last year
- CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech☆366Aug 14, 2025Updated 6 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆119May 19, 2025Updated 9 months ago
- TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages☆18May 23, 2024Updated last year
- ☆80Aug 11, 2025Updated 6 months ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆1,096Dec 22, 2025Updated 2 months ago