Jhomanik/Optimal-Flow-Matching

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Jhomanik/Optimal-Flow-Matching)

Jhomanik / Optimal-Flow-Matching

The official repository for the paper "Optimal Flow Matching: Learning Straight Trajectories in Just One Step" (NeurIPS 2024)

☆110

Alternatives and similar repositories for Optimal-Flow-Matching

Users that are interested in Optimal-Flow-Matching are comparing it to the libraries listed below

Sorting:

Audio-Foundation-Models / ConversationTTS
View on GitHub
☆99Jan 19, 2026Updated last month
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
innnky / descript-audio-vae
View on GitHub
VAE modified from Descript Audio Codec, which replaces the RVQ with VAE
☆88Apr 2, 2024Updated last year
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated last year
archinetai / aligner-pytorch
View on GitHub
Sequence alignement methods with helpers for PyTorch.
☆24Nov 30, 2022Updated 3 years ago
luotianze666 / WaveFM
View on GitHub
[NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching
☆121Mar 27, 2025Updated 11 months ago
milenagazdieva / U-NOTBarycenters
View on GitHub
PyTorch implementation of "Robust Barycenter Estimation using Semi-unbalanced Neural Optimal Transport" (ICLR 2025)
☆17Jul 7, 2025Updated 7 months ago
Mddct / transformer-vocos
View on GitHub
☆36Sep 6, 2025Updated 5 months ago
zhai-lw / SQCodec
View on GitHub
A lightweight audio codec based on a single quantizer
☆69Aug 15, 2025Updated 6 months ago
yuyang-shi / dsbm-pytorch
View on GitHub
PyTorch Implementation of Diffusion Schrodinger Bridge Matching
☆202Feb 19, 2025Updated last year
vtuber-plan / FlowVAE
View on GitHub
☆16Dec 12, 2023Updated 2 years ago
jishengpeng / Languagecodec
View on GitHub
[ACL 2025 Oral] Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models
☆210Jun 25, 2025Updated 8 months ago
kkapusniak / metric-flow-matching
View on GitHub
Official implementation of Metric Flow Matching (NeurIPS 2024)
☆55Aug 14, 2025Updated 6 months ago
fakerybakery / utmos
View on GitHub
A toolkit to calculate speech audio quality. Not affiliated with the original authors
☆69Aug 13, 2024Updated last year
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆24Aug 28, 2024Updated last year
winddori2002 / DEX-TTS
View on GitHub
DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability
☆107Jan 17, 2025Updated last year
yukara-ikemiya / wavefit-pytorch
View on GitHub
PyTorch implementation of WaveFit [2022, Google] which is one of SOTA lightweight/fast speech vocoders.
☆63Sep 8, 2025Updated 5 months ago
YangLing0818 / consistency_flow_matching
View on GitHub
Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"
☆259Jan 17, 2025Updated last year
Aria-K-Alethia / BigCodec
View on GitHub
Official implementation of the paper "BigCodec: Pushing the Limits of Low-Bitrate Neural Speech Codec"
☆213Sep 19, 2024Updated last year
mcf330 / efts2code
View on GitHub
source code of EfficientTTS 2
☆20Feb 18, 2024Updated 2 years ago
BakerBunker / FreeV
View on GitHub
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
☆93Jul 4, 2024Updated last year
seahore / PPG-GradVC
View on GitHub
A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis
☆44Jul 24, 2023Updated 2 years ago
Mddct / cosyvoice2-flow-optimized
View on GitHub
faster inference
☆28Jan 20, 2025Updated last year
yzGuu830 / efficient-speech-codec
View on GitHub
[EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers
☆125Mar 20, 2025Updated 11 months ago
AI-S2-Lab / FluentEditor
View on GitHub
[InterSpeech'2024] FluentEditor:Text-based Speech Editing by Considering Acoustic and Prosody Consistency
☆59Oct 23, 2024Updated last year
philgzl / brever
View on GitHub
Speech enhancement in noisy and reverberant environments using deep neural networks
☆22Oct 10, 2025Updated 4 months ago
revsic / torch-diffusion-wavegan
View on GitHub
Parallel waveform generation with DiffusionGAN
☆17Mar 26, 2022Updated 3 years ago
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
Daniil-Selikhanovych / ASBM
View on GitHub
PyTorch implementation of "Adversarial Schrödinger Bridge Matching" (NeurIPS 2024)
☆28Jun 2, 2025Updated 9 months ago
yangdongchao / SimpleSpeech
View on GitHub
The open source code for SimpleSpeech series
☆145Oct 8, 2024Updated last year
b-sigpro / sed-hsmm
View on GitHub
Onset-and-Offset-Aware Sound Event Detection
☆21Feb 10, 2025Updated last year
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated last year
Many0therFunctions / MaskGCT-Text-To-Semantic-Finetune
View on GitHub
This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …
☆13Dec 4, 2024Updated last year
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated 10 months ago
zhenye234 / FlashSpeech
View on GitHub
ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis
☆154Sep 20, 2024Updated last year
milenagazdieva / LightUnbalancedOptimalTransport
View on GitHub
PyTorch implementation of "Light Unbalanced Optimal Transport" (NeurIPS 2024)
☆22Dec 23, 2024Updated last year
adelacvg / DPTTS
View on GitHub
An AR+AR TTS attempt.
☆18Jan 13, 2025Updated last year
seastar105 / pflow-encodec
View on GitHub
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
☆77May 12, 2024Updated last year
yangdongchao / ALMTokenizer2
View on GitHub
The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…
☆45Sep 5, 2025Updated 5 months ago