☆322Dec 17, 2024Updated last year
Alternatives and similar repositories for ml-tarflow
Users that are interested in ml-tarflow are comparing it to the libraries listed below
Sorting:
- The codebase of our paper "Improving the Training of Rectified Flows", NeurIPS 2024☆130Oct 18, 2024Updated last year
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆121Mar 27, 2025Updated 11 months ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆12Mar 11, 2025Updated 11 months ago
- Official Implementation for "Consistency Flow Matching: Defining Straight Flows with Velocity Consistency"☆259Jan 17, 2025Updated last year
- The official Implementation of PeriodWave and PeriodWave-Turbo☆220Apr 14, 2025Updated 10 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 10 months ago
- EDM2 and Autoguidance -- Official PyTorch implementation☆822Dec 9, 2024Updated last year
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆22Feb 7, 2026Updated 3 weeks ago
- [ICCV 2025] SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer☆318Dec 29, 2024Updated last year
- [ICLR 2025] Rectified Diffusion: Straightness Is Not Your Need☆245Mar 11, 2025Updated 11 months ago
- Official implementation of Inductive Moment Matching☆574Jul 11, 2025Updated 7 months ago
- ☆54Jul 16, 2025Updated 7 months ago
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆1,096Dec 22, 2025Updated 2 months ago
- ☆11Nov 7, 2024Updated last year
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆170May 1, 2025Updated 10 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆107Jan 17, 2025Updated last year
- A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes…☆4,167Jan 5, 2026Updated last month
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆110May 20, 2025Updated 9 months ago
- ☆19May 2, 2024Updated last year
- ☆22Jul 30, 2025Updated 7 months ago
- ☆82Jan 22, 2025Updated last year
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆68Nov 1, 2024Updated last year
- ☆52Jun 24, 2025Updated 8 months ago
- Mutiband version of HIFIGAN☆19Nov 6, 2020Updated 5 years ago
- Official PyTorch Implementation of "Diffusion Autoencoders are Scalable Image Tokenizers"☆166Jan 31, 2025Updated last year
- ☆740Dec 5, 2024Updated last year
- ☆59Oct 22, 2025Updated 4 months ago
- Voice conversion with just linear regression.☆35Sep 25, 2025Updated 5 months ago
- Official repository of Wavehax vocoder☆66Dec 20, 2025Updated 2 months ago
- [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models☆1,402Dec 16, 2025Updated 2 months ago
- LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning☆159Jun 13, 2024Updated last year
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,863Feb 20, 2026Updated last week
- ☆68Jul 22, 2025Updated 7 months ago
- Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis (ICCV, 2025)☆52Jan 14, 2026Updated last month
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- TorchCFM: a Conditional Flow Matching library☆2,331Nov 11, 2025Updated 3 months ago
- ☆318Nov 17, 2025Updated 3 months ago
- Code for vec2wav 2.0, a speech token vocoder for VC. Paper: https://arxiv.org/abs/2409.01995☆78Dec 3, 2024Updated last year
- Consistency Models Made Easy☆325Oct 13, 2024Updated last year