Official pytorch implementation of "AlphaFlow: Understanding and Improving MeanFlow Models"
☆99Oct 24, 2025Updated 4 months ago
Alternatives and similar repositories for alphaflow
Users that are interested in alphaflow are comparing it to the libraries listed below
Sorting:
- ☆39Oct 29, 2025Updated 4 months ago
- Interface Design for Self-Supervised Speech Models, Accepted to Interspeech2024☆16Nov 19, 2024Updated last year
- ☆42Sep 15, 2025Updated 5 months ago
- ☆14Dec 28, 2022Updated 3 years ago
- ☆34Aug 4, 2025Updated 6 months ago
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆117Dec 17, 2025Updated 2 months ago
- Maximize the Resolution Potential of Pre-trained Rectified Flow Transformers☆65Oct 16, 2024Updated last year
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆38Feb 17, 2026Updated 2 weeks ago
- Cut2Next: Generating Next Shot via In-Context Tuning☆31Aug 21, 2025Updated 6 months ago
- Unofficial implementation of E-LatentLPIPS in Diffusion2GAN☆19Sep 5, 2024Updated last year
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆31Aug 30, 2025Updated 6 months ago
- SR-DiT Speedrunning ImageNet Diffusion☆126Dec 31, 2025Updated 2 months ago
- Chinese-native image generation while compatible with SD eco-system, 1st-gen, AAAI2025☆13Jun 25, 2024Updated last year
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- ☆43Jan 13, 2025Updated last year
- Official Implementation of NAF: Zero-Shot Feature Upsampling via Neighborhood Attention Filtering☆69Dec 1, 2025Updated 3 months ago
- Generative Modeling with Bayesian Sample Inference☆24May 17, 2025Updated 9 months ago
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆56Jun 1, 2025Updated 9 months ago
- Orienting Latent Actions for Video World Modeling☆77Feb 11, 2026Updated 2 weeks ago
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆22Aug 5, 2024Updated last year
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆41Jul 23, 2025Updated 7 months ago
- Official code of SenSE.☆74Oct 30, 2025Updated 4 months ago
- Official Code Release for Pix2NPHM☆52Dec 22, 2025Updated 2 months ago
- Collection of works for evaluating (and analyzing) large audio-language models (LALMs)☆40Aug 11, 2025Updated 6 months ago
- An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control☆31Jan 13, 2026Updated last month
- Official PyTorch/Diffusers implementation of "RectifiedHR: Enable Efficient High Resolution Image Generation via Energy Rectification"☆30Oct 11, 2025Updated 4 months ago
- [ICLR 2026] 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆106Feb 6, 2026Updated 3 weeks ago
- [CVPR 2025] Diff2Flow: Training Flow Matching Models via Diffusion Model Alignment☆104Jun 4, 2025Updated 8 months ago
- Lightweight Speech Representation Learning for One-Shot Voice Conversion☆24Dec 12, 2024Updated last year
- [ICLR 2026] Code for our paper "Next Visual Granularity Generation".☆49Jan 26, 2026Updated last month
- ☆29Jan 15, 2025Updated last year
- ☆59Oct 22, 2025Updated 4 months ago
- Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"☆108Dec 20, 2025Updated 2 months ago
- Pytorch implementation for MeanFlow☆322Jul 30, 2025Updated 7 months ago
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆58Dec 26, 2025Updated 2 months ago
- Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation☆28Apr 11, 2025Updated 10 months ago
- [ICLR 2026] SparseD: Sparse Attention for Diffusion Language Models☆59Feb 22, 2026Updated last week
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- Official Repository of Personalized Visual Instruct Tuning☆34Mar 6, 2025Updated 11 months ago