YuchuanTian/DiC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YuchuanTian/DiC)

YuchuanTian / DiC

[CVPR 2025] "DiC: Rethinking Conv3x3 Designs in Diffusion Models", a performant & speedy Conv3x3 diffusion model.

☆249

Alternatives and similar repositories for DiC

Users that are interested in DiC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YuchuanTian / U-REPA
View on GitHub
[NeurIPS 2025] U-REPA: Aligning Diffusion U-Nets to ViTs
☆38Dec 15, 2025Updated 7 months ago
shallowdream204 / DiCo
View on GitHub
[NeurIPS 2025 Spotlight] DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling
☆72Feb 12, 2026Updated 5 months ago
YuchuanTian / U-DiT
View on GitHub
[NeurIPS 2024] The official code of "U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers"
☆240Jun 21, 2026Updated last month
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago
star-kwon / FCDM
View on GitHub
[CVPR 2026] Official repository for "Reviving ConvNeXt for Efficient Convolutional Diffusion Models"
☆71Mar 26, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
hustvl / LightningDiT
View on GitHub
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,508Dec 16, 2025Updated 7 months ago
bytetriper / RAE
View on GitHub
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
☆1,977Feb 25, 2026Updated 4 months ago
seongho608 / RingFormer
View on GitHub
☆52Jun 24, 2025Updated last year
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
hustvl / Turbo-VAED
View on GitHub
[AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices
☆131Jul 10, 2026Updated last week
bfs18 / armel
View on GitHub
poorman's ar-dit tts
☆45Dec 31, 2025Updated 6 months ago
NVlabs / DDO
View on GitHub
[ICML 2025 Spotlight] Direct Discriminative Optimization: Reinforcing Diffusion/Autoregressive with GAN Discrimination
☆124Jan 27, 2026Updated 5 months ago
MCG-NJU / PixNerd
View on GitHub
[ICLR 2026] PixNerd: Pixel Neural Field Diffusion
☆182Dec 10, 2025Updated 7 months ago
lonzi / mrflow_dpo
View on GitHub
☆22Jan 3, 2026Updated 6 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
aqtq314 / VogenSVS
View on GitHub
☆15Apr 16, 2026Updated 3 months ago
DIYer22 / sddn
View on GitHub
Core Library of Discrete Distribution Networks (ICLR 2025)
☆15Oct 12, 2025Updated 9 months ago
Tencent / SongBench
View on GitHub
☆50Apr 30, 2026Updated 2 months ago
LTH14 / JiT
View on GitHub
PyTorch implementation of JiT https://arxiv.org/abs/2511.13720
☆2,462Dec 8, 2025Updated 7 months ago
dinhoitt / BemaGANv2
View on GitHub
☆21Mar 3, 2026Updated 4 months ago
meaningTeam / tidy-tunes
View on GitHub
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆23May 19, 2026Updated 2 months ago
ShoufaChen / PixelFlow
View on GitHub
Pixel-Space Generative Models
☆316May 11, 2025Updated last year
Audio-Foundation-Models / ConversationTTS
View on GitHub
☆101Jan 19, 2026Updated 6 months ago
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ictnlp / SLED-TTS
View on GitHub
Streamable Text-to-Speech model using a language modeling approach, without vector quantization
☆108May 20, 2025Updated last year
primepake / dac_vae
View on GitHub
Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder
☆38Aug 30, 2025Updated 10 months ago
HaoyiZhu / MeanFlow-PyTorch
View on GitHub
PyTorch re-implementation for MeanFlow
☆126Jul 17, 2025Updated last year
cszn / ConverseNet
View on GitHub
Reverse Convolution and Its Applications to Image Restoration (ICCV, 2025)
☆134Aug 15, 2025Updated 11 months ago
zelaki / ReDi
View on GitHub
[NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis
☆120Nov 3, 2025Updated 8 months ago
PKU-YuanGroup / WF-VAE
View on GitHub
[CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
☆205May 11, 2025Updated last year
chomeyama / wavehax
View on GitHub
Official repository of Wavehax vocoder
☆75Dec 20, 2025Updated 7 months ago
Zehong-Ma / DeCo
View on GitHub
[CVPR2026 Highlight] Official repository for “DeCo: Frequency-Decoupled Pixel Diffusion for End-to-End Image Generation”
☆233Feb 27, 2026Updated 4 months ago
KdaiP / DC-Speech-VAE
View on GitHub
5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs
☆57Nov 19, 2025Updated 8 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Martinser / REG
View on GitHub
[NeurIPS 2025 Oral] Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think
☆274Oct 4, 2025Updated 9 months ago
Visual-AI / JoVA
View on GitHub
JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
☆33Dec 22, 2025Updated 6 months ago
duzw9311 / LDA-AQU
View on GitHub
[MM2024] LDA-AQU: Adaptive Query-guided Upsampling via Local Deformable Attention
☆13Dec 24, 2024Updated last year
redredsheep / PrismLayers
View on GitHub
PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models
☆37Jan 14, 2026Updated 6 months ago
ali-vilab / FACM
View on GitHub
FACM: Flow-Anchored Consistency Models
☆147Aug 6, 2025Updated 11 months ago
haidog-yaqub / MeanFlow
View on GitHub
PyTorch implementation of MeanFlow & iMF (one-step generative modeling).
☆1,177Jul 1, 2026Updated 3 weeks ago
the-bird-F / Expressive-Vectors
View on GitHub
[ICASSP 2026] Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis
☆40Dec 24, 2025Updated 6 months ago