thunlp/ACDiT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/thunlp/ACDiT)

thunlp / ACDiT

ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer

☆42

Alternatives and similar repositories for ACDiT

Users that are interested in ACDiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenCausaLab / MORE
View on GitHub
☆15Jan 9, 2026Updated 6 months ago
OpenCausaLab / CELLO
View on GitHub
☆22Nov 5, 2024Updated last year
chenmeiqii / Teach-LLM-LR
View on GitHub
☆32Aug 30, 2024Updated last year
chenmeiqii / ERGO
View on GitHub
☆14Jun 26, 2023Updated 3 years ago
AlanBaade / SyllableLM
View on GitHub
Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models
☆63Jul 1, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
cnaigithub / SpeechDewarping
View on GitHub
Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023
☆27Apr 27, 2023Updated 3 years ago
justinlovelace / SESD
View on GitHub
☆61Oct 28, 2024Updated last year
choiHkk / nix-tts
View on GitHub
End-To-End SpeechSynthesis system with knowledge distillation
☆18Jul 16, 2022Updated 4 years ago
CyberZHG / torch-position-embedding
View on GitHub
Position embedding in PyTorch
☆16Jul 11, 2020Updated 6 years ago
alibaba-damo-academy / K-Forcing
View on GitHub
Official implementation for "K-Forcing: Joint Next-K-Token Decoding via Push-Forward Language Modeling"
☆16Jun 14, 2026Updated last month
furkanyesiler / move
View on GitHub
PyTorch code for training and evaluating MOVE, musically-motivated version embeddings
☆50Jul 6, 2023Updated 3 years ago
cloneofsimo / repa-rf
View on GitHub
☆32Nov 4, 2024Updated last year
MarkXCloud / CSpD
View on GitHub
The official repo of continuous speculative decoding
☆36Mar 28, 2025Updated last year
showlab / FQGAN
View on GitHub
FQGAN: Factorized Visual Tokenization and Generation
☆59Mar 29, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
jiahaolu97 / anything-unsegmentable
View on GitHub
(CVPR 2024) "Unsegment Anything by Simulating Deformation"
☆29May 27, 2024Updated 2 years ago
xinshengwang / robpitch
View on GitHub
A pitch detection model trained to be robust against noise and reverberation environments.
☆27Jan 21, 2025Updated last year
wenqsun / Real-Play
View on GitHub
Code implementation for: From Virtual Games to Real-World Play
☆48Jun 23, 2025Updated last year
Picsart-AI-Research / FlowDIS
View on GitHub
[CVPR 2026] FlowDIS: Language-Guided Dichotomous Image Segmentation with Flow Matching
☆33May 15, 2026Updated 2 months ago
lxa9867 / ImageFolder
View on GitHub
High-performance Image Tokenizers for VAR and AR
☆307Apr 25, 2025Updated last year
Clybius / Personalized-Optimizers
View on GitHub
A collection of niche / personally useful PyTorch optimizers with modified code.
☆28Apr 14, 2026Updated 3 months ago
TIGER-AI-Lab / Context-Forcing
View on GitHub
Context Forcing: Consistent Autoregressive Video Generation with Long Context [ICML26]
☆98Jun 29, 2026Updated 3 weeks ago
andres-fr / iamusica_training
View on GitHub
ONSETS&VELOCITIES real-time piano detection - PyTorch training [EUSIPCO2023]
☆32Aug 31, 2023Updated 2 years ago
wskbest / MFC-Bench
View on GitHub
☆12Oct 17, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
mattfoster / sofle-lefty-keymap
View on GitHub
Keymap for my SofleKeyboard
☆10Apr 8, 2021Updated 5 years ago
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
lucidrains / autoregressive-diffusion-pytorch
View on GitHub
Implementation of Autoregressive Diffusion in Pytorch
☆438Dec 4, 2025Updated 7 months ago
AMEERAZAM08 / sam-sdxl-inpainting
View on GitHub
☆15May 13, 2024Updated 2 years ago
xjuspeech / YOLOPitch
View on GitHub
☆10Jun 11, 2024Updated 2 years ago
mzsun01 / MM-LDM
View on GitHub
☆11Apr 12, 2024Updated 2 years ago
youngsheen / GPST
View on GitHub
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
☆70Nov 1, 2024Updated last year
zxxwxyyy / sonique
View on GitHub
Video Background Music Generation Using Unpaired Audio-Visual Data
☆32Oct 8, 2024Updated last year
kahache / video_packaging_platform
View on GitHub
Video packaging platform - this will build a Docker with a web API that will let you upload, encrypt and serve videos as MPEG DASH files
☆10Jul 1, 2026Updated 2 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zouzx / TriplaneGaussian
View on GitHub
☆10Jan 18, 2024Updated 2 years ago
bethgelab / supersanity
View on GitHub
A critical analysis of the Cambrian-S model and VSI-Super benchmarks
☆16Nov 20, 2025Updated 8 months ago
bentherien / mu_learned_optimization
View on GitHub
[Poster; ICLR 2026] [Oral; Neurips OPT2024] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers
☆16Apr 15, 2026Updated 3 months ago
yue-zhongqi / tif
View on GitHub
CVPR 2024 Official Repository
☆13Mar 27, 2024Updated 2 years ago
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
View on GitHub
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆60Apr 4, 2024Updated 2 years ago
nii-yamagishilab / midi-to-audio
View on GitHub
Project for MIDI to Audio Synthesis
☆27Mar 13, 2023Updated 3 years ago
yfyeung / DS-WED
View on GitHub
[ICASSP 2026] Official code for "Measuring Prosody Diversity in Zero-Shot TTS: A New Metric, Benchmark, and Exploration"
☆17Apr 16, 2026Updated 3 months ago