ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer
☆42Jan 29, 2026Updated 4 months ago
Alternatives and similar repositories for ACDiT
Users that are interested in ACDiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Jan 9, 2026Updated 5 months ago
- ☆22Nov 5, 2024Updated last year
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆22Dec 9, 2024Updated last year
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆27Apr 27, 2023Updated 3 years ago
- Art2Mus is a system that generates music based on digitized artworks and text by using the AudioLDM2 architecture with an added projectio…☆20Oct 20, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆61Oct 28, 2024Updated last year
- End-To-End SpeechSynthesis system with knowledge distillation☆18Jul 16, 2022Updated 3 years ago
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆40Feb 24, 2025Updated last year
- ☆25May 23, 2025Updated last year
- ☆33Nov 4, 2024Updated last year
- PyTorch code for training and evaluating MOVE, musically-motivated version embeddings☆50Jul 6, 2023Updated 2 years ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆48Aug 26, 2025Updated 9 months ago
- An AR+AR TTS attempt.☆18Jan 13, 2025Updated last year
- (CVPR 2024) "Unsegment Anything by Simulating Deformation"☆29May 27, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- FQGAN: Factorized Visual Tokenization and Generation☆59Mar 29, 2025Updated last year
- This repository is for an implementation of the accepted paper "Sketching the Expression: Flexible Rendering of Expressive Piano Performa…☆22Dec 15, 2022Updated 3 years ago
- ☆121Jun 2, 2026Updated last week
- High-performance Image Tokenizers for VAR and AR☆307Apr 25, 2025Updated last year
- A Chinese Character BERT Trained with Multi-Level Masking☆12Sep 24, 2023Updated 2 years ago
- Consistent Autoregressive Video Generation with Long Context☆88Feb 6, 2026Updated 4 months ago
- ☆12Oct 17, 2024Updated last year
- A collection of niche / personally useful PyTorch optimizers with modified code.☆28Apr 14, 2026Updated last month
- Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"☆41Jun 28, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Unofficial PyTorch implementation of "Step-unrolled Denoising Autoencoders for Text Generation"☆24Nov 19, 2022Updated 3 years ago
- Code accompanying paper "SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation"☆27May 8, 2026Updated last month
- ONSETS&VELOCITIES real-time piano detection - PyTorch training [EUSIPCO2023]☆29Aug 31, 2023Updated 2 years ago
- ☆19Aug 1, 2025Updated 10 months ago
- IROS☆17Aug 10, 2025Updated 9 months ago
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆58Apr 9, 2025Updated last year
- Implementation of Autoregressive Diffusion in Pytorch☆437Dec 4, 2025Updated 6 months ago
- [CVPR 2025] GEAL: Generalizable 3D Affordance Learning with Cross-Modal Consistency☆45Nov 2, 2025Updated 7 months ago
- ☆11Apr 12, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repository is an offical PyTorch implementation of SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.☆13Mar 18, 2024Updated 2 years ago
- ☆15May 13, 2024Updated 2 years ago
- CVPR 2024 Official Repository☆13Mar 27, 2024Updated 2 years ago
- ☆188Jun 27, 2025Updated 11 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆70Nov 1, 2024Updated last year
- ☆10Jun 11, 2024Updated last year
- [MM'22 Oral] AI Illustrator: Translating Raw Descriptions into Images by Prompt-based Cross-Modal Generation☆11Apr 3, 2023Updated 3 years ago