Yuanshi9815 / LiteFocus
[Interspeech 2024] LiteFocus is a tool designed to accelerate diffusion-based TTA model, now implemented with the base model AudioLDM2.
☆33Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for LiteFocus
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆54Updated 3 weeks ago
- ☆101Updated 4 months ago
- official code for Diff-Instruct algorithm for one-step diffusion distillation☆46Updated 7 months ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆71Updated 3 months ago
- Score identity Distillation with Long and Short Guidance for One-Step Text-to-Image Generation☆33Updated 2 months ago
- Vico: Compositional Video Generation as Flow Equalization☆50Updated 4 months ago
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆37Updated last week
- An official pytorch implementation of AAAI 2024 paper "Latent Space Editing in Transformer-based Flow Matching"☆27Updated 6 months ago
- [ICLR 2024] Official code for the paper 'Elucidating the Exposure Bias in Diffusion Models'☆39Updated 5 months ago
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆28Updated 4 months ago
- ☆25Updated last year
- Official PyTorch Implementation of "Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models"☆30Updated last month
- The codebase of our paper "Improving the Training of Rectified Flows"☆79Updated 3 weeks ago
- ☆29Updated last week
- [Arxiv 2024] Official code for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions☆24Updated this week
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆29Updated 4 months ago
- ☆44Updated 2 months ago
- This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Gener…☆16Updated 4 months ago
- "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆38Updated 2 weeks ago
- Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆60Updated last month
- This repo contains the official PyTorch implementation of AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image …☆76Updated 4 months ago
- ☆44Updated 7 months ago
- Official code for Accelerating Diffusion Sampling with Optimized Time Steps (CVPR 2024)☆22Updated 7 months ago
- The official PyTorch implementation of Fast Diffusion Model☆91Updated last year
- [ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…☆50Updated 5 months ago
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆21Updated this week
- Official Repository of Personalized Visual Instruct Tuning☆23Updated this week
- Rectified Diffusion: Straightness Is Not Your Need☆117Updated last week
- STAR: Scale-wise Text-to-image generation via Auto-Regressive representations☆117Updated 4 months ago