Gengzigang / TokenSet
Official PyTorch implementation of TokenSet.
☆88Updated last week
Alternatives and similar repositories for TokenSet:
Users that are interested in TokenSet are comparing it to the libraries listed below
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆78Updated 3 weeks ago
- [CVPR2025] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project/☆127Updated last week
- Inference-time scaling of diffusion-based image and video generation models.☆117Updated 3 weeks ago
- [CVPR2025] PyTorch-based reimplementation of CrossFlow, as proposed in 'Flowing from Words to Pixels: A Noise-Free Framework for Cross-Mo…☆143Updated last week
- ☆191Updated last month
- Official Implementation of weights2weights☆140Updated 2 weeks ago
- Official Implementation of Video-T1: Test-Time Scaling for Video Generation☆70Updated this week
- ☆68Updated 5 months ago
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows☆57Updated 2 weeks ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆66Updated 5 months ago
- [ECCV 2024] Official PyTorch implementation of "Getting it Right: Improving Spatial Consistency in Text-to-Image Models"☆100Updated 8 months ago
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆104Updated last month
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆100Updated last year
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆68Updated 3 months ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆69Updated 2 weeks ago
- Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …☆68Updated 3 months ago
- Learning Motion from Low-Rank Adaptation☆44Updated 9 months ago
- [NeurIPS 2024] RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance☆123Updated 5 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆50Updated last month
- [NeurIPS 2024] ReNO: Enhancing One-step Text-to-Image Models through Reward-based Noise Optimization☆132Updated 2 months ago
- [ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis☆104Updated last year
- ☆147Updated 3 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆71Updated 9 months ago
- Paint by Inpaint: Learning to Add Image Objects by Removing Them First☆98Updated last week
- Pytorch implementation of MIMO, Controllable Character Video Synthesis with Spatial Decomposed Modeling, from Alibaba Intelligence Group☆133Updated 5 months ago
- ☆83Updated 7 months ago
- Official implementation for "pOps: Photo-Inspired Diffusion Operators"☆79Updated 8 months ago
- ☆68Updated 4 months ago
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆41Updated this week
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆69Updated 3 months ago