LemonTwoL / ReNeg
ReNeg: Learning Negative Embedding with Reward Guidance
☆25Updated last week
Alternatives and similar repositories for ReNeg:
Users that are interested in ReNeg are comparing it to the libraries listed below
- ☆37Updated last year
- Open implementation of "RandAR"☆46Updated last week
- ☆17Updated this week
- Sora Generates Videos with Stunning Geometrical Consistency☆47Updated 9 months ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆31Updated 3 weeks ago
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆83Updated 5 months ago
- ☆42Updated last week
- Liquid: Language Models are Scalable Multi-modal Generators☆57Updated 3 weeks ago
- ☆58Updated last year
- Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers☆32Updated last month
- Official Implementation of ICLR'24: Kosmos-G: Generating Images in Context with Multimodal Large Language Models☆57Updated 7 months ago
- IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model☆26Updated last month
- Official Implementation of VideoDPO☆31Updated last week
- 🔥 Aurora Series: A more efficient multimodal large language model series for video.☆61Updated last month
- Repo of HawkLlama.☆13Updated last week
- [NeurIPS 2024 D&B Track] Official Repo for "LVD-2M: A Long-take Video Dataset with Temporally Dense Captions"☆45Updated 2 months ago
- 🔥 [CVPR 2024] Official implementation of "See, Say, and Segment: Teaching LMMs to Overcome False Premises (SESAME)"☆30Updated 6 months ago
- Code for ROICtrl: Boosting Instance Control for Visual Generation☆99Updated last month
- VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation☆77Updated this week
- Towards training VQ-VAE models robustly!☆42Updated this week
- The official implementation for "MonoFormer: One Transformer for Both Diffusion and Autoregression"☆80Updated 2 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆39Updated this week
- Diffusion Powers Video Tokenizer for Comprehension and Generation☆38Updated last month
- Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)☆74Updated 6 months ago
- DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention☆117Updated last month
- ☆33Updated 2 months ago
- Implementation of Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding☆26Updated 2 months ago
- T2VScore: Towards A Better Metric for Text-to-Video Generation☆77Updated 9 months ago
- [NeurlPS 2024] One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos☆94Updated 2 weeks ago
- ICCV2023-Diffusion-Papers☆109Updated last year