Owen718 / LongPrompt-LLamaGenLinks
This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompts. And it's also powered by additional prompt refining features for improved performance.
☆30Updated 7 months ago
Alternatives and similar repositories for LongPrompt-LLamaGen
Users that are interested in LongPrompt-LLamaGen are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT☆98Updated last month
- Autoregressive Image Generation with Randomized Parallel Decoding☆63Updated 2 months ago
- [CVPR 2025 AI4CC Workshop] Official Implementation of HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editin…☆30Updated 3 weeks ago
- official code repo of CVPR 2025 paper PhyT2V: LLM-Guided Iterative Self-Refinement for Physics-Grounded Text-to-Video Generation☆31Updated 2 months ago
- [NeurIPS 2024] The official implement of research paper "FreeLong : Training-Free Long Video Generation with SpectralBlend Temporal Atten…☆44Updated 3 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆31Updated 3 months ago
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆117Updated 2 weeks ago
- No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves☆53Updated this week
- ☆50Updated 5 months ago
- VeriThinker: Learning to Verify Makes Reasoning Model Efficient☆38Updated last week
- UniCon: A Simple Approach to Unifying Diffusion-based Conditional Generation (ICLR 2025)☆29Updated last week
- ☆36Updated 2 weeks ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆55Updated last month
- [CVPR 25] A framework named B^2-DiffuRL for RL-based diffusion model fine-tuning.☆29Updated 2 months ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆50Updated 2 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆158Updated 2 months ago
- Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?☆49Updated this week
- Vico: Compositional Video Generation as Flow Equalization☆58Updated 6 months ago
- Official implementation of LaVin-DiT☆32Updated 4 months ago
- Official Repository of paper: Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing☆53Updated last week
- ☆26Updated 3 months ago
- ☆52Updated last month
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆47Updated 6 months ago
- An Empirical Study of GPT-4o Image Generation Capabilities☆20Updated last month
- The official code of "Weak-to-Strong Diffusion with Reflection".☆45Updated last month
- ☆87Updated this week
- GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning☆70Updated last week
- Dimple, the first Discrete Diffusion Multimodal Large Language Model☆60Updated last week
- [CVPR 2025] PatchDPO: Patch-level DPO for Finetuning-free Personalized Image Generation☆33Updated 3 months ago
- ☆43Updated last month