Official respository for ReasonGen-R1
☆74Jun 23, 2025Updated 8 months ago
Alternatives and similar repositories for ReasonGen-R1
Users that are interested in ReasonGen-R1 are comparing it to the libraries listed below
Sorting:
- [TOG 2025] Order Matters: Learning Element Ordering for Graphic Design Generation☆20Aug 5, 2025Updated 6 months ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆26Apr 9, 2025Updated 10 months ago
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning☆237May 30, 2025Updated 9 months ago
- Official implementation of Next Block Prediction: Video Generation via Semi-Autoregressive Modeling☆41Feb 12, 2025Updated last year
- Official implementation of "LOCATEdit: Graph Laplacian Optimized Cross Attention for Localized Guided Image Editing☆15May 27, 2025Updated 9 months ago
- [NIPS24] Official Implementation of Unsupervised Modality Adaptation with Text-to-Image Diffusion Models for Semantic Segmentation☆20Oct 31, 2024Updated last year
- ☆168Nov 26, 2025Updated 3 months ago
- ☆141Oct 15, 2025Updated 4 months ago
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Apr 15, 2025Updated 10 months ago
- The official repo for “Unleashing the Reasoning Potential of Pre-trained LLMs by Critique Fine-Tuning on One Problem” [EMNLP25]☆34Sep 1, 2025Updated 6 months ago
- [ICML 2025] Official Code for "ADHMR: Aligning Diffusion-based Human Mesh Recovery via Direct Preference Optimization"☆42Feb 13, 2026Updated 2 weeks ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions☆45Jun 11, 2025Updated 8 months ago
- [CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation☆857May 23, 2025Updated 9 months ago
- An Efficient Text-to-Image Generation Pretrain Pipeline☆130Apr 18, 2025Updated 10 months ago
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆76Sep 19, 2025Updated 5 months ago
- Official Implementation of "UniFlow: A Unified Pixel Flow Tokenizer for Visual Understanding and Generation"☆137Oct 17, 2025Updated 4 months ago
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Aug 26, 2025Updated 6 months ago
- [AAAI 2026] Few-step Flow for 3D Generation via Marginal-Data Transport Distillation☆50Jan 9, 2026Updated last month
- RepText: Rendering Visual Text via Replicating 🔥☆141Jun 7, 2025Updated 8 months ago
- A toolbox for benchmarking Multimodal LLM Agents trustworthiness across truthfulness, controllability, safety and privacy dimensions thro…☆63Jan 9, 2026Updated last month
- Explore how to get a VQ-VAE models efficiently!☆68Jul 24, 2025Updated 7 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆198Jan 7, 2026Updated last month
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT☆430Sep 18, 2025Updated 5 months ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆426Jun 20, 2025Updated 8 months ago
- [CVPR 2025] Code for Deformable Radial Kernel Splatting☆200May 20, 2025Updated 9 months ago
- High-Resolution Visual Reasoning via Multi-Turn Grounding-Based Reinforcement Learning☆53Jul 23, 2025Updated 7 months ago
- Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex☆723Updated this week
- [CVPR2024] Inverse Rendering of Glossy Objects via the Neural Plenoptic Function and Radiance Fields☆35Dec 19, 2025Updated 2 months ago
- [ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆186May 21, 2025Updated 9 months ago
- ☆10Dec 21, 2022Updated 3 years ago
- The code repository of UniRL☆51May 30, 2025Updated 9 months ago
- ☆41Aug 16, 2024Updated last year
- [NeurIPS 2025]Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency☆76Sep 19, 2025Updated 5 months ago
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆338Feb 21, 2026Updated last week
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆218Dec 9, 2025Updated 2 months ago
- ☆291Jul 29, 2025Updated 7 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆420Aug 26, 2025Updated 6 months ago
- [ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs☆68Jul 1, 2025Updated 8 months ago
- A sample project for using iTextSharp to create pdf from picture folder or draw ECG from specfied format of text file, support Android, i…☆11Jun 3, 2015Updated 10 years ago