Training Autoregressive Image Generation models via Reinforcement Learning
☆50Nov 26, 2025Updated 3 months ago
Alternatives and similar repositories for AR-GRPO
Users that are interested in AR-GRPO are comparing it to the libraries listed below
Sorting:
- PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation☆53Jan 5, 2026Updated 2 months ago
- [TPAMI 2023] Object Affinity Learning: Towards Annotation-free Instance Segmentation☆14Sep 14, 2023Updated 2 years ago
- This is the official PyTorch implementation of TBSR. Our team received 2nd place (real data track) and 3rd place (synthetic track) in NTI…☆14Jun 11, 2022Updated 3 years ago
- ☆15Mar 30, 2025Updated 11 months ago
- Codes of PostEdit☆23Apr 28, 2025Updated 10 months ago
- ☆31Jul 16, 2025Updated 7 months ago
- PhotoVerse is a text-to-image generation system that produces personalized images from text prompts using a single facial photograph.☆33May 23, 2024Updated last year
- EditAR: Unified Conditional Generation with Autoregressive Models (CVPR 2025)☆39Jun 13, 2025Updated 8 months ago
- Accelerating Multi-Reference Virtual Try-On via Cacheable Diffusion Models☆56Jan 3, 2026Updated 2 months ago
- A Multi-channel CNN for Blind 360-Degree Image Quality Assessment☆27Mar 8, 2023Updated 2 years ago
- SimCMF: A Simple Cross-modal Fine-tuning Strategy from Vision Foundation Models to Any Imaging Modality☆35Nov 25, 2024Updated last year
- Official implementation of the paper ``W2N: Switching From Weak Supervision to Noisy Supervision for Object Detection"☆29Jul 26, 2022Updated 3 years ago
- Retrieval-based Spatially Adaptive Normalization for Semantic Image Synthesis(CVPR2022)☆26May 3, 2022Updated 3 years ago
- Self-Pair: Synthesizing Changes from Single Source for Object Change Detection in Remote Sensing Imagery (official)☆26Dec 23, 2022Updated 3 years ago
- Official repository for "Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models", https://arxiv.org/abs/2601.1983…☆78Feb 13, 2026Updated 3 weeks ago
- Official code of "Edit Transfer: Learning Image Editing via Vision In-Context Relations"☆88Jun 6, 2025Updated 9 months ago
- An official implementation of Coefficients-Preserving Sampling for Reinforcement Learning with Flow Matching☆66Sep 11, 2025Updated 5 months ago
- UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture☆94Feb 5, 2026Updated last month
- ☆37Mar 21, 2025Updated 11 months ago
- ☆18Aug 1, 2025Updated 7 months ago
- pix2pix and Cycle GAN architectures for image style transfer☆13May 27, 2021Updated 4 years ago
- [AAAI2026] Bring Your Dreams to Life: Continual Text-to-Video Customization☆36Dec 9, 2025Updated 2 months ago
- Knowledge Guided Multi-instance Multi-label Networks (KG-MIML-Net) for Medicines Prediction☆13Oct 2, 2018Updated 7 years ago
- Create realistic looking handwritten text PDFs from text files.☆15Jun 19, 2021Updated 4 years ago
- ☆25Aug 19, 2025Updated 6 months ago
- (CVPR 2026) Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation☆27Feb 28, 2026Updated last week
- Source code used in the blog☆12Feb 6, 2024Updated 2 years ago
- https://demo-web.reflex.run☆12Apr 25, 2024Updated last year
- ☆43Dec 1, 2025Updated 3 months ago
- Automate your blogging with AI-powered tools for creating, optimizing, and deploying content. Generate SEO-optimized articles effortlessl…☆12Aug 16, 2024Updated last year
- ☆10Oct 13, 2024Updated last year
- Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features☆14Feb 24, 2026Updated last week
- Non-disruptive collagen characterization in clinical histopathology using cross-modality image synthesis☆10Apr 25, 2025Updated 10 months ago
- Wikimedia Enterprise - client SDK in Python☆20Nov 11, 2025Updated 3 months ago
- This is the official PyTorch implement of MW-ISPNet. Our team received a winner award in the AIM 2020 Learned Image ISP Challenge (ECCVW …☆41Jun 11, 2022Updated 3 years ago
- Official code for our CVPR 2025 paper: "Toward Generalized Image Quality Assessment: Relaxing the Perfect Reference Quality Assumption"☆66Sep 15, 2025Updated 5 months ago
- [ECCV 2024, Oral] Self-Supervised Video Desmoking for Laparoscopic Surgery☆47Nov 18, 2024Updated last year
- [ICCV 2021] Learning RAW-to-sRGB Mappings with Inaccurately Aligned Supervision☆104May 5, 2025Updated 10 months ago
- [ICCV 2021] Click to Move: Controlling Video Generation with Sparse Motion☆11Apr 14, 2023Updated 2 years ago