a unified reinforcement learning toolbox for joint RL on language models and diffusion models
☆75Feb 7, 2026Updated last month
Alternatives and similar repositories for UniRL
Users that are interested in UniRL are comparing it to the libraries listed below
Sorting:
- ☆22May 11, 2025Updated 9 months ago
- [CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts☆21Dec 22, 2025Updated 2 months ago
- ☆41Jan 4, 2026Updated 2 months ago
- Exploring Representation-Aligned Latent Space for Better Generation☆17Feb 4, 2025Updated last year
- https://huggingface.co/datasets/multimodal-reasoning-lab/Zebra-CoT☆125Jan 30, 2026Updated last month
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆25Apr 14, 2025Updated 10 months ago
- ☆40Dec 16, 2025Updated 2 months ago
- BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models☆40Oct 30, 2025Updated 4 months ago
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- Extend BoxDiff to SDXL (SDXL-based layout-to-image generation)☆26May 23, 2024Updated last year
- Official repository for “PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss”☆205Feb 3, 2026Updated last month
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalization☆109Apr 10, 2024Updated last year
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆32Nov 30, 2025Updated 3 months ago
- Consistent Autoregressive Video Generation with Long Context☆72Feb 6, 2026Updated last month
- [CVPR 2025] A Hierarchical Movie Level Dataset for Long Video Generation☆92Mar 16, 2025Updated 11 months ago
- [ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process☆669Feb 10, 2026Updated 3 weeks ago
- MoCapDeform: Monocular 3D Human Motion Capture in Deformable Scenes☆31Jul 2, 2024Updated last year
- ☆157Jan 16, 2026Updated last month
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- Code used in the 2023 Alert Geomaterials doctoral school on Machine in Geomechanics☆13Oct 2, 2023Updated 2 years ago
- 哈尔滨工业大学2023春季学期编译系统课程实验、习题、课件以及期末复习材料☆11Jul 30, 2023Updated 2 years ago
- ☆78May 8, 2025Updated 9 months ago
- Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)☆269Dec 5, 2025Updated 3 months ago
- CycleReward is a reward model trained on cycle consistency preferences to measure image-text alignment.☆55Nov 3, 2025Updated 4 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆43Mar 11, 2025Updated 11 months ago
- [EAAI 2024] Template-based Feature Aggregation Network for industrial anomaly detection☆11Mar 6, 2025Updated last year
- Mohr Coulomb Model☆15Aug 17, 2021Updated 4 years ago
- Prior Information based NEural neTwork (PiNet) is tailored for constitutive modelling and solving partial differential equations in soil …☆15Nov 4, 2025Updated 4 months ago
- The official implementation of "2025ICLR Dynamic Diffusion Transformer" and "2025ArXivDyDiT++: Dynamic Diffusion Transformers for Efficie…☆47Apr 10, 2025Updated 10 months ago
- code for paper "Ju Xu, Zhanxing Zhu. Reinforced Continual Learning. NIPS 2018."☆37Feb 17, 2019Updated 7 years ago
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆11Sep 3, 2024Updated last year
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆12Nov 1, 2025Updated 4 months ago
- ☆13Nov 5, 2024Updated last year
- Towards Memorization-Free Diffusion Models (CVPR2024) Codebase☆12Jun 2, 2024Updated last year
- [ICML 2025 Oral] An official implementation of VideoRoPE & VideoRoPE++☆219Feb 2, 2026Updated last month
- The official code of "Thinking With Videos: Multimodal Tool-Augmented Reinforcement Learning for Long Video Reasoning"☆87Oct 15, 2025Updated 4 months ago
- Diffusion Model(DDPM)来生成MNIST数字☆13Jul 29, 2023Updated 2 years ago
- JsonTuning: Towards Generalizable, Robust, and Controllable Instruction Tuning☆10Nov 3, 2024Updated last year
- Graph Convolutional Networks for Unstructured Flow Fields☆13Sep 5, 2022Updated 3 years ago