Official Implementation of "Maximum Likelihood Reinforcement Learning (MaxRL)"
☆134Feb 26, 2026Updated last week
Alternatives and similar repositories for maxrl
Users that are interested in maxrl are comparing it to the libraries listed below
Sorting:
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆14Jun 28, 2025Updated 8 months ago
- Official Implementation for the paper "VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models"☆21Aug 14, 2025Updated 6 months ago
- ☆14Apr 25, 2025Updated 10 months ago
- ☆16Jun 10, 2025Updated 8 months ago
- ☆34Jan 25, 2026Updated last month
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆59Mar 17, 2025Updated 11 months ago
- (ICML 2025) Rethinking Chain-of-Thought from the Perspective of Self-Training☆13Feb 15, 2025Updated last year
- MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆38Feb 19, 2026Updated last week
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆34Jan 16, 2026Updated last month
- More reliable Video Understanding Evaluation☆14Sep 23, 2025Updated 5 months ago
- ☆42Sep 15, 2025Updated 5 months ago
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆15Jan 3, 2023Updated 3 years ago
- Improving large language models with concept-aware fine-tuning (CAFT)☆29Jan 31, 2026Updated last month
- ☆24May 23, 2025Updated 9 months ago
- a collaborative agent-based workflow designed for NL2Vis task☆19Mar 6, 2025Updated 11 months ago
- ☆355Feb 20, 2026Updated last week
- ☆19Jun 29, 2025Updated 8 months ago
- "FusionFactory: Fusing LLM Capabilities with Routing Data", Tao Feng, Haozhen Zhang, Zijie Lei, Pengrui Han, Mostofa Patwary, Mohammad Sh…☆19Dec 30, 2025Updated 2 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Oct 14, 2024Updated last year
- OmniGAIA: Towards Native Omni-Modal AI Agents☆46Updated this week
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Oct 9, 2025Updated 4 months ago
- [ICLR 2026] Official PyTorch implementation for "ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding"☆57Dec 26, 2025Updated 2 months ago
- ☆17Feb 4, 2025Updated last year
- ☆93Dec 30, 2025Updated 2 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆33Jul 25, 2025Updated 7 months ago
- vMCP - Virtual Model Context Protocol☆50Dec 24, 2025Updated 2 months ago
- [NeurIPS25 Spotlight] Official Implementation for CBSA (Contract-and-Broadcast Self-Attention)☆35Dec 9, 2025Updated 2 months ago
- Official Code for "Rethinking Diffusion Model in High Dimension"☆24May 20, 2025Updated 9 months ago
- ☆20Oct 25, 2022Updated 3 years ago
- A platform for building configurable, database-backed generative AI agentic assistants.☆25Feb 11, 2025Updated last year
- Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping…☆91Jan 29, 2026Updated last month
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- [ICML 2025] Official Implementation of Hierarchical Masked Autoregressive Models with Low-Resolution Token Pivots☆30May 28, 2025Updated 9 months ago
- [ICLR 2026] PixNerd: Pixel Neural Field Diffusion☆170Dec 10, 2025Updated 2 months ago
- Updating curated list of research advancements on item identification in generative recommender systems. The survey is titled "A Survey o…☆58Feb 18, 2026Updated 2 weeks ago
- Official implemention for Diffusion Models Are Innate One-Step Generators☆26Jun 25, 2025Updated 8 months ago
- The official implementation of Mantis: A Versatile Vision-Language-Action Model with Disentangled Visual Foresight☆83Jan 16, 2026Updated last month
- Reinforcement Learning Seminar at the Chinese University of Hong Kong, Shenzhen, China.☆21Nov 17, 2023Updated 2 years ago