aeroplanepaper/GRPO-LEAD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aeroplanepaper/GRPO-LEAD)

aeroplanepaper / GRPO-LEAD

☆40

Alternatives and similar repositories for GRPO-LEAD

Users that are interested in GRPO-LEAD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NineAbyss / S2R
View on GitHub
This is the official implementation of the paper "S²R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning"
☆75Apr 22, 2025Updated last year
JingyangYi / ShorterBetter
View on GitHub
☆18Jul 31, 2025Updated 11 months ago
multimodal-art-projection / TreePO
View on GitHub
☆65Mar 30, 2026Updated 3 months ago
pzs19 / LEMMA
View on GitHub
☆16Sep 4, 2025Updated 10 months ago
GX-XinGao / GRA
View on GitHub
The Code and Script of "David's Slingshot: A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis"
☆34Jun 13, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tengxiao1 / SimPER
View on GitHub
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)
☆17Aug 22, 2025Updated 10 months ago
IBM / ColPret
View on GitHub
Efficient Scaling laws and collaborative pretraining.
☆22Sep 18, 2025Updated 9 months ago
horseee / CoT-Valve
View on GitHub
CoT-Valve: Length-Compressible Chain-of-Thought Tuning
☆91Feb 14, 2025Updated last year
nick7nlp / FastCuRL
View on GitHub
FastCuRL: Curriculum Reinforcement Learning with Stage-wise Context Scaling for Efficient LLM Reasoning (EMNLP 2025)
☆61Oct 10, 2025Updated 9 months ago
opendatalab / REST
View on GitHub
☆34Jul 15, 2025Updated 11 months ago
nuochenpku / COMEDY
View on GitHub
This is the official project of paper: Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conver…
☆25Nov 18, 2024Updated last year
arctanxarc / GENIUS
View on GitHub
☆42May 9, 2026Updated 2 months ago
tongxuluo / LeaP
View on GitHub
Code, Data and Model for Paper "Learning from Peers in Reasoning Models"
☆26May 13, 2025Updated last year
euxcet / thulearn2018
View on GitHub
Tools for Web Learning of Tsinghua University.
☆10Sep 17, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hemingkx / TokenSkip
View on GitHub
[EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMs
☆225Nov 30, 2025Updated 7 months ago
lzhxmu / CPPO
View on GitHub
CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)
☆181Nov 4, 2025Updated 8 months ago
XiaokunFeng / MemVLT
View on GitHub
[NeurIPS'24] MemVLT: Vision-Language Tracking with Adaptive Memory-based Prompts
☆19Oct 7, 2024Updated last year
ritzz-ai / PACS
View on GitHub
☆31Sep 12, 2025Updated 10 months ago
Vision-CAIR / Infinibench
View on GitHub
Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows
☆20Nov 4, 2025Updated 8 months ago
abdelfattah-lab / SplitReason
View on GitHub
☆20Mar 18, 2026Updated 3 months ago
scalable-model-editing / unified-model-editing
View on GitHub
We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.
☆29Dec 16, 2024Updated last year
pangjh3 / AnLLM
View on GitHub
☆20Jun 17, 2024Updated 2 years ago
TrustedLLM / UnKE
View on GitHub
☆24Feb 18, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
NJUNLP / PATS
View on GitHub
☆46May 27, 2025Updated last year
THU-KEG / AdaptThink
View on GitHub
☆186Dec 5, 2025Updated 7 months ago
TIGER-AI-Lab / Hierarchical-Reasoner
View on GitHub
Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning [ICLR26]
☆64Apr 11, 2026Updated 3 months ago
InternLM / OREAL
View on GitHub
Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning
☆190Mar 20, 2025Updated last year
TergelMunkhbat / concise-reasoning
View on GitHub
Code for paper called Self-Training Elicits Concise Reasoning in Large Language Models
☆44Apr 22, 2025Updated last year
multimodal-art-projection / CriticLean
View on GitHub
☆49Aug 5, 2025Updated 11 months ago
katiekang1998 / reasoning_generalization
View on GitHub
☆33Jan 7, 2025Updated last year
steven-ccq / VisualReasoner
View on GitHub
[EMNLP 2024] Official repository for paper "From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis"
☆22Oct 15, 2024Updated last year
RUCKBReasoning / CodeRM
View on GitHub
Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'
☆27May 16, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
plageon / HierSearch
View on GitHub
HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches
☆40Oct 9, 2025Updated 9 months ago
THU-KEG / ReaRAG
View on GitHub
ReaRAG: Knowledge-guided Reasoning Enhances Factuality of Large Reasoning Models with Iterative Retrieval Augmented Generation
☆28Aug 24, 2025Updated 10 months ago
Adaxry / Unified_Layer_Skipping
View on GitHub
☆15Apr 11, 2024Updated 2 years ago
AI45Lab / DEAN
View on GitHub
☆11Oct 25, 2024Updated last year
alchemistyzz / PeRL
View on GitHub
[NeurIPS'25] The official code of "PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning"
☆30Mar 30, 2026Updated 3 months ago
shiqichen17 / SPA
View on GitHub
Github repository for "Internalizing World Models via Self-Play Finetuning for Agentic RL"
☆36Nov 1, 2025Updated 8 months ago
amazon-science / ContextualUnderstanding-ContrastiveDecoding
View on GitHub
Enhancing contextual understanding in large language models through contrastive decoding
☆19May 3, 2024Updated 2 years ago